At the start, we knew the Two-to-Two recombination would be difficult.
Previous systems like ORBIT reached only ~1% efficiency, and even CRISPR/Cas9 + λ-Red rarely exceeded 40% for chromosomal edits.
So, achieving dual-site integration in one step felt nearly impossible.
By week three, after four failed attempts, frustration had set in.
We had the right construct and plasmid — but integration simply wouldn’t work.
Instead of guessing again, we built a tool to predict what was wrong before the next experiment.
That tool became GenOMe Navigator.
Weeks later, it showed us the answer: our induction window was too short.
We extended the timing — and the next experiment worked exactly as predicted.
What once seemed impossible became an 80% success rate, proving that data-driven design can replace trial-and-error.
GenOMe Navigator transformed our workflow from trial-and-error to data-driven design.
Before using the software, we experienced four consecutive failures in the Two-to-Two recombination, with a success rate close to zero.
After applying its modeling recommendations, the first attempt succeeded with an integration efficiency of ~80%. Specifically, Mode A (site selection) improved from 0% to ~0.0195%, while Mode B (production integration) consistently reached ~82%.
Experimental cost was greatly reduced by minimizing failed trials and optimizing induction parameters, saving approximately 10 days and 300USD in reagents. By turning uncertainty into quantitative prediction (±3% accuracy), GenOMe Navigator replaced frustration with confidence — demonstrating real, measurable impact on experimental design.
| Layer | Function | Key Determinant |
|---|---|---|
| Locus Layer | Defines the attainable upper limit of efficiency | Replication accessibility (oriC proximity) |
| DNA Geometry Layer | Defines the rate of decline in efficiency | att-site coordination and insert length |
| Protein Layer | Determines whether the upper limit is reached | Induction timing and expression dynamics |
GenOMe Navigator saved approximately 10 days and 300USD in reagents, transforming our workflow from frustration to confidence.
GenOMe Navigator quantitatively deconstructs Bxb1 genome integration into three mechanistic layers:
Calibrated with experimental data, the model outputs success-rate predictions and standard operating procedure (SOP) recommendations that guide users from in silico planning to wet-lab execution.
Under 0.3–1.0 µM Bxb1 and a 30-minute reaction window, fragment length (1–2 kb) becomes the dominant factor, yielding integration efficiencies of at least 80%.
Problem: Repeated integration failures despite correct constructs. Navigator diagnosis: At 30 minutes, active Bxb1 fraction = 12% (below the 40% threshold). Recommendation: Extend induction to 240–300 minutes. Result: The next experiment succeeded as predicted.
Problem: Unclear how many colonies to plate. Navigator calculation: For a 1.5 kb fragment, predicted success = 82%; recommended 2–3 plates (30 CFU each) to obtain ~50 positives. Result: Two plates yielded 49 positives, saving both time and materials.
Before Navigator: “Should we try 0.8 kb or 2.5 kb?”—leading to more unnecessary trials.
Navigator output:
| Fragment Length | Predicted Success |
|---|---|
| 0.8 kb | 92% |
| 1.5 kb | 82% |
| 2.5 kb | 58% |
| 3.5 kb | 31% |
Decision: Test DNA fragments in the 0.8–1.5 kb range based on Navigator’s prediction.
Result: Both fragments integrated successfully on the first attempt, confirming the predicted optimal range.
The logical structure of GenOMe Navigator is illustrated below.
Each prediction integrates three interpretable modules:
Together, these modules convert empirical trial-and-error into quantitative, design-driven decision-making.
Each prediction guided the wet-lab team in adjusting induction time, protein expression, and plating effort, forming a continuous feedback loop between modeling and experimentation.
Mode A was developed to address the first-round integration, where both protein activity and genomic context strongly determine success.
Early experiments repeatedly failed despite correct constructs. Through GenOMe Navigator’s protein-layer module, we realized that the induction time being used was too short—the predicted active protein fraction had not yet reached the productive range.
After extending the induction period and increasing the effective protein concentration, the following experiments succeeded exactly as the model predicted.
Mode A also quantified how genomic factors affect success: loci farther from oriC or with unbalanced GC content exhibited lower probabilities.
While the two-to-two configuration requires coordinated recombination at both ends, it is the only route to achieve complete cassette integration. Our model demonstrated that, once optimized, its predicted efficiency approaches that of one-to-one events—proving that high-efficiency dual-end recombination is not only feasible but practical for production-level genome engineering.
Mode B represents the second-round two-to-two process, where both att-site pairs are already pre-installed.
In this mode, the Navigator showed that fragment length rather than protein concentration became the dominant determinant of success.
Under moderate induction (≈30 min), fragments between 1–2 kb consistently achieved ≥ 80 % predicted efficiency, matching experimental observations.
The Quick Lookup Table converts model outputs into concrete lab instructions predicted success rate, estimated positive colonies, and recommended plate numbers.
When Mode A predicted very low success at certain loci, we plated four to five times more colonies to ensure enough positives in one round.
When Mode B predicted high success, we reduced plating to save materials.
Each visualization corresponds to one of the three mechanistic layers of GenOMe Navigator:
| Layer | Description | Output |
|---|---|---|
| Protein | Induction duration and active fraction of recombinase proteins | φtot map |
| DNA | Sequence- and locus-specific determinants | Site-selection heatmap |
| Population | Transformation variability and plating success | Predicted colony yield |
Through this integrated workflow:
Mode A helped the wet-lab team correct protein induction conditions, turning repeated failures into success.
Mode B enabled precise plate-number planning and fragment-length optimization.
Predictions and outcomes were consistent within ±0.4 % MAE across all validations.
GenOMe Navigator thus functioned not as an auxiliary simulator but as an active experimental co-designer, enabling the team to iterate faster and more accurately across both dry- and wet-lab cycles.
GitLab(1): GenOMe Navigator
GitLab(2): GenOMe Navigator
GitHub:GenOMe Navigator
Main modules:
m5_tool.py – main calculator
plot_figures.py – figure generator
calibrate.py – cross-strain calibration
params.json – parameter list
data/m5_observed.csv – validation data
All modules are self-contained and runnable via command line or web interface.
Installation (< 5 minutes)
git clone https://gitlab.com/NYCU-Formosa/GenOMe-Navigator.git
cd GenOMe-Navigator
pip install -r requirements.txt
streamlit run app.py
Try it now: Input L=1.5 kb, C=0.387 µM, t=30 min
→ Predicted success: 82%
→ Recommended plates: 2-3
This reproduces our actual experimental design.
$$ \text{logit}(P) = a - bL, \quad P = \frac{1}{1 + e^{-(a-bL)}} $$ For Mode A: $$ P \propto e^{-\gamma(gc-0.5)^2} \cdot e^{-k_d d} $$ Population layer: $$ E[\text{positives}] = N_{\text{CFU}} \times \alpha_{\text{pop}} \times P $$
These equations bridge molecular-scale parameters with observable experimental outcomes, predicting how insert length, GC content, and genomic position jointly determine success rates.
To adapt GenOMe Navigator to a new strain or recombination system:
This ensures reproducibility and cross-lab transferability.
While our validation focuses on the Bxb1-E. coli system, the three-layer architecture is designed to be recombinase-agnostic.
The calibration protocol requires only 3 validation experiments to adapt the model to new systems (e.g., ΦC31, Cre-lox).
Design principle: Mechanistic models with few parameters generalize better than black-box ML on small datasets.
License: MIT (Open Source Initiative–approved)
We invite contributions through GitLab issues and pull requests:
Submission of new strain datasets or recombinase parameters (e.g., ΦC31, TP901-1)
Improvements to the Streamlit interface or localization (English / Chinese)
Expansion to new model layers (e.g., host growth or metabolic burden)
All datasets and code are open and version-controlled for transparency and reproducibility.
GenOMe Navigator represents a paradigm shift in genome engineering:
From intuition to prediction.
From trial-and-error to design.
From black boxes to interpretable models.
We built this tool because we needed it, and we are sharing it because others will too.