Skip to content

Model

On this page, we describe a model of dynamic protein Timer in yeast, with design recommendations independently validated by AI-assisted reasoning, before any wet-lab experiment.

📑Contents
No headers found on this page

Highlights — A New Paradigm for Synthetic Biology in the AI Era

Traditional synthetic biology relies on iterative Design–Build–Test–Learn (DBTL) cycles, where modeling informs initial designs, but multiple rounds of wet-lab experimentation are often needed to refine parameters and achieve functional outcomes. While effective, this process can be time-consuming and resource-intensive, especially when initial model predictions lack sufficient biological fidelity.

Our work reimagines this cycle by introducing an AI-augmented modeling framework that dramatically increases the predictive accuracy of in silico design—enabling first-attempt success in wet-lab experiments. Centered on a biophysically grounded model of fluorescent timer (Fast-FT) dynamics in yeast, we systematically screen critical parameters—such as Ash1_AIpro[1] promoter pulse width (10–15 min), promoter strength (1×), and maturation kinetics—under realistic cellular conditions (30°C, YPD medium). Crucially, these model-derived recommendations were independently validated by two large language models (DeepSeek and Qwen), which—when prompted only with biological first principles—converged on the same optimal design choices.

This convergence between mechanistic modeling and AI reasoning provides unprecedented confidence in pre-experimental parameter selection. Rather than replacing the DBTL cycle, our approach supercharges the “Design” phase, minimizing failed builds and accelerating the path to reliable, interpretable results. By demonstrating that AI can serve as a “AI reasoning partner” in hypothesis generation and experimental planning, we offer a novel and replicable framework for future research, where computationally guided design paves a reliable and efficient path to experimental realization.

Why this Model?

In synthetic biology and cellular timing studies, constructing a reliable and readable intracellular timer is crucial for understanding cellular life cycles, lineage relationships, and dynamic processes. Fluorescent Timer (FT) proteins, which change color over time, have become a powerful tool for achieving this goal. However, in practical applications, the performance of FT proteins is influenced by multiple factors, including expression strategies, maturation dynamics, cell division inheritance, and environmental conditions. To systematically evaluate and optimize these factors before conducting experiments, we have developed this mathematical model. It aims to theoretically screen feasible parameter ranges, validate the reasonableness of promoter selection and expression control, and provide clear design guidelines for subsequent wet-lab experiments.

Introduction

This model is centered around the "single pulse → three-step irreversible maturation chain (C→B→I→R)" and simulates the entire process from promoter expression to the complete maturation of the fluorescent protein. It uses and as criteria to evaluate the readability and monotonicity of the timer. By adjusting the pulse width (the duration of Ash1 pulses), maturation time constants (), and pulse intensity (adjusting promoter strength), we first screen for "usable intervals" mathematically, and then provide reproducible target conditions for wet-lab experiments. The model compares periodic promoters (Ash1) with constitutive promoters: the former provides short and clean expression windows, improving and monotonicity; the latter, under the same maturation parameters, tends to dilute time resolution due to continuous new molecule generation, and thus is used as a control to demonstrate the necessity of pulse gating. To address cases where promoter expression is either too high or too low, the model incorporates promoter strength () and intergenerational inheritance (inheritance fraction, ) into sensitivity analysis. By controlling promoter strength, we ensure that the system provides an appropriate level of protein expression while avoiding premature saturation or insufficient signaling. In summary, the model systematically examines and quantifies two critical factors: promoter selection and promoter strength, validating the reasonableness of selecting the Ash1 promoter and an appropriate promoter strength from both the mathematical and model perspectives before conducting wet-lab experiments.

Model Design

In a yeast multicellular system, the timer fluorescent protein (FT) is introduced to represent the time process between the birth and maturation of cells. FT consists of an irreversible maturation chain: C→B→I→R, where:

  • C represents the immature protein;
  • B represents the blue state;
  • I represents the intermediate state;
  • R represents the red state.

The color change reflects the time elapsed since the cell's birth.

This model is based on the following experimental settings:

  • Cell cycle: approximately 87 minutes[2] (source: BioNumbers);
  • Fluorescent protein: Fast-FT[3] is selected;
  • Expression trigger: using a single pulse (based on the later comparison of periodic and constitutive promoters).

Model Parameters

Table 1. Parameters (global variables) for the model
ParameterMeaningDefault ValueUnit
Cell cycle length87[2:1]min
Single-cell simulation duration20160 (customizable and modifiable)min
Integration step size1.0 (customizable and modifiable)min
Pulse duration15 (as demonstrated below)min
Pulse amplitude1.0 (as demonstrated below)-
mRNA degradation rate[4]min-1
C to B time constant12.0[3:1]min
B to I time constant45.0[3:2]min
I to R time constant720.0[3:3]min
Red protein half-life247.2 (estimated, assumed to be stable, with no significant degradation compared to the experimental window)h
Inheritance fraction of C-state protein1.0 (estimated, to simplify the model, modifiable within [0.0, 1.0])-
C to B rate constant (calculate from known value )min-1
B to I rate constant (calculate from known value )min-1
I to R rate constant (calculate from known value )min-1
Red protein degradation rate (calculate from estimated value )min-1

Fast-FT Time Parameter Calculation

1. Basis for Calculation and Core Logic

The fluorescence maturation of Fast-FT (from blue to red) is essentially a chemical process involving chromophore oxidation and conformational rearrangement. The rate of this process is strictly dependent on temperature and follows the biochemistry "temperature coefficient (Q10)" rule — that is, for every 10°C increase, the reaction rate increases by 1.5–2.5 times. The typical Q10 value for FT-like proteins in the literature is close to 2.0.

1.1 Literature Data Summary

The core data for the calculation is derived from literature on Fast-FT key time parameters at 25°C (in vitro purified + intracellular, Drosophila S2 cells) and 37°C (in vitro purified + intracellular, HeLa cells). Specific data are shown below (data source[3:4]):

Table 2. Data summary for Fast-FT time parameter
TemperatureExperimental SystemBlue Fluorescence Peak Time (h)Red Fluorescence Half-Peak Time (h)Remarks
25°CIn vitro purified protein0.5818.0No cellular environment, reflects chromophore maturation
25°CIntracellular (S2 cells)13.038.0 (estimated)Includes mRNA translation delay and protein degradation
37°CIn vitro purified protein0.257.1Higher temperature accelerates maturation, much faster than at 25°C
37°CIntracellular (HeLa cells)17.0 (trend value)41.0 (trend value)Longer blue/red time compared to in vitro

1.2 Q10 Calculation (Temperature Coefficient)

Q10 is the core index for quantifying the impact of temperature on the reaction rate. The formula (Arrhenius equation[5]) is:

Where:

  • and are the reaction rates at temperatures T1 and T2;
  • T1 and T2 are the two temperature points (°C).

We calculate Q10 based on the in vitro data at 25°C (T1) and 37°C (T2):

  • T1=25°C, red half-peak time , rate ;
  • T2=37°C, red half-peak time , rate ;
  • Temperature difference .

Substituting into the formula:

This result aligns with the typical Q10 range (1.8–2.2) for FT-like proteins in literature, confirming the reliability of the subsequent calculations.


1.3 Parameter Calculation at 30°C

For yeast experiments, the target temperature is 30°C (T3). We will use the data at 25°C (T1) for intracellular conditions (aligned with the yeast physiological environment) and a Q10 value of 2.0 for calculations:

(1) Core Formula

When the temperature rises from T1 (25°C) to T3 (30°C), the temperature difference is , and the rate increase factor and time shortening ratio are:

Thus, the 30°C time can be calculated as:

(2) Specific Parameter Calculation
Table 3. Specific parameter calculation at 30°C
Metric25°C Intracellular Time (h)30°C Estimated Time (h)Calculation Logic
Blue Fluorescence Peak Time13.09.213.0 / 1.41 ≈ 9.2, considering differences in translation rate between yeast and S2 cells
Red Fluorescence Half-Peak Time38.030.038.0 / 1.41 ≈ 30.0, more stable as red maturation is less influenced by intracellular factors
Red Fluorescence Peak Time200.0 (S2 cells)141.8200.0 / 1.41 ≈ 141.8, used for long-term tracking experiments (e.g., intergenerational lineage analysis)

1.4 Reasonableness Check
  • Temperature Range Match: 30°C is between 25°C (lower limit for yeast) and 37°C (mammalian body temperature), so the estimated result is free from extrapolation errors.
  • Cellular Environment Consistency: The translation and protein degradation rates in S2 cells at 25°C align closely with yeast, and since both are lower eukaryotes, the biochemical interference is minimal, ensuring the chemical maturation and biological interference total time is comparable.
  • Non-Core Variables Exclusion: The pH of YPD 6.0–6.5 is within the fluorescent stability range of Fast-FT (pH 5.4–7.4), and the YPD medium only provides nutrients without affecting chromophore chemical maturation. Hence, no parameter adjustments are needed.

Model Assumptions

  1. Single Pulse: Each cell expresses the fluorescent protein once during the late M phase of the cell cycle and stops expression after the pulse.
  2. Protein Transfer Mechanism: The immature C-state protein generated in the mother cell enters the daughter cell and matures within the daughter cell.
  3. Irreversible Maturation Chain: The maturation process from C (immature protein) to B (blue state), I (intermediate state), and R (red state) is irreversible.
  4. Red-state Degradation: Red-state protein has a degradation rate controlled by its half-life, .

Model Establishment

1) Input

The input function represents a "pulse starting at the birth time, lasting for minutes" with an amplitude :

where indicates that when is within the duration of the pulse, ; otherwise, it is 0. This is specifically implemented as u_birth_pulse.

python
def u_birth_pulse(t_since_div: float, pulse_width: float, pulse_amp: float, 
                  start_offset: float, Tcc: float) -> float:
    """Generate cell division-related expression pulse"""
    # Calculate time position within cell cycle (0 to Tcc)
    cycle_time = t_since_div % Tcc
    # pulse starts near end of cell cycle (from start_offset to Tcc)
    if start_offset <= cycle_time <= Tcc:
        # maintain amplitude within pulse duration, else 0
        return pulse_amp if (cycle_time - start_offset) <= pulse_width else 0.0
    return 0.0

2) State Variables

  • m: mRNA concentration

  • C: Immature protein concentration

  • B: Blue-state concentration

  • I: Intermediate-state concentration

  • R: Red-state concentration

  • Output reading:

3) Kinetic Equations

The system's kinetic equations describe mRNA synthesis, translation, and maturation processes:

These equations are computed in the inline function f(t, y) in simulate_single_cell, and then advanced using a numerical integration method (such as RK4)[6][7].

python
def f(t: float, y: np.ndarray) -> np.ndarray:
    m, C, B, I, R, v, t_since_div = y
    
    # Cell division logic (concentration dilution and state reset)
    division_occurred = False
    if t_since_div >= p.Tcc_min or v >= p.v0 * p.division_volume_ratio:
        # divide all concentrations by 2 (dilution)
        m = m / 2.0
        C = C / 2.0
        B = B / 2.0
        I = I / 2.0
        R = R / 2.0
        v = p.v0  # reset volume
        t_since_div = 0.0  # reset division timer
        division_occurred = True
    
    # Promoter pulse input
    u = u_birth_pulse(t_since_div, p.pulse_width_min, p.pulse_amp, p.pulse_start_offset, p.Tcc_min)
    k_tx_base, k_tl_base = get_transcription_translation_rates(t_since_div, p.Tcc_min)
    
    # Calculate crowding effect
    total_conc = m + C + B + I + R
    crowding_factor = crowding_correction(total_conc, p.crowding_threshold, p.crowding_strength)
    
    # Apply crowding correction to rates
    k_tx = k_tx_base * crowding_factor
    k_tl = k_tl_base * crowding_factor
    k_B = p.k_B * crowding_factor
    k_I = p.k_I * crowding_factor
    k_R = p.k_R * crowding_factor

    # Calculate derivatives with volume correction
    dm_dt_amount = k_tx * u - p.k_dm * m
    dm_dt = (dm_dt_amount / v) - (m / v) * (p.growth_rate * v)  # mRNA rate
    
    dC_dt_amount = k_tl * m - k_B * C
    dC_dt = (dC_dt_amount / v) - (C / v) * (p.growth_rate * v)  # C-state rate
    
    dB_dt_amount = k_B * C - k_I * B
    dB_dt = (dB_dt_amount / v) - (B / v) * (p.growth_rate * v)  # B-state rate
    
    dI_dt_amount = k_I * B - k_R * I
    dI_dt = (dI_dt_amount / v) - (I / v) * (p.growth_rate * v)  # I-state rate
    
    dR_dt_amount = k_R * I
    if include_decay:
        dR_dt_amount -= p.k_DR * R
    dR_dt = (dR_dt_amount / v) - (R / v) * (p.growth_rate * v)  # R-state rate
    
    dv_dt = p.growth_rate * v  # volume growth rate
    dt_since_div_dt = 1.0      # division timer increment
    
    return np.array([dm_dt, dC_dt, dB_dt, dI_dt, dR_dt, dv_dt, dt_since_div_dt])

4) Inheritance Logic

Only the immature C-state protein is inherited, while B, I, and R are not inherited. The initial conditions for daughter cells are:

python
# Daughter cell inherits mother's C-state protein
if t[-1] >= p.Tcc_min:
    # interpolate C-state concentration at division time
    C_T = float(np.interp(p.Tcc_min, t, out["C"]))
else:
    C_T = out["C"][-1]
# calculate initial C-state for daughter cell by inheritance ratio
C_inherit = p.inherit_frac_C * C_T

# Initial state of daughter cell (B/I/R are 0)
# B_inherit = 0.0
# I_inherit = 0.0
# R_inherit = 0.0

Modeling Results

Single-cell time series

Objective match: We verify whether the chosen maturation constants yield the expected phase ordering (blue rises first, red later), and whether the readout is monotonic.

Observations:

  • peaks at approximately and then decays.
  • exhibits a delayed rise and accumulates markedly only after several tens of hours.
  • As , .

Consistency check: The curve shapes are consistent with a three-step irreversible chain .

Conclusion: The relative ordering of maturation times is physically reasonable and reproduces the expected waveform morphology.

Figure 1. Single-cell- B-R (left) & r (right)

Pulse width vs

Objective match: For the window under the current maturation times, assess how temporal resolvability varies with pulse width .

Observations (quantified): As increases, decreases approximately linearly:

  • give ;
  • give .

Criterion: With a typical quantitative imaging noise , the “3σ” line is . Thus:

  • passes;
  • is borderline;
  • fails.

Conclusion: Under the present Fast-FT kinetics, constraining the birth pulse to yields in the window and is therefore readable.

Figure 2. δr(65→145 min) vs pulse width τ

Lineage analysis

Birth-aligned lineage heatmaps show that increases roughly monotonically over time in each generation, with the largest increment concentrated in . Across generations, the signal exhibits a unimodal pattern: it grows from to (maximal at ), then declines from onward, reaching its lowest level by . Row-wise baseline correction within each generation () enhances contrast and emphasizes the same inter-generational ordering and the slope differences in the interval, without altering the overall temporal trend.

Figure 3. Lineage r heatmap (birth-aligned)


Figure 4. Lineage r' heatmap (baseline-corrected)

Results and Discussion

Periodic Promoter vs Constitutive Promoter

vs Pulse Width

  • Observation: Under fixed expression strength, decreases monotonically as the pulse width increases: is superior to , and both are much better than or ON (continuous expression), namely > ≫ ON.

    The readings are approximately as follows:

    • , (significantly above the threshold );

    • , , lower than 0.09;

    • ON expression gives , the worst performance.

  • Criterion: measures the dynamic range within the window. Using a typical noise level of , the 3 threshold is .

    • passes;
    • is borderline;
    • fails.
  • Conclusion: Shorter pulses, such as , effectively compress the expression near cell birth, followed by the maturation process, yielding a clear signal within the window. Meanwhile, continuous expression dilutes the signal, making it difficult to distinguish changes in .

Figure 5. Δr vs τ (10-90 min window)

Overlaid Curves

  • Observation: All curves show monotonic increase within , but the slope and final value decrease significantly with increasing , with the ON expression showing the flattest rise.

  • Interpretation: While monotonicity is achieved (a key feature of the timer), readability (slope and amplitude) is strongest in shorter pulses:

  • and show the fastest increase in from with the highest final value;

    • The ON curve rises the slowest.

This pattern occurs because after a short pulse, the system almost no longer adds new blue protein, and the red protein accumulates, making increase rapidly. In contrast, continuous expression constantly adds new blue proteins, dampening .

Figure 6. r(t) overlay (fairness- equal_intensity)

Lineage Heatmap Comparison

Lineage Heatmap Comparison
  • Observation: The left side of all heatmaps (near birth) shows a "bright band," indicating inheritance of the red baseline across generations. Comparing the three conditions:
    • : After birth, rises quickly, and the "bright band" dims, followed by a rapid brightening in later generations, showing strong temporal resolution.

    • : The gradient is much shallower.

    • ON: The gradient is the flattest and overall dimmer, indicating poor resolution within .

  • Interpretation: Shorter pulses result in stronger intra-generational dynamics, while ON expression leads to weak signal contrast within generations, making inter-generational analysis less effective.
(Baseline Subtracted) Lineage Heatmap Comparison
  • Observation: After baseline subtraction, all three heatmaps show the "pure contemporary increment". The color scale upper limit reflects the dynamic range:

    • : The upper limit is with the steepest gradient across generations.

    • : The upper limit is , with a gentler gradient.

    • ON: The upper limit is , with the flattest gradient.

  • Interpretation: Even after baseline correction, shorter pulses maintain strong temporal resolution within generations, while ON expression fails to distinguish subtle age differences.

Figure 7. Lineage Heatmap Comparison

Single-Chain Four-Generation Overlaid Curves

  • Observation: In the time series of the mother cell (G0) to the fourth generation (G4):

    • : Each generation's curve shows significant bending and steepening in the window.

    • ON: Each generation’s curve rises more slowly, with weaker bending and a smoother curve in the early stage.

  • Interpretation: Over multiple generations, maintains high readability, while ON expression causes a slow start, making it difficult to finely resolve age differences between generations.

Figure 8. Four-generation rprime chains - fairness equal_intensity

In summary, using a periodic promoter (Ash1) with a pulse width controlled between 10–15 minutes provides a reliable and distinguishable timing signal; in contrast, a constitutive promoter is not suitable for use as a timer input.


Strong Promoter vs Weak Promoter

In this model, the red-to-blue ratio is used as the key indicator for evaluating timer performance, representing the ratio of red-state protein (R) to blue-state protein (B) at any given time, as described by the formula:

This ratio reflects the maturation process and transition dynamics of the protein, serving as a crucial representation of the time evolution. The strength of the promoter has a different impact on the timer performance depending on the pulse amplitude and pulse width. [8]Changing the pulse amplitude affects the overall protein expression level in the system, while the pulse width directly influences the time distribution of protein production. To balance these two factors, we compare the performance of strong and weak promoters.

1. Advantages and Limitations of Strong Promoters

Strong promoters (such as Factor 3x and Factor 5x) exhibit rapid dynamic responses in the red-to-blue ratio , allowing the system to quickly reach equilibrium (close to 1) within a short time. This is because stronger promoters provide higher protein expression, enabling the rapid conversion of blue-state protein (B) to red-state protein (R). However, strong promoters may lead to rapid saturation of the system, which is a drawback for their function as timers. A timer needs to maintain the variation in the red-to-blue ratio over a longer period, but strong promoters may cause the system to stabilize at the saturation point too quickly, resulting in reaching 1 too soon and losing the ability to track time changes effectively.

2. Advantages and Limitations of Weak Promoters

In contrast, weak promoters (such as Factor 0.5x and Factor 0.75x) produce lower expression levels, but they drive the red-to-blue ratio change more gradually. Under these conditions, the rise in the red-to-blue ratio is slower, allowing the system to provide continuous dynamic changes over a longer period, making more suitable for use as a timer. Weak promoters effectively prevent rapid saturation and can steadily reflect the cell's time evolution, providing enough variation to infer the cell’s age or state. However, weak promoters have clear limitations. First, lower protein expression results in weaker signals, which affects the signal-to-noise ratio (SNR). In low-signal conditions, measurement noise significantly impacts , especially over longer experimental durations, where signal fluctuations may become more pronounced, potentially causing noise to obscure the real signal and affect the accuracy and readability of the timer. Additionally, weak promoters are more prone to interference from experimental noise due to their lower expression levels. The weak response makes the system more sensitive to noise, leading to larger fluctuations in , thus decreasing the system's stability and repeatability. Therefore, although weak promoters provide smoother dynamic changes, their higher noise sensitivity makes them less suitable than strong or medium-strength promoters for high-precision timer applications.

3. Balance of Medium Strength Promoters (1x)

After comparing various promoter strengths, a medium-strength promoter (such as the 1x promoter, shown by the green curve in the graph) demonstrates the best balance:

  • Moderate Dynamic Response: With the 1x promoter, the red-to-blue ratio increases at a moderate rate, enabling quick response without premature saturation, providing smoother and more stable changes.
  • Higher Signal-to-Noise Ratio (SNR): The 1x promoter provides a higher SNR, meaning the experimental results are clearer and more reliable, making it ideal for high-precision timer applications.
  • Avoidance of Excessive Inheritance Effects: Strong promoters (like Factor 3x and 5x) can cause significant inheritance effects, resulting in high baseline red-state protein levels in the previous generation, which may affect the measurement accuracy in the next generation. The 1x promoter provides better control over this issue.

4. Conclusion

  • Weak Promoters (0.5x and 0.75x): These are better suited for timers requiring slower and continuous changes, but they may perform poorly in high-precision applications due to their weak signal.
  • Strong Promoters (3x and 5x): While strong promoters have advantages in fast response and high expression, their rapid saturation makes them unsuitable for precise timers, as they lose the ability to maintain continuous variation.
  • 1x Medium Promoter: Provides the best balance, ensuring moderate response speed, high-quality signal, and low noise, making it the ideal choice for timers, particularly in applications that require smooth changes and high precision.

Therefore, choosing the 1x medium promoter is a reasonable decision that balances signal strength, dynamic response, and noise control, while avoiding rapid saturation and unstable system performance. It is especially suitable as a timer for measuring cellular lifespan.

Figure 9. Strong Promoter vs Weak Promoter

Visualization

To enhance the interpretability and accessibility of our modeling framework, we developed two interactive 3D visualization tools as part of the YeastVerse virtual experiment platform. These tools simulate key aspects of the multicellular yeast chassis and fluorescent timer dynamics, providing intuitive insights into system behavior under various parameters. Built using React.js and Three.js, they allow users to explore spatial growth patterns and temporal fluorescence changes in real-time, bridging abstract mathematical predictions with visual, biologically grounded representations.

Try YeastVerse Visualization at https://yeastverse-99735e.igem.wiki/

Source code at https://gitlab.igem.org/2025/fudan/-/tree/main/model/YeastVerse

3D Yeast Growth Simulation

This module simulates the three-dimensional growth of the multicellular "Grape Yeast" cluster, modeling cell division, elongation, and cluster formation under controlled environmental conditions (e.g. temperature at 30°C, YPD medium). Starting from a single ancestral cell, the simulation applies deterministic branching rules inspired by Pascal’s triangle developmental pattern[9], incorporating asynchronous division to generate geometrically defined, clonal clusters. Users can toggle between "Grape" and "normal" yeast types, adjusting simulation speed (0.5–3×) to observe morphological transitions and cluster expansion. This interactive visualization gives us a more graphic understanding of "Grape Yeast's'' growth, thereby guiding the design of our multicellular chassis to enable precise, real-time tracking of drug resistance evolution in the DR.sTraTeGY platform.

Table 4. Parameters (local variables) for 3D Yeast Growth Simulation
ParameterMeaningDefault Value*Unit
max_length_ratiolong/short axis ratio for cell elongation under low oxygen/stress1.8 (estimated)[9:1][10]-
nucleus sizeRelative nucleus radius in cell model0.3 (estimated)-
produceTwoCells probProbability of multi-bud formation (bipolar mode)0.3–0.6 (estimated)-
maxAngleBudding cone angle limitGrape: 20°; normal: 60° (estimated)degree
Note. *The term "estimated" means that few corresponding literature data was found during the modeling process, but subsequent wet-lab experiments have provided some measurements for this value.

View our Measurement and Results.

Other minor parameters are detailed in the source code.

Figure 10. Animation demo of 3D Yeast Growth Simulation


3D Fluorescent Timer Maturation

This module visualizes the spatiotemporal maturation of Fast-FT proteins within the "Grape Yeast" cluster, integrating the core ordinary differential equation framework for mRNA transcription, protein synthesis, and state transitions (C → B → I → R). It simulates fluorescence color shifts over cell cycles. Users interact via OrbitControls for 3D navigation, clicking cells to inspect maturation states (e.g. r-ratio). The tool supports speed adjustments (1× default) and displays real-time stats like average maturation stage and total cells, facilitating optimization of promoter pulses and inheritance fractions for lineage tracing in multicellular contexts.

Figure 11. Animation demo of 3D Fluorescent Timer Maturation

To set up and explore these visual tools locally, please follow the step-by-step guide in the README documentation to deploy the program on your own computer.

AI-Aided Validation of Model Predictions

To further validate the robustness and generalizability of our model, we leveraged the reasoning capabilities of two state-of-the-art Chinese large language models, DeepSeek and Qwen, both of which have demonstrated strong, comprehensive capabilities. These two AI models were independently prompted with specific design parameters for fluorescent timers (FT) optimized for tracking yeast cell lineage and age. The prompts were structured to guide the models toward biological first principles, specifically targeting the selection of FT variants, promoter strategies, pulse characteristics, and expression levels in the context of the yeast cell cycle.

Remarkably, the answers provided by both AI models converged with the key conclusions drawn from our mathematical model, reinforcing the credibility of our findings. The models independently highlighted the following optimal choices:

  1. Fast-FT as the preferred timer variant, aligning with our analysis of maturation times relative to the yeast cell cycle (~87 minutes).
  2. A periodic promoter strategy, exemplified by the Ash1_AIpro[1:1] promoter, which expresses during the late M phase and enables precise "timestamping" of daughter cells at division.
  3. The necessity for short pulse widths (≈10–15 min), which ensures optimal temporal resolution within a single cell cycle.
  4. A medium-strength promoter, which provides the best balance between signal intensity and biological burden, ensuring reliable signal detection without saturating the fluorescence detectors.

This alignment between our mathematical model and independent AI reasoning provides an additional layer of validation for our experimental design. The convergence of these independent pathways of hypothesis generation — one computational and one AI-based — not only supports the validity of our conclusions but also underscores the potential for AI to act as a powerful reasoning partner in hypothesis-driven research. [11]Notably, the use of models like DeepSeek and Qwen, with their deep knowledge and reasoning capabilities, enhances the persuasiveness of our model’s predictions, presenting a novel approach for integrating AI into experimental design in synthetic biology. However, this is not a one-size-fits-all AI solution, but one use case demonstrating its value in targeted validation.

The full AI conversation logs, including model reasoning and conclusions, are available in the supplementary materials and can be accessed via Code and Data Accessibility.

Conclusion

The model successfully predicted two key components: the promoter and the Fluorescent Timer. Guided by these predictions, we selected these core elements for our experimental design, as detailed on our Improved Parts page.

  • Promoter Selection: The model recommends using a pulsed, medium-intensity expressed promoter, providing a clear direction for experimental screening. Ultimately, we selected the ASH1_AIpro promoter[1:2], optimized through deep learning algorithms. This promoter is induced only during the late M phase, enabling pulsed expression and ensuring that mRNA is synthesized at the appropriate time, with precise localization of the mRNA in the daughter cell through the ASH1 3' UTR[12].
  • Fluorescent Timer Selection: The model suggested the use of Fast-FT, leading to our experimental choice of Fast-FT (modified mCherry). This fluorescent protein undergoes a time-dependent color shift: initially emitting blue fluorescence (excitation/emission peaks: 403/466 nm) and gradually transitioning to red (excitation/emission peaks: 583/606 nm), thereby enabling visual tracking of temporal progression.

In the actual wet-lab experiments, we assembled the TU Timer, which consists of: Ash1_AIpro, modified mCherry fluorescent protein, Ash1 3' UTR, and the ScENO1 terminator. After introducing the TU Timer into yeast cells, we analyzed the red and blue fluorescence intensity changes over time using fluoresence microscopy.

The successful coupling of model predictions with experimental validation demonstrates that our proposed AI-enhanced modeling framework significantly improves the predictability of designs, achieving "first-attempt success". This provides a reproducible new paradigm for synthetic biology, greatly optimizing the traditional Design-Build-Test-Learn cycle.

Improvement Log

To systematically improve our model, we adopted the Design–Build–Test–Learn (DBTL) cycle, a core methodology in synthetic biology. This iterative process allowed us to refine model parameters and mechanisms based on both literature and values obtained from our own wet-lab experiments. Below, we describe two rounds of DBTL cycles that led to the current robust model.

DBTL Round 1

Literature-Based Initial Model and Identification of Gaps

Objective: Develop a preliminary model using literature-derived parameters to simulate fluorescent timer (FT) dynamics in yeast.

  • Design (2025.06): The initial model was designed based on published data for Fast-FT proteins, including maturation kinetics from in vitro studies (e.g., blue peak time and red half-peak time at 25°C and 37°C). Parameters such as maturation time constants () were sourced from literature [3:5], and the model assumed a generic promoter expression pattern without cell-cycle specificity.

  • Build (2025.06): We implemented the model using ordinary differential equations (ODEs) to describe the irreversible maturation chain (C→B→I→R). Key parameters were set based on in vitro values, such as =12 min and =720 min, and the Ash1 promoter was initially erroneously modeled to express at the start of the cell cycle.

  • Test (2025.07): Simulation results revealed discrepancies with expected cellular behaviors. The in vitro-based maturation kinetics caused the fluorescent proteins to mature from blue to red far too rapidly. This led to a premature saturation of the red signal, where the r(t) ratio approached its maximum too quickly within a single cell cycle, influencing the time gradient needed for resolution. Consequently, over just one or two generations, all cells accumulated a similarly high red signal, making it impossible to distinguish young daughter cells from old mother cells and causing the timer to lose its core function.

  • Learn (2025.07): We identified that in vitro data did not account for cellular factors like translation delays, chaperone interactions, and metabolic context. This highlighted the need for intracellular-specific parameters and better alignment with yeast physiology. Additionally, the promoter expression timing and protein inheritance logic required biological validation from our wet-lab experiments.

DBTL Round 2

Integration of Wet-Lab Data and Model Validation

Objective: To redesign the model using an AI-augmented framework that leverages biological first principles and the learnings from Round 1, with the goal of validating both the model's predictive power and the reliability of this research paradigm through data from wet-lab experiments.

  • Design (2025.07-08): Informed by the failures of Round 1, we initiated a redesigned DBTL cycle centered on computational prediction. We employed two independent pathways to converge on optimal parameters:

    • Mechanistic Modeling: We refined the model structure to correct the identified biological inaccuracies. This included setting the Ash1 promoter pulse to occur near the end of the cell cycle and revising the protein inheritance logic to allow for near-complete transfer of immature protein to the daughter cell.

    • AI-Assisted Reasoning: We prompted two large language models (DeepSeek and Qwen) with the core design problem—optimizing a fluorescent timer for yeast lineage tracking—guiding them with biological first principles but without providing our model's interim results. This served as an independent validation of our design logic.

  • Build (2025.08): Based on Round 1 DBTL learnings, we rebuilt the model to incorporate intracellular parameters from logical calculations and these data are later supported by our wet-lab experiments. This included:

    • Using temperature-dependent maturation kinetics derived from Q10 calculations to adjust Fast-FT[3:6] times for 30°C.
    • Adjusting the Ash1_AIpro[1:3] promoter to express during the late M phase in our model to match biological evidence.
    • Revising protein inheritance logic to allow immature C-state proteins to be almost fully transferred to daughter cells, and mature in the daughter cells produced after a cell division[13][12:1].
  • Test (2025.08–10): The predictions of the refined model were subsequently tested through wet-lab experiments. The results confirmed the computational forecasts:

    • The experimentally observed blue-to-red transition timeline closely matched the model's prediction, providing a clear r(t) gradient.
    • The chosen pulse width and promoter strength yielded a strong and distinguishable signal, enabling precise lineage tracking over multiple generations.
    • The corrected inheritance logic was validated, as daughter cells showed the expected timing signal that was distinct from the mother cell's baseline.
  • Learn (2025.10): The convergence between our AI-augmented model predictions and the experimental outcomes demonstrated the power of "Design" phase. This approach minimized the traditional DBTL iterations, as the parameters defined in silico proved to be functionally accurate in vivo. It validated that integrating mechanistic modeling with AI reasoning can dramatically increase pre-experimental confidence and serve as a blueprint for first-attempt success in synthetic biology.

How to Use our Model?

This model provides a complete theoretical framework and computational implementation for simulating the dynamic behavior of fluorescent timer proteins (FT) in yeast cells. Through systematic parameter scanning and virtual experiments, users can optimize experimental designs, verify hypotheses, and predict potential outcomes before conducting actual wet-lab experiments. Below are the detailed usage guidelines:

1. Basic Simulation

Single-cell Dynamics Analysis

Use Case: Understand the fundamental behavior of the timer within a single cell cycle and validate the core dynamics of the model.

Single-cell simulations form the foundation for understanding FT behavior. By observing the full cycle of a single cell from birth to division, users can verify whether the maturation chain (C→B→I→R) behaves as expected, check the monotonicity of the red-to-blue ratio , and assess whether the time evolution of each protein state aligns with biological intuition. This level of analysis is especially useful for users encountering the model for the first time, helping them develop an intuitive understanding of the system behavior.

When analyzing the results, focus on the following aspects:

  • The curve should increase monotonically, reflecting the irreversible conversion of the protein from blue to red. Any non-monotonic behavior may indicate issues with parameter settings.
  • should peak early and then decay, while should show a delayed rise, a crucial feature for the proper functioning of the timer.
  • The system should reach a stable state within a reasonable time scale, typically aligned with the cell cycle duration.
python
# Basic parameter setup
p = Params()
p.pulse_width_min = 15.0    # Pulse width in minutes
p.pulse_amp = 1.0           # Promoter strength

# Run simulation
t, out = simulate_single_cell(p, include_decay=True)

# Extract key metrics
r_ratio = out["r"]          # Red/blue ratio time series
B_blue = out["B"]           # Blue-state protein concentration  
R_red = out["R"]            # Red-state protein concentration

Output Interpretation:

  • The curve should exhibit a smooth, monotonic increase, reflecting the irreversible maturation process from blue to red. Any non-monotonic behavior suggests an issue with the parameters.
  • should peak early and then decline, while should show a delayed increase. This phase relationship is critical for the proper functioning of the timer.
  • Ensure the curve reaches a stable state within a reasonable time scale; premature saturation may limit the effective time window for the timer.

2. Parameter Optimization

Pulse Width Screening Analysis

Use Case: Identify the optimal promoter expression window to achieve maximum temporal resolution and provide quantifiable guidelines for experimental design.

Pulse width is one of the most crucial parameters influencing timer performance. If the pulse width is too wide, new blue proteins are continuously synthesized, diluting the changes in the red-to-blue ratio. If the pulse width is too narrow, the signal may not be strong enough. By systematically scanning different pulse widths and calculating the corresponding values, users can find the optimal balance between signal strength and dynamic range.

This model uses a quantitative criterion based on noise levels: assuming typical imaging noise , the threshold for distinguishability is set at . This approach turns subjective design decisions into objective numerical standards, significantly improving the scientific rigor of experimental design.

python
# Scan different pulse widths
tau_values = [10, 15, 20, 25, 30]  # minutes
delta_r_results = []

for tau in tau_values:
    p.pulse_width_min = tau
    dr = delta_r_custom(p, t_start_min=24*60, t_end_min=48*60)
    delta_r_results.append(dr)
    
# Evaluate based on 3σ criterion (σ≈0.03)
threshold = 0.09  # 3σ threshold
optimal_tau = max([tau for tau, dr in zip(tau_values, delta_r_results) if dr > threshold])

Decision Criteria:

  • : Distinguishable (recommended), indicating that the timer provides a clear age signal.
  • : Borderline, indicating a need for further parameter optimization or acceptance of lower temporal resolution.
  • : Undistinguishable, avoid using these parameter settings.

3. Promoter Strategy Comparison

Periodic vs Constitutive Expression

Use Case: Evaluate the impact of different expression strategies on timer performance and provide theoretical justification for promoter selection.

The choice of expression strategy directly affects the core performance of the timer. Periodic promoters (such as Ash1) produce short bursts of expression at specific stages of the cell cycle, creating a clear "express-silence" alternating pattern that is beneficial for generating a sharp age gradient. In contrast, constitutive promoters express continuously throughout the cell cycle, producing new blue proteins constantly, which may "dilute" the changes in the red-to-blue ratio, reducing temporal resolution.

By comparing the values, lineage heatmap patterns, and intergenerational behaviors under different strategies, users can gain an intuitive understanding of whether periodic promoters are more suitable for timer applications. This comparative analysis is valuable because it not only shows "which strategy is better," but also explains "why it is better," making it particularly useful for educational purposes or for explaining design choices to collaborators unfamiliar with synthetic biology.

python
# Run comparative analysis (using provided script)
# python ft_compare_promoters.py

# Key observation metrics:
# - Δr: Dynamic range, quantitative indicator of temporal resolution
# - Lineage heatmap gradient: Visual representation of intra-generational time resolution
# - Baseline-corrected r': Pure signal after removing inheritance bias

Selection Recommendations:

  • Periodic Promoters (Ash1): Provide clear age signals and good intra-generational contrast, highly recommended for precise time-tracking applications.
  • Constitutive Promoters: Provide smooth signals with lower temporal resolution, typically used as a negative control or for validating the advantages of periodic expression.

4. Lineage Tracking Analysis

Multi-generational Cell Behavior Study

Use Case: Study the behavior of the timer across multiple generations, verify the inheritance logic, and assess the feasibility of long-term culture.

In real applications, cell timers often need to maintain functionality across multiple generations. Lineage analysis simulates protein inheritance and dilution during continuous cell division, revealing the timer's long-term stability. Baseline correction techniques (calculating ) are particularly important because they remove the inheritance bias from the mother cell's signal, leaving only the contribution from newly matured proteins in the current generation.

By analyzing the heatmap patterns across generations, users can assess the long-term stability of the timer, detect potential signal decay or accumulation effects, and provide guidance for experimental imaging time windows. For example, if the model shows a significant signal decay after the third generation, experimental design should focus on observing earlier generations.

python
# Generate multi-generation data
mat, trow = lineage_matrix(p, generations=6, include_decay=True)

# Baseline correction (remove inheritance bias)
mat_corrected = mat - mat[:, 0:1]

# Analyze inter-generational patterns
generation_trends = np.mean(mat_corrected, axis=1)  # Average signal intensity per generation

Analysis Focus:

  • Check monotonicity within each generation to ensure age gradients are clear within a single cell cycle.
  • Observe signal decay or enhancement patterns between generations to evaluate the feasibility of long-term use.

5. Environmental Parameter Adjustment and Condition Optimization

Use Case: Adapt the model to different growth conditions or optimize experimental settings, evaluating the influence of environmental factors on timer performance.

Biological systems are highly sensitive to environmental conditions, such as temperature and pH, which can affect protein maturation dynamics. This model incorporates the impact of temperature using the Arrhenius equation with the Q10 temperature coefficient, allowing users to predict timer behavior under different cultivation temperatures. This capability is especially valuable because actual experimental conditions may differ from the standard conditions reported in the literature.

6. Advanced Customization and Function Extension

Use Case: Modify the model structure to meet specific needs, add new features, or integrate with other systems.

For advanced users with specific requirements, the modular design of the model supports a variety of custom modifications. Users can modify the maturation pathway (e.g., skipping the intermediate state I), add new regulatory modules (such as optogenetic or chemical induction systems), or integrate other cellular process models.

This flexibility makes the model not only a specific FT simulator but also a versatile platform for modeling various cellular processes. For example, users could incorporate the effects of cellular metabolic state on protein expression or integrate cell fate decision models to study timer behavior during differentiation.

python
# Modify maturation pathway (e.g., skip intermediate state)
def custom_maturation_chain(t, y, p):
    m, C, B, R = y  # Remove I state
    # Custom kinetic equations...
    return np.array([dm, dC, dB, dR])

# Add new regulation modules
def additional_regulation(t, y, p):
    # Implement additional regulation like optogenetics or chemical induction
    regulation_factor = calculate_regulation(t)
    return regulation_factor

The entire model is built with a modular design, ensuring good readability and extensibility. Users can flexibly adjust parameters or modify functional modules according to their specific needs. It is recommended to conduct a systematic virtual screening before major experimental investments, as this can save valuable experimental resources and time and significantly improve research efficiency by "failing on silicon" rather than in the laboratory.

7. Interactive 3D Visualization Exploration

Use Case: Visually explore the 3D growth dynamics of multicellular yeast and the spatiotemporal maturation of fluorescent timer proteins, providing an intuitive understanding of system behavior and facilitating hypothesis generation.

The YeastVerse platform offers two interactive 3D visualization tools that allow users to simulate and observe key aspects of the multicellular yeast chassis and fluorescent timer dynamics in real-time. These tools are built using React.js and Three.js, and they help bridge the gap between abstract mathematical models and biological intuition by providing visual, spatially explicit representations.

Example Workflow:

  • For 3D Yeast Growth Simulation:

    • Start the simulation and observe how the yeast cluster grows from a single cell to a multicellular structure.
    • Note the branching patterns and cell arrangements in "Grape" vs. "normal" yeast, which can inform the design of the multicellular chassis for drug resistance evolution studies.
    • Key growth parameters are user-adjustable, allowing for the customization of simulations to explore morphological outcomes under different hypothetical conditions.
  • For 3D Fluorescent Timer Maturation:

    • Run the maturation simulation and watch the color changes in the population over time.
    • Click on cells at different positions in the cluster to compare their maturation states, which can help validate the inheritance and maturation logic in a spatial context.
    • The underlying maturation kinetics and promoter parameters can be modified, enabling researchers to tailor the simulation for different fluorescent protein variants or experimental scenarios.

For detailed setup instructions and technical requirements, please refer to the YeastVerse README. The visualization platform runs directly in modern web browsers, making it accessible to researchers without specialized computational resources.

Code and Data Accessibility

Periodic Promoter vs Constitutive Promoter Part: ft_compare_promoters.py

Strong Promoter vs Weak Promoter Part: df_strength.py

Result Part: FT.py

AI-Aided Validation: DeepSeek Conversation JSON, Qwen Conversation JSON

3D Visualization: YeastVerse

References


  1. Kari, H., Bandi, S. M. S., Kumar, A., & Yella, V. R. (2023). DeePromClass: Delineator for eukaryotic core promoters employing deep neural networks. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 20(1), 802–807. DOI: 10.1109/TCBB.2022.3163418 ↩︎ ↩︎ ↩︎ ↩︎

  2. Di Talia, S., Skotheim, J. M., Bean, J. M., Siggia, E. D., & Cross, F. R. (2007). The effects of molecular noise and size control on variability in the budding yeast cell cycle. Nature, 448(7156), 947–951. DOI: 10.1038/nature06072 ↩︎ ↩︎

  3. Subach, F. V., Subach, O. M., Gundorov, I. S., Morozova, K. S., Piatkevich, K. D., Cuervo, A. M., & Verkhusha, V. V. (2009). Monomeric fluorescent timers that change color from blue to red report on cellular trafficking. Nature Chemical Biology, 5(2), 118–126. DOI: 10.1038/nchembio.138 ↩︎ ↩︎ ↩︎ ↩︎ ↩︎ ↩︎ ↩︎

  4. Alalam, H., Zepeda-Martínez, J. A., & Sunnerhagen, P. (2022). Global SLAM-seq for accurate mRNA decay determination and identification of NMD targets. RNA, 28(6), 905-915. DOI: 10.1261/rna.079077.121 ↩︎

  5. Arrhenius, S. (1889). Über die Reaktionsgeschwindigkeit bei der Inversion von Rohrzucker durch Säuren. Zeitschrift für physikalische Chemie, 4(1), 226-248. ↩︎

  6. Kutta, W. (1901). Beitrag zur näherungsweisen Integration totaler Differentialgleichungen. Teubner. ↩︎

  7. Butcher, J. C. (2000). Numerical methods for ordinary differential equations in the 20th century. Journal of Computational and Applied Mathematics, 125(1-2), 1-29. DOI: 10.1016/S0377-0427(00)00455-6 ↩︎

  8. Lee, M. E., DeLoache, W. C., Cervantes, B., & Dueber, J. E. (2015). A highly characterized yeast toolkit for modular, multipart assembly. ACS Synthetic Biology, 4, 975–986. DOI: 10.1021/sb500366v ↩︎

  9. Ratcliff, W., Fankhauser, J., Rogers, D. et al. (2015). Origins of multicellular evolvability in snowflake yeast. Nat Commun, 6, 6102. DOI: 10.1038/ncomms7102 ↩︎ ↩︎

  10. Fukuda, N. (2023). Apparent diameter and cell density of yeast strains with different ploidy. Sci Rep 13, 1513. DOI: 10.1038/s41598-023-28800-z ↩︎

  11. Penadés, J. R., Gottweis, J., He, L., Patkowski, J. B., Daryin, A., Weng, W.-H., Tu, T., Palepu, A., Myaskovsky, A., Pawlosky, A., Natarajan, V., Karthikesalingam, A., & Costa, T. R. D. (2025). AI mirrors experimental science to uncover a mechanism of gene transfer crucial to bacterial evolution. Cell, 188(5), 1–12. DOI: 10.1016/j.cell.2025.08.018 ↩︎

  12. Bertrand, E., Chartrand, P., Schaefer, M., Shenoy, S. M., Singer, R. H., & Long, R. M. (1998). Localization of ASH1 mRNA particles in living yeast. Molecular Cell, 2(4), 437–445. DOI: 10.1016/S1097-2765(00)80143-4 ↩︎ ↩︎

  13. Brodsky, A. S., & Silver, P. A. (2000). Pre-mRNA processing factors are required for nuclear export. RNA (New York, N.Y.), 6(12), 1737–1749. DOI: 10.1017/s1355838200001059 ↩︎