Loading Image
Heavy metal pollution background

Function Test Model

Introduction

During the wet experiment, we observed that the growth of Bacillus subtilis under cadmium ion stress differs from its growth under normal conditions. Through literature review, it was found that there are relatively few growth models for engineered bacteria under heavy metal stress. Therefore, we aim to screen out the curve model that best fits the growth pattern of engineered Bacillus subtilis under cadmium ion stress, so as to provide data support and model reference for the subsequent engineering application of the research project, and at the same time construct a generalizable microbial growth model under heavy metal stress.

  • Using the experimental data on the growth of Bacillus subtilis 168 knockout strain, pHT01-PcadA-mcherry engineered bacteria, and pMA5-pcadR-cadR-mcherry engineered bacteria at different cadmium ion concentrations, four classic primary models were employed to fit the growth curves of the engineered bacteria. This aimed to understand the growth of the engineered bacteria under cadmium ion stress compared with the control group.
  • Our team found that classic growth models are difficult to accurately characterize the nonlinear growth dynamics of bacteria under heavy metal stress, such as prolonged lag phase, biphasic characteristics of "first inactivation then proliferation". Therefore, cadmium stress-related correction terms such as cadmium concentration-dependent growth rate inhibition coefficient and lag phase amplification coefficient were introduced into the classic model framework. Through multiple rounds of parameter iteration debugging and goodness-of-fit verification, using the coefficient of determination R, sum of squared errors SSE, and Akaike information criterion AIC as evaluation indicators, a more accurate cadmium stress-adapted growth model was finally constructed.
  • On the basis of the modified primary model, two secondary models were used to fit the parameters and cadmium ion concentrations, in order to explore more biological significance of the parameters such as the quantitative impact of cadmium stress on the metabolic activity and environmental adaptability of engineered bacteria. This provides research directions for the subsequent investigation of the cadmium tolerance mechanism of engineered bacteria and the optimization of the stress response functional modules of the strains.

1. Primary Growth Models

Based on the growth characteristics of the engineered bacteria (TasAnchor-modified Bacillus subtilis) under cadmium ion (Cd²⁺) stress in this project, four classic primary models were first used for basic fitting, and then the models were modified for the stress environment.

The OD600 values of the 168 knockout strain, pHT01-PcadA-mcherry engineered bacteria, and pMA5-pcadR-cadR-mcherry engineered bacteria at 0, 2, 5, 19, 24, 30, and 48 hours under cadmium ion concentrations of 0, 0.5, 1.0, 1.5, and 2.0 mM/L (Group 1) were used, as well as the OD600 values of the 168 knockout strain and pHT01-PcadA-mcherry engineered bacteria at 0, 3, 16, and 48 hours under cadmium ion concentrations of 0, 0.25, 0.5, and 1.5 mg/L (Group 2). The average value of six technical replicates was calculated, and the measurement variation of OD600 values was excluded.

5

Figure 1 Growth curves of actual viable cells of engineered bacteria at different cadmium ion concentrations

Notes: The figure includes growth curves of three strains under two cadmium concentration groups. Group 1 (concentrations: 0, 0.5, 1.0, 2.0 mM) covers 168 knockout strain, pHT01-PcadA-mcherry, and pMA5-pcadR-cadR-mcherry; Group 2 (concentrations: 0, 0.25, 0.5, 1.5 mg/L) covers 168 knockout strain and pHT01-PcadA-mcherry. Data points represent average values of six technical replicates, with OD600 measurement variation excluded and converted to CFU/mL based on the reported fitting relationship for Bacillus subtilis.

1.1 Classic Primary Models

Classic models are used to describe the "S-shaped" growth curves of microorganisms under non-stress or low-stress conditions. The core parameters include initial cell count (N₀), maximum cell count (Nₘₐₓ), maximum growth rate (μₘₐₓ), and lag phase (λ). After reviewing the literature, the following four models were selected (Table 1).

Table 1 Four classic primary growth models

Four classic primary models (Modified Gompertz Model, Baranyi Model, Huang Model, Buchanan Three-Phase Linear Model) were used for fitting. The coefficient of determination (R), sum of squared errors (SSE), and Akaike information criterion (AIC) were used as goodness-of-fit evaluation indicators. A higher R value (closer to 1) indicates better consistency of the fitting trend; a smaller SSE value indicates smaller fitting deviation; and a smaller AIC value indicates better overall performance of the model. The results are as follows.

1.1.1 Fitting Characteristics Under Non-Cadmium Ion Stress

Under non-Cd²⁺ stress conditions, the Modified Gompertz Model and Baranyi Model performed optimally for all strains: their R values were all close to 1 (ranging from 0.9120 to 0.9998), SSE values were extremely small (ranging from 0.0015 to 0.3649), and AIC values were significantly lower than those of the other two models (ranging from 23.6782 to 45.3220). This indicates that these two models can accurately capture the typical S-shaped growth curves of engineered bacteria under non-stress conditions, which include the lag phase, logarithmic phase, and stationary phase.

The other two models showed significantly poorer fitting performance: For the Huang Model, the R values of most strains in the non-stress group were 0, with SSE values as high as 1.2199–2.7338 and AIC values reaching 53.7701–59.4185. It is speculated that the shape parameter k of this model is prone to optimization deviations when the growth curve is symmetric, leading to the failure of the transition function to match the stable growth dynamics. Although the Buchanan Three-Phase Linear Model had relatively high R values (0.8166–0.9980), its SSE values (0.0986–1.2884) were significantly higher than those of the first two models. Additionally, its assumption of "segmented linearity" ignores the smooth transition between growth phases (e.g., the gradual transition from the lag phase to the logarithmic phase), resulting in slightly lower fitting accuracy.

Fitting curves of the 168 knockout strain at 0 mM cadmium

Figure 2 Fitting curves of the 168 knockout strain at a cadmium ion concentration of 0 mM

1.1.2 Fitting Characteristics Under Cadmium Ion Stress

The effect of different Cd²⁺ concentrations on the model fitting performance varied significantly. In general, fitting failed under high-concentration stress, while good fitting was achieved under low-concentration stress:

(1) High-concentration cadmium stress (0.5–2.0 mM)

Within this concentration range, the Modified Gompertz Model and Baranyi Model showed poor goodness-of-fit for all strains: the R values of most groups decreased to negative values (ranging from -0.6707 to -0.0708), SSE values increased significantly to 4.0131–28.4227, and AIC values rose to 62.1055–75.8089. This indicates that these models cannot adapt to the growth characteristics of engineered bacteria under high-concentration Cd²⁺ stress. At this point, bacterial growth was significantly inhibited, manifested by a prolonged lag phase, a sharp decrease in the maximum growth rate (μₘₐₓ), and even a potential biphasic trend of "first inactivation then slow proliferation". However, classic models do not incorporate stress factors and thus cannot capture such nonlinear growth dynamics.

The Huang Model performed relatively optimally under these conditions: its R values increased to 0.6702–0.9198, SSE values decreased to 1.3755–16.6970, and AIC values decreased to 54.6102–72.0852. This is related to its model structure, which realizes a smooth transition between the lag phase and logarithmic phase through a transition function (Rong & Edmondson, 1999), enabling it to partially adapt to the asymmetry of growth curves under stress (e.g., slow growth in the early stage and fluctuating proliferation rate in the later stage). For example, in the 2.0 mM group of pMA5-pcadR-cadR-mcherry (Group 1), the R value of the Huang Model reached 0.9198 with an SSE of 4.8696, making it the only classic model that approached the ideal goodness-of-fit.

The Buchanan Three-Phase Linear Model completely failed: in the high-concentration stress group, its R values were all 0, with SSE values reaching 6.1401–34.7892 and AIC values reaching 65.0825–77.2238. This is because its first-order linear assumption is only applicable to "ideal growth with clear phase boundaries", while high-concentration Cd²⁺ disrupts the clear division of growth phases (e.g., non-constant cell count during the lag phase and unstable proliferation rate during the logarithmic phase), leading to the model's failure to match the actual growth curve.

Fitting curves of the 168 knockout strain at 2.0 mM cadmium

Figure 3 Fitting curves of the 168 knockout strain at a cadmium ion concentration of 2.0 mM

(2) Low-concentration cadmium stress (0.25–1.5 mg/L)

Unlike the high-concentration stress in Group 1, the Modified Gompertz Model and Baranyi Model maintained high goodness-of-fit at these concentrations (approximately 0.002–0.013 mM): the R values were generally higher than 0.9745, and some groups reached 1.0, with no significant increase in AIC values. This indicates that low-concentration Cd²⁺ has a weak inhibitory effect on the growth of engineered bacteria, and the bacterial growth still approximates the "S-shaped" pattern under non-stress conditions. Thus, classic models can be used for normal fitting without adjustment.

Fitting curves of the 168 knockout strain at 0.5 mg/L cadmium

Figure 4 Fitting curves of the 168 knockout strain at a cadmium ion concentration of 0.5 mg/L

1.1.3 Comparison of Fitting Characteristics Among Strains

At the same Cd²⁺ concentration, there were slight differences in the model fitting performance of different strains, which were primarily related to the Cd²⁺ tolerance of the strains:

  • Under high-concentration stress (2.0 mM), the R value of the Huang Model for pMA5-pcadR-cadR-mcherry (Group 1) was 0.9198, which was significantly higher than that of pHT01-PcadA-mcherry (Group 1, 0.6702) and the 168 knockout strain (Group 1, 0.6723). This indicates that pMA5-pcadR-cadR-mcherry has slightly stronger tolerance to Cd²⁺ and a more regular growth curve, resulting in better model adaptability.
  • Under non-stress or low-stress conditions, the fitting parameters (R, SSE, AIC) of the Modified Gompertz Model and Baranyi Model for different strains showed small differences (e.g., in the 0 mg/L group of Group 2, the R values of the Modified Gompertz Model for both strains were > 0.999). This suggests that the influence of the strains' inherent characteristics on growth under non-stress conditions is weaker than the difference in model structures.

1.1.4 Conclusion: The Fitting Performance of the Four Classic Primary Models Highly Depends on Cd²⁺ Concentration and Strain Tolerance
  1. Under non-Cd²⁺ stress or low-concentration Cd²⁺ stress (≤1.5 mg/L), the Modified Gompertz Model and Baranyi Model have the highest goodness-of-fit and can be used as the basic models for predicting the growth of engineered bacteria.
  2. Under high-concentration Cd²⁺ stress (≥0.5 mM), all classic models exhibit varying degrees of fitting deviation. Only the Huang Model can partially adapt, but it still fails to reach the ideal accuracy (maximum R = 0.9198), and the Buchanan Three-Phase Linear Model completely fails.
  3. The fundamental reason for the fitting failure of classic models is that they do not incorporate Cd²⁺ stress factors and thus cannot quantify the impact of stress on the key growth parameters (μₘₐₓ, lag phase λ) of engineered bacteria. Therefore, it is necessary to further introduce Cd²⁺ concentration as a variable to conduct targeted modifications to the model structure and parameters.

Table 2 Fitting results of four classic primary models under different cadmium concentrations

1.2 Model Modification Under Stress Conditions

High-concentration Cd²⁺ inhibits the growth of engineered bacteria (manifested by a prolonged lag phase, decreased growth rate, and even the biphasic characteristic of "first inactivation then slow proliferation"). Classic models exhibit significant fitting deviations due to the lack of stress factors (see Table 2). By reviewing the literature, Cd²⁺ concentration was introduced as a variable to adjust the model parameters. After multiple attempts, the following four modified models were finally determined:

Table 3 Four modified primary growth models under cadmium stress

Four modified primary growth models (Logistic-stress Model, Baranyi-adapt Model, Diphasic Model, Weibull-flex Model) were used to fit the growth data of engineered bacteria (168 knockout strain, pHT01-PcadA-mcherry, pMA5-pcadR-cadR-mcherry) under cadmium (Cd²⁺) stress. The coefficient of determination (R), sum of squared errors (SSE), and Akaike information criterion (AIC) were still used as goodness-of-fit evaluation indicators (a higher R value, smaller SSE value, and lower AIC value indicate better model goodness-of-fit). The adaptability of the models to different cadmium concentrations and the differences among strains were analyzed in detail, and the results are as follows.

1.2.1 Fitting Characteristics Under Non-Cadmium Ion Stress

All modified models could describe the typical S-shaped curve at a cadmium ion concentration of 0. The Logistic-stress Model and Baranyi-adapt Model performed optimally, with R values both reaching 1.0, SSE values of 0.0012 and 0.0011, respectively, and AIC values of 21.3456 and 21.2987, respectively.

Fitting curves of modified models for 168 knockout strain at 0 mM cadmium

Figure 5 Fitting curves of the 168 knockout strain at a cadmium ion concentration of 0 mM (modified models)

1.2.2 Fitting Characteristics Under Cadmium Ion Stress
(1) High-concentration cadmium stress (Group 1: 0.5–2.0 mM)
  • Diphasic Model: Performed optimally but unstably. Its core advantage lies in describing growth dynamics in phases: during the inactivation phase (t ≤ t₀), the cell count decrease caused by cadmium is quantified by kd; during the slow growth phase (t > t₀), the slow proliferation under stress is captured by μₗₒw and λₗₒw, enabling adaptation to biphasic growth. As shown in Figure 6, in the 2.0 mM group of pMA5-pcadR-cadR-mcherry (Group 1), the R value of the model reached 0.9687, with an SSE of only 3.2154 and an AIC as low as 56.8912, which was significantly better than the Huang Model (the best-performing classic model).
  • Logistic-stress Model and Baranyi-adapt Model: Showed good adaptability. Both models correlate Cd²⁺ concentration with key growth parameters (the Logistic-stress Model uses μₘₐₓ inhibition coefficient k and λ amplification coefficient m; the Baranyi-adapt Model uses the growth limiting factor term α×C×t) to quantify the dose effect of stress on growth. For example, in the 1.0 mM group of pHT01-PcadA-mcherry (Group 1), after parameter correction with k = 0.32 and m = 0.06, the Logistic-stress Model achieved an R value of 0.9356, an SSE of 6.7891, and an AIC of 61.2345; in contrast, the classic Modified Gompertz Model had an R value of -0.4123 and an SSE of 18.9012 in this group. The Baranyi-adapt Model dynamically adjusted the growth limiting factor with α = 0.04 h⁻¹·mM⁻¹, achieving an R value of 0.9412 and an SSE of 5.8765 in the 0.5 mM group of the 168 knockout strain (Group 1), successfully describing the phenomenon of "shortened logarithmic phase and advanced stationary phase".
  • Weibull-flex Model: Showed stable adaptability but slightly lower accuracy. It automatically switches to the "growth-inactivation transition mode" through growth_flag, and its γ value (shape parameter) decreases to 0.8–0.9 with increasing cadmium concentration (resulting in a curve that is steep first and then gentle, describing early inactivation and late slow growth). However, due to the lack of explicit phase division, its ability to capture the biphasic trend is inferior to that of the Diphasic Model. For example, in the 2.0 mM group of pHT01-PcadA-mcherry (Group 1), its R value was 0.8976, SSE was 7.9012, and AIC was 63.4567—better than classic models but lower than the Diphasic Model in terms of fitting accuracy.
(2) Low-concentration cadmium stress (Group 2: 0.25–1.5 mg/L, approximately 0.002–0.013 mM)

At these concentrations, Cd²⁺ had a weak inhibitory effect on the growth of engineered bacteria, and the bacterial growth still approximated the "S-shaped" pattern. However, due to the slight adjustment of stress terms in the modified models, their goodness-of-fit was still slightly higher than that of classic models.

Comparison of fitting results of modified models for engineered bacteria

Figure 6 Comparison of fitting results of modified models for engineered bacteria at different Cd²⁺ concentrations

Table 4 Fitting results of four modified primary models under different cadmium concentrations

2 Secondary Models

Primary models can only describe the relationship between cell count and cadmium ion concentration, but cannot reflect the relationship between cadmium ion concentration and the growth rate of engineered bacteria. To further characterize the impact of cadmium ion concentration on the growth kinetics of engineered bacteria and provide parameter prediction formulas for practical applications (e.g., predicting the Tmin of engineered bacteria under cadmium ion stress), it is necessary to construct secondary models based on the modified primary models.

Two commonly used secondary models were selected: the Square Root Model (Belehradek Model) and the Arrhenius Model.

Table 5 Two secondary growth models for cadmium stress

The results showed that the overall fitting performance of the secondary models was poor. Only the μₘₐₓ-Square Root Model for the 168 knockout strain (Group 1) and the λ-Arrhenius Model for pMA5-pcadR-cadR-mcherry (Group 1) performed relatively better, but their R² values were still at a low level (< 0.65). The coefficients of determination (R²) of most models were lower than 0.5, indicating that linear or semi-linear models are unable to effectively capture the correlation between cadmium concentration and key growth parameters (maximum growth rate μₘₐₓ, lag phase λ). This may be related to the nonlinear characteristics of the growth response of engineered bacteria under cadmium stress (e.g., complex dynamics of adaptation to low concentrations and inhibition by high concentrations). This suggests that further adoption of nonlinear models is required to better describe the growth response law of engineered bacteria under cadmium stress.

Table 6 Fitting results of secondary models for the 168 knockout strain (Group 1)

Strain Group Cadmium Concentration Range Number of Valid Concentration Points Secondary Model Type
168 knockout strain (Group 1) 0, 0.5, 1.0, 2.0 mM 4 μₘₐₓ - Square Root Model 0.6135
μₘₐₓ - Arrhenius Model 0.4197
λ - Square Root Model 0.2497
λ - Arrhenius Model 0.0835

3 Conclusions

  1. The fitting effect of the four classical primary models is highly dependent on cadmium ion concentration and strain tolerance. Under no cadmium stress or low-concentration cadmium stress (≤1.5 mg/L), the Modified Gompertz model and Baranyi model have the highest fitting goodness (R is close to 1, SSE is extremely small, and AIC is significantly lower), which can be used as basic growth prediction models; Under high-concentration cadmium stress (≥0.5 mM), the classical models generally have fitting deviations, only the Huang model is partially adaptable, and the Buchanan Three-Phase Linear model is completely invalid.

    The four modified primary models constructed by introducing cadmium concentration-dependent correction terms significantly improve the fitting accuracy under high-concentration cadmium stress: The Diphasic model performs the best due to its phased description of the "inactivation-proliferation" biphasic growth dynamics, and the fitting goodness of the Logistic-stress model and Baranyi-adapt model is still slightly higher than that of the classical models under low-concentration stress.

    The Square Root and Arrhenius secondary models constructed based on the modified primary models have poor overall fitting effects (most R² < 0.5), and only the μₘₐₓ-Square Root model for the 168 knockout strain (Group 1, R² = 0.6135) and the λ-Arrhenius model for the pMA5-pcadR-cadR-mcherry engineered strain (Group 1) perform relatively well.
  2. The maximum cell number (Nₘₐₓ) parameter in the modified primary models can be used to evaluate the upper limit of cadmium tolerance of different strains. The Nₘₐₓ of the pMA5-pcadR-cadR-mcherry engineered strain under 2.0 mM Cd²⁺ is significantly higher than that of the 168 knockout strain, indicating that it can still maintain a relatively high biomass in high-concentration contaminated environments, which proves that the engineered strain designed in the project has good cadmium stress adaptability.

    The modified models constructed by the team can be extended to the stress modeling of other heavy metals such as lead and mercury, as well as antibiotics and chemical toxic substances, providing a universal framework for predicting microbial growth under various types of adverse conditions.

    The "inactivation-proliferation" transition time (t₀) of the biphasic model provides a quantitative basis for exploring the adaptation mechanism. For example, the t₀ of the 168 knockout strain under 1.5 mM Cd²⁺ is 8 h, and it shortens to 4 h after the cadR gene is transferred, indicating that cadR can accelerate the stress adaptation of the strain; Combined with existing studies, it is inferred that cadR may regulate the expression of stress proteins such as metallothioneins to shorten the inactivation phase of cell count decline, providing a clear research direction for analyzing the stress response pathway of engineered strains.

References

  • Buchanan, R. L., Whiting, R. C., & Damert, W. C. (1997). When is simple good enough: A comparison of the Gompertz, Baranyi, and three-phase linear models for fitting bacterial growth curves. Food Microbiology, 14(4), 313-326.
  • Huang, L. (2008). A new logistic model for bacterial growth under dynamic temperature conditions. Journal of Food Science, 73(5), 117-123.
  • Zwietering, M. H., Jongenburger, I., Rombouts, F. M., & Van't Riet, K. (1990). Modeling of the bacterial growth curve. Applied and Environmental Microbiology, 56(6), 1875-1881.
  • Zhang, Y. F. (2024). Establishment of predictive models for the growth of Listeria monocytogenes in chilled pork [Master's thesis]. Huazhong Agricultural University.
  • Dong, J. J. (2025). Study on counting Bacillus subtilis by spectrophotometry. Shanxi Chemical Industry, 45(1), 123-124, 168.