Rotating Image 1 Rotating Image 2 Rotating Image 3 Rotating Image 4 Rotating Image 5 Rotating Image 6
loading . . .

1. Background

In the optogenetics design segment of the iRIS project, the blue light optogenetic system serves as the core module for regulating dye synthesis. The downstream enzymes EcTnaA and MaFMO, whose production it controls, play crucial roles in the continuous catalysis of tryptophan, 6-bromotryptophan, etc. Consequently, these two enzymes are significant in the synthesis of indigo, purple, and green pigments. Thus, verifying the proper functioning of the blue light optogenetic system is essential. Since the red and green light optogenetic systems share overlaps with the blue light system, modeling is not repeated for them.

Furthermore, the blue light optogenetic system employs a dual-plasmid expression system to achieve the functions of "light-sensitive regulation" and "pigment synthesis." As the blue light system is designed for constitutive expression under dark conditions and termination of expression upon blue light illumination, we introduced a PhlF inverter to reverse the response logic. PhlF, functioning as a repressor, effectively achieves this. Under dark conditions, the photosensory protein induces the expression of PhlF, which inhibits the downstream gene. Under light conditions, the inactivation of the photosensory protein leads to the cessation of PhlF expression, thereby lifting the inhibition. Therefore, its concentration change is directly related to the expression rate of the downstream gene. Additionally, the first complete plasmid construct also leads up to PhlF, which is why modeling prioritizes focusing on PhlF. Verifying the concentration change of PhlF can validate the rationality of the upstream plasmid design and construction.

Figure 1: Map of pCDFDuet-1-PJ23100-YF1-fixJ-PhlF-B0015

2. Design Logic

Based on relevant literature[1], we can summarize the following logical pathway:

Figure 2: The general logic of blue light induced system

First, the YF1 gene is transcribed and translated to produce monomeric YF1 protein:

Formula block 1

YF1 monomers then spontaneously dimerize to form the YF1 dark-state dimer, which can also dissociate back into monomers:

Formula block 2

The dark-state dimer binds ATP to become the phosphorylated dark-state dimer YF1-dark-P parameter:

Formula block 3

Blue light drives the conformational change of YF1-dark parameter to YF1-light parameter. YF1-light parameter can also be phosphorylated:

Formula block 4

Simultaneously, the FixJ gene is transcribed and translated to produce the FixJ protein:

Formula block 5

YF1-dark-P parameter can efficiently phosphorylate FixJ to FixJ~P, while YF1-light-P parameter only performs inefficient phosphorylation:

Formula block 6

However, YF1-light-P parameter acts as a potent phosphatase, and FixJ-P also undergoes spontaneous dephosphorylation:

Formula block 7

FixJ~P stimulates the downstream promoter Pfix-K2 to activate the PhlF gene, while FixJ lacks this function:

Formula block 8

3. Concentration Equations

Based on the sections above, the concentration equations can be listed as follows:

Formula block 9

4.Parameter Settings

We obtained the following parameter values by consulting relevant literature and making estimations:

5. Results and Discussion

After solving the equations above, the following result curves are obtained:

Figure 3: Result curves related to PhlF under constant conditions

It can be observed that under both constant dark and constant light conditions, the concentration of PhlF protein initially increases gently and then rises more steeply. However, the comparison shows that the PhlF concentration increases more rapidly under dark conditions, whereas the accumulation of PhlF under light conditions is considerably slower. From the final comparison graph, it is evident that the ratio of PhlF concentration in the dark to that in the light exceeds 1000 within less than ten minutes and continues to grow.

These results demonstrate that the blue light optogenetic system effectively achieves the basic regulatory logic of "darkness promotes PhlF expression, light inhibits PhlF expression." The regulatory effect strengthens over time, and the optogenetic "switch" characteristic is clear and efficient.

Figure 4: Result curves related to PhlF under switching conditions

These three graphs show the PhlF concentration curves after switching the condition to light following dark accumulation periods of 10min, 30min, and 60min, respectively. The trend shows that the PhlF concentration increases rapidly in the dark and decreases after light treatment. This fully aligns with the design intention of the blue light optogenetic system: high PhlF concentration in the dark inhibits downstream gene expression; low PhlF concentration under light allows substantial downstream gene expression. This proves that the blue light optogenetic system is an effective biological tool for precise, on-demand regulation of target protein expression.

Limited by the short iGEM season, we have not yet fully validated the effect of the blue light optogenetic system in wet lab experiments. Through modeling, we have confirmed the system's effectiveness and gained clearer insights into the timing and efficiency of its operation. Based on different dark treatment durations, subsequent wet lab work could involve appropriately extending dark treatment times to enhance PhlF accumulation, ensuring inhibitory effects during dark periods, and improving the accuracy of optogenetic patterning.

[1]Moglich, A., Ayers, R.A. & Moffat, K. Design and signaling mechanism of light-regulated histidine kinases. J Mol Biol 385, 1433-1444 (2009).

[2]Zhang, P., Yang, J., Cho, E. & Lu, Y. Bringing Light into Cell-Free Expression. ACS Synthetic Biology 9, 2144-2153 (2020).

[3]Berntsson, O. et al. Sequential conformational transitions and α-helical supercoiling regulate a sensor histidine kinase. Nat. Commun. 8 (2017).

[4]Mesa, S. et al. Comprehensive Assessment of the Regulons Controlled by the FixLJ-FixK2-FixK1Cascade inBradyrhizobium japonicum. J. Bacteriol. 190, 6568-6579 (2008).

[5]Abbas, A. et al. Characterization of Interactions between the Transcriptional Repressor PhlF and Its Binding Site at the phlA Promoter in Pseudomonas fluorescens F113. J. Bacteriol. 184, 3008-3016 (2002).

1. Project Background

In our experimental group's project design, we plan to use L-tryptophan as a substrate and employ TxtE—a cytochrome P450 enzyme reported to specifically nitrate the carbon at position 4 of L-tryptophan—to catalyze the production of 4-nitrotryptophan [1], which can subsequently be converted to green 4,4'-dinitroindigo [1]. However, TxtE requires electron supply during catalysis, and the natural redox partner for this P450 enzyme has not been reported in the literature.

Existing studies have utilized exogenous redox partners (e.g., spinach-derived ferredoxin and ferredoxin reductase) [1] to achieve electron transfer to TxtE in vitro. However, establishing a functional heterologous redox partner system for TxtE within Escherichia coli cells lacks clear literature guidance. Therefore, we extracted the coding sequences for ferredoxin (Fdx) and ferredoxin reductase (Fdr) from the gene cluster of Mycobacterium sp. HXN-1500 [2]. This native gene cluster includes a P450 enzyme, Fdx, and Fdr, and has been successfully heterologously expressed in Pseudomonas putida and E. coli [2]. We plan to co-express TxtE with these two redox partner genes in E. coli to attempt constructing a complete intracellular electron transfer chain.

As no literature currently confirms whether this heterologous system can assemble and function effectively in E. coli, and relevant wet-lab experiments have not yet progressed to this stage, we decided to employ computational modeling for theoretical validation of the system. Specifically, we focus on the interaction mechanism between the P450 enzyme and its redox partner (e.g., ferredoxin Fdx as a single-electron carrier).

We will adopt a protein-protein docking strategy, using the TxtE structure as the receptor and the redox partner protein as the ligand, performing docking simulations and molecular dynamics simulations using professional software. Through this modeling analysis, we hope to preliminarily verify, from a structural perspective, whether the designed heterologous redox system can dock stably, thereby inferring its potential to achieve electron transfer.

2. Protein Structure Acquisition

For the TxtE protein, we found its structure in the RCSB PDB and downloaded the PDB file named 5D3U. As it is a dimeric structure, we selected the first protein chain, which is the required TxtE protein, and used ChimeraX to create a dynamic representation:

Dynamic representation of the TxtE protein structure

Figure 1: Dynamic representation of the TxtE protein structure

For the Fdx protein, we did not find literature reporting its structure. Therefore, we used AlphaFold3 [3] for modeling, with the result shown below:

AlphaFold3 predicted structure of the Fdx protein

Figure 2: AlphaFold3 predicted structure of the Fdx protein

3. Protein-Protein Docking

3.1 Protein-Protein Docking Method

To investigate the binding region and interaction mode between the TxtE and Fdx proteins, this study employed the HDOCK program for protein docking experiments. HDOCK is a professional protein docking tool developed by Professor Huang Shengyou's team at Huazhong University of Science and Technology [4]. After completing the docking calculation, we selected the structure with the best docking score as the standard result for subsequent protein-protein interaction analysis.

The HDOCK docking score is calculated based on the ITScorePP or ITScorePR iterative scoring function. The core principle of this scoring system is: a more negative docking score indicates a higher probability of binding for the corresponding model and stronger interactions between the proteins.

To further quantify the binding potential of the two proteins, HDOCK provides a "confidence score" formula based on the docking score, used to directly assess the likelihood of binding between the two molecules. The specific formula is as follows:

Confidence Score Formula

When the confidence score is above 0.7, the two molecules are highly likely to bind; when the confidence score is between 0.5 and 0.7, the molecules are likely to bind; when the score is below 0.5, the molecules are unlikely to bind (HDOCK Help).

3.2 Protein-Protein Docking Results

From the docking results, the best docking score obtained was -227.77, with a Confidence Score of 0.8257:

The 10 best docking scores from HDOCK

Figure 3: The 10 best docking scores from HDOCK

This result indicates that the docked complex model has high reliability. The optimal docked model of the TxtE-Fdx complex is shown below; we used ChimeraX to create a dynamic representation, with TxtE in green and Fdx in blue.

Optimal docked model of the TxtE-Fdx complex

Figure 4: Optimal docked model of the TxtE-Fdx complex

3.3 TxtE-Fdx Protein Interactions

We used the PLIP program to analyze the interactions between TxtE and Fdx. PLIP (Protein-Ligand Interaction Profiler) is an analysis tool for identifying and visualizing multiple types of interactions between proteins and ligands [5]. The results were visualized using PyMOL, as shown below:

PLIP analysis of interactions between TxtE and Fdx proteins

Figure 5: PLIP analysis of interactions between TxtE and Fdx proteins

The analysis identified 7 hydrophobic interactions, 3 hydrogen bonds, and 2 salt bridges. These results verify the molecular basis for stable binding between the two proteins, involving various non-covalent interactions. The specificity and stability of the protein-protein binding provide the physical contact foundation necessary for electron transfer.

4. Molecular Dynamics Simulation

To address the limitations of static docking results and make the predictions closer to the real state of biomolecules in a physiological environment, we introduced molecular dynamics simulations following the protein docking analysis.

4.1 Molecular Dynamics Simulation Method

This study selected the highest-scoring model from HDOCK for molecular dynamics simulation. The simulations were performed using Gromacs 2022.4 software [6]. The force field used for the simulation system was amber14sb [7], the water model was TIP3P, counterions were sodium (Na⁺) and chloride (Cl⁻) ions, and the salt concentration was set to 0.15 mol/L.

The specific simulation workflow was as follows: First, the system underwent two steps of energy minimization: the first step used the steepest descent method (steep) for 10,000 iterations, and the second step used the conjugate gradient method (cg) for 5,000 iterations. After energy optimization, the system underwent 500 ps of NVT simulation followed by 500 ps of NPT simulation. After the NPT simulation, a 100 ns production simulation was conducted.

Simulation parameters were set as follows: the temperature coupling algorithm was V-rescale, the pressure coupling algorithm was Parrinello-Rahman, the simulation time step was 2 fs, and the temperature was maintained at 298.15 K. Electrostatic interactions were calculated using the Particle Mesh Ewald (PME) algorithm. The cutoff radii for both Coulombic and van der Waals interactions were set to 12 Å. Hydrogen bonds were constrained using the LINCS algorithm.

Analysis of the dynamics trajectory was performed using built-in Gromacs commands. The binding free energy was calculated using the gmx_MMPBSA tool [8].

4.2 Dynamics Simulation Results

4.2.1 RMSD Results

The Root Mean Square Deviation (RMSD) is a key metric for assessing the overall atomic positional deviation of a protein structure relative to a reference conformation. It effectively reflects the structural stability and convergence of the simulated system. The RMSD of the TxtE-Fdx protein complex backbone atoms over simulation time is shown below:

RMSD of TxtE-Fdx protein complex backbone atoms

Figure 6: RMSD of TxtE-Fdx protein complex backbone atoms

In the early simulation stage (first ~20 ns), the backbone RMSD increased rapidly, reflecting significant conformational adjustment of the complex structure. Subsequently, the RMSD gradually decreased and fluctuated. After ~50 ns, the RMSD stabilized, fluctuating around a mean value of 0.18 nm during the 50 ns to 100 ns period. This trend indicates that the TxtE-Fdx complex structure reached a stable state in the later stages of the simulation.

4.2.2 RMSF Results

The Root Mean Square Fluctuation (RMSF) reflects the flexibility of individual residues during the molecular dynamics simulation. A higher value indicates greater spatial fluctuation and flexibility of the residue. The RMSF analysis results for TxtE and Fdx residues are as follows:

RMSF fluctuations of TxtE and Fdx protein residues

Figure 7: RMSF fluctuations of TxtE and Fdx protein residues

For the TxtE protein, most residues exhibited low RMSF values, indicating good overall stability. For the Fdx protein, the RMSF fluctuations of all residues were also generally at a low level, indicating good structural stability.

4.2.3 Radius of Gyration Results

The Radius of Gyration (Rg) is a key indicator measuring the compactness of a protein structure. The change in the Rg value of the TxtE-Fdx protein complex over simulation time is shown below:

Radius of gyration of the TxtE-Fdx protein complex

Figure 8: Radius of gyration of the TxtE-Fdx protein complex

After ~50 ns, the Rg stabilized, fluctuating around a mean value of 2.25 nm. This trend indicates that the structure of the TxtE-Fdx complex gradually reached a stable and compact state.

4.2.4 Solvent Accessible Surface Area Results

The Solvent Accessible Surface Area (SASA) characterizes the extent of contact between a protein and the solvent. The change in SASA of the TxtE-Fdx protein complex over simulation time is shown below:

Solvent Accessible Surface Area of the TxtE-Fdx protein complex

Figure 9: Solvent Accessible Surface Area of the TxtE-Fdx protein complex

After ~50 ns, the SASA stabilized, fluctuating around a mean value of 208 nm². This indicates that the solvent-accessible surface area of the complex gradually decreased and eventually reached a dynamic equilibrium, suggesting the formation of a relatively stable complex structure.

4.2.5 Hydrogen Bond Results

The analysis of the number of hydrogen bonds formed between TxtE and Fdx proteins over simulation time is shown below:

Hydrogen bonds between TxtE and Fdx proteins

Figure 10: Hydrogen bonds between TxtE and Fdx proteins during simulation

During the simulation, the number of hydrogen bonds began to increase around 50 ns and subsequently gradually reached a dynamic equilibrium state. During the 50 ns to 100 ns period, the average number of hydrogen bonds fluctuated around 12.

4.2.6 Binding Free Energy

To quantify the binding strength between TxtE and Fdx proteins, 300 frames were extracted from the 70 ns-100 ns molecular dynamics simulation trajectory, and the binding free energy was calculated. The results are shown in the table below:

Figure 11: Binding Free Energy between TxtE and Fdx proteins (kcal/mol)

 

ΔVDWAALS

ΔEEL

ΔEPB

ΔENPOLAR

ΔGGAS

ΔGSOLV

ΔTOTAL

Energy

-87.75

-61.12

122.01

-10.24

-148.87

111.77

-37.10

From the data, the total binding free energy between TxtE and Fdx proteins is -37.10 kcal/mol. The negative value indicates that the binding process is thermodynamically favorable and the stable complex can form spontaneously.

5. Summary

This modeling study, through methods such as protein docking and molecular dynamics simulations, comprehensively demonstrates the dynamic stability and physiological validity of the docked conformation. The stable binding between the proteins provides the spatial foundation for electron transfer. Furthermore, the dynamic adaptation of key functional sites and specific interactions that fix the binding orientation further support the efficiency of electron transfer. This provides a reliable theoretical basis for subsequent in-depth investigation of the functional mechanism of inter-protein electron transfer.

[1] BARRY S M, KERS J A, JOHNSON E G, et al. Cytochrome P450–catalyzed L-tryptophan nitration in thaxtomin phytotoxin biosynthesis [J]. Nature Chemical Biology, 2012, 8(10): 814-6.

[2] HAO Y, LIU M, FORDJOUR E, et al. Engineering Escherichia coli for Perillyl Alcohol Production with Reduced Endogenous Dehydrogenation [J]. ACS Synthetic Biology, 2025, 14(5): 1594-605.

[3] ABRAMSON J, ADLER J, DUNGER J, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3 [J]. Nature, 2024, 630(8016): 493-500.

[4] YAN Y, TAO H, HE J, et al. The HDOCK server for integrated protein–protein docking [J]. Nature Protocols, 2020, 15(5): 1829-52.

[5] SCHAKE P, BOLZ S N, LINNEMANN K, et al. PLIP 2025: introducing protein–protein interactions to the protein–ligand interaction profiler [J]. Nucleic Acids Research, 2025, 53(W1): W463-W5.

[6] ABRAHAM M J, MURTOLA T, SCHULZ R, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers [J]. SoftwareX, 2015, 1-2: 19-25.

[7] MAIER J A, MARTINEZ C, KASAVAJHALA K, et al. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB [J]. Journal of Chemical Theory and Computation, 2015, 11(8): 3696-713.

[8] Valdés-Tresanco, M.S., Valdés-Tresanco, M.E., Valiente, P.A. and Moreno, E., 2021. gmx_MMPBSA: A new tool to perform MM/PBSA calculations in GROMACS trajectories. Journal of chemical theory and computation, 17(10), pp.6281-6291.

1. Project Background

In the experimental phase, the team engineered E.coli using synthetic biology approaches to produce pigments during its growth process. This enables the bacteria to "paint" unique patterns, presenting life activities in a visual manner.

However, the team was not satisfied with merely "seeing" the trajectory of life. They further wondered: if the growth of E.coli can be seen, can it also be "heard"? To allow more people, especially visually impaired individuals, to intuitively experience the beauty of science, the team proposed the concept of "Microbial Music"— transforming the growth process of E.coli into melodies.

In this model, the Petri dish serves as a musical score, the spread of bacterial colonies turns into musical notes, and the rhythm of life is presented in the form of sound. This not only expands the artistic boundaries of synthetic biology but also allows science to be perceived and understood in a more inclusive and multi-sensory way.

2. Technical Route

Our core process is divided into four steps, forming a technical path of "Experimental Image → Intelligent Recognition → Data Analysis → Artistic Translation". The overall process is shown in the following figure:

Schematic Diagram of the Four-Step Core Experimental Process

Figure 1: Schematic Diagram of the Four-Step Core Experimental Process

3. Specific Operation Process and Results

3.1 Experimental Images

3.1.1 Experimental Plating and Image Acquisition

The spread plate experiment takes E.coli as the research object. The plating operation is completed on the aseptic operating table in the laboratory. LB solid medium is used to support the growth of bacterial colonies. After plating, the plates are inverted and placed in a 37°C constant-temperature incubator for cultivation. During the cultivation period, photos are taken at regular intervals to record the growth of bacterial colonies, and the following precautions must be strictly followed during shooting:

① The same camera shall be used for shooting and recording throughout the process, and the shooting distance and storage format shall be fixed to avoid the impact of differences in equipment parameters on subsequent recognition.

② Shooting shall be conducted in an environment with uniform light, without direct strong light. The lens shall be perpendicular to the surface of the medium and fully cover the area of the Petri dish.

③ Place the Petri dish on a solid-color, pattern-free background board to eliminate interferences such as background variegation and shadows, ensuring that the outline of bacterial colonies is clearly distinguishable.

④ Take the moment when the plate is placed into the incubator as the "0h" starting point, take photos at irregular intervals to record the growth of E.coli, and finally obtain 15-20 images covering the entire growth cycle.

⑤ Name the images according to the "cultivation time" and record the shooting time to facilitate subsequent data organization.

3.1.2 Image Processing

To eliminate differences in angle and field of view among colony images taken at different times and ensure unified scales during subsequent target detection, we perform standardization processing on the images. Through methods such as rotation (adjusting the image angle to make the edge direction of the medium consistent) and cropping (taking the edge of the medium as a fixed control reference to crop out the excess background area), all processed images have a unified scale and consistent visual starting point, preventing shooting deviations from affecting the accuracy of colony feature extraction. The processed images are shown in the figure below:

Processed Plate Images

Figure 2: Processed Plate Images

The unit of the image names is "hours", corresponding to the growth time of E.coli.

3.2 Intelligent Recognition

Intelligent recognition is the core link connecting "standardized images" and "quantified colony features". Its purpose is to extract key information that can be used for music translation, such as the position, length, and width of colonies, from the processed E.coli growth images. To avoid the efficiency bottlenecks and accuracy deviations caused by manual operations, this module adopts the target detection technology based on YOLOv8s to realize the automatic recognition of colony information, providing stable and batch structured data support for subsequent data analysis and music translation.

3.2.1 Data and Preprocessing

① Dataset

The dataset used for training in this module is derived from the free part of the AGAR dataset [C1] and the dataset provided in relevant literatures [C2]. A total of 140 images and corresponding data have been sorted out, mainly involving five core colony categories: B.subtilis, C.albicans, E.coli, P.aeruginosa, S.aureus, as well as a small number of mixed colonies. We sincerely appreciate the efforts and contributions of relevant researchers and staff.

② Format Conversion Processing

Subsequently, we need to perform format processing on the obtained dataset. The process involves converting the original JSON annotations into XML files in VOC format first, and then further converting them into txt files dedicated to YOLO. While standardizing the bounding box coordinates into normalized values required for model training, the automatic division of the training set (80%) and validation set (20%) is completed, providing standardized and directly readable input data for subsequent data processing and efficient training of the YOLOv8s model.

③ Image Slicing and Algorithm Optimization Processing

We first conducted a training session using the existing dataset, but we found that the training results had problems in recognizing extremely small colonies in practical instances. Therefore, after converting JSON to XML files, we implemented algorithm adaptability optimization through image slicing processing. Large-sized or irregular images are converted into sub-regions of fixed size, and label information is adjusted synchronously to ensure that the colony features in the slices are complete and the label coordinates are accurate.

We adopt the sliding window method for slicing: using a preset fixed size as the window, calculating the sliding step according to the set overlap rate, and successively intercepting sub-regions on the original image; synchronously processing the VOC format labels, retaining only the colony targets completely located within the window, converting their coordinates into relative coordinates within the slices and generating new labels; performing grayscale processing on some truncated colony regions to weaken the features of incomplete targets and eliminate interferences; finally outputting label files corresponding to the slices one-to-one to maintain the consistency of the data structure, laying a foundation for the YOLOv8s model to efficiently learn colony features.

Effect Diagram of Slicing and Grayscale Masking

Figure 3 : Effect Diagram of Slicing and Grayscale Masking

3.2.2 YOLOv8s Model

YOLOv8 is a major updated version based on YOLOv5, released by Ultralytics in January 2023. On the basis of previous YOLO versions, it introduces new functions to further improve the performance and flexibility of the model. The following figure shows the overall framework of YOLOv8:

Model structure of YOLOv8

Figure 4: Model structure of YOLOv8 detection models from RangeKing@github[C3]

① Backbone

The Backbone of YOLOv8 is composed of three major modules (Conv, C2f, and SPPF) combined in a logical manner. The Conv module is the foundation, which realizes downsampling through the structure of "2D Convolution + BatchNorm2d + SiLU" to provide fixed scales for subsequent feature extraction. C2f is a key improvement, which has fewer parameters and stronger feature extraction capabilities than the C3 module of YOLOv5. Through the process of "ConvModule Processing → Split into Two Paths → One Path Directly Connected, the Other Path Passing Through DarknetBottleneck with Residual → Concat Concatenation → ConvModule Output", it reduces the loss of shallow features and captures fine-grained information. Consistent with YOLOv5, SPPF fuses features through multi-scale pooling and retains global semantics. Overall, it consists of 5 Conv modules, 4 C2f modules with different parameters, and 1 SPPF module, forming a complete feature extraction chain from underlying textures to high-level semantics.

YOLOv8 Backbone Structure Part 1 YOLOv8 Backbone Structure Part 2 YOLOv8 Backbone Structure Part 3

Figure 5: Schematic Diagram of the YOLOv8 Backbone Structure

② Neck

The Neck adopts a combined structure of "FPN + PAN", whose core is to realize the bidirectional fusion of high-level and low-level features. FPN (Feature Pyramid Network) works in a "top-down" manner: after upsampling the high-level small-sized feature maps of the Backbone, it sequentially concatenates them with the middle-level and low-level feature maps, followed by C2f processing, to transmit high-level semantic features and solve the problem of low-level semantic ambiguity. PAN (Path Aggregation Network) operates in a "bottom-up" way: after downsampling the low-level feature maps fused by FPN, it sequentially concatenates them with the middle-level and high-level feature maps, followed by C2f processing, to transmit low-level positioning features and make up for the insufficient positioning of high-level features. Finally, 3 optimized feature maps corresponding to different scales are output.

Schematic Diagram of the Feature Pyramid Network Structure

Figure 6: Schematic Diagram of the Feature Pyramid Network Structure

③ Head

The Head adopts the innovative design of "Anchor-Free + Decoupled-Head" to improve the flexibility and accuracy of prediction. Anchor-Free does not require preset anchor boxes; it directly infers the position and size of the target from the feature maps of the Neck, adapts to changes in object size, and avoids missed detection and repeated detection caused by unreasonable anchor boxes. The Decoupled-Head splits "classification + regression" into independent branches, both of which are processed through "4 3×3 convolutions + 2 1×1 convolutions". The classification branch uses BCE (Binary Cross-Entropy) loss to optimize category judgment, and the regression branch uses CIOU (Complete Intersection over Union), WIOU (Weighted Intersection over Union), and DFL (Distribution Focal Loss) to optimize the bounding box coordinates. Finally, the target box coordinates and category probabilities are output, and the results are obtained through non-maximum suppression. Additionally, it outputs 3 feature maps corresponding to the Neck, realizing accurate prediction of targets of different sizes.

Schematic Diagram of the YOLOv8 Head Structure

Figure 7: Schematic Diagram of the YOLOv8 Head Structure, from MMYOLO[C4]

④ Model Performance

The following figure shows the performance curves of multiple YOLO series models:

YOLOv8 Performance Curve

Figure 8: YOLOv8 Performance Curve [C5]

Considering the size of our training set, we decided to select YOLOv8s as our training model.

3.2.3 Training Results

Based on the preprocessed dataset and the YOLOv8s model, we conducted 300 rounds of training on 1421 images in the training set and 356 images in the validation set. The results are as follows:

Statistical Distribution Visualization of the Colony Dataset

Figure 9: Statistical Distribution Visualization of the Colony Dataset

From the statistical distribution visualization of the dataset, it can be seen that the sample sizes of the five core colony categories are sufficient, providing rich feature learning materials for model training, which is an important foundation for the model's excellent recognition performance on the core categories. At the same time, the morphological characteristics of colonies, such as random distribution in images and positive correlation between width and height, also provide data-level support for model training.

F1-Confidence Curve Precision-Confidence Curve Precision-Recall Curve Recall-Confidence Curve

Figure 10: Diagram of Machine Training Effects

These four figures are the F1-Confidence Curve, Precision-Confidence Curve, Precision-Recall Curve, and Recall-Confidence Curve respectively. Based on the comprehensive analysis of these four evaluation curves, it can be seen that our trained model exhibits excellent colony detection performance: in the Precision-Recall Curve, the overall mAP@0.5 (mean Average Precision at IoU=0.5) is as high as 0.982, and the average precision of core colonies such as B. subtilis, C. albicans, and S. aureus is close to 1.0, indicating that the model has extremely high recognition accuracy for target colonies. The Recall-Confidence and F1-Confidence curves further verify that when the confidence threshold is approximately 0.380, the model can achieve the optimal balance between recall and precision (with an F1-score of 0.95 at this point).

Normalized Confusion Matrix Original Confusion Matrix

Figure 11: Confusion Matrix Diagram of Training Results

From the results of the confusion matrix, the model shows good recognition performance for core colonies: in the normalized confusion matrix, the correct classification ratio of these categories is close to or reaches 1.0; in the original confusion matrix, the number of correctly classified samples of core categories is large, and the number of misclassified samples is small, indicating that the model can accurately and stably distinguish core colonies.

Loss Curves and Evaluation Metric Curves

Figure 12: Loss Curves and Evaluation Metric Curves

These curves indicate that during the training process, various types of losses converge rapidly and eventually stabilize; core metrics such as precision, recall, and mAP continue to rise and tend to saturate. This shows that the model converges well on the colony detection task, being able to accurately classify colony categories and regress bounding boxes, and has good adaptability to different detection scenarios.

3.2.4 Practical Application

After completing the model training, we used the best training weights obtained from the training to perform colony detection on the images acquired from the experimental plating. Due to issues such as poor detection results and small differences in colony states between adjacent time points for some images, which provide no substantial help for subsequent data analysis, we excluded these images from the candidate list. Finally, we retained a batch of images with high detection accuracy that can clearly reflect the dynamic growth process of colonies, and added a confidence filter (confidence = 0.2). The detection results are as follows:

Diagram of Practical Application

Figure 13: Diagram of Practical Application

(Specific colony information is displayed in the "Data Analysis" module)

3.2.5 Discussion

From the results, although most colonies can be recognized and the boundary recognition is relatively accurate, the following problems still exist:

1. A small number of colonies cannot be detected;

2. Unclear large detection boxes appear in blank areas without colonies;

3. Although the machine training effect diagram is good, there are still cases of incorrect colony type recognition in practical applications.

Based on the above problems and combined with the foundation of our project, we speculate that the possible reasons are as follows:

① Since our experiment involves the growth of E.coli from small to large, there are E.coli of various sizes in different growth stages. Moreover, our dataset is not particularly large, which may make it difficult to fully cover the morphologies of E.coli in all stages. At the same time, the appearances of other types of colonies are similar to those of E.coli in these stages, leading to machine misjudgment.

② At the same time, considering that some bacteria have similar morphological characteristics themselves, and under the interference of shooting methods or the environment, the core visual features of colony morphologies are highly similar. Relying solely on the recognition method of "image morphology" is prone to confusion.

③ In addition, during the shooting process, due to factors such as light or reflection, some blank areas without colonies may present "pseudo-features" similar to colonies, leading to machine misjudgment. It may also be due to the Anchor-Free design adopted by YOLOv8s: although it is suitable for targets of different sizes, the judgment threshold for non-target areas is relatively low. When there are slight visual changes in blank areas, such as background color gradients and tiny impurities, the model tends to incorrectly calculate large-sized bounding boxes. If the subsequent non-maximum suppression parameters are set too loosely to filter out these false-positive large boxes, they may be retained in the detection results.

Combining the current machine training results and possible existing problems, our future optimization directions will focus on targeted dataset supplementation, experimental scenario expansion, algorithm optimization, and so on.

4 Data Analysis

After completing the practical application, we extracted the prediction results. Due to the excessive number of bacterial colonies, we selected a portion of colonies with clear data and organized the numerical information of their x , y ,width, and height at corresponding time points for subsequent artistic translation. The results are as follows:

5 Artistic Translation

After extracting and filtering the required data, we initiated the process of artistic translation. The primary goal is to convert this data into corresponding musical information following specific rules and present it alongside vivid visuals. Our aim is to allow the general public to perceive the rhythm of life through hearing, and more importantly, to provide visually impaired individuals with a new way to experience the beauty of science, thereby fulfilling the core original intention of the project—"multisensory inclusiveness".

5.1 Music Composition

With the assistance of Professor Zhou [see Professor Zhou's profile here], we completed the composition of an original piece of music based on E.coli growth data using the SuperCollider audio programming software. The music adopts a collaborative design of two vocal layers, utilizing data from the two dimensions of "colony group" and "individual colony" respectively to ensure that the sound expression of the growth process is both comprehensive and hierarchical. The specific data utilization logic is as follows:

The first layer of code focuses on the group growth law of E.coli, taking the width values of 9 bacterial strains at 9 time points (30-270 minutes) as the core data source. It adjusts the number of active bacterial strains according to the progress of growth time, thereby presenting the overall growth rhythm changes of the E.coli colony from the lag phase to the logarithmic phase and then to the stationary phase.

The second layer focuses on the individual morphological data of E.coli, using the spatial coordinates and sizes of 9 colonies at 9 time points as the core data source. Through real-time data updates and smooth transition processing between time periods, it restores the continuous growth changes of a single colony in spatial position and morphology over time.

During synchronous playback, the music synergistically utilizes E.coli growth data from the two core dimensions of "overall growth dynamics of the bacterial colony" and "spatial morphology of individual colonies", jointly realizing the comprehensive audibility of the growth process.

5.2 Animation Production

To better showcase our music, we used TouchDesigner software for modeling. In terms of design, we balanced scientific authenticity and artistic appreciation—using rotating and expanding spheres to simulate E.coli. The distribution positions of each sphere are determined based on the approximate growth positions of E.coli, and colors that gradient over time are added to reflect the vitality of the E.coli growth process:

Dynamic display of a single sphere

Figure 14: Dynamic display of a single sphere

Up to this point, we have completed the integration of the music and animation, and finally realized the entire creation of the artistic translation section. The relevant results can be viewed through the following link:

https://video.igem.org/w/vd9RaZ9JN6xDfSbbrJzRHa

[1] Majchrowska, S., Pawłowski, J., Guła, G., Bonus, T., Hanas, A., Loch, A., Pawlak, A., Roszkowiak, J., Golan, T., & Drulis-Kawa, Z. (2021). AGAR a microbial colony dataset for deep learning detection. arXiv preprint arXiv:2108.01234. https://arxiv.org/abs/2108.01234

[2] Pawłowski, J., Majchrowska, S. & Golan, T. Generation of microbial colonies dataset with deep learning style transfer. Scientific Reports 12 (2022).

[3] It can be found at https://github.com/ultralytics/ultralytics/issues/189

[4] It can be found at
https://mmyolo.readthedocs.io/zh-cn/latest/recommended_topics/algorithm_descriptions/yolov8_description.html

[5] It can be found at https://docs.ultralytics.com/zh/models/yolov8/

I. Microbial Music

In the "Microbial Music" module, it is necessary to translate the growth dynamics of E. coli (such as colony size, density, and growth rate) into rhythmic and melodic music through algorithms. In the current experiment, the music generation algorithm is based on the initially observed growth parameters, and there are issues such as the need to expand data sampling.

1. Technical Deepening and Iteration

Dataset Expansion: Supplement image data of E. coli at different growth stages in a targeted manner, and simultaneously include more colony samples with morphological similarity to *E. coli* to improve the model's discrimination ability for the target strain.

Algorithm Improvement: Refine the target detection model to enhance the accuracy of recognition.

2. Innovation in Artistic Translation

Multi-dimensional Data Mapping: On the basis of the existing mapping logic, add other mapping relationships to create more music pieces.

Through multi-dimensional optimization (including technical deepening and iteration, as well as innovation in artistic translation), microbial music can be improved in the future.

II. Molecular Dynamics Simulation

In the core goal of "Microbial Painting", the indigo pigment synthesized by *E. coli* needs to diffuse in the sodium alginate hydrogel matrix. The diffusion coefficient is a key quantitative index, and its core role is reflected in multiple aspects such as accurately controlling the painting time, optimizing the hydrogel performance, and realizing the collaborative painting with multiple pigments.

In the current experiment, the pigment diffusion process lacks the support of quantitative data at the microscopic level. The diffusion coefficient can be calculated through GROMACS molecular dynamics simulation, which provides a scientific basis for the optimization of the experimental scheme and reduces the cost of trial and error.

1. Core Technical Route and Implementation Steps

1.1 Construction of Sodium Alginate Hydrogel Model (Based on Tools such as Avogadro)

1.1.1 Design and Construction of Single-Chain Structure

1. Monomer Selection and Sequence Design: Sodium alginate is composed of β-D-mannuronic acid (M) and α-L-guluronic acid (G) linked by 1,4-glycosidic bonds, among which the G-unit enriched region (G-block) is the core site for calcium ion cross-linking. With reference to the structural characteristics of natural sodium alginate, a single-chain sequence is designed: 2 segments of G-blocks are set, each containing 10 consecutive G units, and the remaining regions adopt a random mixed arrangement of M and G.

2. Structure Optimization and Charge Assignment: Use Avogadro to minimize the energy of the single chain and optimize the geometric structure. According to the dissociation characteristics of sodium alginate, assign a charge of -1 to the carboxyl group of each G and M unit to simulate the molecular state in the physiological environment.

1.1.2 Construction of Cross-Linked Network (mediated by Calcium Ions)

1. Establishment of Multi-Chain System: Place 8-10 sodium alginate single chains in a periodic box, and set the minimum distance between the single chains to 0.5 nm to avoid overlapping of initial structures.

2. Calcium Ion Cross-Linking and Grid Formation: Add Ca²⁺ ions. Based on the principle of the "egg-box model", guide Ca²⁺ to coordinate with the carboxyl oxygen atoms of G-blocks in different single chains. Fix the coordination bonds between Ca²⁺ and G-blocks by means of energy constraint, so as to promote the formation of cross-linking points between different single chains through Ca²⁺. Remove the free Ca²⁺ that does not participate in cross-linking, and finally form a simplified periodic hydrogel grid model, which not only retains the core characteristics of "cross-linking points - grid pores" but also reduces the calculation load of subsequent simulations.

2.2 Simulation of Indigo Pigment-Hydrogel Diffusion (Based on GROMACS)

2.2.1 Preparation of Simulation System

1. Construction of Pigment Molecules and Force Field Parameters: Obtain files of indigo molecules and other substances from PubChem, generate topology files corresponding to the force field through tools, and place the indigo pigment molecules in the center of the hydrogel grid.

2. Addition of Solvent and Ions: Adopt the SPC/E water molecule model to fill the periodic box to simulate the high water content environment of the hydrogel, and add gdl ions to neutralize the total charge of the system to avoid charge imbalance of the system.

2.2.2 Simulation Process and Parameter Setting

After pre-equilibration, carry out the production simulation of molecular dynamics, save the trajectory file, and record the coordinate changes of indigo pigment and hydrogel molecules for the subsequent analysis of diffusion coefficient.

2. Calculation of Diffusion Coefficient and Result Analysis

1. Calculation of Mean Squared Displacement (MSD): Use the `gmx msd` command of GROMACS to extract the centroid coordinates of indigo pigment molecules from the trajectory file, calculate the MSD at different time intervals, perform linear fitting on the MSD data, and calculate the D value (diffusion coefficient) through the slope according to the formula MSD = 6Dt.

2. Result Verification and Mechanism Analysis: Repeat the simulation 3 times, calculate the average value and standard deviation of the D value to ensure the reliability of the results; Combine the trajectory animation (visualized by VMD software) to observe the diffusion path of indigo pigment in the hydrogel grid; Based on the diffusion coefficient data, proposals for "Microbial Painting" can be put forward to reduce the number of experimental trials and errors, and at the same time provide specific directions for the optimization of hydrogels in the future.