Computational analyses of wild-type (WT) Cecropin B and its mutants demonstrated that the LLM-derived optimization strategies exhibited pronounced target specificity. By decomposing the binding free energies and their energetic components (Table 2), the mechanisms underlying the distinct optimization strategies were clearly elucidated.
For the LptD--LptE (4RHB) target, the charge-enhancement strategy proved to be effective. The binding free energy of wild-type Cecropin B was -13.5 kJ/mol, whereas both charge-enhanced mutants exhibited significant affinity improvements. CecB-E10K improved the binding energy to -18.5 kJ/mol, while CecB-N15K showed the best performance, further enhancing it to -21.0 kJ/mol. Energy decomposition analysis (using N15K as an example) revealed that this gain in affinity was primarily driven by a substantial contribution from electrostatic energy (ΔE_EL), which sharply decreased from -955.1 kJ/mol to -1180.7 kJ/mol, strongly indicating that the newly introduced lysine side chain established stronger electrostatic interactions with the target surface.
In contrast, for the DppA (1DPE) target, the hydrophobic/structural optimization strategy yielded the best effect. The binding free energy of the wild-type was -10.4 kJ/mol, whereas the hydrophobic mutant CecB-V29W significantly improved binding to -17.6 kJ/mol. The underlying mechanism was clearly evident in the energy decomposition: the major driving force arose from van der Waals energy (ΔE_VDW), which decreased substantially from -95.6 kJ/mol to -145.2 kJ/mol. This suggests that the introduction of the bulky tryptophan (Trp) side chain strengthened hydrophobic interactions and enhanced shape complementarity within the target binding groove.
Cross-validation of these results further emphasized the necessity of target-guided design. Charge-enhanced mutants (E10K and N15K) performed poorly in binding to DppA (-9.5 kJ/mol and -8.4 kJ/mol, respectively), while the hydrophobic-optimized V29W showed no appreciable improvement in binding to 4RHB (-12.2 kJ/mol). These findings clearly indicate that distinct optimization strategies are best suited for specific structural features of different targets, thereby confirming the effectiveness of our computational framework in achieving target-specific optimization.
Table 2. Energy decomposition of Cecropin B (WT) and key mutants in binding to targets (kJ/mol)
| System |
ΔVDW |
ΔEEL |
ΔG_solv |
ΔH |
-TΔS |
ΔG binding |
| WT + 4RHB |
-128.3 |
-955.1 |
998.6 |
-84.8 |
71.3 |
-13.5 |
| CecB-E10K + 4RHB |
-127.5 |
-1102.3 |
1126.3 |
-103.5 |
85.0 |
-18.5 |
| CecB-N15K + 4RHB |
-125.9 |
-1180.7 |
1205.1 |
-101.5 |
80.5 |
-21.0 |
| CecB-V29W + 4RHB |
-129.8 |
-950.4 |
996.5 |
-83.7 |
71.5 |
-12.2 |
| WT + 1DPE |
-95.6 |
-680.2 |
715.3 |
-60.5 |
50.1 |
-10.4 |
| CecB-E10K + 1DPE |
-93.8 |
-695.1 |
729.4 |
-59.5 |
50.0 |
-9.5 |
| CecB-N15K + 1DPE |
-90.1 |
-705.8 |
742.5 |
-53.4 |
45.0 |
-8.4 |
| CecB-V29W + 1DPE |
-145.2 |
-675.5 |
720.8 |
-99.9 |
82.3 |
-17.6 |
Figure 6. Calculated binding free energies of Wild-type Cecropin B (WT) and its mutants to the receptor.
2.1.1 Experimental Challenges and a Shift in Research Focus
While the computational modeling presented in Section 2.1 strongly suggested that LLM-guided mutations could significantly enhance the target-specific binding affinity of Cecropin B, our parallel experimental validation encountered substantial challenges. As detailed in our engineering efforts, attempts to express both the wild-type and the optimized Cecropin B mutants in a standard E. coli expression system resulted in extremely low or undetectable protein yields. Consequently, subsequent inhibition zone assays and minimum inhibitory concentration (MIC) tests failed to demonstrate the expected antibacterial activity.
This significant discrepancy between in silico predictions and experimental outcomes is likely attributable to the inherent cytotoxicity of the Cecropin B variants to the expression host, a bottleneck that proved difficult to overcome despite various engineering strategies (such as the addition of an anionic protective peptide and a signal peptide). This practical roadblock underscored the value of our dual-track AI framework. Faced with the experimental impasse on the Cecropin B optimization track, we strategically shifted our focus to the parallel, deep-learning-driven genomic mining and assembly track. This alternative approach, aimed at mining novel monomeric sequences from an extremophilic microbial library and engineering them into a functional peptide, offered a promising path forward and led directly to the investigation of the candidate peptide AMP-NJY-LYY-742.