Dry Lab Modeling Banner

Overview

Interleukin-10 (IL-10) is a key anti-inflammatory cytokine that plays an essential role when treating inflammatory bowel disease symptoms. However, its function is greatly limited in the extreme pH and body temperatures accelerated by inflammatory bowel disease. The structure of IL-10 has been highly studied and many studies report that the biologically active form of IL-10 can be thermally unstable in its homodimer form, as it has a short half-life and can be easily degraded in vivo. To improve the therapeutic efficacy of IL-10 for treating inflammatory bowel disease, we have designed two novel forms of IL-10 that utilize highly specific point mutations that improve the protein’s thermal stability. These point mutations were optimized for protein thermostability using ThermoMPNN, a deep neural network designed to predict amino acid mutations while preserving IL-10’s homodimer’s structural integrity and crucial binding properties.

Identifying Mutations

Introduction

Interleukin-10 (IL-10) is a critical anti-inflammatory cytokine for treating inflammatory bowel disease (IBD). However, in the inflamed gut, extreme pH and higher local temperatures reduce IL-10’s activity. The structure of IL-10 has been highly studied and many studies report that the biologically active form of IL-10 can be thermally unstable in its homodimer form, as it has a short half-life and can be easily degraded in vivo.

To address this, our Dry Lab aimed to engineer IL-10 variants with improved thermostability. We designed two optimized IL-10 variants (M1 and M2) using a computational pipeline that integrates ThermoMPNN, PyMOL, AlphaFold3, and ESM modeling. This approach allowed us to target specific point mutations predicted to stabilize IL-10’s fold without disrupting its dimeric structure or receptor binding.

ThermoMPNN

ThermoMPNN is a deep neural network trained on vast datasets of experimental stability measurements, allowing it to learn the complex relationships between sequence, structure, and stability. Based on the well documented ProteinMPNN framework, ThermoMPNN analyzes a protein's 3D structure to predict how changes in its amino acid sequence (point mutations) will impact a protein’s entire thermal stability as denoted through the point mutation’s resulting change in -ΔΔG.

We began with the high-resolution crystal structure of IL-10 (PDB 1ILK) and ran ThermoMPNN via Google Colab.

Figure 1

Figure 1. Workflow of the ThermoMPNN model used to identify stabilizing point mutations. The pipeline combines structural features of IL-10 (e.g., backbone distances) with sequence embedding processed by an encoder–decoder network. The output is passed through a stability-prediction module (LA and MLP blocks) to calculate the predicted change in folding free energy (ΔΔG) for each possible amino-acid substitution. Negative ΔΔG values (blue on the heat-map) indicate mutations predicted to increase protein stability, whereas positive ΔΔG values (red) suggest destabilizing effects. This method builds on the design principles described in Mega-scale experimental analysis of protein folding stability in biology and design, https://www.nature.com/articles/s41586-023-06328-6

A brief summary of the process is as follows:

1. Uploaded the PDB file and specified chains for design

2.Toggled off “Include Cysteines” to avoid disulfide-driven misfolding.

-Turning off cysteine design is a common precaution in therapeutic protein engineering, because disulfide bonds can increase stability but often cause aggregation or folding problems.

3. Tested the “Multichain Interface” option to account for the homodimer interface; as expected for a symmetric dimer, this setting made little difference.

4. Used the standard mode for predictions and post-processed results to remove duplicates, low-confidence hits, and buried polar residues.

Further detail is provided below:

Starting with a Research Collaboratory for Structural Bioinformatics protein data bank (RCSB PDB) file of our desired protein IL-10. We chose the crystal structure PDB 1ilk-IL10, of IL-10, derived using X-Ray diffraction with a high PDB validation report. To process the ThermoMPNN software, we used Google Collab’s implementation.

Thermo Step 1

The app is organized into easy to follow steps used for running the software. After confirming each step, the user hits the play key on the left side of the platform.

Thermo Step 3

Following, the user uploads a desired PDB file and chooses which protein chains for the software to focus on.

Thermo Step 3

Standard ThermoMPNN mode was used for our dry lab. Importantly, running the “Include Cysteines” option determines whether the model can introduce cysteine residues into designed sequences. Allowing cysteines takes into account potential disulfide bonds, which can greatly improve protein thermostability but also risk misfolding or unwanted aggregation strain on the structure. For IL-10 engineering, our group consulted with Maggie Horst postdoc and chose to toggle this setting off as a cysteine-free sequence is safer for expression and folding.

Thermo Step 5

Next, the multichain interference was experimented with in our dry lab outputs. The multichain interference allows the model to account for the interface between the two chains. For IL-10 this prevents designs that stabilize one monomer but destabilize the dimer as a whole. Our results when toggling this option on versus off did not differ much or introduce new mutations that were significant in improving thermostability. This is logical as the IL-10 chains mirror one another.

Thermo Step 5.5

Lastly, post-processing cleans up the raw sequence output by removing duplicates and low-confidence mutations. It also filters against unwanted motifs, like buried polar residues or problem cysteines, that could cause instability or misfolding. For example sample output could list “Identified the following disulfide engaged residues: [].” Finally, it ranks the remaining candidates by predicted thermostability so only the most promising variants move forward.

Interpreting Results

Raw Data

ThermoMPNN produced a heat-map of ~7,000 candidate substitutions, showing stabilizing (blue) and destabilizing (red) mutations.

ThermoMPNN Heat Map

Figure 2. Thermostability heat map of IL-10 single-site mutations generated using ThermoMPNN.

The heat map illustrates predicted changes in folding free energy (ΔΔG, kcal/mol) across all amino-acid substitutions and positions of IL-10. Negative ΔΔG values (blue) correspond to mutations predicted to enhance thermostability, while positive ΔΔG values (red) suggest destabilizing effects. Although most substitutions showed little improvement, we observed that mutations clustered in positions 32–38 and 133–139 displayed a distinct blue gradient.

Although these effects are subtle, we believe they may contribute to incremental increases in protein stability. Notably, because IL-10 is a homodimer, these regions exhibit symmetry in their interactions, enabling us to identify pairs of mutations across the dimer interface that could act synergistically to enhance thermostability.

Exported Data (to Google Sheets)

We exported the raw data to Google Sheets for further curation. Only ~1,018 mutations had negative ΔΔG values. Among the viable hits, R97V and R97I ranked highest for stabilizing effect and became focus points for deeper analysis.

ThermoMPNN Data Results on Google Sheets

Figure 3. ThermoMPNN’s data results were organized into Google Sheets. The ddG (kcal/mol) value represents the predicted change in protein stability when a mutation is introduced compared to the wild type. A negative ddG suggests the mutation is stabilizing the protein, while a positive value indicates it may destabilize the structure.

In total, close to 7 thousand point mutations were made, however, only the first 1018 mutations had a stabilizing ddG. These 1018 became the focus of our selection. Here, many mutations that could yield high thermostability results were ruled out by marking “X.” The “X” denotes that ThermoMPNN could not map that mutation to a resolved residue position in the structure used. This usually happens when the residue exists in the input sequence but is missing from the PDB file, or when the alignment between sequence and structure does not line up cleanly due to gaps or numbering mismatches. Another possibility is that the residue was filtered out during preprocessing because no backbone coordinates were available to calculate stability. After establishing this, we focused on the available mutations that we would be able to clearly see the PDB and thermoMPNN mutation position it corresponded to. Mutations at the top of the list such as R97V and R97I immediately caught our attention. These had the most significant stabilizing impact on IL-10’s thermostability and needed more confirmations to decide on its viability.

Our dry lab addresses the problem of wild-type IL-10 being naturally unstable with a ΔG of positive folding value of ~4.36 kcal/mol, predicted by ThermoMPNN. This explains its limited half-life and limited therapeutic potential. Screening of nearly 7,000 mutations, our group utilizedThermoMPNN to predict over 1,000 candidates that have thermally stabilizing influences on the structure of IL-10, with clusters between residues 32–38 and 133–139 showing uniform improvements.


Adding to further confirmations, we modeled all suggested mutations with high thermostability using PyMOL for visualization and biochemical interaction analysis. This allowed us to rule out mutations that imposed steric strain, destroyed salt bridge interactions, or decreased dimer geometry. Moreover, this analysis revealed variants that combined positive rotamers with strong ΔΔG scores. Various combinations of mutations with these affirmed confluences were tested for their ability to function as whole through ESM-generative modeling. These results were successful in showing lower ΔG scores from the original sequence. AlphaFold3 modeling further showed that high-priority mutations preserved IL-10's key receptor interactions with IL-10RA and IL-10RB, evidenced by stable ipTM and pTM values across mutant complexes. Together, these findings suggest that targeted point mutations can greatly enhance IL-10 thermostability without compromising structural integrity or receptor affinity, allowing the feasibility of more stable therapeutic variants.

Quantifying Predicted Stability

We next used the ESM protein language model, which predicts folding free energy (ΔG) based on evolutionary patterns in protein sequences.

ESM Model

The Evolutionary Scale Model (ESM) is a protein language model; essentially it is a deep-learning framework trained on millions of natural protein sequences. By learning the patterns that nature has used to build stable, functional proteins, the model can infer structure- and stability-related signals from an amino-acid sequence alone.

One key output from ESM-based predictors is an estimate of the folding free energy (ΔG). In this context, a lower (more negative) ΔG means the folded state of the protein is energetically more favorable, and therefore the protein is expected to be more stable against unfolding. Conversely, higher (positive) ΔG suggests the protein tends to be less stable and may unfold more easily. By running our IL-10 sequences through this model, we can quantify thermodynamic stability, with negative values indicating a favorable folded state.

Baseline Stability of IL-10

When we analyzed the wild-type (unmutated) IL-10 monomer, the ESM predictor estimated a folding free energy of approximately ΔG ≈ +4.36 kcal/mol. A positive ΔG implies that the isolated monomer, on its own, would not be strongly stable in the folded state. However, IL-10 is naturally a homodimer, meaning two identical subunits pair together. This dimerization greatly improves the effective stability of the protein in vivo, so IL-10 as a whole is functional despite its intrinsically less stable monomer.

Baseline ESM-Predicted Folding Free Energy for Wild Type

Figure 4. Baseline ESM-predicted folding free energy (ΔG) for wild-type IL-10. The ESM model estimated the folding free energy of unmutated IL-10 to be ΔG ≈ +4.36 kcal/mol, indicating that the isolated monomer is relatively unstable and tends to unfold at equilibrium. This baseline value served as the reference for evaluating whether our engineered mutations could lower ΔG and thus improve the protein’s predicted thermostability.

Application of ESM

Using an ESM-based predictor, unmutated IL-10 shows ΔG fold ​ ≈ +4.36 kcal/mol, which indicates that the isolated monomer would be predominantly unfolded at equilibrium, signifying weakness in thermostability. IL-10 functions as a homodimer in vivo, and the dimerization free energy contributes substantially to stabilization such that the dimer can be thermodynamically favored despite the weak intrinsic monomer stability. ESM estimates are derived from statistical patterns in large protein sequence datasets rather than physics-based simulations, so we report values together with the sign convention and interpret them as overall thermostability predictions.

To validate whether our collection of point mutations addresses these problems in functional stability, we ran our point mutations through the software to find the combinations of them that would decrease the predicted ΔG the most.

These single-residue changes may push this ΔG downward, making the protein more likely to hold its folded shape under heat or stress. In practice, decreasing the ΔG means the protein should resist unfolding better, which is the goal when trying to engineer higher thermostability.

Using the ThermoMPNN predicted results, we used visualization softwares to filter for practical mutations that would still be impactful for thermostability improvements (see protein folding). Then, running through the ESM-predictive model, our best combination mutations 1 and mutation 2 of IL-10 yielded results of 4.2645 and 4.25949 ΔG, a significant decrease and improvement from the original IL-10 value of 4.357622. Because ESM stability scores are statistical proxies, not calorimetric measurements it is difficult to estimate the improvement in degrees of these predictions. Hence, wet lab confirmation of this order-of-magnitude prediction is required.

Baseline ESM-Predicted for selected il-10 mutation set

Figure 5. ESM-predicted folding free energy (ΔG) for selected IL-10 mutation set. The mutation combination N148M, R32K, E45M, N126M, T26Q, and Q33D was evaluated using the ESM predictor. Chain A: ΔG ≈ 4.309 kcal/mol, showing a notable reduction compared to the wild-type IL-10 baseline of 4.36 kcal/mol; Chain B: ΔG ≈ 4.378 kcal/mol, which is comparable to the wild type.

These results suggest that the engineered substitutions can incrementally improve the stability of Chain A, which may be beneficial in the context of IL-10’s dimeric structure, although experimental validation is required to confirm these computational predictions.

Figure 6 image.

Figure 6. ESM-predicted folding free energy (ΔG) for alternative IL-10 mutation set. The mutation combination N36C, E45M, S2M, N139M, Q33D, and D19E was evaluated using the ESM predictor. The predicted folding free energy was ΔG ≈ 4.2595 kcal/mol, which represents a greater reduction compared to the wild-type IL-10 baseline of 4.36 kcal/mol than the first mutation set achieved. This result suggests that this alternative combination may offer a more pronounced predicted gain in thermostability, reinforcing its potential as a candidate for experimental validation in improving IL-10’s stability.

Summary

Given above are the mutation lists and their respective ESM-generative model results. These mutations are combinations of those that passed through pyMOL visualization and biochemical analysis of R-group interactions. As shown above, both sets of mutations show a reduction in the deltaG predicted value from 4.36 in the original amino acid sequence down to 4.30895 and 4.2594 on chain A respectively from both groups. These reductions, although numerically small, are meaningful because even fractional shifts in ΔG can translate to noticeable changes in folding stability under physiological conditions. In practice, such improvements often correlate with higher melting temperatures (Tm), allowing the protein to remain functional and resist unfolding at elevated temperatures. This kind of stability is especially valuable for therapeutic proteins like IL-10. While experimental validation is necessary, the computational trend points toward a practical gain in thermostability that could enhance the reliability of IL-10 in clinical applications.

Software & Additional Sources

ThermoMPNN:
https://github.com/Kuhlman-Lab/ThermoMPNN

ThermoMPNN is a computational tool that uses graph-based neural networks to predict how individual amino-acid substitutions affect a protein’s thermostability (ΔΔG). By analyzing both the 3-D structure and sequence context, it highlights candidate mutations that are likely to stabilize or destabilize the protein.

Google Colab:
https://colab.research.google.com/drive/1OcT4eYwzxUFNlHNPk9_5uvxGNMVg3CFA

Google Colab is a cloud-based coding platform that allows researchers to run Python notebooks and computational tools like ThermoMPNN or ESM without needing local installations. It provides an accessible environment for running models, visualizing data, and sharing code reproducibly.

ESM Predictive Modeling
https://github.com/facebookresearch/esm

ESM is a family of protein language models developed by Meta AI that learns evolutionary patterns from large protein sequence datasets. Its predictive tools can estimate properties such as folding free energy (ΔG), helping researchers assess how sequence changes may affect stability and structure.

Documentation
Google Sheets (IL-10 mutations)

Documentation of our IL-10 mutations after being transferred onto Google Sheets; further analysis and details can be found by accessing the link above.

Introduction: Visualization & Binding Tests

Interleukin-10 (IL-10) point-mutation design began with structure-guided inspection in PyMOL, where we loaded available IL-10 homodimer structures and mapped candidate sites from our stability screen onto the 3-D model. PyMOL’s mutagenesis tools let us try specific side-chain replacements, choose low-clash rotamers, and measure distances across the dimer interface to protect key contacts. We used surface coloring and vacuum electrostatics to visualize polarity, salt bridges, and hydrogen-bond networks, then inspected hydrophobic cores to avoid burying polar atoms or exposing non-polar patches that could destabilize folding. Side-chain interactions such as π–π and cation–π stacking were checked alongside van der Waals overlaps to catch subtle steric issues.

This first pass produced a short list of practical mutations that preserved packing, interface geometry, and solvent exposure while aligning with our ΔΔG ranking.

Amino Acid Chemistry Reference

Figure 1. Amino Acid Chemistry Reference; A reference chart of the 21 proteinogenic amino acids, organized by their side-chain properties: charged, polar, hydrophobic, or special cases. This guided our initial understanding of which amino acid substitutions (e.g., polar → hydrophobic) might enhance stability or create clashes when buried.

Finally, we used AlphaFold3 to model the folded mutants in both monomeric and homodimeric states and to predict their complexes with the IL-10 receptor chains IL-10RA and IL-10RB (referred to as receptors A and B). We reviewed confidence metrics such as pLDDT and predicted interface accuracy (ipTM), compared mutant contact patterns to the wild-type complex, and flagged any design that weakened the receptor-binding epitope. Mutations that passed all four gates, (1) PyMOL clash and interaction checks, (2) ThermoMPNN stability predictions, (3) past documentation of successful or failed point mutations, and (4) AlphaFold3 complex prediction, were prioritized as the most likely to maintain correct folding and signaling while improving thermal stability.

PyMOL Visualization

PyMOL visualization allowed us to analyze point mutations at the biochemical and structural level. By focusing on the different R-group interactions, we could compare several key metrics such as steric clashes, hydrogen bonds, salt-bridge formation, and side-chain rotamer strain. Using the mutations that were already thermally favored by ThermoMPNN, we plotted them onto the IL-10 structure in PyMOL to check if these changes would disrupt the protein’s fold or its receptor-binding interface.


Our screening guidelines required that the mutation:

  • cause no severe steric clashes,
  • avoid loss of key H-bonds or salt bridges,
  • avoid disrupting secondary-structure elements,
  • maintain receptor-binding contacts (when near the interface),
  • adopt a low-strain, favorable rotamer conformation

Collectively, we decided to focus on the top ~400 most thermally improved mutations.
We split these into six sub-groups, each responsible for inspecting ~60–70 candidates. The following series of tables summarizes the screening pipeline we used.

PyMOL Clash Visualization

Figure 2. PyMOL Clash Visualization; Close-up PyMOL rendering illustrating a steric clash (red overlap) created by the R106V substitution. This example shows how apparently promising mutations from the initial stability screen were rejected after structural inspection because they disrupted local packing.

Above is a sample Steric clash formation from an originally interesting mutation due to low negative ddG value–R106V. The substitution of arginine with valine at position 106 introduces unfavorable steric effects within the local environment of the protein. In the native structure, arginine provides an extended, flexible, and charged side chain capable of forming stabilizing interactions such as hydrogen bonds or salt bridges. Replacing this residue with valine, a shorter hydrophobic side chain, eliminates these stabilizing contacts and instead introduces van der Waals overlap with nearby atoms. The resulting steric clashes create local strain and disrupt packing, which is predicted to reduce overall stability. Consequently, the R106V mutation is not an ideal candidate for engineering increased thermostability, as it both removes a favorable electrostatic contribution and introduces geometric crowding within the folded state.

Figure 3

Figure 3. Excerpt of a team’s rotamer analysis table; comparing candidate substitutions by rotamer strain, ability to form salt bridges, and proximity to key receptor sites. Favorable rotamers (green-highlighted) indicated that the new side-chain could fit into the existing packing without high energy penalty.

Figure 4

Figure 4. Example PyMOL rotamer-comparison snapshots, showing two candidate residues at the IL-10 dimer interface. Visualization helped confirm that favorable rotamers could pack smoothly without disrupting neighboring residues.

Figure 5

Figure 5. Representative mutation-screening spreadsheet from one team summarizing steric-clash status, rotamer strain, salt-bridge retention, and receptor-site proximity for each candidate. Green rows highlight favorable substitutions that passed all screening checks.

Figure 6

Figure 6. A similar screening spreadsheet from another team, supporting consensus on several high-priority mutations.

Figure 7

Figure 7. Snapshot of the ESM stability-ranking table, showing computational ΔG scores used to cross-validate the structural screening after identifying rotamers. Mutations that were both structurally acceptable and predicted by ESM to reduce ΔG (improve folding stability) were prioritized.

AlphaFold3

Binding to receptors A and B is central to IL-10’s signaling cascade, since engagement with both is required to trigger downstream anti-inflammatory pathways. Without stable binding at these sites, IL-10 cannot effectively suppress pro-inflammatory cytokine release, which is one of its primary roles in immune regulation. Maintaining strong receptor interactions is therefore essential to preserve the therapeutic potential of IL-10 in controlling excessive immune responses.

To ensure that these mutations do not inhibit binding in IL-10’s receptors, mutated amino acid sequences were tested through AlphaFold3. AlphaFold3 is a deep learning model that predicts three-dimensional protein structures directly from amino acid sequences with remarkable accuracy. By modeling how mutations alter the overall fold, it provides a way to anticipate whether local substitutions disturb critical receptor binding sites. This approach is valuable in the context of IL-10 because it allows mutations to be screened for structural compatibility before investing in experimental assays.

Figure 8

Figure 8. AlphaFold3 server interface where wild-type and mutant IL-10 sequences were submitted for complex-structure prediction with receptor A.

Figure 9

Figure 9. AlphaFold3 prediction of the wild-type IL-10 + receptor A complex, showing ipTM = 0.75 and pTM = 0.56. The relatively high interface score indicates a reliable prediction for the receptor-binding interface even though flexible loops lowered the global confidence score.

Figure 10

Figure 10. AlphaFold3 prediction of a top-ranked IL-10 mutant + receptor A complex, showing nearly unchanged ipTM = 0.74 and pTM = 0.56. The similarity to the wild-type scores confirms that the introduced mutations (although aimed at improving thermostability) are not predicted to disrupt receptor binding.*Later named Mutation 1 IL-10 (M1IL10)

Figure 11

Figure 11. AlphaFold3 prediction of the IL-10 mutant in complex with receptor A. The ipTM score of 0.74 and pTM score of 0.57 closely match those of the wild-type IL-10–receptor A complex, indicating that the introduced mutation set is not predicted to disrupt receptor binding or global fold. The consistent interface score further supports that thermostability-oriented substitutions can be introduced without compromising IL-10’s essential receptor-binding interactions (similar to figure 10). * Later named Mutation 2 IL-10 (M2IL10)

Summary

By combining PyMOL visualization, rotamer and steric-clash screening, ESM stability scoring, and AlphaFold3 receptor-binding checks, we systematically narrowed thousands of theoretically possible IL-10 point mutations to a handful of practically viable, structurally consistent, and function-preserving candidates for wet-lab validation.

This integrated computational pipeline reassured us that the selected mutations were both thermostability-oriented and biologically safe, increasing the likelihood of successful experimental testing.