Antifreezing proteins

Through comprehensive literature review and with guidance from Prof. Lei Zhang's research group, we acquired 11 phylogenetically diverse, family-representative antifreeze proteins (AFPs) as chassis scaffolds. These proteins underwent systematic evaluation via fermentation-based expression, ice recrystallization inhibition (IRI) assays, and differential scanning calorimetry (DSC) characterization. Subsequent rational protein engineering and directed evolution enabled screening of variants with enhanced cryoprotective activity for functional applications in: Cosmeceutical formulations (e.g., skin cryoprotectants), dermatological barrier products (e.g., frost-protective hand creams), bioinspired anti-icing coatings, to mitigate low-temperature challenges in textile and fashion industries.

Table 1. Introduction to 11 Types of AFPs
Tianjin 2025 Model Framework Diagram
Tianjin 2025 Model Framework Diagram

We first obtained the expression plasmid pET28a-AFPs, which carries the antifreeze protein gene, through commercial DNA synthesis. This plasmid was then transformed into Escherichia coli strains. After fermentation and induction of protein expression, the bacterial cells were lysed to extract the target antifreeze proteins. The antifreeze activity of the obtained proteins was further characterized using the Ice Recrystallization Inhibition (IRI) assay, and differential scanning calorimetry (DSC) characterization.

Modification and Design of AFPs

After obtaining the natural antifreeze protein, we implemented rational protein engineering of the AFPs to enhance cryoprotective efficacy. This optimization strategy was guided by in silico predictions from molecular dynamics simulations and binding energy calculations, enabling targeted modifications to improve ice-binding functionality.

Site-Directed Mutagenesis

The work of our Dry lab leveraged the ESM-1v protein language model to identify mutation sites, distinguishing: evolutionarily conserved core residues critical for structural stability, surface-exposed variable residues as prime targets for functional optimization.

We prioritized directional mutagenesis on surface variable regions while preserving global folding integrity. Four helical AFPs (PDB: 3ULT, 3WP9, 4NU2, 6A8K) were selected for mutational simulation. Computational predictions (see Model section) confirmed the feasibility of enhancing ice-binding activity without compromising structural integrity.

Yeast surface display system schematic
Figure 1. ESM-1v Prediction Outcomes

Guided by in silico results, we introduced site-directed mutations into the gene sequences of: 3ULT (named as MUT4), 3WP9 (named as MUT8), 4NU2 (named as MUT9) and 6A8K (named as MUT10).

We obtain gene sequences with the pET28a vector through gene synthesis. Then transformed it into E. coli for fermentation. The crude enzyme and crude enzyme extract were characterized for antifreeze activity using IRI experiments.

Yeast surface display system schematic
Figure 2. Site-Directed Mutagenesis Genetic Circuits for 3ULT, 3WP9, 4NU, and 6A8K
Inverse Folding

Using an inverse folding approach, we employed 3ULT, 3WP9, 4NU2, and 6A8K as initial templates. Based on a multi-objective optimization framework, we integrated deep learning with molecular simulation techniques to achieve systematic improvement of AFP performance, resulting in optimized novel sequences for AFPs.

We obtain gene sequences through gene synthesis. Expression was induced through fermentation, and antifreeze performance was characterized via IRI assays.

Design and Construction of Novel AFPs

De novo Design

Beyond mutational modification of existing antifreeze proteins as templates, we also designed novel antifreeze proteins in De novo way. First, computational design and prediction were performed from three perspectives: structural periodicity, ice lattice matching, and AI-generated backbones. Computational models including RFdiffusion2 and ProteinMPNN were sequentially applied to provide theoretical basis and design templates for engineering antifreeze functionality in strains. Based on computational predictions, we aim to construct novel antifreeze proteins with different functional priorities, with detailed workflows and simulation methods provided in Model.

Yeast surface display system schematic
Figure 3. Simplified Process of Designing Strategies

Based on computational design results, the output sequences will be used to synthesize novel antifreeze protein genes via gene synthesis. This will enable construction of expression-ready pET28a-De novo series plasmids. Antifreeze performance will then be characterized through IRI assays.

Protein Domain Fusion

In preliminary studies, we observed significant structural differences in antifreeze proteins from different biological sources. For example, Type I AFPs are predominantly α-helical, while Type III AFPs mainly consist of β-sheets. Existing research has demonstrated that domain fusion of glucoamylase genes can yield chimeric enzymes with novel properties and improved thermostability[12]. Inspired by this, we hypothesize that recombining structural domains from different antifreeze proteins may influence their antifreeze performance.

We conducted systematic domain fusion design to further enhance the functional synergy of the chimeric proteins. By rationally recombining structural domains from AFPs of different origins using flexible linkers (GGGGSGGG) and rigid linkers (EAAAKEAAAKEAAAK), we optimized their spatial orientation, enabling more efficient interaction with ice crystal surfaces and thereby improving antifreeze performance.

We will obtain genes encoding domain-fusion proteins through gene synthesis and construct pET28a-wpy series plasmids for surface display of these fusion proteins. These plasmids will be transformed into E.coli for fermentation-induced expression. Antifreeze performance will then be characterized via IRI assays.

Surface Display System

Based on industry consultations with SNEFE Research Institute and Bloomage Biotechnology Co., Ltd. (detailed in Human Practices), we identified significant challenges in scale-up AFP production: extensive protein purification steps incur substantial time-cost expenditures while yielding suboptimal purity. To address these industrial constraints, we engineered an AFP chassis strain incorporating a yeast surface display system.

The EBY100 platform utilizes the Saccharomyces cerevisiae α-agglutinin anchoring mechanism. Genomically expressed Aga1p covalently binds β-glucans in the cell wall extracellular matrix. Plasmid-encoded Aga2p (expressed from pYD1 vector) forms disulfide bonds with Aga1p, enabling surface localization. Target AFP fusion constructs are thus displayed via C-terminal fusion to Aga2p[13].

AFP dataset curation pipeline overview
Figure 4. Mechanism of Surface Display System
11 Types of Natural AFPs

We genetically engineered the Saccharomyces cerevisiae EBY100 strain by fusing 11 antifreeze protein (AFP) genes to the Aga2p coding sequence via a glycine-serine linker (GS linker), constructing AFP-Aga2p fusion cassettes. Post-translationally, the fusion proteins form disulfide bonds with cell wall-anchored Aga1p, resulting in extracellular localization of AFPs on the yeast surface. This configuration enables simultaneous intra- and extracellular cryoprotection, establishing a prototype antifreeze chassis strain with dual-phase cryopreservation functionality.

AFP dataset curation pipeline overview
Figure 5. Gene Circuit for Surface Display of AFPs
Site-directed mutagenesis

Based on the results, we selected high-performance point-mutated AFP genes and seamlessly cloned them into the pYD1 vector to construct the plasmid PR-DK, which enables surface display of the mutated AFPs. This plasmid was then transformed into the S. cerevisiae strain EBY100 for induced fermentation.

AFP dataset curation pipeline overview
Figure 6. Gene Circuit for Surface Display of AFPs

The antifreeze performance of the engineered yeast chassis was evaluated through IRI experiments and yeast freezing-thaw survival rate assays. A comparison with the original AFP-based chassis allows us to assess the effectiveness of the site-directed mutations.

Helical domain replacement

During structural analysis of 11 AFPs for point mutation design, we observed that Type I antifreeze proteins contain multiple aggregated α-helical structures. Given that ice-binding surfaces are enriched with alanine[14] and abundant alanine facilitates α-helix formation, we hypothesized that enhancing the α-helical content of antifreeze proteins could improve cryoprotective capability. Based on this, we propose that either truncating the α-helical regions of antifreeze proteins or introducing α-helical fragments with superior antifreeze activity from other species may modulate their cryoprotective performance.

We selected the 4MTJ protein as the engineering template, incorporating α-helices from 4NU2 and 6A8K for substitution. Concurrently, we computationally designed de novo α-helical structures (Helix A and B) conforming to antifreeze protein characteristics. Through PCR primer-mediated mutagenesis, we introduced mutations into the 4MTJ gene sequence to achieve helical replacements, thereby integrating α-helical structures predicted to enhance ice-binding activity into the 4MTJ protein architecture.

AFP dataset curation pipeline overview
Figure 7. Schematic of Helical Domain Replacement Genetic Circuit for 4MTJ

Trehalose Synthesis Pathway

After preliminary construction and characterization, we have initially obtained antifreeze chassis systems based on natural antifreeze proteins, as well as improved or even de novo designed chassis systems with enhanced antifreeze performance. During discussions with industry partners, concerns were raised regarding the limited antifreeze efficacy of single AFP, as well as the narrow scope of the evaluation criteria for cold resistance. In response to these concerns, under the guidance of Prof. Yi Wu, we obtained a yeast strain with high stress tolerance from his laboratory. The stress resistance of this strain is attributed to the endogenous synthesis of multiple protective metabolites. After comprehensive evaluation, we selected trehalose as a key functional component for its ability to effectively inhibit ice recrystallization and its excellent water-retention properties, which contribute to enhanced cell viability under low-temperature stress. We aim to further amplify its biosynthetic pathway and integrate it into our antifreeze chassis strain, thereby achieving a synergistic effect of antifreeze and hydration through the combined action of trehalose and AFP.

In collaboration with Professor Wu’s team, we utilized the yeast strain to perform chromosomal aneuploidy amplification of chromosome III under environmental stress conditions. Through chromosomal rearrangement and a stepwise screening strategy, we progressively narrowed down the functionally relevant genomic regions. Ultimately, we identified five key genes associated with the regulation of trehalose synthesis.

Building on this foundation, we further propose a synergistic mechanism between trehalose and antifreeze proteins, aiming to enhance the anti-freezing capacity of the chassis strain.

AFP dataset curation pipeline overview
Figure 8. Trehalose Synthesis Genetic Circuit

We obtained five key genes for endogenous trehalose synthesis via PCR amplification from yeast strains. To streamline downstream construction, we first ligated these genes into two fragments using Overlap Extension PCR: a 4902bp fragment (gene1-gene2-gene3) and a 4002bp fragment (gene4-gene5), with homologous arms incorporated via primers. Subsequently, both fragments were assembled with the pRS413 vector backbone through Gibson Assembly, yielding the recombinant plasmid pRS413-trehalose containing all five target genes. This plasmid was co-transformed separately with each of the pR_DK05, pR_DK09, pR_DK10, and pR_DK11 plasmids into Saccharomyces cerevisiae EBY100, generating engineered chassis strains capable of simultaneous endogenous trehalose synthesis and AFP expression under freezing Stress. Following fermentation-induced expression, cryoprotective effects were characterized through IRI assays of cell suspensions and freeze-thaw survival rate quantification of yeast cells.

Performance was systematically compared to baseline strains expressing only surface-displayed native AFPs without trehalose synergy, thus demonstrating the cooperative cryoprotection conferred by integrating trehalose biosynthesis with AFP expression.

AFP dataset curation pipeline overview
Figure 9. Trehalose-Antifreeze Protein Synergistic Genetic Circuit

Multiplexed Site-Specific Rearrangement System

Recognizing that co-expression of multiple AFPs may enhance cryoprotective performance, we implemented the Cre/loxP stochastic recombination system—inspired by existing literature—to screen for optimal AFP combinations.

This system employs Cre recombinase (a tyrosine recombinase) and loxP sites to enable site-specific genomic rearrangements including deletion, inversion, and recombination under controlled conditions[15].

To construct the required genetic tool, we amplified the Cre recombinase gene from a BioBrick template via PCR, and subsequently assembled it with the pET28a vector backbone using Gibson Assembly, yielding the expression plasmid pET-Cre for inducible Cre production.

AFP dataset curation pipeline overview
Figure 10. Genetic Circuit for Cre Recombinase Expression

For vector construction, we performed PCR amplification on the BioBrick pSC101 element to generate two fragments: the Rep101 protein gene-origin of replication and the araC protein gene. Concurrently, the AmpR resistance gene fragment was amplified from the pYD1 vector. These three fragments were then assembled via Overlap Extension PCR (OE-PCR) to construct the complete vector backbone (AmpR-Rep101-Ori-araC).

Regarding the AFP expression cassettes, each AFP cassette was designed as a rearrangeable unit. To achieve this, we flanked each AFP gene with a promoter and terminator, while incorporating loxP sites at both ends through primer design. This configuration enables Cre-mediated cassette rearrangement.

In this system, we selected the arabinose-inducible promoter from pBAD and a conventional T7 terminator. Through PCR amplification of the BioBrick pBAD element, we obtained the BAD promoter and T7 terminator fragments. Homologous arms were introduced via primers to connect terminator of the current protein, promoter of the subsequent protein gene and the target AFP gene.

Using OE-PCR, these elements were assembled into 10 distinct rearrangement units. Subsequent pairwise OE-PCR connections yielded a full-length gene fragment ready for cassette rearrangement under Cre recombinase control.

AFP dataset curation pipeline overview
Figure 11. Technical workflow summary
AFP dataset curation pipeline overview
Figure 12. Genetic circuit for AFP expression cassette rearrangement

The full-length gene fragment was assembled with the engineered vector via Gibson Assembly to construct the expression plasmid. This plasmid was then co-transformed with the pET-Cre plasmid into Saccharomyces cerevisiae cells. Upon induction, Cre recombinase facilitated stochastic rearrangement of the AFP expression cassettes. Transformants were screened through freeze-thaw survival rate assays, identifying clones with superior cryoprotective phenotypes. Selected high-performance clones underwent Sanger sequencing to determine the optimal AFP gene combination.

AFP dataset curation pipeline overview

Figure 13. Map of Cre-loxP Rearrangement Plasmid pTJ25-CL

Multiplexed site-specific inversion system

Through the Cre/loxP stochastic rearrangement system, we can obtain optimal AFP combinations. However, differential expression intensity ratios among these genes may significantly impact the cryoprotective capacity of the chassis strain. Inspired by the work of Prof. Wu Yi's team[16], we propose implementing a multiplexed inversion system to screen optimal gene expression ratios, enabling precise control of target protein expression levels.

As a first step, we acquired the plasmid pRS415-GAL1-Rci from Prof. Wu Yi's laboratory, which expresses the DNA invertase Rci.

AFP dataset curation pipeline overview

Figure 14. Map of Rci invertase expression plasmid pRS415-GAL1-Rci

During the construction of the invertible plasmid, we first established a promoter library with varying strengths using PCR and OE-PCR techniques. The promoters included strong promoters (pGMP1, pTDH3, pTEF1), medium-strength promoters (pPGK1, pPDC1, pTEF2, pTPI1), and weak promoters (pCHO1, pADH1, pCYC1). Each promoter was flanked by sfxa101 sites at both the upstream and downstream ends, forming an invertible cassette. This cassette serves as a modular element for subsequent expression regulation and screening. These promoters regulate target gene expression to modulate intracellular protein yield.

Subsequently, we amplified AFP gene fragments by PCR, adding 40bp homology arms to both ends for seamless fusion with promoters and terminators. These fragments serve as building blocks for inversion cassettes, using pRS413 as the vector backbone.

Next, we employed OE-PCR to assemble each AFP gene with promoter and terminator modules, forming complete inversion cassettes (basic functional units).

The plasmids pTJ25-MI and pRS415-GAL1-Rci were co-transformed into the engineered yeast strain. Under the action of the Rci enzyme, the promoter library fragment in pTJ25-MI undergoes multiple inversions, with each inversion state corresponding to a distinct expression strength. Clones with superior antifreeze performance were identified and characterized through yeast freezing survival assays. This system enables reversible regulation of AFP expression under different conditions, facilitating the optimization and screening of expression levels.

AFP dataset curation pipeline overview
Figure 15. Promoter library invertible units gene circuit
AFP dataset curation pipeline overview

Figure 16. Map of multiplex inversion plasmid pTJ25-MI

References

[1] Sun T, Lin F, et al. An Antifreeze Protein Folds with an Interior Network of More Than 400 Semi-Clathrate Waters. Science. 2014;343(6172):795-798. doi:10.1126/science.1247407
[2] Nishimiya Y, Kondo H, et al. Crystal structure and mutational analysis of Ca²⁺-independent type II antifreeze protein from longsnout poacher, Brachyopsis rostratus. J Mol Biol. 2008;382(3):734-746. doi:10.1016/j.jmb.2008.07.042
[3] Ko T, et al. The Refined Crystal Structure of an Eel Pout Type III Antifreeze Protein RD1 at 0.62-Å Resolution Reveals Structural Microheterogeneity of Protein and Solvation. Biophys J. 2003;84(2):1228-1237. doi:10.1016/S0006-3495(03)74938-8
[4] Middleton AJ, Marshall CB, et al. Antifreeze protein from freeze-tolerant grass has a beta-roll fold with an irregularly structured ice-binding site. J Mol Biol. 2012;416(5):713-724. doi:10.1016/j.jmb.2012.01.032
[5] Leinala EK, Davies PL, et al. A β-helical antifreeze protein isoform with increased activity: structural and functional insights. J Biol Chem. 2002;277(37):33349-33352. doi:10.1074/jbc.M205575200
[6] Pentelute BL, Gates ZP, et al. X-ray Structure of Snow Flea Antifreeze Protein Determined by Racemic Crystallization of Synthetic Protein Enantiomers. J Am Chem Soc. 2008;130(30):9695-9701. doi:10.1021/ja8013538
[7] Garnham CP, Campbell RL, Davies PL. Anchored clathrate waters bind antifreeze proteins to ice. Proc Natl Acad Sci U S A. 2011;108(18):7363-7367. doi:10.1073/pnas.1100429108
[8] Hanada Y, Nishimiya Y, et al. Hyperactive antifreeze protein from an Antarctic sea ice bacterium Colwellia sp. has a compound ice-binding site without repetitive sequences. FEBS J. 2014;281(16):3576-3590. doi:10.1111/febs.12878
[9] Do H, Kim SJ, et al. Structure-based characterization and antifreeze properties of a hyperactive ice-binding protein from the Antarctic bacterium Flavobacterium frigoris PS1. Acta Crystallogr D Biol Crystallogr. 2014;70(Pt 4):1061-1073. doi:10.1107/S1399004714000996
[10] Kondo H, Mochizuki K, Bayer-Giraldi M. Multiple binding modes of a moderate ice-binding protein from a polar microalga. Phys Chem Chem Phys. 2018;20(38):25295-25303. doi:10.1039/C8CP04727H
[11] O'Brien JR, Raynaud C, et al. Insight into the Mechanism of the B₁₂-Independent Glycerol Dehydratase from Clostridium butyricum: Preliminary Biochemical and Structural Characterization. Biochemistry. 2004;43(16):4635-4645. doi:10.1021/bi035930k
[12] Chen Z, Wang L, et al. Improving Thermostability of Chimeric Enzymes Generated by Domain Shuffling Between Two Different Original Glucoamylases. Front Bioeng Biotechnol. 2022;10:881421. doi:10.3389/fbioe.2022.881421
[13] Boder ET, Wittrup KD. Yeast surface display for screening combinatorial polypeptide libraries. Nat Biotechnol. 1997;15(6):553-557. doi:10.1038/nbt0697-553
[14] Davies PL, Baardsnes J, et al. Structure and function of antifreeze proteins. Philos Trans R Soc Lond B Biol Sci. 2002;357(1423):927-935. doi:10.1098/rstb.2002.1081
[15] Lin Q, Qi H, et al. Robust orthogonal recombination system for versatile genomic elements rearrangement in yeast Saccharomyces cerevisiae. Sci Rep. 2015;5:15249. doi:10.1038/srep15249
[16] Li J, Gong S, et al. Creation of a eukaryotic multiplexed site-specific inversion system and its application for metabolic engineering. Nat Commun. 2025;16:1918. doi:10.1038/s41467-025-57206-w