The Major Histocompatibility Complex (MHC) is one of the most polymorphic systems in the human genome. In humans, MHC proteins are encoded by Human Leukocyte Antigen (HLA) genes.
HLA proteins are found on the surface of most human cells and play a central role in regulating the immune system. They act as display platforms, presenting small fragments of proteins (antigens) to immune cells. In this way, HLA enables the immune system to distinguish between the body's own proteins and potentially harmful invaders, such as viruses, bacteria, or abnormal cancer cells. HLA is essentially the human version of the MHC, which is present in many animals (1).
One defining feature of HLA genes is their extraordinary diversity. HLA genes are highly polymorphic, with thousands of different variants existing among humans (4). This genetic variability is a crucial defense mechanism: if everyone carried the same HLA type, a single pathogen could adapt and devastate the population. Instead, diversity ensures that pathogens encounter different immune filters in different individuals, making it far more difficult for them to spread universally.
However, this same variability also creates significant challenges for medicine. Treatments that depend on precise antigen recognition --- such as vaccines, organ transplants, or cancer immunotherapies --- can behave very differently depending on a patient's HLA type. For example, in cancer immunotherapy, immune cells must recognize tumor-specific antigens presented by HLA molecules. Since not all patients display the same antigens in the same way, therapeutic responses can vary greatly between individuals.
MHC molecules are grouped into two main classes:
Figure 1. Illustration of different signalling pathways for MHC classes I & II.(Image Source: Biorender.com)
Our project focuses on MHC Class I, specifically the HLA-A*02:01 allele, one of the most common and well-studied alleles worldwide. This allele has been extensively characterized in structural and immunological studies (5--7) and is widely used in cancer immunotherapy research (8). By selecting HLA-A*02:01, our results can be broadly applicable and comparable with findings from other studies. Focusing on such a common allele also increases the translational potential of our work, ensuring its relevance to real-world medical applications.
Looking forward, researchers are even exploring the idea of a "universal HLA" --- an engineered form that could overcome the challenge of HLA diversity and provide broader therapeutic coverage. While this remains a long-term vision, our project contributes to this larger effort by testing whether engineered HLA molecules can enhance antigen presentation and improve cancer immunotherapy outcomes.
We selected HLA-A*02:01 (HLA-A2) as our model allele for several reasons:
By working with HLA-A*02:01, our results are not only relevant for basic science but also have potential implications for ongoing cancer immunotherapy efforts.
The functional MHC Class I complex is composed of two chains:
While the heavy chain forms the peptide-binding groove, B2M plays a stabilizing role:
Without B2M, the heavy chain cannot achieve its proper conformation, and peptide presentation fails.
To experimentally evaluate our approach, we constructed and used four plasmids, each encoding a key component of the system:
By testing two different mutation strategies against the wild-type, and holding B2M constant, we designed a system that allows direct comparison of wild-type vs. engineered variants under otherwise identical conditions.
To advance the field of synthetic immunology, our team designed and validated a complete set of interoperable parts collectively named the HLA Engineering Toolkit. This toolkit was developed to pursue the Best New Parts Collection special prize and demonstrates a unified, end-to-end pipeline for engineering, refolding, and characterizing human immune display systems in E. coli.
The collection includes four standardized, Registry-compatible plasmids: HLA Engineering Toolkit (New Parts Collection): c64ee9f8-f6ec-4ff4-9da1-38ee16033a42
Together, these parts form a complete, modular MHC Class I system that allows direct comparison of engineered variants under standardized experimental conditions. This integrated structure — rather than a single part — is what defines our eligibility for the Best New Parts Collection award.
Our aim was not simply to create new parts, but to build a rational engineering framework for immune display systems.
The HLA Engineering Toolkit was specifically designed to:
By meeting these goals, the toolkit moves synthetic biology closer to rational immune system engineering — bridging molecular design, wet-lab assembly, and translational application.
Each construct was designed to balance experimental practicality and biological relevance:
By combining these design choices, we produced an experimentally robust and biologically relevant toolkit that models how mutational design influences antigen presentation.
All four constructs were expressed and validated using a unified, reproducible protocol. The full process includes:
Each construct was verified to produce correctly folded, biochemically active complexes under these conditions — confirming that the parts function as a single, coherent system.
We characterized our toolkit collectively and comparatively:
Each construct was verified to produce correctly folded, biochemically active complexes under these conditions — confirming that the parts function as a single, coherent system.
The HLA Engineering Toolkit is a versatile platform with broad applicability:
This toolkit transforms HLA research from a specialized immunology task into a standardized synthetic biology platform — accelerating innovation in immune engineering.
Through iterative design, expression, and refolding, our project validated a complete experimental ecosystem for HLA engineering. Key insights include:
Beyond technical success, our project demonstrates that HLA molecules can be rationally redesigned — paving the way for next-generation synthetic immunology tools.
The HLA Engineering Toolkit is more than just a collection of plasmids — it represents a complete, practical system for studying and engineering human immune proteins in a synthetic biology setting. Through this project, we showed that complex human molecules such as HLA Class I can be expressed, refolded, and functionally tested in E. coli using standardized methods. This achievement is important because it proves that even difficult, multi-domain human proteins can be handled through a systematic Design–Build–Test–Learn approach. By integrating computational predictions, controlled mutagenesis, and careful experimental validation, our toolkit bridges the gap between theoretical protein design and real, testable biological function.
The significance of this project lies in how it transforms HLA biology into something that can be engineered and studied with the same logic used in synthetic biology. Each component — wild type, mutants, and β2M — plays a specific role, but their real strength appears when used together as an integrated system. This structure allows direct comparison of engineered variants, helping researchers understand how specific mutations influence folding, stability, and peptide binding. Because the entire workflow is open and repeatable, future iGEM teams can reuse these parts, adapt the design for other HLA alleles, or apply the same principles to different protein families.
More broadly, the HLA Engineering Toolkit demonstrates how synthetic biology can contribute to medicine and biotechnology. It provides a foundation for developing improved cancer immunotherapies, vaccine candidates, and other applications where antigen presentation is important. At the same time, it serves as an educational model — a hands-on example of how computational design, molecular biology, and experimental testing come together in one continuous cycle. In this sense, the toolkit embodies the spirit of iGEM: using engineering principles to make complex biology understandable, reusable, and ultimately, useful for solving real-world challenges.
In addition to iGEM Parts Registry, full-length sequences have been organized into PDF files as below. All of the sequences were analyzed using SnapGene software.