SPARC

Loading page...

MODEL

Our model, SPARC, is a powerful tool designed to accelerate the development of personalized cancer therapies by optimizing synthetic immune cell systems. By integrating deep learning-based de novo binder design with physics-based molecular dynamics (MD) simulations, SPARC enables the rapid prototyping of protein binders, receptor configurations, and synthetic circuits. Our integrated MD-based pipeline provides a reliable pre-screening metric for binder selection, outperforming traditional methods and reducing experimental costs. We then successfully validated novel binders against tumor-relevant ligands in the wet lab, showcasing our model's predictive capabilities.

Integrating these findings into our mathematical framework of receptor dimerization and intracellular phosphorylation dynamics aided us in our experimental design. This enabled us to not only navigate the design space of our part collection, but also optimize our cells to sense stimulatory and inhibitory ligands simultaneously. This is complemented by a digital database that maps the behavior of various circuit architectures to target diverse tumor types. By streamlining the design process, SPARC offers a safe, adaptable and scalable solution for personalized cancer therapies, ultimately accelerating the path to clinical application.

Introduction

Our PHOENICS system is a cutting-edge synthetic biology platform designed to respond to specific tumor microenvironments by leveraging modular receptor systems and logic-based phosphorylation circuits. By equipping tumor-homing chassis cells with a retargetable synthetic receptor architecture and an effector-coupled intracellular processing unit, PHOENICS cells have the potential to precisely detect and respond to multiple cancer-related cues. This allows for targeted therapeutic strategies that can be adapted to diverse tumor types, representing a significant advancement in personalized oncology.

A key challenge in developing such a sophisticated system is understanding the complex interactions within the PHOENICS cell. The intricacies of receptor dimerization, ligand binding, and intracellular signaling require accurate prediction and optimization before experimental testing. This is why we have created SPARC – the Simulator for Phosphorylation And Receptor Characterization. This framework allows us to build a computational replica of the PHOENICS cell system, enabling us to simulate and analyze its behavior under varying conditions. At its core, our model serves three main purposes:

Binder Development

SPARC plays a crucial role in expanding the range of tumor ligands our system can target. By designing and testing novel binders in silico, we can enhance the modularity of our receptors and ensure their ability to sense a broader array of TME-associated proteins.

Enhanced Understanding

Our digital twin provides deep insights into the complex biochemical processes that govern receptor-ligand interactions and downstream signaling, giving us a clearer understanding of the system's dynamics and switching characteristics.

Designing Optimal Cellular Architectures

SPARC allows us to explore different configurations of receptor and signaling components, helping us identify the most effective designs for a desired therapeutic outcome.

The development of the PHOENICS toolkit and its digital twin was built on several core components in our dry lab modeling efforts. We started with the design and generation of novel binders leveraging the cutting-edge de novo protein design tool BindCraft. Novel binders were then characterized using advanced molecular dynamics simulations, which provided accurate rankings of the binder candidates based on their binding affinities. To understand the dimerization behavior of our MESA receptors in the membrane, we leveraged coarse-grained simulations, which allowed us to optimize transmembrane domain selection based on their background dimerization rates. Finally, the entire PHOENICS system was integrated into a mathematical framework that simulates receptor-ligand binding, receptor dimerization, and phosphorylation-based signaling, helping us to predict system behavior under various ligand concentrations.

Our modeling pipeline is tightly integrated with our wet lab efforts, where we validated key findings through experimental assays such as the Cell-Free Two-Hybrid binding assay and flow cytometry. This iterative feedback loop between computational modeling and experimental validation ensured that the digital twin accurately captures the system's biological behavior, while simultaneously optimizing our part designs for in vitro testing.

Expanding Tumor Targeting Potential

De novo Protein Binder Generation

A crucial aim of our dry lab efforts was to expand the targetable ligand space that our synthetic receptor can sense beyond those initially tested in our wet lab. Leveraging the modular architecture of our sensing platform, we focused on generating novel binding domains for a variety of TME-associated proteins. This approach enhances the system's ability to adapt to and target different tumor types, paving the way for more versatile and personalized therapies.

While antibody fragment engineering has long been a standard approach for target recognition, generating large panels of antibody fragments is time-consuming and often not feasible (Wang et al., 2025). In recent years, de novo protein binder design has emerged as a powerful alternative, driven especially by generative deep learning models such as RFdiffusion and ProteinMPNN (Bennett et al., 2023; Watson et al., 2023). Earlier this year, BindCraft was published as a next-generation platform, capable of generating protein binders with nanomolar affinities (Pacesa et al., 2025). Its neural network-based hallucination model leverages the predictive power of AlphaFold, and offers both accessibility and efficiency for rapid binder prototyping. Within SPARC's modeling framework, we integrated BindCraft to extend the modularity of our receptor platform, particularly due to its proven, strong performance coupled with ease of use and tunability.

We focused our binder generation efforts on ligands of high relevance in the tumor microenvironment (TME). These included the homodimer growth/differentiation factor 15 (GDF15), which is upregulated in the pancreatic TME (Zhu et al., 2024) as well as lymphocyte antigen 6 family member K (LY6K), a protein overexpressed in cervical cancer (Luo et al., 2016). We furthermore sought to target the monomeric ligand osteoglycin (OGN), which exhibits increased expression in pancreatic cancer, however is downregulated in cervical cancer (Robinson et al., 2019).

Results

To initiate binder design, we predicted 3D structures of the target ligands using AlphaFold3 (Abramson et al., 2024). These predicted structures were visually analyzed in PyMOL, where unstructured loops and low-confidence regions were trimmed (Schrödinger, LLC, 2015). Stable regions with high prediction confidence (pLDDT > 90) were defined as optimal binder interfaces. Using standard BindCraft settings (c.f. SPARC user manual), we generated 10 binders for each target with sequence lengths within 60–150 amino acids. These binders were successfully docked to the selected interface regions, and the interface predicted template modeling (i_pTM) score was used as an initial ranking metric by BindCraft, estimating binding affinity (Figure 1 A-C). Binders with an i_pTM score of above 0.8 were then selected for experimental validation in the Cell-free Two-Hybrid wet lab binding assay (click to see our results).

A

Frame 0 / 0

B

Frame 0 / 0

C

Frame 0 / 0

D

Frame 0 / 0

Figure 1: 3D structure of target-binder complex. The rotatable structure of BindCraft-generated binders docked to the target (A) GDF15, (B) OGN and (C) LY6K. (D) 3D structure of arodyn-fused binder (6 streically unhindered residues shown in red) bound to GDF15 for implementation in GPCR-based PAGER architecture.

To adapt de novo generation of binders to our GPCR-based receptor architecture, an additional modification of the BindCraft tool was required. This involved integrating an arodyn moiety at the N-terminus of the binder to meet the functional requirements for this receptor design, which would enable effective protein-gating for receptor activation (Kalogriopoulos et al., 2025). While BindCraft does not natively support this modification, we successfully tuned the tool's sequence initialization and backpropagation functionalities. BindCraft's hallucination model now accounts for the presence of the arodyn-terminus and optimizes the binder sequence accordingly (c.f. SPARC user manual). This adaptation allowed us to generate a library of arodyn-fused binders targeting GDF15, specifically optimized for our GPCR receptor component. Importantly, this modification did not compromise the strengths of BindCraft, preserving its ability to generate high-affinity, well-structured binders, thereby extending the sensing potential of the PHOENICS cell (Figure 1 D).

Outlook

Although BindCraft provides the i_pTM metric as an internal ranking criterion, we found that this score is not a reliable predictor of binding affinity for further experimental characterization. This was additionally emphasized by Dr. Martin Pacesa, the first author of the BindCraft publication, who underlined the fact that the i_pTM score is rather a binary estimator than a ranking metric. Therefore, to streamline the process from in silico generation to wet lab testing, our modeling framework required a more robust and trustworthy predictor of binding affinities. Developing such a metric was crucial for pre-filtering candidates and prioritizing those most likely to succeed in downstream characterization assays like \( K_D \)-value measurement.

Affinity-based Binder Characterization

To further refine our binder selection, we identified the need for a more robust characterization of the binding affinities of the BindCraft generated binders. To address this, we leveraged molecular dynamics simulations, particularly the umbrella sampling (US) method, to predict the free energy profiles of our binder-target complexes, from which a thermodynamically relevant dissociation constant can be derived. While alternative methods, such as alchemical calculations, could have been employed, they do not adequately capture entropic effects and lack the precision required for our application (Govind Kumar et al., 2023). In contrast, our physics-based US approach offers a more reliable ranking of binders (Deng & Roux, 2009), facilitating the pre-selection of candidates for further experimental characterization with techniques like surface plasmon resonance (SPR) or quartz crystal microbalance with dissipation monitoring (QCM-D). Though essential for obtaining accurate binding affinity measurements, these methods are costly and impractical for large-scale testing, which further emphasizes the necessity of a validated pre-screening pipeline (Easley et al., 2022; Kartal et al., 2021).

Besides US, we also tried to estimate binding affinities using the open-source machine learning tool Prodigy, a predictor which is solely based upon interfacial contacts for a protein-protein interaction (Vangone & Bonvin, 2017). With this approach, we also sought to compare and interpret the utility of both ML-based and physics-based methods, to truly find out the optimal pre-screening tool for integration into SPARC's complete pipeline.

Results

The umbrella sampling protocol showed successful steered MD simulation of the binder and target for all cases, which yielded satisfactory histogram overlaps of the unbiased count distributions \( H(\xi) \) (Figure 2 A-B). The resulting PMF showed convergence after a COM distance of \( 7\ nm \), as seen by the plateau in the unbound state (Figure 2 C) for binder 3 for GDF15.

A

Click to load molecular trajectory
Frame 0 / 0

B

Histogram US

C

PMF Curve US

Figure 2: Steered MD and umbrella sampling of GDF15 and Binder 2. (A) The trajectory of the 10 ns steered MD simulation (\( k = 1000\frac{kJ}{nm^2} \)) is visualized. (B) The unbiased count distributions \( H(\xi) \) of each umbrella sampling window at the RC \( \xi_0 \) are shown, displaying significant overlap. (C) Resulting PMF curve \( G(\xi) \) generated using WHAM for the umbrella sampling simulation. The binding free energy of \( -25.4\frac{kJ}{mol} \) was calculated from the energy in the bound state \( \Delta G_{bound}(\xi) \) and unbound state \( \Delta G_{unbound}(\xi) \).

All generated binders were characterized using the US approach, and their \( K_D \) values were calculated from the corrected \( \Delta G° \) values (Tables 1-3). Additionally, Tables 1-3 show the dissociation constants predicted by Prodigy, which generally estimates most binders to be in the nanomolar range.

Table 1: Corrected standard binding free energies \( \Delta G° \) and dissocition constants \( K_D \) for GDF15 binders estimated from US and Prodigy. The resulting corrected \( \Delta G° \) is calculated according to (Equation \eqref{eq:P5}) using the restraint correction \( \Delta G_R \) and standard state volume correction \( \Delta G_V \). All binding energies are given in \( [\frac{kJ}{mol}] \), for binders 1, 3, 5 and 7 the mean and standard error of mean are given for measured triplicates. \( K_D \) values derived from US are calculated using (Equation \eqref{eq:P6}) and the Prodigy predicted values are also also shown for comparison.
Binder \(\Delta G_{\text{PMF}}\) \(\Delta G_{\text{V}}\) \(\Delta G_{\text{R}}\) \(\Delta G°\) \(K_{\text{D}}\) [μM] \(K_{\text{D}}\) (Prodigy) [μM]
1 \(-18.99 \pm 4.29\) \(-10.26 \pm 0.63\) \(19.6 \pm 0.22\) \(-9.64 \pm 4.42\) \((4.15 \pm 4.00) \times 10^{4}\) \(1.14 \times 10^{-2}\)
2 \(-25.8\) \(-11.19\) \(21.84\) \(-15.86\) \(1.73 \times 10^{3}\) \(2.23 \times 10^{-1}\)
3 \(-24.74 \pm 3.10\) \(-11.25 \pm 0.10\) \(20.25 \pm 0.20\) \(-15.74 \pm 3.10\) \((2.88 \pm 3.04) \times 10^{3}\) \(1.52 \times 10^{-1}\)
4 \(-19.38\) \(-11.26\) \(20.15\) \(-10.49\) \(1.49 \times 10^{4}\) \(1.11 \times 10^{-1}\)
5 \(-18.20 \pm 2.52\) \(-10.23 \pm 0.15\) \(19.23 \pm 0.25\) \(-9.19 \pm 2.48\) \((3.24 \pm 2.15) \times 10^{4}\) \(3.72 \times 10^{-1}\)
6 \(-22.57\) \(-9\) \(18.91\) \(-12.67\) \(6.23 \times 10^{3}\) \(5.79 \times 10^{-1}\)
7 \(-49.5 \pm 0.85\) \(-10.46 \pm 0.28\) \(19.53 \pm 0.28\) \(-40.44 \pm 0.85\) \((9.37 \pm 3.13) \times 10^{-2}\) \(1.39 \times 10^{-1}\)
8 \(-17.17\) \(-8.02\) \(19.65\) \(-5.54\) \(1.09 \times 10^{5}\) \(4.76 \times 10^{-1}\)
9 \(-17.18\) \(-10.26\) \(19.6\) \(-7.83\) \(4.32 \times 10^{4}\) \(1.51 \times 10^{-1}\)
10 \(-10.57\) \(-9.06\) \(19.68\) \(0.05\) \(1.02 \times 10^{6}\) \(1.90 \times 10^{-1}\)

Results: Wet lab binder and metric validation

In the wet lab, we additionally tested the binding strength of five GDF15 binders using the fluorescence readout-based Cell-free Two-Hybrid assay: Here, binder 3 yielded the highest signal, suggesting a promising de novo designed binder for GDF15 (Figure 3).

Description of your image

Figure 3: Functional validation of BindCraft-generated de novo binders with CF2H assay. Neg. ctrl: No DBD-binder encoding DNA. Pos. ctrl: DNA encoding for NbALFA nanobody paired cognate ALFA-tag epitope. sfGFP fluorescence [AU] was measured for induction with GDF15 and compared to fluorescence without target protein, displaying mean ± SEM. Fluorescence was analyzed by two-way ANOVA. Adjusted p-values are shown for each binder and used as a ranking metric.

The ranking of the experimentally validated binders was subsequently compared to BindCraft-derived i_pTM scores, as well as \( K_D \)-based rankings acquired through our US simulations and Prodigy. For this, the Kendall's \( \tau \) as well as Spearman correlation metric were utilized, whereas Kendall's \( \tau \) is better suited for quantifying ranking similarity for the small sample size of \( n\ =\ 5 \) binders (Kendall, 1938). We found that both the US-derived ranking showed the better alignment with the experimental data than the BindCraft i_pTM ranking, with a Kendall's \( \tau_{US}\ =\ 0.80 \) compared to \( \tau_{BindCraft}\ =\ -0.20 \) (Table 4), whereas Prodigy also performed relatively well with a Kendall's \( \tau_{\text{Prodigy}}~=~0.40 \).

Table 4: Comparison of in silico characterized and wet lab measured GDF15 binder rankings. Binders 1, 2, 3, 5 and 7 were experimentally characterized using the CF2H assay to obtain a ranking based on fluorescence readout fold-change between the induced and uninduced conditions (c.f. Figure 3). This ranking was compared to the rankings obtained from BindCraft (based on i_pTM scores), umbrella sampling and Prodigy (based on predicted \( K_D \) values) using the Kendall's \( \tau \) and Spearman's \( \rho \) correlation coefficients.
Method Ranking of Binders Kendall's \(\tau\) Spearman's \(\rho\)
Wetlab (CF2H assay) B3 \( > \) B7 \( > \) B2 \( > \) B5 \( > \) B1 - -
Bindcraft B1 \( > \) B2 \( > \) B3 \( > \) B5 \( > \) B7 \(-0.20\) \(-0.30\)
Umbrella sampling B7 \( > \) B2 \( > \) B3 \( > \) B5 \( > \) B1 \(0.80\) \(0.90\)
Prodigy B1 \( > \) B7 \( > \) B3 \( > \) B2 \( > \) B5 \(0.40\) \(0.50\)

Outlook

The results evidently highlight the fact that the BindCraft i_pTM-based ranking is insufficient to specifically characterize the binder affinities among themselves, as seen by the negative Kendall's \( \tau \) value. This demonstrates the power of both of our pre-filtering approaches in drastically reducing the work, cost, and time associated with testing all generated binders using methods such as SPR or QCMD to obtain biologically relevant \( K_D \) values (Easley et al., 2022; Kartal et al., 2021). However, we rationally decide to choose the MD-based estimator as SPARC's primary method in the overall workflow over Prodigy. This is explained by the fact that the US method is based on a more realistic, thermodynamic approach that takes into account more parameters than Prodigy. Still, both validated methods allowed us to make the binder selection process much more efficient and accurate for downstream use in SPARC's mathematical model.

Simulating Membrane-associated Receptor Dimerization

In addition to the binder characterization, a critical component of SPARC's work is focused on the characterization of MESA receptor dimerization within the membrane. This aspect is crucial for guiding the design of the wet lab experiments and identifying optimal transmembrane (TM) domains that minimize background dimerization. Since all-atom simulations cannot efficiently model such large membrane-protein systems due to their high computational cost, we turned to coarse-grained (CG) simulations. Here, the atoms of a molecular structure are grouped into CG beads based on their physical properties and parameterized using special force fields, which drastically reduces the degrees of freedom of the whole system (Borges-Araújo et al., 2023). For this we decided to use the Martini3 force field upon talking to experts, such as Dr. Fabian Grünewald (Souza et al., 2021). This allows us to perform longer simulations of biologically meaningful durations (2-10 µs) of membrane-protein complexes, providing insights into the dynamic behavior of the receptors. The goal was to simulate receptor halves of the FK506 binding protein (FKBP) and the FKBP-rapamycin binding domain (FRB) embedded in the membrane, without the dimerization-inducing rapalog ligand, to evaluate the background dimerization behavior of various TM domains for implementation in the wet lab (Levintov et al., 2024).

Results

The dimerization behavior of the two TM domain pairs (CD28/CD28 and CD28/CD28(M3)) was monitored by visualizing the MD trajectory as well as plotting the center-of-mass (COM) distance between the receptor halves over time (Figure 4 A). From the trajectory, it becomes evident that both TM domain pairs exhibit some level of background dimerization in the absence of the rapalog ligand, which is in line with literature stating high levels of baseline signal for the MESA receptor architecture (Zhou et al., 2023).

A

B

Distance Plot

Figure 4: Coarse-grained background dimerization of MESA receptors for different TM domain pairs. (A) The trajectory for the background membrane dimerization of CD28/CD28 TM domain pair in the absence of rapalog is visualized in a mammalian plasma membrane in a coarse-grained simulation with the Martini3 force field. (B) The distance plot shows the center-of-mass (COM) distance between the two receptor halves over the course of the \( 2\ \mu s \) simulation for the CD28/CD28 (orange) and CD28/CD28(M3) (purple) domain pairs.

As shown in the plot, the CD28/CD28 pair dimerized more quickly than the CD28/CD28(M3) pair, indicating a higher background dimerization rate. These results were leveraged to improve the induction of MESA receptors using our expression readout system. The wet lab experiment comparing both TM domain pairs showed that the CD28/CD28(M3) pair indeed has a significantly lower background dimerization in the uninduced state and higher fold-change in response to rapamycin-induced expression (Figure 4 B).

Using SPARC's coarse-grained dimerization simulation, we were able to confirm that the CD28/CD28(M3) TM pair has more favorable properties for the MESA receptor design, which guided the engineering cycle for receptor optimization for the PHOENICS cell in the wet lab.

Outlook

The validated coarse-grained Martini simulations have shown promising results and hold great potential for further development and improvement of SPARC's predictive capabilities. In the future, by combining umbrella sampling with CG models, these simulations could be used to estimate kinetic parameters, such as background dimerization rates and binding affinities, which can be integrated into SPARC's mathematical model (Patel & Ytreberg, 2018). This approach could further enhance our pipeline, improving both accuracy and precision through the incorporation of membrane-based dynamics, while also accelerating binder screening by complementing time-intensive all atom simulations

Mathematical Modeling of System Processes

For a deeper understanding of the underlying processes in a PHOENICS cell, our approach was to build a mathematical model into SPARC, simulating the biochemical reactions of our entire system. This model predicts the concentration of phosphorylated substrate for varying input concentrations of both stimulatory and inhibitory tumor ligands. The cellular architecture is represented by a set of kinetic and thermodynamic parameters of our wet lab components, which determine the system's behavior. This defines the model's end-to-end input-output mapping, which we used to guide and optimize experimental assays throughout the project.

To obtain expert perspectives and counsel, we engaged with multiple specialists in systems biology and information processing. Based on their feedback, we developed the model in two segments, reflecting our system's modules: a sensing layer that captures ligand binding and receptor dimerization, which then feeds into a processing layer that describes intracellular phosphorylation and dephosphorylation. This modular approach allowed us to develop each element independently before integrating them into a complete, but more intricate framework.

Underlying Assumptions

To strike a balance between feasibility and an accurate reflection of the biological complexity, we adopted a set of assumptions, which we developed in close collaboration with various experts in systems biology. Most importantly, our central assumption is that the system operates at steady-state, which is a common simplification in mathematical biology, assuming equilibrium concentrations of components that remain constant over time (Mathematical Biology. Vol. 1, c.f. Interview Dr. Pahle). This is appropriate, because our system is based on rapid phosphorylation and dephosphorylation of existing cellular components, instead of relying on significantly slower expression. Therefore, switching between the cell's inactive and active state occurs dynamically, quickly establishing equilibrium conditions that allow for steady-state simplification (Goldbeter & Koshland, 1984). Additionally, the modeled pathway behaves in a feed-forward manner with no engineered feedback loops, minimizing the likelihood of oscillations or multistability (Ferrell, 2002). Together, these factors ensure that the system quickly reaches a quasi-equilibrium state upon an imposed change in condition, supporting the steady-state approximation.

Sensing Layer: Ligand Binding and Receptor Dimerization

The sensing layer is responsible for translating the extracellular signal in terms of ligand concentrations into intracellular signaling. This process involves the dimerization of two receptor halves upon ligand binding, leading to spatial co-localization on the membrane. This induces spatial proximity between the intracellular leucine zipper and the enzyme coupled to the intracellular domain on each receptor half. We chose to model a receptor system capable of both hetero- and homodimerization, which increases the flexibility of possible receptor architectures.

Our mathematical model for receptor dimerization is based on existing work that describes the rapamycin-induced heterodimerization of FKBP and FRB (Lu & Wang, 2017). This framework aligns with the well established MESA-receptor system used in our wet lab approach. The receptors' reaction mechanism follows a two-step process, where the intermediate, a ligand molecule bound to a single receptor half, can stably exist (Figure 5).

Description of your image

Figure 5: Reaction scheme of receptor heterodimerization. Ligand binding to free receptor halves (top-left) can occur under the dissociation constants \( K_{D,1} \) or \( K_{D,2} \). Once one receptor half is bound to the ligand (top right and bottom left), the second receptor half can bind under the adjusted dissociation constants \( K_3 = \frac{K_1}{\alpha} \) and \( K_4 = \frac{K_2}{\alpha} \), where \( \alpha \) is the cooperativity factor (bottom right). In the case of homodimerization, \( K_1 \) and \( K_2 \) are set to identical values.

Starting with free-floating ligand \( L \), receptor half 1 (\( R_1 \)), and receptor half 2 (\( R_2 \)), the ligand can bind either to \( R_1 \) or \( R_2 \). Binding occurs at the underlying kinetic on-rates \( k^+_1 \) and \( k^+_2 \), respectively, while dissociation happens at off-rates \( k^-_1 \) and \( k^-_2 \). Once the ligand binds to one receptor half, a receptor chain of the other species can engage the already bound ligand, resulting in a complex of both receptor halves and the ligand molecule (\( R_1R_2L \)). The involved reaction steps up to this point are described in the following reactions:

(R.1)
(R.2)
(R.3)
(R.4)

The equilibria for these reactions are defined by the respective dissociation constants, which are the ratios of the off-rate (\( k^- \)) to the on-rate (\( k^+ \)):

\begin{equation} K_D = \frac{k^-}{k^+} \label{eq:kd} \end{equation}

While this model is primarily designed to represent a heterodimerization process, it can also account for homodimerizations by setting \( K_1 \) and \( K_2 \) to identical values. The dissociation constants \( K_3 \) and \( K_4 \) for the second reaction step, corresponding to the binding of \( R_1 \) to \( R_2L \) and \( R_2 \) to \( R_1L \), are defined as the dissociation constants \( K_1 \) and \( K_2 \) from the first step, respectively, divided by the cooperativity factor (\( \alpha \)). This adjustment to the dissociation constants reflects a change in the favorability of receptor half binding when the other half is already bound to the ligand. If \( \alpha \) is greater than one, as is the case for rapamycin-induced FKBP-FRB dimerization, binding is more favorable when the ligand is already bound by the other receptor half (c.f. Figure 5) (Wang et al., 2019). Based on the explained biochemical reactions, a solvable polynomial can be defined to calculate the concentration of the dimerized receptor. Expand the section below to view the detailed derivation.

A quintic equation for the concentration of the free ligand \( [L] \) can be derived:

\begin{equation} a_0[L]^5 + a_1[L]^4 + a_2[L]^3 + a_3[L]^2 + a_4[L]^1 + a_5 = 0 \label{eq:17} \end{equation}

with the coefficients

\begin{equation} a_0 = 1- \frac{1}{\alpha} \label{eq:18} \end{equation} \begin{equation} a_1 = -[L]_T + (1- \frac{1}{\alpha})\bigl(K_{D, 1} + K_{D, 2} - [L]_T + [R_1]_T + [R_2]_T\bigr) \label{eq:19} \end{equation} \begin{equation} \begin{split} a_2 & = (K_{D, 1} - [L]_T + [R_1]_T)(K_{D, 2} - [L]_T+[R_2]_T) \\ & -(1- \frac{1}{\alpha})[K_{D, 2}([L]_T-[R_1]_T)+K_{D, 1}([L]_T-[R_2]_T)] \end{split} \label{eq:20} \end{equation} \begin{equation} \begin{split} a_3 & = K_{D, 2}(K_{D,1}-[L]_T+[R_1]_T)(\frac{K_{D, 1}}{\alpha} -[L]_T + [R_1]_T)+K_{D,1}(K_{D,2} \\ & -[L]_T+[R_2]_T)(\frac{K_{D, 2}}{\alpha}-[L]_T + [R_2]_T) + 2 \frac{K_{D, 1}K_{D,2}}{\alpha}[L]_T \end{split} \label{eq:21} \end{equation} \begin{equation} \begin{split} a_4 & = K_{D, 1}K_{D,2}(\frac{K_{D, 1}}{\alpha} - [L]_T + [R_1]_T)(\frac{K_{D, 2}}{\alpha} \\ & - [L]_T + [R_2]_T) + (1- \frac{1}{\alpha})\frac{1}{\alpha}K_{D,1}^2K_{D,2}^2 \end{split} \label{eq:22} \end{equation} \begin{equation} a_5 = - \frac{1}{\alpha}K_{D,1}^2K_{D,2}^2[L]_T \label{eq:23} \end{equation}

Solving this quintic equation yields the concentration of the free ligand at equilibrium. By definition, a polynomial of degree five has five solutions, but only one of these solutions is biologically relevant. Selecting the correct root is crucial to ensure that the model calculates the proper concentration. To select the appropriate root, we filter out any roots with a negative real part, leaving only plausible positive solutions. If multiple positive roots remain after this selection, the root with the smallest imaginary part is chosen, and its imaginary component is projected to the real part. This assumes that the physically meaningful solution is the nearly real root with a positive real part.

With the concentration of free ligand now known, we could calculate the concentrations of all other species using equations (15)-(19). This allowed the model to compute the concentration of the dimerized receptor complex \( [R_1R_2L] \) at equilibrium.

Results

Given a set of parameters, \( [R_1]_T \), \( [R_2]_T \), \(K_1 \), \( K_2 \), and \( \alpha \), the dimerization behavior of the receptor can be investigated by increasing the ligand concentration \( [L] \) (Figure 6). The concentration of the \( R_1R_2L \) complex in steady-state increases with ligand concentration until it reaches a maximum. Beyond this point, increasing the ligand concentration further results in a decline in the dimerized receptor amount. This behavior is expected, as increasing ligand concentrations saturate the receptor halves, preventing them from binding to ligands already bound to other receptor halves (Han, 2020). As a result, the concentration of the dimerized complex decreases to nearly zero, indicating that nearly all receptor halves are occupied by a single ligand. This behavior aligns with biological expectations, validating the receptor dimerization model and completing the first module of the overall model (Han, 2020; Ross et al., 2020).

Description of your image

Figure 6: Predicted receptor dimerization levels for various receptor ratios. The dimerized receptor amount \( [R_1R_2L] \) for \( R_2\ = \ 50nM\) is shown for increasing ligand concentrations \( [L] \) at different \( {R_1} \) values: \( 10nM \) (dark purple), \( 50nM \) (orange), \( 250nM \) (yellow) and \( 1500nM \) (purple). Other model parameters were set to \( K_1 = 25nM \), \( K_1 = 250nM \) and \( \alpha = 10 \).

Processing Layer: Phosphorylation Circuit

The processing layer is the core of the PHOENICS system, integrating multiple inputs into a single output through a carefully designed phosphorylation cycle. The central component of this circuit is the substrate (S), which is phosphorylated and dephosphorylated by an orthogonal kinase and phosphatase, respectively. A leucine zipper (LZ) connected to the substrate localizes it to the enzymes, which each have a complementary LZ attached. Once the substrate is positioned near the enzyme via leucine zipper dimerization, the proximity-induced enzymatic reaction can occur. When phosphorylated, the substrate is in its "activated" form, serving as the starting point for any downstream effector pathways. The primary objective of this module is to predict the concentration of the phosphorylated (active) substrate based on the given concentrations of the kinase and phosphatase. The model of the phosphorylation cycle is derived from the mathematical description of the system on which our circuit is based, with modifications to suit our specific needs (Yang et al., 2025).

The substrate, whether phosphorylated (\( S_p \)) or unphosphorylated (\( S_u \)), can bind either to the leucine zipper of the kinase (\( K \)) or the phosphatase (\( P \)) (Figure 7). The reaction equilibria for these interactions are defined by the dissociation constants \( K_{LZ,s} \) for the substrate-kinase LZ dimerization and \( K_{LZ,i} \) for the substrate-phosphatase LZ dimerization. The following reactions can be described:

(R.5)
(R.6)
(R.7)
(R.8)
Description of your image

Figure 7: Reaction scheme of intracellular phosphorylation circuit. The substrate can bind reversibly to complementary LZs of the kinase and phosphatase receptor halves under the dissociation constants \( K_{LZ,s} \) and \( K_{LZ,i} \), respectively. When in proximity to the kinase, the unphosphorylated substrate can be irreversibly phosphorylated at a rate of \( \tilde{k}_K \) or when near the phosphatase, it can be dephosphorylated at a rate of \( \tilde{k}_P \). Additionally, background phosphorylation occurs at a rate of \( \tilde{k}_{bg} \). All phosphorylation rates are normalized to the background dephosphorylation rate \( {k}_{bg,dephos}.\)

When bound to the LZ near the enzyme, the substrate becomes available for enzymatic conversion. A reaction can only occur if an unphosphorylated substrate is associated with a kinase (Eq. R.5), or if a phosphorylated substrate is associated with a phosphatase (Eq. R.8). Phosphorylated and unphosphorylated substrates will still bind to the leucine zippers of the kinase and phosphatase, respectively (Eq. (R.7), (R.6)), but will dissociate again without a reaction having occurred (Figure 7). The reaction rates for the phosphorylation of the unphosphorylated substrate and the dephosphorylation of the phosphorylated substrate are defined as \( \tilde{k}_K \) and \( \tilde{k}_P \), respectively. In addition to the conversion catalyzed by the synthetic enzymes, the substrate is also phosphorylated by background phosphorylation activity, denoted as \( \tilde{k}_{bg} \). The tilde (~) indicates that the enzymatic rates are normalized to the background dephosphorylation rate, thereby eliminating the need to separately account for this additional reaction. The enzymatic reactions are represented as follows:

(R.9)
(R.10)
(R.11)

Based on these set of reactions, a polynomial can be defined, and its solution can be used to calculate the concentration of the phosphorylated substrate. To view the full derivation, expand the section below.

By solving the following cubic polynomial, the concentration of the free substrate at steady-state can be derived:

\begin{equation} 0 = \biggl(\frac{[S]}{\sqrt{K_{LZ,s}K_{LZ,i}}}\biggr)^3 + \biggl(\frac{K_{LZ,s} + K_{LZ,i}}{\sqrt{K_{LZ,s}K_{LZ,i}}} + \frac{[K]_T + [P]_T - [S]_T}{\sqrt{K_{LZ,s}K_{LZ,i}}} \biggl)\biggl(\frac{[S]}{\sqrt{K_{LZ,s}K_{LZ,i}}}\biggr)^2 + \biggr(1+\frac{[K]_T - [S]_T}{K_{LZ,s}} + \frac{[P]_T - [S]_T}{K_{LZ,i}}\biggr)\biggl(\frac{[S]}{\sqrt{K_{LZ,s}K_{LZ,i}}}\biggr) -\frac{[S]_T}{\sqrt{K_{LZ,s}K_{LZ,i}}} \label{eq:54} \end{equation}

With the known concentration of the free substrate, all other unknown concentrations, including the total concentration of the phosphorylated substrate \( [S_p]_T \), can be calculated.

Results: Model Predictions

The model successfully computed the concentration of phosphorylated substrate based on the kinase and phosphatase levels. Using this framework, we generated dose–response curves by fixing the concentration of either the kinase or phosphatase and varying the other enzyme. For each sampling point, we calculated both the phosphorylation rate (\( \frac{[S_p]_T}{[S]_T} \)) and the kinase-to-phosphatase ratio. Plotting the phosphorylation rate against this ratio produced plausible sigmoid activation curves, indicative of switching behavior within the circuit (Figure 8). Phosphorylation remained near baseline levels when the phosphatase predominated, but increased as the kinase concentration rose, eventually plateauing at the maximum phosphorylation rate under kinase-dominant conditions (Porter & Miller, 2012). We further observed an increase in the relative phosphorylation level, when modeling a kinase variant with a higher enzymatic rate (\( \tilde{k}_K \)), as well as a shift in the activation point when using a phosphatase variant with a lower enzymatic rate (\( \tilde{k}_P \)) (Figure 8). These changes shift in the dose-response curve towards higher phosphorylation levels and lower kinase-to-phosphatase ratios demonstrates the model's sensitivity to changes in enzymatic activity.

Description of your image

Figure 8: Predicted dose-response curves for different ABL kinase and PTPN1 phosphatase variants. The relative phosphorylated substrate amount \( \frac{[S_p]_T}{[S]_T} \) is computed for increasing kinase-to-phosphatase ratios \( \frac{[K]_T}{[P]_T} \) (\( [P]_T\ =\ 5000nM \)) for different configurations of strong (\( \tilde{k}_K\ =\ 2.9 \)) and medium (\( \tilde{k}_K\ =\ 0.41 \)) ABL and strong (\( \tilde{k}_P\ =\ 8.0 \)) and weak (\( \tilde{k}_P\ =\ 0.49 \)) PTPN1 variants. Other constant model parameters were set to \( K_{LZ,s}\ =\ K_{LZ,i}\ =\ 0.0028 nM \), \( \tilde{k}_{bg}\ =\ 0.0016 \) and \( [S]_T\ =\ 2190nM \).

Results: Wet Lab Validation of Circuit Model Predictions

While the behavior of the circuit model appeared to be biologically sound, we sought to verify its performance by comparing it with our wet-lab-generated data. For this, we designed an experiment in which cells were transfected with constant amounts of our ABL kinase and varying amounts of the PTPN1 phosphatase. We then set the model parameters to values corresponding to the specific circuit components. The reaction rates of the enzymes and the background phosphorylation were known, as were the dissociation constants of the LZs attached to the enzymes. Since the total intracellular protein concentrations were unknown, we assumed that the relative transfection ratios proportionally reflected the ratio of intracellular concentrations.

When overlaying measured and predicted phosphorylation ratios and scaling the outputs to a comparable range, we observed that the experimental switching point of the synthetic cells aligned well (\( R^2\ \approx\ 0.93 \)) with model predicted data (Figure 9).

Description of your image

Figure 9: Validation of model-predicted data with wet lab titration assay. The total predicted phosphorylated substrate concentration \( [S_p]_T \) (purple curve) is shown for increasing kinase-to-phosphatase ratios \( \frac{[K]_T}{[P]_T} \) with constant \( [W]_T\ =\ 2000nM\) . The experimentally measured phosphorylation levels with flow cytometry (> 33000 single cells per condition, values display geometric mean of SplitFast fluorescence (RFU), geometric SEM < 2% (not displayed)) for various ABL-to-PTPN1 transfected ratios \( \frac{[K_{trans.}]_T}{[P_{trans.}]_T} \) with constant amount of transfected ABL kinase \( [K_{trans.}]_T\ =\ 100ng \) are overlaid after scaling to a comparable range (orange points). The model parameters were set to match the experimental setup, with \( \tilde{k}_K\ =\ 0.41 \), \( \tilde{k}_P\ =\ 8.0 \), \( K_{LZ,s}\ =\ K_{LZ,i}\ =\ 0.0028 nM \), \( \tilde{k}_{bg}\ =\ 0.0016s \) and \( [S]_T\ =\ 500nM \).

Together, these results demonstrate that the phosphorylation circuit model accurately captures the experimentally observed circuit behavior. Even without knowledge of the absolute intracellular concentrations, the model performed reliably based on the provided enzyme ratios. This validation experiment confirms the intracellular phosphorylation model, thereby completing the second module of our overall system model.

Integrated Modeling of the PHOENICS Cell

After developing the models for the sensing and processing layers, we combined them into a unified system. To do this, we duplicated the receptor model to provide mathematical descriptions for both the stimulatory and inhibitory input-sensing units. These modules calculate receptor dimerization based on the concentrations of two distinct ligands. Since dimerization of receptor halves induces enzymatic activity toward the substrate, we assumed that the concentrations of dimerized receptors for stimulatory and inhibitory ligands directly correspond to the concentrations of active kinase and phosphatase, respectively. The outputs from the first module thus serve as inputs for the second, resulting in a seamless pipeline. Additionally, we assumed that the enzymatic rates of the receptor-attached kinase and phosphatase match those of their cytosolic counterparts. This allowed us to infer the steady-state concentration of the phosphorylated substrate based solely on ligand concentrations in the extracellular environment.

Results: Model Predictions

The behavior of a specific cell architecture, defined by parameters 3 to 18, can be explored by modulating the concentrations of stimulatory and inhibitory ligands. By fixing the concentration of one ligand and varying the other, we predicted phosphorylation levels for different ligand ratios. Plotting phosphorylation levels against the ligand ratio yielded dose-response curves, clearly displaying the cell's switching behavior for different possible circuit architectures. Furthermore, we observed that holding the inhibitory ligand concentration constant while varying the stimulatory ligand produced a bell-shaped curve where the phosphorylation level plateaued at an optimal ligand ratio and dipped for increaisng stimulatory ligand concentration (Figure 10). This is in line with the trend observed with the simple receptor model, where excessive ligand concentrations lead to a decrease in dimerization due to saturation of receptor binding sites (Han, 2020). The model's reliable predictive capabilites were additionally confirmed when observing a reflected curve when holding the stimulatory ligand constant and varying the inhibitory ligand (Figure 10).

Description of your image

Figure 10: Predicted dose-response curves from complete, integrated model. The predicted relative phosphorylated substrate amount is shown for increasing stimulatory-to-inhibitory ligand ratios \( \frac{[L_{s}]}{[L_{i}]} \), where either \( L_i = 100nM\) (purple) or \( L_s = 100nM\) (orange) is held constant. In both cases, \( [R_{1,s}]_T\ =\ [R_{2,s}]_T\ =\ 10000nM \) and \( [R_{1,i}]_T\ =\ [R_{2,i}]_T\ =\ 100nM \). Other model parameters were set constant to \( \tilde{k}_K\ =\ 0.41 \), \( \tilde{k}_P\ =\ 8.0 \), \( K_{LZ,s}\ =\ K_{LZ,i}\ =\ 0.0028 nM \), \( \tilde{k}_{bg}\ =\ 0.0016 \) and \( [S]_T\ =\ 100nM \). All dissociaton constants for the receptor model (Parameters 7-10) were held constant at 100nM.

Results: Leveraging SPARC for Optimal Dual-Ligand Sensing

Next, we sought out to implement SPARC's complete, integrated mathematical framework to assist in designing the dual-ligand wet lab experiment with our MESA receptors. For this, different ratios of the stimulatory and inhibitory receptors were tested as input values to predict the phosphorylation output of the entire dual-ligand system, while keeping all other intracellular model parameters constant. The binding affinities for the TNF-α based homodimerization (stimulatory ligand) and rapalog based heterodimerization (inhibitory ligand) were taken from literature (Abdolalizadeh et al., 2013; Lu & Wang, 2017).

Based on these configurations, SPARC predicted the best fold change for TNF-α induced phosphorylation for the 4:1 ratio, which aligns with our expectations, as this condition contains the highest amount of stimulatory receptor (Figure 11, B). Interestingly, the 1:4 ratio also depicted a satisfactory fold change of 3.7. However, when computing the fold change for the rapalog induced dephosphorylation in the presence of TNF-α, a ratio of 1:4 showed the best signal quenching behavior with a 6.3x fold change (Figure 11, B). This hints towards the fact that a higher concentration of inhibitory receptor halves compared to stimulatory receptor halves leads to the best switch upon induction with both ligands, while still retaining a sufficiently high activation fold change.

Description of your image

Figure 11: Comparison of model-predicted and wet lab generated data for dual-ligand MESA activation (A) Experimentally measured NanoLuc luminescence (RLU) upon ligand-induced expression normalized by Firefly luminescence (RLU) for 1:4, 1:1 anf 4:1 transfected ratios of stimulatory and inhibitory receptor halves, respectively. Luminescence was analyzed by two-way ANOVA. Adjusted p-values (ns > 0.05; * ≤ 0.05; ** ≤ 0.01; *** ≤ 0.001; **** ≤ 0.0001) and fold changes are shown for each condition. (B) Predicted data from SPARC's mathematical model showing total substrate phosphorylation levels for 1:4, 1:1 and 4:1 ratios of stimulatory (Rapalog) to inhibitory (TNF-\( \alpha \)) receptor halves, respectively. Parameters for the inhibitory Rapalog ligand (\( K_{D1,i}\ =\ 0.3nM \), \( K_{D2,i}\ =\ 26\mu M \), \( \alpha_i\ =\ 117 \)) and stimulatory ligand TNF-\( \alpha \) (\( K_{D1,s}\ =\ K_{D2,s}\ =\ 15nM \), \( \alpha_s\ \approx\ 1 \)) were taken from literature (Abdolalizadeh et al., 2013; Lu & Wang, 2017), whereas all other parameters were held constant to \( \tilde{k}_K\ =\ 2.5 \), \( \tilde{k}_P\ =\ 8.0 \), \( K_{LZ,s}\ =\ K_{LZ,i}\ =\ 0.0028 nM \), \( \tilde{k}_{bg}\ =\ 0.0016 \) and \( [S]_T\ =\ 300nM \) to match experimental conditions.

To achieve the optimal switch with our PHOENICS cell, the aforementioned conditions were replicated in the wet lab, by transfecting the ratios of stimulatory and inhibitory receptors as mentioned above. As predicted by SPARC, the 1:4 ratio performed best in the dual ligand experiment, showing the most efficient switch with rapalog-mediated dephosphorylation in the presence of the stimulatory ligand TNF-α (2.0x and 1.9x fold change) (Figure 11, A). In contrast, the other conditions did not show the desired quenching behavior, which can also be seen in the trend predicted by the model. This experiment not only proves the validity of the complete, integrated mathematical model, but also shows SPARC's predictive capabilities for designing optimal PHOENICS cells in-silico.

Outlook

With our efforts, we successfully developed a generalized mathematical framework that captures the entire process from ligand sensing to phosphorylation output. Our model enables rapid calculation of phosphorylation levels at steady-state, providing accurate predictions of circuit behavior. Although we could not directly translate transfection concentrations into intracellular protein concentrations, wet lab validation confirmed that the model accurately predicts switching behavior. This provides valuable insights for tuning receptor and intracellular component ratios. Ultimately, this enables SPARC to serve as a tool for circuit analysis and optimization, guiding the development of PHOENICS cells tailored to specific therapeutic needs.

Model-guided PHOENICS Cell Design

Having established robust mathematical modeling of our entire circuit within SPARC we can efficiently characterize cellular switching behavior, a core feature of the PHOENICS cell. This behavior allows the cell to transition from a dormant to an active state once a specific ratio of stimulatory to inhibitory ligands is reached. The range and the exact ligand ratio at which this transition occurs can be finetuned by modifying the model parameters. Our goal was to quantify this switching behavior in terms of speed and concentration threshold. By integrating this functionality into SPARC, we aim to streamline the filtering of suitable circuit architectures for specific use cases for more efficient, model-guided experimental testing. This would enable anyone to design a PHOENICS cell and adapt our toolkit for any desired purpose.

Ultrasensitivity Analysis for Circuit Characterization

Switching behavior can be quantified through ultrasensitivity analysis, which assesses how sharply a system responds to input changes. A system is considered ultrasensitive if its response is more pronounced than the classical Michaelis-Menten curve, meaning switching happens within a narrower, more defined window. In synthetic circuit design, this steepness is critical, as it ensures tight control over the activation of responses. A steeper switch indicates a rapid and precise transition from inactive to active states, minimizing background activity in dormant states and ensuring swift activation once the activation threshold is crossed (Ferrell & Ha, 2014). In our case the precise switch we have built can be described by simple Goldbeter-Koshland kinetics, which we learned in the interview with Dr. Stefan Kallenberger (Goldbeter & Koshland, 1984). He especially advised us to implement this model in our ultrasensitivity analysis, which makes the calculation of characteristic values, like the Hill coefficient and \( EC_{50} \)-value straightforward.

In the context of PHOENICS cells, high sensitivity is especially important to avoid activation in healthy tissues and ensure TME specific targeting. By filtering out circuit designs with lower sensitivity early in the development process, we can focus on those most likely to exhibit the desired switching behavior, enabling an efficient allocation of resources.

Hill Coefficient and \( EC_{50} \)

The sigmoidal dose-response curve generated by varying the ligand ratio can be approximated by the Hill equation:

\begin{equation} \frac{[S_p]}{[S]_T} = \frac{1}{1+\bigr(\frac{EC_{50}}{[L]_T}\bigr)^{n_H}} \label{eq:hill} \end{equation}

This equation is characterized by the Hill coefficient (\( n_H \)) and the half-maximal effective concentration (\( EC_{50} \)). The Hill coefficient quantifies the steepness of the switch: higher values indicate a faster, more abrupt switch, while lower values represent a more gradual response. The Hill coefficient can be calculated using \( EC_{10} \) and \( EC_{90} \) values derived from the model-generated curve:

\begin{equation} n_H = \frac{\log_{10}(81)}{\log_{10}\bigl(\frac{EC_{90}}{EC_{10}}\bigr)} \label{eq:nH} \end{equation}

\( EC_{50} \) represents the ligand ratio at which the phosphorylated substrate reaches half of its maximal value, marking the point where the system is halfway through its transition. This concentration is considered the switching point of the circuit.

For a swift determination of switching characteristics of a given circuit architecture, we utilized SPARC's integrated mathematical modeling, which spans from receptor input to phosphorylation output. Without prior knowledge of the precise location of the switch, we begin with an initial rough sampling of input signal ratios to identify the general range where the switch occurs. From this initial sampling, the borders of the switching window for every composition are inferred, ensuring that both the minimal and maximal output signals are captured within the interval. Once the region of interest has been identified, a second fine-grained sampling of ratios is performed to generate a high-resolution dose-response curve around the switching point. Since the Hill equation is designed to produce values between 0 and 1, the model-generated data requires baseline correction and scaling. This results in a curve that spans from the lower plateau at value 0 to the upper plateau at value 1. Only after this adjustment can the \( EC_{10} \) , \( EC_{90} \), and \( EC_{50} \) values be extracted from the ratios at which the output reaches these specific points. These values are then used to calculate the Hill coefficient. In summary, this approach assigns each circuit architecture a Hill coefficient and its corresponding \( EC_{50} \) value, which define its switching behavior.

Results: Wet Lab Validation of Circuit Switch Behavior

Since the Hill coefficient and \( EC_{50} \) are characteristic of each circuit architecture, we aimed to validate whether our established workflow can be aligned with experimentally observed behavior. For experimental feasibility, we first decided to characterize these circuit attributes only for the intracellular processing module of the PHOENICS circuit, for now excluding the receptors. To achieve this, the phosphorylation output was measured in vitro for a set of cytosolic, LZ-fused kinase and phosphatase ratios. We then fitted a modified Hill equation to the data points, accounting for the offset from zero (\( b \)) and scaling of the values with \( s \):

\begin{equation} \frac{[S_p]}{[S]_T} = b + \frac{s}{1+\bigr(\frac{EC_{50}}{[L]_T}\bigr)^{n_H}} \label{eq:hill_scaled} \end{equation}

The fit closely matched the data points, resulting in a nearly perfect fit, with an \( R^2\ \approx\ 0.98 \) (Figure 12). The optimal fit was obtained for a \( n_H\ =\ 1.373 \). To compare these results to the model, we introduced the parameters of the circuit architecture into SPARC and generated a dose-response curve. The Hill coefficient was subsequently calculated from the curve, yielding a value of \( n_H\ =\ 1.002 \). While not a perfect match, the values for both the predicted and fitted Hill coefficients fall within the low-cooperativity regime, indicating a gradual rather than an ultrasensitive transition (Figure 12). This is further supported by the minimal visible change in the estimated half-maximal effective concentration (\( EC_{50} \)), which suggests that the dose-response range remains similar (Figure 12). The slight upward deviation in the fitted \( n_H \) value could be due to experimental noise, minor cooperative effects not accounted for in the model, or simplifications in the model itself. However, these small discrepancies do not change the core conclusion: the system's dose-response curve is slow-switching. This level of agreement is sufficient for early-stage screening to identify and exclude designs with clearly undesirable characteristics and to prioritize architectures for more detailed in-vitro characterization.

Description of your image

Figure 12: Comparison of model-predicted and experimentally measured switching characteristic from dose-response curves. The experimentally measured data points from the flow cytometry expriment (Figure 9) are fitted to the scaled Hill equation, yielding an \( n_H\ =\ 1.373 \) (orange). The model-predicted dose-response curve (purple) is overlayed and the Hill coefficent (\( n_H\ =\ 1.002 \)) is inferred according to Equation (\ref{eq:nH}).

Database Construction for Optimizing the PHOENICS Cell

Up to this point, SPARC operated in a forward mode: given a circuit architecture and input conditions, it predicted the resulting switching response using typical circuit properties. To enable design-by-specification, we inverted this workflow so that the system proposes matching architectures for a desired switching profile. Because the mathematical model spans 18 coupled parameters, an analytical inversion is infeasible. Instead, we approached this task by creating a precomputed database of all possible circuit compositions and their respective behaviors described by their \( EC_{50} \) and Hill coefficient.

For this, we systematically enumerate all possible combinations of components and concentrations, run each through the mathematical model, to estimate these characteristic properties. This maps switching characteristics to circuit composition, producing a behavioral design space that can be leveraged for straight forward application of the PHOENICS toolkit. A user can specify desired switch behavior and target ligands, while SPARC determines the most suitable circuit composition by finding the nearest point in the behavior space. For this, the standardized euclidean distance between the desired and available sets of properties is minimized. The closest match represents the best available architecture from our current parts collection, providing the most fitting design to the desired switching characteristics. By returning the top-k closest matches, we generate a shortlist of candidate architectures that can be experimentally validated in cells, allowing for further refinement and testing.

Try out our PHOENICS Builder using the pre-computed database below.

Outlook

The current database is limited to a selection of components with measured or well-characterized parameter values, but the framework is designed to be easily extendable. The process of database generation is fully automated. Users provide a structured table with allowed parameter values, and the pipeline systematically generates all possible permutations. These combinations are then evaluated using the model, and the resulting descriptors are recorded for each configuration. To expand the database, users can simply add new components with their respective kinetic parameters, including protein binding domains for additional ligands, additional LZs for fine tuned substrate recruitment dynamics and enzymes for tailored catalytic activity. The pipeline will automatically characterize all newly generated combinations, which could uncover previously inaccessible switching behaviors. This flexibility allows for a continuous expansion of targetable tumor contexts using the PHOENICS platform, as the library of components are extended by us and others.

PHOENICS Builder

Try out the PHOENICS Builder to find the optimal cell architecture for your desired cellular switch and activation threshold! Additionally you can choose from a set of inhibitory and stimulatory tumor ligands and small moelcules that you would like your PHOENICS cell to sense. Based on your input, SPARC will search through our pre-computed database of circuit architectures and return the 100 closest matches that you can experimentally test in the wet lab. Below is a summarizing table of all available components and their respective parameters that are used in the database generation.

Please choose parameters and press "Find!"

Outlook

Limitations and The Road Ahead

While our work on SPARC has laid a solid foundation for designing and optimizing personalized cancer therapies with PHOENICS, there are still some minor limitations that we will address as we move forward.

The binder generation process is currently limited to generating binders for single epitopes, which restricts us to homomultimeric ligands. However, Dr. Martin Pacesa, the main publisher of BindCraft, has emphasized the possibility to extend this method to heterodimers with a few minor tweaks, such as hotspot definition and further target structure trimming. This improvement would significantly expand the scope of binders we can generate and test, thus enhancing the versatility of SPARC's binder design capabilities.

Another limitation lies in our use of all-atom molecular dynamics simulations. While MD simulations provide valuable insights into binder-target interactions, the quantitative determination of \( K_D \) values remains a significant challenge in the field, due to inconsistencies in the umbrella sampling method (Roux, 1995). Despite this, we were successful in developing an internal ranking system using MD simulations that has proven to be more reliable than BindCraft's i_pTM metric. Moreover, machine learning-based estimators for \( K_D \) calculation have shown great potential (Piao et al., 2025). These methods could be incorporated into SPARC's pipeline to enhance binder ranking, reduce computational costs, and improve the efficiency of binder screening.

The mathematical framework we have developed for SPARC is based on the assumption that the system operates at steady-state, which simplifies the complexity of dynamic activation and system behavior. While this assumption holds true in many conditions, it does not capture the full temporal dynamics of the system, such as the speed and activation timeline. To address this, we plan to extend the mathematical framework to incorporate dynamic behaviors. Time-resolved wet lab experiments are already underway to provide the necessary data for building more detailed dynamic models. This addition will further refine SPARC's ability to predict effector output over time.

In addition, the current model does not account for background receptor dimerization rates, which have been observed in wet lab experiments. However, the promising results we see from coarse-grained Martini simulations offer a potential solution. These simulations are already providing reliable, wet lab-validated results, and by combining them with techniques like umbrella sampling, we can estimate background dimerization kinetic rates and binding affinities in the membrane. This approach will not only improve the model's accuracy but also provide deeper insights into the interplay of the PHOENICS components. Simulations spanning across longer time intervals could also be used to capture a broader range of dynamic interactions within the membrane, which would further enhance the system's predictive capabilities.

Another challenge has been the integration of our synthetic GPCR sensing architecture into the existing SPARC framework. Currently, the mathematical model is based solely on our MESA-receptor designs. However, we have already taken the first steps to incorporate the GPRC-based PAGER system into SPARC by adapting BindCraft to generate binders compatible with their architecture. The next steps involve further MD-based filtering and wet lab validation of these arodyn-fused binders. Additionally, the extensive literature describing various mathematical modeling approaches for native GPCR reactions provides a solid foundation for adapting SPARC for the PAGER platform, providing further flexibility and scalability of our model (Carvalho et al., 2021; Culhane et al., 2022).

Our Vision

Despite some limitations, SPARC enables the design and optimization of PHOENICS cells for personalized synthetic immune cell therapies. We envision this process as follows: By utilizing patient tumor biopsy data and subsequent TME profiling, researchers could use SPARC's binder generation and characterization workflow to rapidly design protein binders for biomarkers that are differentially expressed in the TME compared to healthy tissue. SPARC could then generate an optimized PHOENICS cell candidate in silico with the most suitable circuit architecture, fine-tuned to patient-specific biomarker concentrations by leveraging the ultrasensitivity database and our synthetic part collection. In an ideal scenario, once target biomarkers are identified, this patient-specific circuit design process could be completed within days, significantly accelerating personalized cancer therapy development. Beyond patient-specific designs, SPARC can also be leveraged to create off-the-shelf PHOENICS cells precisely engineered for common tumor types. This would drastically reduce cell therapy costs and significantly improve accessibility of effective cancer treatment.

Looking ahead, SPARC's modular architecture facilitates continuous optimization and extension of individual components over time. By addressing current limitations and implementing our proposed improvements, SPARC can evolve into an increasingly powerful tool for therapeutic cell engineering. This progress will be accelerated through continued collaboration with the iGEM and broader scientific communities, building upon the robust foundation we have established.