To efficiently construct our safeTEA plasmid, we established four concurrent engineering cycles to investigate four key components: aptamers binding to target molecules, anti-aptamers inhibit the binding of aptamers, lambda exonuclease processivity, and a reproducible plasmid insert. Due to the carcinogenic nature of Aflatoxin B1, progesterone was selected as our target molecule, to create a qualitative fluorescent binding assay to determine whether or not our culminated aptamer library binds to, or is inhibited by, its respective anti-aptamer counterpart. To generate single-stranded aptamers from nicked double-stranded DNA, we needed to selectively digest the complement strand of our aptamer encoding sequence—the anti-aptamer. For this, we utilize the 5’-targeting Lambda exonuclease. Lambda exonuclease is able to be heat-inactivated and can exhibit pausing behavior, making it even more desirable for our application. Using Geneious Prime, a dual-aptamer insert was designed to be cloned into the lactose-sensitive plasmid pUC19 using Golden Gate Assembly. The construct is to be tested for its ability to transform into Seagull. The customizable insert allows for the safeTEA system to be adapted to other toxins by replacing the aptamer sequence. When all parts align we envision the plasmid being cut open, releasing its aptamer wings, allowing our seagulls to fly.
Engineering
In an effort to reduce large group decision-making for detail-oriented tasks, we created Helix: a system of interconnected “cycle” groups that simultaneously focused on different aspects of our overall project, but in much more depth. Cycles were created based on individual goals and milestones that needed to be proven before moving on to the next goal. The groups worked at the same time, but their progress was linked: no cycle could truly move ahead without the one before it. Inspired by the “follow-the-sun” approach studied in a required leadership course, Helix transformed our team into a continuous and interlinked structure where progress in one cycle fuels innovation in another. This model echoed the coordination strategies outlined by Treinen and Miller-Frost [1], which allowed us to maintain continuous momentum and ensure that no idea was lost between iterations. Helix proved that intentionally designed interdependence catalyzes rather than constrains creativity. We were able to convert overwhelming complex challenges into smaller structured, achievable milestones while preserving collaboration and forward momentum.
Citations
- [1] J. J. Treinen and S. L. Miller-Frost, “Following the sun: Case studies in global software development,” IBM Systems Journal, vol. 45, no. 4, pp. 773–783, 2006, doi: 10.1147/sj.454.0773.
Cycle 1 & 2 full PDF
Cycle 1 & 2 — Full Document (PDF)
Cycle 3 5 iterations + citations
Design
GOAL: Design a double-stranded gene block and establish a streamlined digestion protocol that will allow us to measure the digestion capabilities and processivity of Lambda exonuclease.
Design:
Lambda exonuclease strictly targets phosphorylated 5′ ends. Its intended function is to digest one strand of double-stranded DNA (dsDNA), leaving the other strand intact. We will employ Lambda exonuclease (exo) to digest our gene blocks (gBlocks).
In order to evaluate the digestion capabilities and processivity of Lambda exo, we designed a dsDNA construct, TriCycleV1 (T), containing putative pause sequences. The initial design of TriCycleV1 was 830 nucleotides in length and holds four 5′-GGGGATTC T-3′ pause sequences | separated by 9-nt nonsense spacers [1]. This “rumble zone” flanks a 210-bp region intended to remain undigested to “hold” the undigested aptamer arms (Fig. 1). We also designed a control sequence with no pause sequences, NonStop (N), to better characterize the influence of the pause sequences within the context of digestion. The greatest challenges we faced were determining how to pause the exonuclease, and how to measure where in the sequence it stopped digesting.
There are two sets of primer binding sequences encoded within both of the experimental gBlocks. The T7 forward and T7 reverse primer binding sites are located at the 3′ ends to enable PCR amplification of the entire gene block and Sanger sequencing (Fig. 1). The M13 forward and M13 reverse primer binding sites are positioned downstream of their T7 counterparts. The M13 primers are to be used as a second measure of digestion efficiency and measuring where the exonuclease halted digestion when used to perform Sanger sequencing. The primer sequences were sourced from the University of California, Berkeley DNA Sequencing Facility stock primer sequences [2].
To amplify the gBlock, we use Touchdown PCR (TD-PCR) to avoid potential hairpin formation on one of our primer sites (Fig. 2). We use OneTaq DNA polymerase because it does not exhibit exonuclease activity. The aforementioned primers and primer binding sites are employed during Sanger sequencing as performed by the University of California, Berkeley DNA Sequencing Facilities.


Build
For the purpose of experimenting with the “rumble zone”, we ordered our TriCycleV1 and NonStop sequences as gBlocks from Integrated DNA Technologies (IDT) [4]. We similarly ordered T7 forward and T7 reverse primers from IDT. We amplified the gBlocks using Touchdown PCR with 2X OneTaq Master Mix. We started the annealing temperature at 64 °C and went down −1 °C per cycle for 14 cycles to reach a final annealing temperature of 50 °C.
After sequencing, raw .ab1 files were aligned using Geneious Prime’s De Novo assembler for each sample, combining the four reads from each primer and to generate assembled contigs [5]. These assembled contigs were mapped back to their respective template sequence to assess digestion. The mapped contigs were subsequently used to visualize the results of our experiments.
Test
We amplified our gBlocks using 2X OneTaq Master Mix in a 50 µL reaction format with 1 µL of our 10 ng/µL template DNA, 10 µM forward, and 10 µM reverse primer. We verified the amplicons on a 1% agarose gel with SYBR Safe to ensure our primers were viable. Then, we digested approximately 200 ng of each experimental gBlock at 37 °C with Lambda exonuclease following the manufacturer specifications for reaction conditions [6]. We then cleaned up the DNA using the “Monarch Spin PCR & DNA Cleanup Kit (5 µg)” following manufacturer instructions [7]. We used a NanoDrop spectrophotometer to verify the concentration and purity of our DNA.
We tested different methods of DNA isolation and exonuclease inactivation such as: EDTA, chloroform extraction, ethanol precipitation, guanidinium isothiocyanate (GITC) extraction, ammonium acetate extraction with both glass beads and a silica column. Protocols can be referenced on our protocol page.
The digestion products were divided into four 10 µL aliquots, each sequenced using a different primer (M13 forward, M13 reverse, T7 forward, T7 reverse). We then sent the aliquots to UC Berkeley for Sanger sequencing.
Learn
We confirmed that our gBlocks were the correct size and our primers had successfully bound and amplified our target gene blocks (Fig. 3). The resulting amplicons were at the expected lengths—approximately 830 bp for TriCycleV1 and 786 bp for NonStop, respectively.
For each inactivation method tested, the DNA yield after purification was consistently poor. However, chloroform inactivation produced the highest yield compared to the other methods tested. Therefore, we believed that all subsequent digestions were to be performed with chloroform inactivation.
Following the first round of Sanger sequencing, we observed that the gBlocks were hardly being digested by Lambda exonuclease. We observed digestion by analyzing peak height on the chromatograms after De Novo assembly. A sharp drop in peak height can be associated with lower confidence due to lower quantity of fragments at that position. Based on the consensus results, we estimated that only 20 base pairs were being eaten in 15 minutes, which was significantly slower than expected (Fig. 4). The difference between digestion on the TriCycleV1 and NonStop gBlocks was indistinguishable.
A second round of Sanger sequencing was performed on samples that were subjected to longer digestion times, but unfortunately, the results were similarly inconclusive. The read quality was not interpretable, with some of the reads going up to approximately 2000 base pairs—twice the length of the gene block (Fig. 5). There was no consistency between the amount of digestion seen on the 5′ ends for each sample.
The low read quality was likely due to low concentrations of DNA within the digested samples. In addition to the low DNA quantity, we had means to verify each aliquot contained equivalent concentrations and a proportional number of digested products. This issue is corroborated by multiple reads from one sample lacking consistent digestion from primers on the same strand.
Further analysis revealed that the gene blocks were not ordered with 5′-phosphorylated ends. The lack of phosphorylation significantly slows down and impairs the ability of Lambda exo to bind and digest the DNA [8]. Thus, the absence of these modifications may explain the limited digestion we observed. Therefore, in our next experiments we will test lambda’s digestion of 5′-phosphorylated DNA substrates.



Design
GOAL: Assay our digestion samples using loading dye with a denaturing agent on a 1% agarose gel and a 2% agarose E-Gel.
To investigate how the exonuclease behaves on the individual strands of the gBlocks, we aimed to denature our digested samples. This required running the products on a denaturing gel or using a denaturing loading dye. We chose 2X RNA Loading Dye containing formamide [12]. The formamide denatures the DNA, allowing us to observe any differences in migration distance and intensity of bands, which could indicate variability of Lambda exo binding and processing of each strand.
Build
We ran digestions for 4 hours, 2 hours, and 1 hour, reverting back to standard protocol amounts (1×) for Lambda exonuclease and Lambda exonuclease buffer. We used a 10 minute inactivation step at 80 °C.
We employed both an E-Gel and traditional gel electrophoresis to analyze our digestion samples. The SizeSelect II E-Gel contains SYBR Gold stain, a highly sensitive dye capable of detecting very low concentrations of nucleic acids. The E-Gel also allows for easy extraction; therefore bands of interest can be readily selected and used for subsequent experimentation, such as Sanger sequencing. In contrast, the traditional 1% agarose gel is stained with SYBR Safe to compare band separation and fluorescence. Both gels were tested to determine which method produced more interpretable results.
Test
We used the RNA loading dye on an Invitrogen E-Gel SizeSelect II, 2% agarose gel. We loaded the gel with our digested samples and an undigested TriCycleV2 gBlock to use as a reference. Each well contained approximately 20–30 ng of DNA diluted in 12.5 µL of nuclease-free H2O with 12.5 µL of 2X RNA Loading Dye for a total 25 µL sample. We ran the gel for 12 minutes in an E-gel Power Snap electrophoresis chamber.
We then ran the same protocol using the formamide loading dye on a 1% agarose gel. Each sample contained approximately 100 ng of DNA diluted in nuclease-free water with 10 µL 2X RNA Loading Dye for a total 20 µL sample. We ran the gel for approximately 45 minutes in a 0.5× TBE buffer.
Learn
The E-Gel exhibited decent resolution with individual bands distinguishable from the general lane smear (Fig. 14). There was no visible band in the template lane, likely due to the template DNA being left out from that sample, which made direct size comparison more challenging. However, each visible band was relatively similar in size, as expected. The four-hour digestions run on the E-Gel produced fewer and fainter bands, suggesting more complete digestion of the DNA when compared to the shorter digestion times. Bands seen for the shorter digestions seemed to migrate less far down the gel as well.
The 1% agarose gel showed several bands for each sample but there was less correlation between the brightness of the bands and digestion time. The presence of multiple bands of varying sizes in each sample led us to believe that digestion certainly occurred. Further, the most intense bands for each sample increase in size as digestion time decreases, further corroborating our findings from the previous E-Gel.
In future experiments, we would like to test the addition of formamide to a higher-percentage agarose gel to improve band resolution and intensity. We would also like to design smaller testing sequences to run on such a gel to more accurately analyze subtle differences in digestion.

Cycle 4 template — iterations + citations
Project Summary
Through the utilization of the pUC19 plasmid as a cloning vector, we aimed to design a Golden Gate compatible insert that would generate a final assembled "safeTEA" plasmid.
This final plasmid design, when in the presence of lactose, would be de-repressed to form a pUC19 backbone, which functions as an interior double-stranded segment between two aptamer arms capable of selectively binding to target molecules in aqueous solutions.
Our insert was modeled to be under the control of the LacZ promoter, so that lactose regulation may occur. Due to native pUC19 not having an active LacI gene, it would need to be added to the system to allow for its binding to the DNA of the lac operon, preventing spontaneous derepression and unintended maturation of the backbone-aptamer arm system.
When lactose is introduced to this designed system, it can then bind to LacI, splitting it and inhibiting its ability to bind to the lac operon promoter. This then allows for our restriction enzyme to cleave the double-stranded DNA located between the two aptamer components of the insert, and subsequently for the lambda exonuclease to cut away the anti-aptamer strands from their exposed 5′ ends. The lambda exonuclease is slowed down at rumble zones included in our insert, in order to protect the double-stranded backbone interior between the two aptamer arms.
This ultimately linearizes the plasmid, freeing the aptamers from the anti-aptamers, making them available for target molecule binding.
While lactose presence allows for transcription, under the presence of glucose, either alone or in addition to lactose, this transcription is suppressed, allowing for glucose to be utilized as another method for suppressing plasmid maturation until its final form is desired.
Therefore, both in the absence of lactose as well as in the presence of glucose, transcription, and ultimately plasmid maturation, is suppressed, allowing for the plasmid to remain intact for reproducibility, transportation, and longevity purposes.
Software 2 iterations + citations
Design
Our computational team set out to design single-stranded DNA spacer sequences that do not interact with neighboring coding regions of DNA. A common biological linker is a polyA tail which consists of many adenine bases strung together. This is not optimal when interacting with aptamers because any of the bases in the polyA tail could interact with any thymine bases in the aptamer/flanking sequence. Spacer sequences are influenced by user given parameters, such as target A/T content and specific crossing over methods, with the goal of avoiding base pair complements to neighboring sequences and minimizing specific motifs (such as palindromes, repeats, and folding). The key tool that NOODL implements is a genetic algorithm (GA) which is a search based optimization technique relying on principles of cell biology, genetics, and natural selection [1]. Having a GA at the core of our program allows each spacer to be evaluated in kmers (DNA substrings of length k), with each kmer being evaluated by itself to compound to a total score. The lower the score, the better the spacer.
An important requirement for our application, driven by our experimental procedures, was to strictly avoid interaction between a spacer sequence and its flanking regions. Mutation does not occur in these fixed regions and therefore the crossover and mutation functions act only on the variable portion of the spacer, thus preserving the shape of the aptamer. When brainstorming how to write our scoring function, we decided that it would penalize any predicted hybridization. Since this folding can inhibit target molecule/aptamer binding, it was essential to build a tool to prevent it.

The “seagull” structure contains a double-stranded plasmid backbone, along with spacers and aptamers with overhangs that come off of each backbone strand. This required us to consider the possibility of spacers on the top strand of our sequence potentially interacting with spacers on the bottom strand. This gave rise to an initial limitation of NOODL’s design, where spacers were incorrectly made under the assumption that aptamers were evaluated moving from the 5’ to 3’ end.
Build
NOODL modules include:
- noodl.jl: Contains the main genetic algorithm functions, parsing, and imports modules from the other Julia files.
- Crossover: Implements multiple types of crossing over methods (Multipoint, Single, Uniform). Initially chose a random crossover method as default if the user did not input a selection.
- RCScore: Computes total penalties; facilitates reverse complement search.
- Bias Selection: Implements multiple types of bias for parent selection; this drives convergence rate of the GA. Good parents allow for fitter offspring (Stochastic Universal Sampling, Roulette, and Tournament). Selects random bias as the default if the user did not input a selection.
To combat the earliest limitation of NOODL, we modeled the right flank sequence as its reverse complement to mimic the DNA’s behavior on the opposite strand.
Test
Run Sequence:
- Parse user input for parameters such as spacer length, target AT content, populations, generations, crossover type, bias selection type, mutation rate, flanking sequences, etc.
- Generate a pool of random initial sequences with inputted length and rough A+T percentage.
- Create a kmer dictionary for randomly generated sequence and inputted flanking regions; create a second dictionary of reverse complement kmers of the initial dictionary.
- Assign penalties based on number of kmer matches between the two dictionaries. Also penalize based on proximity to user-inputted AT content.
- Select parents using a bias method; apply crossover and mutation functions.
- Re-score and repeat across the desired amount of generations to determine single best spacer sequence.
We took multiple sequences produced by NOODL and ran them through UNAFold’s DNA Folding Form [2] to observe folding differences and folding under different temperatures. Based on the differences observed, we refined NOODL’s input parameters. The goal was to identify the parameters that consistently produced 1) a low internal score and 2) desirable folding.
Our main changes in testing included:
- Bias Selection: experimented with different kinds of bias to see which types of bias gave us the lowest scoring sequences
- Crossover Functions: experimented with different crossing over points in the sequence to see which point gave us the lowest scoring sequences
- Mutation Rate: experimented with values between 0.05 and 0.2
- Population Size: experimented with values between 100-300
- Number of Generations: experimented with values between 0-100
- Kmer Count: the size of the kmers we are counting, which ranged from 5-8
We tested multiple values of the listed parameters until finding the lowest scoring combination.
Learn
We learned that Tournament Selection bias increased population diversity and reduced repeat parent picks within a selection pass. This led to steadier convergence than Roulette Wheel bias or Stochastic Universal Sampling bias. It also reduced the program’s runtime by more than half. Additionally, the Multipoint Crossover method consistently produced lower scoring sequences than other crossover methods, validating our decision to set it as the default. We also deduced that the reverse complement penalty system worked well as a scoring method because after plotting the structure of final sequences, we observed hairpins of length k-1. When any hairpins had a length greater than k-1, they were proven to be thermodynamically impossible.
Ideal input parameters (learned after testing):
- Bias Selection: Tournament
- Crossover type: Multipoint
- Mutation Size: 0.2
- kmer Length: 5