1 Supplementary Materials and Methods Design of the automata components A computer program was developed to design the symbols of the diagnostic molecules. It generates a random sequence of 6 nucleotides for each disease marker and improves this random set using a genetic algorithm. We constrained the sequences to contain 75% CG content in each 4-nucleotides sticky end. All sticky ends derived from the symbols were checked for complete or partial complementarity. The algorithm renders sequences with minimal partial complementarity between non-related sticky ends. Several runs were performed and a set of symbols with best non-overlapping properties was chosen for diagnostic molecules construction. In the actual diagnostic molecules the 6-bp symbols were separated by 1-bp spacers to obtain symbols of 7-bp total length. A computer program was developed to select mRNA activation and deactivation tags, which were then realized using ssDNA molecules in most of our experiments. It accepts a set of mRNA sequences of the disease markers for a particular disease and provides the two most unique short subsequences for each of these markers which also contained a partial FokI recognition site (preferentially, first three nucleotides: 5'-TCC) to facilitate the strand exchange. We use Hamming distance1, which is a number of nucleotides that need to be changed to obtain one sequence from another, as the uniqueness criterion and assume that specific interaction of each transition molecule with its regulatory tag depends only on the uniqueness of its regulatory sticky end. The lengths of the tags were adjusted to have a melting temperature of ~25 oC, using a simplified assumption to determine Tm of a sequence. In a model ssDNA regulatory tags are separated by a linker ~40 nt long, designed to have minimum interaction with other ssDNA sequences in the system. Each tag sequence was used as a template for the design of the transition molecules. The complete set of oligonucleotides, comprising the 2 automaton and the model disease markers, was tested for cross-interactions using the Oligonucleotide Modeling Platform (DNA Software, Inc.) software tool for possible flaws in the design. Construction of the automata components All deoxyribonucleotides employed for automata construction were ordered from Sigma-Genosys or from the Weizmann Institute DNA synthesis unit, PAGE-purified to homogeneity and quantified by GeneQuant instrument (Pharmacia). Non-labeled double-stranded components were prepared by annealing 1000 pmol of each single strand in 10 l of 50 mM NaCl, by heating to 94 oC and slow cooling down in a PCR machine block. Diagnostic molecules employed for the experiments in Fig. 3b and Supplementary Fig. S4 were prepared by annealing of 1000 pmol of their singlestranded components, with 3 pmol of an antisense oligonucleotide phosphorylated by Redivue [-32P] ATP (~3000 mCi/mmol, 3.33 pmol/l, Amersham-Pharmacia). Fluorescently labelled diagnostic inputs employed for parallel diagnosis experiment (Fig. 3c) were prepared by annealing non-labelled sense strand of the input and either FAM- or Cy5 5'-labelled antisense strands. Diagnostic molecules with drug release and drug suppressor release moieties were prepared by ligation, employing 32P-labeled 5' terminus of one of the oligonucleotides to introduce internal label in the single-stranded loop. Preparation of PPAP2B↓GSTP5↓PC: The oligonucleotides for the construction of the drug-release diagnostic molecule were RL.21 (CCGAGGCGGTGCGCGACGCTCGAGCCTCGACGCTCGTTGGTATTG) and RL.22 (32PCACATCCAACGAGCGTCGAGCGTCGAGCGTCGCGCACCGCC). The ligation was afforded by the bridging oligonucleotides RL.25 (CTCGACGCTCGTTGGATGTGCAATACCAACGAGCGTCGAGCGTCGAGCGTCGCGCACCGCCTCGG) Twenty pmol of RL.22 oligonucleotide (out of 1000 pmol) were 32P-labelled with 5 l 3 of [-32P] ATP (~3000 mCi/mmol, 3.33 pmol/l, Amersham) in 50 l reaction containing T4 Polynucleotide Kinase Buffer and 20 u of T4 Polynucleotide Kinase (New England Biolabs). After 1 hour at 37 oC, 20 u of T4 Polunucleotide Kinase in T4 Ligase Buffer were added, the volume was increased to 165 l and the reaction continued for additional hour at 37 oC. Double stranded block was prepared by annealing of 1000 pmol of RL.21 and 1200 pmol of RL.25. For ligation, 1000 pmol of the labeled RL.22 oligonucleotide was mixed with the annealed block and ligated using 1,600 u of Taq Ligase (New England Biolabs) in 1 ml of Taq Ligase buffer at 55 oC for 18 hours. The ligation products were ethanol-precipitated, resuspended in TE buffer, pH 8.0 and separated using 12% denaturing PAGE (40 cm x 1.5 mm). The correct-length ligation product was excised from the gel and extracted using standard methods. The product was refolded prior to use. Drug suppressor-release molecule was constructed by the identical protocol using the oligonucleotides RL.23 (CCGAGGCGGTGCGCGCGAGGCGCGAGGCGCGAGGCCCATGTGCAATAC), RL.24 (32PCAACGCACATGGGCCTCGCGCCTCGCGCCTCGCGCGCACCGCC) and the auxiliary oligonucleotide RL.27 (CGCGAGGCCCATGTGCGTTGGTATTGCACATGGGCCTCGCGCCTCGCGCCTCGCGCGCACCGCCTCGG). Preparation of PPAP2BGSTP1PIM1HEPSINPC: The oligonucleotides for the construction of the inputs were: RL.5-50 (CCGAGGCGGTGCGCGCAGGGCGGGTGGCGACGCTCGACGCTCGACGCTCG) and RL.3-51 (32P- TTGGTATTGCACATCCAACGAGCGTCGAGCGTCGAGCGTCGCCACCCGCCCTGCGCGCACCGCC). They were ligated with the help of a bridging oligonucleotide RL.25n (GGATGTGCAATACCAACGAGCGTCGAGCGTCGAGCGTCGCCACCCGCCCTGCGCGC). Twenty pmol of the RL.3-51 oligonucleotide were 32P-labeled; 1000 pmol of the same substrate were phosphorylated with PNK in T4 DNA Ligase buffer with 1 mM ATP. For ligation, 1000 pmol of the RL.3-51 (mixture of 32P-labeled and phosphorylated substrates), R.5-50 and RL.25n (bridge) oligonucleotides were mixed and ligated by 2,000 u of Taq Ligase 4 (New England Biolabs) in 1 ml of Taq Ligase buffer at 60 oC for 2 hours. The ligation products were ethanol-precipitated, resuspended in TE buffer, pH 8.0 and separated using 8% denaturing PAGE (40 cm x 1.5 mm). The correct-length ligation product was excised from the gel and extracted using standard methods. The product was refolded prior to use. Note that according to our observation, ligation product migrates much faster than could be expected from its length, probably due to its stem-loop structure. Regulation by mRNA We used an mRNA transcribed from a pTRI-Xef 1 ~1900 bp DNA template provided with the MEGAScript T7 kit (Ambion) as a generic mRNA disease marker. mRNA sequence was folded using mFold server v 3.0 (URL: http://www.bioinfo.rpi.edu/applications/mfold/old/rna/) and visually examined to find sequences of low secondary structure. mRNA was synthesized using MEGAScript T7 kit and quantified by GeneQuant (Pharmacia). mRNA was refolded in solution by heating to 70 oC and slow cooling down prior to regulation experiments. Transition molecules were designed to match these sequences and screened to determine the most effective activation and inactivation tags of the mRNA. These were identified at the locations around 600 nt and 1500 nt. Transition molecules were built from fluorescently labeled oligonucleotides to facilitate their identification. A mixture of 0.25 M active negative and 0.25 M inactive positive transition molecules and 0.25 M of the 5'→3' oligonucleotide for positive transition were incubated in 10 l of NEB4 (New England Biolabs) buffer at 37 oC for 20 min with varying amounts of mRNA and analyzed by native acrylamide gel (15%). Transition molecules involved in the experiment described in Supplementary Fig. S3a were similar but not identical to the fluorescently labeled molecules used in direct visualization of the regulation process presented in Fig. 3a. The only difference was converting the negative transition to positive transition and vice versa, by introduction and removal of spacers between the FokI sites and the state- 5 symbol recognition sticky ends, respectively, for technical reasons (Table S1). To improve the regulation pattern, negative transition molecule was taken at 0.5 M while positive transition molecule was at 1 M concentration. Table S1. Molecules involved in single-step computation with pTRI-Xef mRMA Symbol Positive (Yes → Yes) transition Negative (Yes → No) transition CAGGGCC TTCCTCATATCTTTTCTGACTGTATGGGGGATGCC ACATACCCCCTACGGGTCC CTGAGGATG GCGACGAAAGACTCCTACGTCC + GGAAATATTCATTTGGTTTTCGCTGCTTTCTGAGCAG ACTCA Diagnostic computations Diagnostic computations consist of three steps: 1) mixing the active negative and inactive positive transition molecules for each diagnosed marker and the diagnostic molecule(s); 2) equilibrating the software component with the mixture of ssDNA oligonucleotides or mRNA molecules representing the molecular disease markers and 3) processing of the diagnostic string by the hardware enzyme. For each symbol of diagnostic string, we combine transitions in the following manner: if its marker is under expressed in a disease, we mix 1 M of active positive transition molecule and 1 M of inactive negative transition molecule. For a marker over-expressed in a disease, we mix 1 M of active negative transition molecule and 1 M of inactive positive transition molecule. For some transition molecules, inactivated only by high marker concentrations, we add 1 M of the protecting oligonucleotide (see Fig. 2 and Supplementary Fig. S2 and their legends) to improve regulation (namely, for each pair of transitions in the SCLC diagnosis and for PPAP2B and GSTP5-related transitions in the PC diagnosis). All other components except FokI, including the diagnostic string 6 molecules (1 M), neutral transition molecules (1 M each), Yes- and No-verification transition molecules (2 M each) and NEB4x10 buffer are admixed at this stage. A mixture of model ssDNA or mRNA molecular markers is prepared in parallel, with each marker at either zero (normal state for over-expressed gene and disease state for underexpressed gene) or 3 M concentration (normal state for under-expressed gene and disease state for over-expressed gene). Both mixtures are thoroughly mixed to a total volume of 9 l and incubated at 15 oC for ssDNA markers or at 37 oC for mRNA markers for 20 min. Following equilibration, the computation is initiated by adding 1 l of FokI enzyme (New England Biolabs) solution, either at concentration equal to the total concentration of active transition molecules or at 5.4 M. Typical reaction proceeded for 30 min at 15 oC, but for shorter diagnostic strings (2 symbols) incubation times were shortened to 15 min. The reaction was quenched by addition of 1 volume of formamide loading buffer. Samples were analyzed by denaturing PAGE (15%) following denaturation at 94 oC for 5 min. In this assay, Yes and No outputs are represented by 17-nt and 15-nt long bands, respectively. Radioactive gels were exposed to Imaging Plates (Fuji) and scanned on PhosphorImager (Fuji). In the parallel diagnoses experiment (Fig. 3c), the diagnostic molecules were labeled with FAM and Cy5 at the 5' of their antisense strands. The gels were scanned by Typhoon 9400 instrument (Amersham Pharmacia). Controlled drug production Drug production (Fig. 4a) was tested with PPAP2BGSTP1PIM1HEPSIN drug release diagnostic string (0.5 M). Different diagnostic outputs were obtained by transition regulation with a desease marker in one of the symbols, while the rest of the 7 symbols were processed by preformed positive transition molecules (1 M each). Absolute transition molecules concentration of 1 M was taken for HEPSIN ssDNA model regulation and 0.25 M for pTRI-Xef mRNA regulation. The samples were analyzed by denaturing PAGE (15%). For regulated drug release experiments (Fig. 4b) equal amounts of diagnostic string molecules PPAP2BGSTP1 with drug-release and drug-suppressor release moieties (0.5 M each) were mixed with 1 M of B GSTP1 Yes transition molecule and varying ratios of Yes Yes and Yes PPAP2 1 No at 1 M total concentration to model different diagnostic outcomes. In Yes GSTP all experiments Yes- and No-verifying transition molecules were added at 2 M each and FokI enzyme at ~ 4.3 M in 10 l final volume. The mixtures were incubated at 15 o C for 30 min, quenched with EDTA, mixed with loading buffer and analyzed by native PAGE (20%). To control drug:drug suppressor ratio at a constant confidence level in a positive diagnosis (Fig. 4c), varying amounts of diagnostic string molecules PPAP2BGSTP1 with drug-release and drug-suppressor release moieties (1 M total B Yes transition molecule and 0.6 concentration) were mixed with 1 M of Yes PPAP2 1 GSTP1 Yes and 0.4 M of Yes No transition molecules to obtain 50% M of Yes GSTP confidence in the positive diagnosis. Yes- and No-verifying transition molecules were added at 2 M each and FokI enzyme at ~ 4.3 M in 10 l final volume. The mixtures were incubated at 15 oC for 30 min, quenched with EDTA, mixed with loading buffer and analyzed by native PAGE (20%). Molecular components of the computer and the disease markers DNA sequences of the oligonucleotides used for the construction of the molecular computer components are shown in Tables S2-S7. The coloring of the nucleotides reflects their function, as described in the main text. X stands for AAGAGCTAGAGTC in the sense strand and for its complementary sequence in the antisense strand. 8 Table S2. Transition molecules for SCLC diagnostic moiety Yes → Yes transition Symbol ASCL1↑ GCAGGGC GRIA2↑ CCGAGGC INSM1↑ GGTGCGC PTTG1↑ CGGAGGC CDKN2A↑ CGAGGGC CCATGGATGTC GCTCGGCCGGTACCTACAGCGTC + GAGACCGGCGAGCCGGCCATGGAGTTCGC CCAAGGATGAC TTTCCGTGGGTTCCTACTGGGCT + TGCCGCATAAAGGCACCCAAGGAAAACCCGA AGCGGATGCG TGGCTCTCTTCGCCTACGCCCAC + ACGCCACACCGAGAGAAGCGGAGACGGG GGTGGATGGG AAGGGTACCACCTACCCGCCT + CAGATTGGATTCCCATGGTGGAGAGGCG GAGGGATGGC GAGACCTCCCTACCGGCTC + CCTCAAATCCTCTGGAGGGACCGCCG Yes → No transition No → No transition GGCGGGGAGGTGAAGGGATG CCACTTCCCTACCGTC - TGCATTCCCCCTTCGGGATG GGGAAGCCCTACGGCT XGGATGCC XCCTACGGCTCC ACCTTGGCGCACTCGGGATG CGTGAGCCCTACCCAC XGGATGCC XCCTACGGACGC ATAGGCATCATCTGAGGCAGGATG TAGACTCCGTCCTACGCCT XGGATGCC XCCTACGGCTCC GAGCACTTAGCGAATGTGGATG CGCTTACACCTACGCTC XGGATGCC XCCTACGGTCCC Table S3. ssDNA models for SCLC markers Marker ASCL1 GRIA2 INSM1 PTTG1 CDKN2A Sequence (5'→3') GAACTCCATGGCCGGCTCGCCGGTCTCATTACAATGCTGCAAACTAAGAATCTCAACGGTCCCTTCACCTCCCCGCC TCTGGTTTTCCTTGGGTGCCTTTATGCGGCAAACAATGCTGCAAACTAAGAATCTCAACTCTCCCGAAGGGGGAATG CA CGTCTCCGCTTCTCTCGGTGTGGCGTGCCAGCAGATTACAATGCTGCAAACTAAGAATCTCAACTCCCGAGTGCGCC AAGGT CCTCTCCACCATGGGAATCCAATCTGCCAGCAGATTACAATGCTGCAAACTAAGAATCTCAACGTTCCTGCCTCAGA TGATGCCTAT GCGGTCCCTCCAGAGGATTTGAGGAAAAAAAATTACAATGCTGCAAACTAAGAATCTCAACAAAAAAAGTGCCACAT TCGCTAAGTGCTC 9 Table S4. Transition molecules for PC diagnostic moiety Symbol PPAP2B↓ CCGAGG GSTP1↓ GGTGCGC PIM1↑ GCAGGGC HEPSIN↑ GGGTGGC Yes → Yes transition TACTGTCTGATGAGATTGGATGGC TACTCTAACCTACCGGGCT ATATGCTGAGAGCAGGGGGATGGC GTCCCCCTACCGCCAC GCCGGATGCC GTCGTGTCGGCCTACGGCGTC + TATTTCTCCCAGCACAGCCGGAGTCCGC GTTGGATGGC GGACCGCAACCTACCGCCCA + CGGCTACCCTGGCGTTGGAGCGCGG Yes → No transition CACCAGGATG GTCCGGTGGTCCTACGGCT + CTATGCAGCAGGCCACCAGGGCTCCGA No → No transition - GATCTGGATG TCCTCTAGACCTACCCAC + TCAGCGAAGGAGATCTGGTCTGGTG XGGATGCC XCCTACGGACGC CTAAAGGAGGCAGAAAAAAGGATG GTCTTTTTTCCTACCGTC XGGATGCC XCCTACGGTCCC TTCGCACGTCCAGCTCGGATG AGGTCGAGCCTACCCCA XGGATGCC XCCTACGGCACC Table S5. ssDNA models for PC markers. Marker PPAP2B GSTP1 PIM1 HEPSIN Sequence (5'→3') GCTCTCCAATCTCATCAGACAGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCCTGGTGGCCTGC TGCATAG GCGTTCCCCCTGCTCTCAGCATATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACCAGATC TCCTTCGCTGA TGCCCTTTTTTCTGCCTCCTTTAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGACTCCGGCTGT GCTGGGAGAAATA ACTCCGAGCTGGACGTGCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCGCTCCAACGCCA GGGTAGCCG Table S6. Diagnostic strings for SCLC and PC Diagnostic string Structure ASCLGRIAINSMPTTGSCLC GCAGGGCCCGAGGCGGTGCGCCGGAGGCAAAATTTACCGATTAAGTTGGA CCGGGCTCCGCCACGCGGCCTCCGTTTTAAATGGCTAATTCAACC PPAP2BGSTP1PIMHEPSINPC CCGAGGCGGTGCGCGCAGGGCGGGTGGCAAAATTTACCGATTAAGTTGGA CCGCCACGCGCGTCCCGCCCACCGTTTTAAATGGCTAATTCAACC PTTG1CDKN2ASCLC CGGAGGCCGAGGGCAAAATTTACCGATTAAGTTGGA CCGGCTCCCGTTTTAAATGGCTAATTCAACC PIM1HEPSINPC GCAGGGCGGGTGGCAAAATTTACCGATTAAGTTGGA CCGCCCACCGTTTTAAATGGCTAATTCAACC 10 Table S7. Molecules related to drug administration Molecule Yes-verifying transition molecule No-verifying transition molecule Drug-release moiety Drug-suppressor release moiety Structure GGTCAGCAGCTGAGGATGCC CCAGTCGTCGACTCCTACGGCTGC GGTCAGCAGCTGAGGATGCC CCAGTCGTCGACTCCTACGGCTCC TATT GACGCTCGACGCTCGACGCTCGTTGG G CTGCGAGCTGCGAGCTGCGAGCAACC G TACA AATA GCGAGGCGCGAGGCGCGAGGCCCATGTGC C CGCTCCGCGCTCCGCGCTCCGGGTACACG C CAA Supplementary references 1. Hamming, R. W. Error Detecting and Error Correcting Codes. Bell Syst. Tech. J. 29, 147-160 (1950).