1475-2859-9-78-S1

advertisement
Supplementary Information
Quantification of the physiochemical constraints on the export of spider silk
proteins by Salmonella type III secretion
Daniel M Widmaier and Christopher A Voigt
Table of Contents
Page
1. Construction of Hybrid psicA promoters
Figure S1: Sequences of hybrid psicA promoters with features
2. Measurement of Hybrid Promoter Activity
Figure S2: Fluorescent screening of hybrid promoters
3. Secretion and Expression Titer Quantitation
Figure S3: ADF2 secretion and expression titration with IPTG
4. LacI Expressing Plasmids
2
3
3
4
4
5
5
Figure S4: Maps for LacI expressing plasmids
5. Quantitative Western Blots
6
6
Figure S5: Blots for secreted protein IPTG titration
7
Figure S6: Blots for secreted protein timecourse
8
Figure S7: Blots for silk variant secretion and expression testing
9-10
6. Silk DNA and Amino Acid Sequences
7. Supplementary References
11
42
1
1.
Construction of Hybrid psicA Promoters
The natural psicA promoter (Figure S1) was cloned by PCR from Salmonella typhimurium
SL1344 genomic DNA and is the basis for LacO looping engineering. The lac operator DNA
sequence was introduced by site directed mutagenesis (materials and methods) and
replaced 20bp of DNA from the wild-type psicA promoter. The first variant constructed was
psicA01 which was further modified to create 71bp, 82bp and 93bp variants based upon
optimal spacing found in [1]. The optimal spacing found in this publication was 70.5bp,
81.5bp and 92.5 bp, so a series of single bp spacing mutants were constructed by site
directed mutagenesis. Additional bases (up to 3bp) were added or deleted immediately 5’
to the invF:sicA operator site (Figure S1). Interestingly a different optimum spacing for the
psicA promoter was found compared to Müller et al (Figure S2).
2
Figure S1: The sequence of the natural psicA promoter with critical features highlighted;
+1, -10, -35 nucleotides (yellow boxes), invF:sicA operator binding site (orange box) [2].
Additional colored boxes (green with blue base) show the placement of the O1 operator
DNA binding sites in the sequence to obtain additional system control. Various lac operator
spacings were constructed as measured from center to center (blue guanine nucleotide) and
are consistent with the literature for enhanced promoter control [1]. Small changes in
spacing were achieved by adding or deleting single bases immediately 5’ of the invF:sicA
minimal binding box. An example of the spacing changes is shown by comparing psicA02, a
three base deletion, with 71bp LacO psicA.
2. Measurement of All Hybrid Promoter Function
The library of LacO spacing mutants were assayed for activity by expressing green
fluorescent protein (GFP) in Salmonella and measured using flow cytometry (materials and
methods). The library was measured twice, once with no IPTG in the culture media and a
second time with 1000µM IPTG. The mean fluorescence data is shown in Figure S2A with
3
and without IPTG. The ratio of induced (+ IPTG) and uninduced (-IPTG) is shown in Figure
S2B. The sharp repression maxima as a function of operator spacing resemble the pattern
found by Müller et al and are an indicator of DNA looping.
Figure S2: A measure of dynamic range for a series of psicA promoter hybrids. Numerous
spacing variants of the psicA promoter were created (Figure S1) and tested in the presence
or absence of 1000µM IPTG. A. All promoters in this study are measured with (green bars)
and without IPTG (blue bars) using a GFP reporter and flow cytometry measurements
(materials and methods). The mean fluorescence of 100,000 events is shown here. The red
line is the average fluorescent value for non-GFP expressing cells (white cells) in the assay.
B. A plot of the ratio of +/- IPTG vs lac operator spacing.
3. Secretion and Expression Titer Quantitation
The data to generate Figure 4 is a combination of the ADF2 expression and secretion data
shown in Figure S3. This data was collected by densitometry analysis of quantitative
4
western blots.
Figure S3: Expression (A) and secretion (B) of ADF2 protein under the control of the
psicA01 (blue circles) and psicA02 (red diamonds) promoters in the presence of various
concentrations of inducer (IPTG). Protein titers are measured by quantitative western
blotting (materials and methods). Individual gels this data was process from are found in
figure S5.
4. LacI Expressing Plamsids
Several LacI expressing plasmids were used in this study to titrate LacI tetramer
concentration in the cell. All plasmids are designed around a core of the LacI open reading
frame, Kanamycin resistance marker, and p15a (pACYC) origin of replication. Maps of the
specific plasmids with size in kb are provided in figure S4.
5
Figure S4: The plasmid maps for the lacI repressor expressing plasmids used in this study.
5. Quantitative Western Blots
All quantitative western blots used in this study are copied below. Images are the result of
scans for the Cy-5 dye on a Typhoon variable mode imager. The photomultiplier tube
sensitivity was adjusted to prevent detector oversaturation for each image.
6
Figure S5: All gels used for quantitative western blotting analysis to determine the curves
in Figure 4 and Figure S3. Each blot is labeled for the internal control bands (ADF2), the
sample band with colony number and IPTG inducer concentration. The concentration of
sample loaded is labeled above the lanes. Each gel has a 1x ADF2 supernatant or 0.1x ADF2
Lysate sample from to act as an internal standard [3]. Gels were processed according to
materials and methods to determine signal intensity.
7
Figure S6: The blots to generate Figure 3 are shown. Each blot is labeled for the internal
control bands (ADF2), the sample band with colony number and timepoint of harvest (in
hours). The concentration of sample loaded is labeled above the lanes. Each gel has a 1x
ADF2 supernatant or 0.1x ADF2 Lysate sample from to act as an internal standard [3]. Gels
were processed according to materials and methods to determine signal intensity.
8
9
Figure S7: The blots used to generate Figure 5C and Figure 6 are shown. Each blot is
labeled for the internal control bands (ADF2), the sample band with colony number and
timepoint of harvest (in hours). The concentration of sample loaded is labeled above the
lanes. Each gel has a 1x ADF2 supernatant or 0.1x ADF2 Lysate sample from to act as an
10
internal standard [3]. Gels were processed according to materials and methods to
determine signal intensity. The blots for ADF4 were run in the previously published
method with an external bacterial alkaline phosphatase standard. Calculations were
performed as published for the ADF4 bands [3].
6. Silk DNA and Amino Acid Sequences
All silk amino acid and synthetic DNA sequences used in this study. The originating
organism, silk type, and ascension number (Genbank) are listed where appropriate.
Latrodectus hesperus Major Ampullate Spidroin 1 (LHF1), Genbank ascension
EF595246 [4]
Synthetic DNA Sequence:
ATGACGTGGTCTACGCGCCTGGCACTGTCTTTCCTGTTTGTACTGTGCACTCAGTCCCTGTACGCA
CTCGCTCAGGCTAATACTCCGTGGTCTAGCAAGGCAAACGCAGATGCATTCATTAATTCCTTTAT
CTCCGCAGCCTCTAACACGGGCTCTTTTAGCCAAGACCAAATGGAGGACATGAGCCTGATCGGCA
ACACGCTGATGGCTGCCATGGATAACATGGGCGGTCGCATCACGCCGTCTAAGCTGCAAGCGCTG
GACATGGCATTTGCATCCTCTGTGGCGGAGATCGCTGCCTCCGAGGGTGGTGATCTGGGTGTGAC
GACTAATGCAATCGCAGACGCTCTGACTTCCGCGTTCTACCAAACTACTGGTGTGGTGAACAGCC
GTTTTATTTCCGAGATCCGCTCTCTGATCGGTATGTTCGCACAAGCCTCCGCCAATGATGTCTACG
CTTCCGCAGGTTCTAGCGGTGGCGGCGGGTACGGGGCGTCTAGTGCTAGCGCAGCCTCAGCATCGG
CCGCAGCCCCGTCGGGAGTTGCTTATCAAGCTCCGGCACAGGCTCAAATTAGCTTTACATTGCGTG
GACAACAACCGGTATCTTACGGGCAAGGAGGCGCTGGGCCAGGCGGTGCCGGGGCAGCTGCAGCC
GCAGCGGCTGCTGCGGGCGGGGCAGGACAGGGCGGGCAGGGAGGCTATGGGCAAGGCGGATACGG
CCAGGGAGGTGCAGGCCAAGGAGGGTCAGGAGCTGCCGCCGCGGCCGCTGCTGCAGCAGGTGGAA
CGGGACAGGGTGGAGCAGGTCAGGGTGGCGCAGGTGCTGCCGCGGCTGCCGCTGCCGCAGCTGGA
GGAGCCGGACAAGGAGGTCAGGGTGGTTACGGGCAGGGTGGTTATGGGCAGGGCGGAACAGGGCA
GGGAGGTGCTGGCGCGGCCGCCGCAGCCGCAGCAGCAGGTGGAGCGGGTCAGGGAGGGCAAGGAG
GCTACGGTCAGGGCGGTTATGGCCAGGGAGGGTATGGCCAGGGTGGATCCGGAGCTGCCGCAGCT
GCAGCAGCGGCAGCCGGAGGCGCGGGCCAGGGCGGGCAGGGTGGGTATGGACAGGGAGGTTACGG
CCAGGGAGGCGCCGGACAAGGCGGTGCAGGCGCTGCAGCCGCCGCTGCGGCCGCTGCAGGAGGAG
CTGGACAAGGAGGCTACGGCAGGGGCGGCGCGGGACAGGGTGGCGCTGCCGCGGCGGCAGCAGCC
GCTGCCGGTGCGGGCCAAGGCGGCTATGGCGGCCAGGGTGCAGGACAGGGTGGCAGTGGGGCAGC
TGCGGCGGCCGCGGCAGCTGGTGGGGCAGGCCAAGGAGGGCAAGGCGGATACGGGCAAGGCGGGT
11
ATGGTCAAGGTGGCTCTGGTGCAGCGGCCGCGGCTGCTGCTGCAGGAGGGGCAGGCCAGGGAGGG
CAGGGTGGGTATGGGCAGGGAGGATACGGCCAAGGAGGCGCGGGCCAGGGCGGAGCAGGGGCAGC
TGCTGCGGCAGCGGCTGCAGGCGGCGCCGGCCAGGGAGGTCAGGGAGGATATGGACAGGGCGGGT
ACGGACAGGGTGGTGCGGGACAGGGCGGGGCGGGCGCGGCGGCAGCGGCGGCGGCTGCCGGAGGG
GCTGGGCAGGGTGGACAAGGAGGTTATGGGCAGGGAGGCTATGGACAGGGAGGTGCTGGCCAGGG
TGGCGCAGGCGCTGCTGCCGCTGCGGCCGCAGCAGGCGGAGCCGGACAGGGTGGTCAGGGTGGTT
ATGGACAAGGAGGCTACGGGCAGGGTGGAGCTGGCCAGGGCGGGGCGGCTGCTGCGGCCGCGGCA
GCAGCAGGAGGAGCGGGCCAAGGAGGATACGGGAGGGGAGGGGCGGGACAGGGTGGTGCGGCGG
CCGCTGCTGGTGCAGGACAAGGCGGGTACGGAGGCCAGGGTGCGGGTCAGGGCGGAGCAGGGGCT
GCGGCTGCAGCAGCTGCGGCGGGAGGTGCAGGCCAAGGCGGACAAGGAGGGTATGGACGCGGAGG
TTATGGGCAGGGCGGTGCTGGGCAAGGAGGGGCGGGCGCGGCTGCCGCCGCCGCTGCCGCAGGAG
GCGCAGGGCAGGGTGGACAGGGCGGCTACGGTCAGGGCGGATATGGCCAAGGAGGGGCTGGACAA
GGAGGAGCGGCAGCAGCCGCAGCAGCAGCGGGAGGTGCGGGGCAAGGAGGGTACGGCAGGGGCGG
CGCAGGCCAGGGTGGAGCGGCGGCTGCCGCAGCCGCTGCGGCCGGAGCGGGTCAGGGTGGCTATG
GAGGTCAGGGTGCCGGTCAGGGAGGGGCTGGTGCCGCGGCGGCAGCTGCAGCAGCTGGTGGCGCC
GGGCAGGGCGGCCAAGGGGATTACGGTAGAGGCGGTTACGGGCAAGGCGGAGCCGGCCAAGGCGG
TGCCGGAGCAGCGGCAGCTGCAGCCGCGGCCGGCGGAGCAGGCCAGGGCGGACAAGGCGGTTATG
GCCAAGGAGGATACGGACAAGGAGGCGCAGGACAGGGTGGTGCGGCGGCGGCCGCAAGCGCAGCG
GCAGCTGGCGGGGCCGGGCAGGGTGGGTATGGACGCGGAGGCGCGGGTCAGGGTGGCGCGGCGGC
TGCGGCCGGTGCGGGGCAAGGAGGTTATGGAGGTCAGGGAGCGGGGCAAGGCGGAGCAGGAGCAG
CAGCCGCAGCTGCTGCGGCGGGAGGAGCCGGCCAAGGAGGGCAGGGCGGATATGGTCGAGGAGGG
TACGGTCAAGGAGGCGCAGGACAGGGAGGGGCGGGTGCTGCAGCCGCTGCAACCGCTGCGGGCGG
GGCAGGCCAAGGCGGACAGGGAGGTTACGGACAGGGCGGGTATGGCCAGGGTGGTGCTGGGCAAG
GCGGGGCGGCGGCAGCGGCCGCGGCTGCCGCAGGAGGTGCCGGCCAGGGCGGCTATGGACGGGGC
GGTGCCGGACAGGGAGGAGCAGCGGCCGCGGCCGCTGCAGCAGCGGGTGCCGGCCAAGGAGGGTA
TGGCGGACAGGGAGCCGGTCAGGGTGGCGCAGGAGCCGCAGCCGCTGCCGCTGGTGGAGCTGGAC
AGGGTGGGCAGGGCGGATACGGCCGTGGCGGGTATGGTCAGGGAGGTGCTGGGCAAGGAGGAGCT
GGAGCAGCCGCCGCCGCAGCAGCTGCAGGCGGCGCCGGCCAGGGTGGTCAAGGAGGTTACGGTCA
AGGAGGATATGGCCAGGGTGGGGCAGGACAGGGCGGAGCTGCCGCGGCTGCCGCCGCAGCTGCCG
GTGGTGCGGGTCAAGGAGGTTATGGACGAGGTGGCGCGGGGCAGGGAGGAGCAGCGGCGGCCGCA
GGCGCGGGCCAGGGCGGGTACGGCGGGCAGGGCGCAGGCCAGGGCGGCGCGGGAGCTGCTGCTGC
CGCTTCAAGAGGTGCAGGTCAAGGTGGTCAAGGAGGGTATGGCCGCGGAGGATATGGACAGGGTG
GAGCTGGACAGGGAGGTGCGGGCGCGGCAGCCGCCGCGGCTGCGGCTGGTGGAGCTGGCCAAGGC
12
GGTCAAGGCGGCTATGGTCAGGGCGGTTATGGGCAAGGAGGAGCCGGCCAGGGCGGGGCGGCGGC
AGCGGCTGCTGCGGCGGGCGGCGCAGGCCAAGGAGGATACGGTCGTGGCGGTGCCGGCCAAGGAG
GCGCAGCTGCCGCAGCGGGTGCGGGGCAGGGTGGCTACGGTGGGCAGGGTGCCGGTCAGGGTGGT
GCCGGAGCGGCCGCCGCTGCTGCTGCAGCTGGTGGCGCGGGCCAGGGTGGACAGGGCGGATATGG
CCGAGGAGGGTACGGACAGGGCGGCGCAGGGCAAGGCGGGGCAGGAGCCGCCGCCGCTGCTGCGG
CGGCCGGAGGGGCTGGACAGGGTGGGCAGGGAGGCTATGGCCAGGGTGGTTATGGTCAAGGAGGA
GCCGGACAGGGTGGTGCTGCCGCTGCCGCCGCGGCTGCTGCCGGTGGAGCCGGTCAGGGCGGCTAC
GGTCGAGGCGGTGCCGGCCAGGGCGGTGCGGCCGCTGCCGCAGCCGCTGCAGCAGGAAGCGGACA
AGGTGGATATGGCGGTCAAGGCGCTGGGCAAGGAGGGGCCGGGGCGGCTGCAGCAGCAGCAGCTG
CCGGAGGCGCGGGTCAAGGCGGCCAAGGCGGATACGGGCGAGGCGGATACGGTCAGGGAGGTGCG
GGGCAAGGCGGAGCAGGCGCGGCAGCCGCTGCGGCCGCTGCGGGTGGGGCTGGTCAAGGAGGCCA
GGGCGGATACGGCCAGGGTGGTTACGGACAGGGAGGAGCAGGGCAAGGCGGAGCTGCAGCCGCGG
CAGCCGCCGCGGCCGCCGGAGGAGCGGGGCAGGGAGGGTACGGACGAGGAGGTGCTGGGCAGGGC
GGCGCTGCCGCCGCCGCTGGCGCCGGCCAAGGAGGATACGGTGGTCAGGGTGCCGGACAGGGCGG
AGCTGGAGCTGCCGCTGCAGCAGCCGCAGCAGGTGGGGCGGGACAGGGAGGTCAAGGCGGCTATG
GACGGGGCGGATACGGGCAAGGAGGGGCTGGCCAGGGAGGGGCAGGTACTGCAGCTGCGGCAGCA
GCGGCTGGCGGAGCAGGCCAAGGAGGACAGGGTGGGTACGGTCAAGGCGGGTATGGGCAGGGTGG
GGCTGGACAGGGCGGTGCAGCCGCGGCAGCCGCTGCAGCCGCGGGCGGTGCGGGCCAAGGAGGTT
ACGGGCGAGGTGGTGCGGGTCAAGGCGGGGCCGCTGCCGCAGCTGCCGCAGCGGCAGGAGCTGGG
CAGGGAGGATACGGAGGGCAAGGGGCGGGTCAAGGCGGGGCTGGCGCGGCAGCCGCTGCTGCAGC
TGCTGGAGGAGCGGGGCAGGGTGGCCAAGGTGGTTATGGTCGCGGAGGTTACGGACAAGGAGGGG
CCGGCCAGGGCGGCGCAGGAGCGGCGGCGGCTGCGGCTGCAGCGGGTGGCGCGTCCCAGGGTGGA
CAAGGTGGGTACGGTCAAGGGGATTATGGGCAAGGAGGAGCAGGTCAGGGTGGGGCTGCTGCTGC
GGCCGCGGCGGCCGGTGGAGCAGGACAGGGAGGGTATGGACGTGGTGGCGCAGGCCAAGGCGGAG
CTGCTGCCGCTGCCGGAGCAGGACAAGGAGGTTACGGAGGGCAGGGTGCTGGGCAGGGAGGAGCG
GGTGCAGCGGCAGCTGCGGCGGCGGCTGGTGGTGCCGGAAGGGGTGGGCAAGGCGGGTATGGACG
TGGCGGATACGGGCAGGGAGGGGCAGGCCAAGGTGGTGCGGGTGCTGCGGCCGCGGCGGCAGCGG
CCGGTGGTGCTGGCCAAGGTGGCCAGGGTGGCTACGGACAAGGTGGGTATGGTCAAGGTGGAACT
GGCCAAGGCGGTGCAGCTGCTGCTGCCGCGGCTGCTGCGGGCGGAGCTGGACAAGGAGGCTATGG
TCGGGGTGGAGCAGGGCAAGGAGGGGCTGCAGCCGCTGCGGCTGCGGCAGCCGGGGCCGGTCAGG
GTGGTTATGGAGGGCAGGGCGCGGGTCAAGGCGGGGCTGGGGCGGCTGCTGCTGCAGCAGCGGCC
GGCGGCGCCGGGCAGGGTGGACAGGGTGGCTACGGAAGAGGAGGCTACGGTCAAGGAGGCGCAGG
CCAAGGCGGGGCAGGAGCTGCTGCTGCCGCGGCGGCCGCGGGTGGTGCAGGGCAAGGAGGACAGG
13
GAGGGTACGGCCAAGGAGGATATGGGCAAGGCGGTTACGGTCAGGGTGGAGCTGGACAAGGCGGA
GCGGCTGCCGCTGCTGCAGCTGCTGGCGGCGCGGGACAAGGTGGTTACGGGAGGGGAGGTGCCGG
ACAGGGTGGCGCTGCAGCCGCCGCTGGTGCTGGCCAAGGAGGTTACGGTGGCCAAGGTGCTGGGC
AGGGTGGGGCTGGTGCGGCTGCTGCGGCGGCAGCTGCGGGAGGAGCAGGGCAGGGCGGACAAGGT
GGCTATGGCAGGGGCGGTTATGGACAGGGCGGCGCAGGACAAGGAGGAGCGGGCGCGGCGGCTGC
AGCAGCCGCTGCCGGAGGCGCCGGTCAGGGTGGCCAGGGCGGGTACGGCCAAGGCGGATACGGTC
AAGGCGGGGCAGGACAGGGTGGCGCTGCGGCTGCGGCAGCAGCAGCGGCAGGAGGGGCGGGCCAA
GGCGGATATGGGCGGGGCGGAGCGGGACAAGGCGGAGCGGCAGCTGCAGCGGCCGCAGCGGCGGG
TTCAGGCCAGGGAGGTTACGGCGGCCAAGGTGCCGGACAGGGAGGAGCAGGCGCCGCGGCCGCCG
CAGCAGCTGCGGGAGGCGCCGGGCAGGGAGGGCAAGGAGGTTACGGGCGTGGCGGCTACGGACAA
GGTGGCGCGGGCCAGGGTGGGGCTGGCGCAGCGGCGGCGGCTGCCGCCGCAGGCGGCGCTGGGCA
AGGCGGGCAGGGTGGATACGGGCAGGGCGGATATGGCCAGGGTGGGTACGGTCAAGGAGGTGCCG
GACAGGGAGGAGCAGCGGCAGCAGCTGCGGCGGCGGCAGCTGGAGGGGCAGGGCAGGGCGGTTAT
GGTCGTGGAGGGGCCGGACAAGGAGGTGCAGCTGCAGCAGCGGGAGCTGGGCAGGGTGGATACGG
AGGTCAGGGCGCAGGACAGGGTGGTGCTGGCGCTGCGGCTGCGGCAGCCGCGGCGGGAGGCGCGG
GGCAGGGAGGCCAAGGCGGGTATGGAAGAGGAGGCTATGGGCAAGGCGGTGCGGGACAAGGTGG
GGCCGGTGCTGCGGCGGCTGCAGCGGCTGCTGGAGGTGCTGGCCAAGGAGGGCAAGGCGGATACG
GTCAGGGTGGAAATGGGCAAGGCGGAGCGGGCCAGGGCGGCGCGGCCGCTGCCGCGGCCGCAGCA
GGCGGCGCGGGCCAAGGCGGGTACGGTCGTGGCGGGGCCGGACAAGGCGGAGCTGCCGCCGCCGC
TGCGGCCGCGGCTGGGGCTGGGCAGGGTGGATATGGAGGGCAAGGCGCGGGACAGGGTGGTGCAG
GCGCGGCTGCCGCTGCTGCTGCGGCGGGCGGAGCAGGGCAGGGCGGCCAGGGTGGATACGGGAGA
GGTGGCTACGGTCAAGGAGGGGCCGGCCAAGGTGGTGCGGGAGCAGCCGCTGCAGCTGCGGCCGC
TGGTGGGGCTTCGCAGGGTGGCCAGGGTGGTTACGGGCAAGGTGATTACGGGCAGGGAGGAGCGG
GACAAGGTGGGGCCGCCGCTGCGGCTGCGGCAGCAGGTGGCGCTGGACAAGGAGGGTATGGACGA
GGAGGGGCAGGCCAAGGCGGGGCTGCCGCTGCTGCAGGGGCAGGTCAAGGCGGGTACGGCGGACA
AGGTGCAGGGCAAGGAGGTGCTGGAGCTGCGGCTGCAGCAGCAGCTGCGGGTGGGGCAGGTAGAG
GAGGCCAAGGAGGCTACGGAAGAGGCGGATACGGTCAGGGCGGTGCGGGCCAGGGTGGTGCTGGT
GCTGCCGCAGCCGCCGCTGCTGCCGGTGGGGCTGGACAGGGTGGACAAGGTGGGTACGGACAAGG
TGGCTACGGGCAGGGAGGCGCGGGTCAAGGAGGGGCGGCGGCAGCCGCTGCAGCTGCCGCTGGTG
GCGCCGGACAAGGAGGCTATGGCCGTGGTGGGGCGGGGCAAGGCGGCGCGGCGGCCGCGGCGGGT
GCGGGGCAGGGCGGCTATGGAGGTCAAGGAGCCGGCCAAGGTGGGGCAGGCGCAGCAGCCGCTGC
GGCCGCTGCCGGAGGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAGAGGCGGGTACGGGCAGG
GAGGCGCCGGACAGGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGGAGCGGGTCAG
14
GGTGGCCAGGGCGGCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCTGGACAAGGTGGCGCCGC
GGCAGCTGCAGCAGCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGGCCGAGGTGGGGCAGGTC
AAGGCGGTGCTGCGGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGGTTATGGAGGC
CAAGGAGCAGGCCAAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGGAGCAGGGCAG
GGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGGCCAAGGCGGAGCTGG
AGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGCTACGGTCAGG
GTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGCGGCCGCGGGC
GGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGGCCGCTACCGG
TGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCCGCTGCTGCAG
CGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAA
GGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCA
AGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGG
CCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAA
GGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGG
CGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAG
GTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCC
GCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGG
TGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAG
GAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCG
GGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCA
GCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAAT
CCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCT
TCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGC
TCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTC
AGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
MTWSTRLALSFLFVLCTQSLYALAQANTPWSSKANADAFINSFISAASNTGSFSQDQMEDMSLIGN
TLMAAMDNMGGRITPSKLQALDMAFASSVAEIAASEGGDLGVTTNAIADALTSAFYQTTGVVNSR
FISEIRSLIGMFAQASANDVYASAGSSGGGGYGASSASAASASAAAPSGVAYQAPAQAQISFTLRGQQ
PVSYGQGGAGPGGAGAAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGSGAAAAAAAAAGGTG
QGGAGQGGAGAAAAAAAAAGGAGQGGQGGYGQGGYGQGGTGQGGAGAAAAAAAAGGAGQGGQG
GYGQGGYGQGGYGQGGSGAAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAGAAAAAAAAA
15
GGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGSGAAAAAAAAGGAGQGGQGGY
GQGGYGQGGSGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQG
GYGQGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAGAAAAAAAAGG
AGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGG
QGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGY
GQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQG
GAGAAAAAAAAGGAGQGGQGDYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYG
QGGAGQGGAAAAASAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAA
AAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAATAAGGAGQGGQGGYGQGGYGQGGAGQGG
AAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAGGAG
QGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAA
AAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAASRGAGQGGQGGYGRGG
YGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGR
GGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGG
AGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAA
AAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAA
AAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAG
AGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGTAAAAAAAGGA
GQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGG
YGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGASQGGQ
GGYGQGDYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGG
AGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQ
GGTGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAA
AAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQ
GGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAA
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQ
GGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQGGAGQGGAAA
AAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQ
GGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGNGQGGAGQGGAAAAAAAAGGA
GQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRG
GYGQGGAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAAAAAAAAGGAGQGGYG
RGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQG
16
GAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGA
AAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAA
AAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGGAGQGGAAAAAAAAAA
GSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAGQGGAGAAAAAAAAGG
AGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAATGAGQGGYGG
QGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGY
GQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQG
GAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGA
GQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAP
ATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGS
VNYDSSGQYAQVVTQSVQNAFA
8B LHF1:
Synthetic DNA Sequence:
GGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAAGGCGGTGCAGG
TCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCAAGGCGGACAAG
GAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGGCCGCAGCGGCT
GCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAAGGAGGCGCCGC
GGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGGCGCGGGTCAGG
GTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAGGTCAAGGCGGG
TACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCCGCAGCGGCGGG
AGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGGTGGTGCAGGGC
AGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAGGAGGATACGGC
GGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCGGGCCAGATCTA
CTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCAGCACCGGCCAC
TTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAATCCTGCGTCCAT
CTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCTTCTGCCTGTG
ATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGCTCATCAAAT
ATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTCAGTTCAAAA
CGCCTTCGCC
Amino Acid Sequence:
17
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQG
GQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGA
GQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAPATSARISSHASALLSNGP
TNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGSVNYDSSGQYAQVVTQSV
QNAFA
12B LHF1:
Synthetic DNA Sequence:
GGCGGAGCAGGGCAGGGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGG
CCAAGGCGGAGCTGGAGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGG
GCGGCTACGGTCAGGGTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCA
GCTGCGGCCGCGGGCGGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGC
GGCGGCCGCTACCGGTGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGG
GCGCCGCTGCTGCAGCGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGG
GGTGGCTACGGCCAAGGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCC
GGAGGAGCTGGGCAAGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGG
ACAAGGCGGAGCGGCCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAG
GCGGGGCCGGTCAAGGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGT
TATGGAGGACAAGGCGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGG
CGCTGGGCAAGGAGGTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAG
GTGGGGCGGGTGCCGCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGT
GGATATGGCCAAGGTGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGG
CGGCGCGGGCCAAGGAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAG
CGGCTTCAGGACCGGGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGG
CGTCTGCGTTGGCAGCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTA
ATGGGCCGACAAATCCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGC
TCTAACCCAGGCGCTTCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTA
TTGACTATTATCGGCTCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACA
AGTCGTTACTCAGTCAGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
18
GGAGQGGQGGYGGGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAAGGAGQGGYGRGGAGQGGAAAATGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQG
GYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAG
QGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGG
YGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGGYG
QQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAPATSARISSHASALLSNGPTNPASISNVIS
NAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGSVNYDSSGQYAQVVTQSVQNAFA
16B LHF1:
Synthetic DNA Sequence:
GGAGGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAGAGGCGGGTACGGGCAGGGAGGCGCCGG
ACAGGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGGAGCGGGTCAGGGTGGCCAGG
GCGGCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCTGGACAAGGTGGCGCCGCGGCAGCTGCA
GCAGCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGGCCGAGGTGGGGCAGGTCAAGGCGGTGC
TGCGGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGGTTATGGAGGCCAAGGAGCAG
GCCAAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGGAGCAGGGCAGGGTGGACAGG
GCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGGCCAAGGCGGAGCTGGAGCTGCTGCG
GCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGCTACGGTCAGGGTGGATACGG
TCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGCGGCCGCGGGCGGAGCAGGGC
AGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGGCCGCTACCGGTGCTGGACAA
GGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCCGCTGCTGCAGCGGCAGCTGC
TGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAAGGCGGTGCAG
GTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCAAGGCGGACAA
GGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGGCCGCAGCGGC
TGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAAGGAGGCGCCG
CGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGGCGCGGGTCAG
GGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAGGTCAAGGCGG
GTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCCGCAGCGGCGG
GAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGGTGGTGCAGGG
CAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAGGAGGATACGG
CGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCGGGCCAGATCT
ACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCAGCACCGGCCA
CTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAATCCTGCGTCCA
19
TCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCTTCTGCCTGTG
ATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGCTCATCAAAT
ATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTCAGTTCAAAA
CGCCTTCGCC
Amino Acid Sequence:
GGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAVGGAGQGGYGRGGAGQGGAAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAG
QGGQGGYGGGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAA
AAGGAGQGGYGRGGAGQGGAAAATGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGR
GGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGY
GRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQG
GAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGGYGQQGG
AGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAPATSARISSHASALLSNGPTNPASISNVISNAVS
QISSSNPGASACDVLVQALLELVTALLTIIGSSNIGSVNYDSSGQYAQVVTQSVQNAFA
20B LHF1:
Synthetic DNA Sequence:
GGTGGGGCAGGTAGAGGAGGCCAAGGAGGCTACGGAAGAGGCGGATACGGTCAGGGCGGTGCGG
GCCAGGGTGGTGCTGGTGCTGCCGCAGCCGCCGCTGCTGCCGGTGGGGCTGGACAGGGTGGACAA
GGTGGGTACGGACAAGGTGGCTACGGGCAGGGAGGCGCGGGTCAAGGAGGGGCGGCGGCAGCCGC
TGCAGCTGCCGCTGGTGGCGCCGGACAAGGAGGCTATGGCCGTGGTGGGGCGGGGCAAGGCGGCG
CGGCGGCCGCGGCGGGTGCGGGGCAGGGCGGCTATGGAGGTCAAGGAGCCGGCCAAGGTGGGGCA
GGCGCAGCAGCCGCTGCGGCCGCTGCCGGAGGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAG
AGGCGGGTACGGGCAGGGAGGCGCCGGACAGGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCG
CCGGAGGAGCGGGTCAGGGTGGCCAGGGCGGCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCT
GGACAAGGTGGCGCCGCGGCAGCTGCAGCAGCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGG
CCGAGGTGGGGCAGGTCAAGGCGGTGCTGCGGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCA
GGGAGGTTATGGAGGCCAAGGAGCAGGCCAAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTG
CTGGCGGAGCAGGGCAGGGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCC
GGCCAAGGCGGAGCTGGAGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCA
GGGCGGCTACGGTCAGGGTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTG
CAGCTGCGGCCGCGGGCGGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGC
20
GCGGCGGCCGCTACCGGTGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGC
GGGCGCCGCTGCTGCAGCGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCA
GGGGTGGCTACGGCCAAGGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCC
GCCGGAGGAGCTGGGCAAGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGC
CGGACAAGGCGGAGCGGCCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGA
GAGGCGGGGCCGGTCAAGGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGC
GGTTATGGAGGACAAGGCGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGG
CGGCGCTGGGCAAGGAGGTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGAC
AAGGTGGGGCGGGTGCCGCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAG
GGTGGATATGGCCAAGGTGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGC
AGGCGGCGCGGGCCAAGGAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTG
CAGCGGCTTCAGGACCGGGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTG
CGGCGTCTGCGTTGGCAGCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTA
GTAATGGGCCGACAAATCCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCT
AGCTCTAACCCAGGCGCTTCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCG
CTATTGACTATTATCGGCTCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGC
ACAAGTCGTTACTCAGTCAGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
GGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQG
GYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGA
GQGGYGRGGAGQGGAAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGG
GGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGG
YGRGGAGQGGAAAATGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAG
QGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGG
AAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGA
AAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAAS
GPGQIYYGPQSVAAPAAAAASALAAPATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGAS
ACDVLVQALLELVTALLTIIGSSNIGSVNYDSSGQYAQVVTQSVQNAFA
24B LHF1:
Synthetic DNA Sequence:
21
GCGGGCGGAGCAGGGCAGGGCGGCCAGGGTGGATACGGGAGAGGTGGCTACGGTCAAGGAGGGGC
CGGCCAAGGTGGTGCGGGAGCAGCCGCTGCAGCTGCGGCCGCTGGTGGGGCTTCGCAGGGTGGCC
AGGGTGGTTACGGGCAAGGTGATTACGGGCAGGGAGGAGCGGGACAAGGTGGGGCCGCCGCTGCG
GCTGCGGCAGCAGGTGGCGCTGGACAAGGAGGGTATGGACGAGGAGGGGCAGGCCAAGGCGGGGC
TGCCGCTGCTGCAGGGGCAGGTCAAGGCGGGTACGGCGGACAAGGTGCAGGGCAAGGAGGTGCTG
GAGCTGCGGCTGCAGCAGCAGCTGCGGGTGGGGCAGGTAGAGGAGGCCAAGGAGGCTACGGAAGA
GGCGGATACGGTCAGGGCGGTGCGGGCCAGGGTGGTGCTGGTGCTGCCGCAGCCGCCGCTGCTGCC
GGTGGGGCTGGACAGGGTGGACAAGGTGGGTACGGACAAGGTGGCTACGGGCAGGGAGGCGCGGG
TCAAGGAGGGGCGGCGGCAGCCGCTGCAGCTGCCGCTGGTGGCGCCGGACAAGGAGGCTATGGCC
GTGGTGGGGCGGGGCAAGGCGGCGCGGCGGCCGCGGCGGGTGCGGGGCAGGGCGGCTATGGAGGT
CAAGGAGCCGGCCAAGGTGGGGCAGGCGCAGCAGCCGCTGCGGCCGCTGCCGGAGGTGCTGGGCG
AGGAGGGCAGGGTGGGTATGGTAGAGGCGGGTACGGGCAGGGAGGCGCCGGACAGGGAGGGGCT
GGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGGAGCGGGTCAGGGTGGCCAGGGCGGCTACGGTCA
GGGCGGGTATGGGCAGGGCGGGGCTGGACAAGGTGGCGCCGCGGCAGCTGCAGCAGCTGCGGTGG
GAGGGGCGGGTCAGGGTGGATACGGCCGAGGTGGGGCAGGTCAAGGCGGTGCTGCGGCCGCGGCG
GCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGGTTATGGAGGCCAAGGAGCAGGCCAAGGCGGCGC
CGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGGAGCAGGGCAGGGTGGACAGGGCGGTTACGGTG
GTGGAGGATACGGACAAGGAGGTGCCGGCCAAGGCGGAGCTGGAGCTGCTGCGGCAGCAGCAGCC
GCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGCTACGGTCAGGGTGGATACGGTCAGGGCGGGGC
GGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGCGGCCGCGGGCGGAGCAGGGCAGGGAGGCTATG
GTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGGCCGCTACCGGTGCTGGACAAGGTGGCTATGGC
GGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCCGCTGCTGCAGCGGCAGCTGCTGGTGGGGCCGG
TCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAAGGCGGTGCAGGTCAAGGAGGTG
CTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCAAGGCGGACAAGGAGGGTATGGC
CAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGGCCGCAGCGGCTGCAGCCGCCGG
TGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAAGGAGGCGCCGCGGCAGCTGCGG
CGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGGCGCGGGTCAGGGTGGAGCGGGT
GCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAGGTCAAGGCGGGTACGGTCGAGG
TGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCCGCAGCGGCGGGAGGAGCCGGTC
AGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGGTGGTGCAGGGCAGGGTGGGGCT
GCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAGGAGGATACGGCGGATATGGTCA
GCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCGGGCCAGATCTACTACGGTCCGC
AGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCAGCACCGGCCACTTCTGCACGCA
22
TTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAATCCTGCGTCCATCTCCAACGTAA
TCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCTTCTGCCTGTGATGTGCTAGTT
CAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGCTCATCAAATATTGGTAGTGT
GAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTCAGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
AGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAA
AAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQG
GYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGA
GQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQ
GGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGG
AGQGGAAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAG
QGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQG
GAAAATGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAA
AAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAA
GAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQG
GQGGYGQGGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQ
SVAAPAAAAASALAAPATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQAL
LELVTALLTIIGSSNIGSVNYDSSGQYAQVVTQSVQNAFA
28B LHF1:
Synthetic DNA Sequence:
GGAGGCGCGGGGCAGGGAGGCCAAGGCGGGTATGGAAGAGGAGGCTATGGGCAAGGCGGTGCGG
GACAAGGTGGGGCCGGTGCTGCGGCGGCTGCAGCGGCTGCTGGAGGTGCTGGCCAAGGAGGGCAA
GGCGGATACGGTCAGGGTGGAAATGGGCAAGGCGGAGCGGGCCAGGGCGGCGCGGCCGCTGCCGC
GGCCGCAGCAGGCGGCGCGGGCCAAGGCGGGTACGGTCGTGGCGGGGCCGGACAAGGCGGAGCTG
CCGCCGCCGCTGCGGCCGCGGCTGGGGCTGGGCAGGGTGGATATGGAGGGCAAGGCGCGGGACAG
GGTGGTGCAGGCGCGGCTGCCGCTGCTGCTGCGGCGGGCGGAGCAGGGCAGGGCGGCCAGGGTGG
ATACGGGAGAGGTGGCTACGGTCAAGGAGGGGCCGGCCAAGGTGGTGCGGGAGCAGCCGCTGCAG
CTGCGGCCGCTGGTGGGGCTTCGCAGGGTGGCCAGGGTGGTTACGGGCAAGGTGATTACGGGCAG
GGAGGAGCGGGACAAGGTGGGGCCGCCGCTGCGGCTGCGGCAGCAGGTGGCGCTGGACAAGGAGG
GTATGGACGAGGAGGGGCAGGCCAAGGCGGGGCTGCCGCTGCTGCAGGGGCAGGTCAAGGCGGGT
ACGGCGGACAAGGTGCAGGGCAAGGAGGTGCTGGAGCTGCGGCTGCAGCAGCAGCTGCGGGTGGG
23
GCAGGTAGAGGAGGCCAAGGAGGCTACGGAAGAGGCGGATACGGTCAGGGCGGTGCGGGCCAGGG
TGGTGCTGGTGCTGCCGCAGCCGCCGCTGCTGCCGGTGGGGCTGGACAGGGTGGACAAGGTGGGT
ACGGACAAGGTGGCTACGGGCAGGGAGGCGCGGGTCAAGGAGGGGCGGCGGCAGCCGCTGCAGCT
GCCGCTGGTGGCGCCGGACAAGGAGGCTATGGCCGTGGTGGGGCGGGGCAAGGCGGCGCGGCGGC
CGCGGCGGGTGCGGGGCAGGGCGGCTATGGAGGTCAAGGAGCCGGCCAAGGTGGGGCAGGCGCAG
CAGCCGCTGCGGCCGCTGCCGGAGGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAGAGGCGGG
TACGGGCAGGGAGGCGCCGGACAGGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGG
AGCGGGTCAGGGTGGCCAGGGCGGCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCTGGACAAG
GTGGCGCCGCGGCAGCTGCAGCAGCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGGCCGAGGT
GGGGCAGGTCAAGGCGGTGCTGCGGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGG
TTATGGAGGCCAAGGAGCAGGCCAAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGG
AGCAGGGCAGGGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGGCCAAG
GCGGAGCTGGAGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGC
TACGGTCAGGGTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGC
GGCCGCGGGCGGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGG
CCGCTACCGGTGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCC
GCTGCTGCAGCGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGG
CTACGGCCAAGGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGG
AGCTGGGCAAGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAG
GCGGAGCGGCCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGG
GCCGGTCAAGGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGG
AGGACAAGGCGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTG
GGCAAGGAGGTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGG
GCGGGTGCCGCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATA
TGGCCAAGGTGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCG
CGGGCCAAGGAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCT
TCAGGACCGGGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCT
GCGTTGGCAGCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGG
CCGACAAATCCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAA
CCCAGGCGCTTCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGAC
TATTATCGGCTCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCG
TTACTCAGTCAGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
24
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGNGQGGAGQGGAAA
AAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQG
GQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAAAAAAAAG
GAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGY
GQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGR
GGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGG
AGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGGAGQGGAA
AAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAGQGGAGAA
AAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAATG
AGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGA
GQGGQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGY
GGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQ
GGYGQGGAGQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAA
AAASALAAPATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALL
TIIGSSNIGSVNYDSSGQYAQVVTQSVQNAFA
32B LHF1:
Synthetic DNA Sequence:
GGAGGCGCCGGGCAGGGAGGGCAAGGAGGTTACGGGCGTGGCGGCTACGGACAAGGTGGCGCGGG
CCAGGGTGGGGCTGGCGCAGCGGCGGCGGCTGCCGCCGCAGGCGGCGCTGGGCAAGGCGGGCAGG
GTGGATACGGGCAGGGCGGATATGGCCAGGGTGGGTACGGTCAAGGAGGTGCCGGACAGGGAGGA
GCAGCGGCAGCAGCTGCGGCGGCGGCAGCTGGAGGGGCAGGGCAGGGCGGTTATGGTCGTGGAGG
GGCCGGACAAGGAGGTGCAGCTGCAGCAGCGGGAGCTGGGCAGGGTGGATACGGAGGTCAGGGCG
CAGGACAGGGTGGTGCTGGCGCTGCGGCTGCGGCAGCCGCGGCGGGAGGCGCGGGGCAGGGAGGC
CAAGGCGGGTATGGAAGAGGAGGCTATGGGCAAGGCGGTGCGGGACAAGGTGGGGCCGGTGCTGC
GGCGGCTGCAGCGGCTGCTGGAGGTGCTGGCCAAGGAGGGCAAGGCGGATACGGTCAGGGTGGAA
ATGGGCAAGGCGGAGCGGGCCAGGGCGGCGCGGCCGCTGCCGCGGCCGCAGCAGGCGGCGCGGGCC
AAGGCGGGTACGGTCGTGGCGGGGCCGGACAAGGCGGAGCTGCCGCCGCCGCTGCGGCCGCGGCT
GGGGCTGGGCAGGGTGGATATGGAGGGCAAGGCGCGGGACAGGGTGGTGCAGGCGCGGCTGCCGC
TGCTGCTGCGGCGGGCGGAGCAGGGCAGGGCGGCCAGGGTGGATACGGGAGAGGTGGCTACGGTC
AAGGAGGGGCCGGCCAAGGTGGTGCGGGAGCAGCCGCTGCAGCTGCGGCCGCTGGTGGGGCTTCG
CAGGGTGGCCAGGGTGGTTACGGGCAAGGTGATTACGGGCAGGGAGGAGCGGGACAAGGTGGGGC
CGCCGCTGCGGCTGCGGCAGCAGGTGGCGCTGGACAAGGAGGGTATGGACGAGGAGGGGCAGGCC
25
AAGGCGGGGCTGCCGCTGCTGCAGGGGCAGGTCAAGGCGGGTACGGCGGACAAGGTGCAGGGCAA
GGAGGTGCTGGAGCTGCGGCTGCAGCAGCAGCTGCGGGTGGGGCAGGTAGAGGAGGCCAAGGAGG
CTACGGAAGAGGCGGATACGGTCAGGGCGGTGCGGGCCAGGGTGGTGCTGGTGCTGCCGCAGCCG
CCGCTGCTGCCGGTGGGGCTGGACAGGGTGGACAAGGTGGGTACGGACAAGGTGGCTACGGGCAG
GGAGGCGCGGGTCAAGGAGGGGCGGCGGCAGCCGCTGCAGCTGCCGCTGGTGGCGCCGGACAAGG
AGGCTATGGCCGTGGTGGGGCGGGGCAAGGCGGCGCGGCGGCCGCGGCGGGTGCGGGGCAGGGCG
GCTATGGAGGTCAAGGAGCCGGCCAAGGTGGGGCAGGCGCAGCAGCCGCTGCGGCCGCTGCCGGA
GGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAGAGGCGGGTACGGGCAGGGAGGCGCCGGACA
GGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGGAGCGGGTCAGGGTGGCCAGGGCG
GCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCTGGACAAGGTGGCGCCGCGGCAGCTGCAGCA
GCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGGCCGAGGTGGGGCAGGTCAAGGCGGTGCTGC
GGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGGTTATGGAGGCCAAGGAGCAGGCC
AAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGGAGCAGGGCAGGGTGGACAGGGC
GGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGGCCAAGGCGGAGCTGGAGCTGCTGCGGC
AGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGCTACGGTCAGGGTGGATACGGTC
AGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGCGGCCGCGGGCGGAGCAGGGCAG
GGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGGCCGCTACCGGTGCTGGACAAGG
TGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCCGCTGCTGCAGCGGCAGCTGCTG
GTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAAGGCGGTGCAGGT
CAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCAAGGCGGACAAGG
AGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGGCCGCAGCGGCTG
CAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAAGGAGGCGCCGCG
GCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGGCGCGGGTCAGGG
TGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAGGTCAAGGCGGGT
ACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCCGCAGCGGCGGGA
GGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGGTGGTGCAGGGCA
GGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAGGAGGATACGGCG
GATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCGGGCCAGATCTAC
TACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCAGCACCGGCCACT
TCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAATCCTGCGTCCATC
TCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCTTCTGCCTGTGA
TGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGCTCATCAAATA
26
TTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTCAGTTCAAAAC
GCCTTCGCC
Amino Acid Sequence:
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQGGAGQ
GGAAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGA
GQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGNGQGGAGQGGAAAAAA
AAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQG
GYGRGGYGQGGAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAAAAAAAAGGAG
QGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQG
GAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGA
GQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGA
AAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGGAGQGGAAAAA
AAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAGQGGAGAAAAA
AAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAATGAGQ
GGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQG
GQGGYGQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQ
GAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGY
GQGGAGQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAA
SALAAPATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALLTIIG
SSNIGSVNYDSSGQYAQVVTQSVQNAFA
36B LHF1:
Synthetic DNA Sequence:
GGAGGAGCAGGGCAGGGCGGACAAGGTGGCTATGGCAGGGGCGGTTATGGACAGGGCGGCGCAGG
ACAAGGAGGAGCGGGCGCGGCGGCTGCAGCAGCCGCTGCCGGAGGCGCCGGTCAGGGTGGCCAGG
GCGGGTACGGCCAAGGCGGATACGGTCAAGGCGGGGCAGGACAGGGTGGCGCTGCGGCTGCGGCA
GCAGCAGCGGCAGGAGGGGCGGGCCAAGGCGGATATGGGCGGGGCGGAGCGGGACAAGGCGGAGC
GGCAGCTGCAGCGGCCGCAGCGGCGGGTTCAGGCCAGGGAGGTTACGGCGGCCAAGGTGCCGGAC
AGGGAGGAGCAGGCGCCGCGGCCGCCGCAGCAGCTGCGGGAGGCGCCGGGCAGGGAGGGCAAGGA
GGTTACGGGCGTGGCGGCTACGGACAAGGTGGCGCGGGCCAGGGTGGGGCTGGCGCAGCGGCGGC
GGCTGCCGCCGCAGGCGGCGCTGGGCAAGGCGGGCAGGGTGGATACGGGCAGGGCGGATATGGCC
AGGGTGGGTACGGTCAAGGAGGTGCCGGACAGGGAGGAGCAGCGGCAGCAGCTGCGGCGGCGGCA
27
GCTGGAGGGGCAGGGCAGGGCGGTTATGGTCGTGGAGGGGCCGGACAAGGAGGTGCAGCTGCAGC
AGCGGGAGCTGGGCAGGGTGGATACGGAGGTCAGGGCGCAGGACAGGGTGGTGCTGGCGCTGCGG
CTGCGGCAGCCGCGGCGGGAGGCGCGGGGCAGGGAGGCCAAGGCGGGTATGGAAGAGGAGGCTAT
GGGCAAGGCGGTGCGGGACAAGGTGGGGCCGGTGCTGCGGCGGCTGCAGCGGCTGCTGGAGGTGC
TGGCCAAGGAGGGCAAGGCGGATACGGTCAGGGTGGAAATGGGCAAGGCGGAGCGGGCCAGGGCG
GCGCGGCCGCTGCCGCGGCCGCAGCAGGCGGCGCGGGCCAAGGCGGGTACGGTCGTGGCGGGGCCG
GACAAGGCGGAGCTGCCGCCGCCGCTGCGGCCGCGGCTGGGGCTGGGCAGGGTGGATATGGAGGG
CAAGGCGCGGGACAGGGTGGTGCAGGCGCGGCTGCCGCTGCTGCTGCGGCGGGCGGAGCAGGGCA
GGGCGGCCAGGGTGGATACGGGAGAGGTGGCTACGGTCAAGGAGGGGCCGGCCAAGGTGGTGCGG
GAGCAGCCGCTGCAGCTGCGGCCGCTGGTGGGGCTTCGCAGGGTGGCCAGGGTGGTTACGGGCAA
GGTGATTACGGGCAGGGAGGAGCGGGACAAGGTGGGGCCGCCGCTGCGGCTGCGGCAGCAGGTGG
CGCTGGACAAGGAGGGTATGGACGAGGAGGGGCAGGCCAAGGCGGGGCTGCCGCTGCTGCAGGGG
CAGGTCAAGGCGGGTACGGCGGACAAGGTGCAGGGCAAGGAGGTGCTGGAGCTGCGGCTGCAGCA
GCAGCTGCGGGTGGGGCAGGTAGAGGAGGCCAAGGAGGCTACGGAAGAGGCGGATACGGTCAGGG
CGGTGCGGGCCAGGGTGGTGCTGGTGCTGCCGCAGCCGCCGCTGCTGCCGGTGGGGCTGGACAGGG
TGGACAAGGTGGGTACGGACAAGGTGGCTACGGGCAGGGAGGCGCGGGTCAAGGAGGGGCGGCGG
CAGCCGCTGCAGCTGCCGCTGGTGGCGCCGGACAAGGAGGCTATGGCCGTGGTGGGGCGGGGCAA
GGCGGCGCGGCGGCCGCGGCGGGTGCGGGGCAGGGCGGCTATGGAGGTCAAGGAGCCGGCCAAGG
TGGGGCAGGCGCAGCAGCCGCTGCGGCCGCTGCCGGAGGTGCTGGGCGAGGAGGGCAGGGTGGGT
ATGGTAGAGGCGGGTACGGGCAGGGAGGCGCCGGACAGGGAGGGGCTGGTGCCGCTGCGGCTGCG
GCAGCCGCCGGAGGAGCGGGTCAGGGTGGCCAGGGCGGCTACGGTCAGGGCGGGTATGGGCAGGG
CGGGGCTGGACAAGGTGGCGCCGCGGCAGCTGCAGCAGCTGCGGTGGGAGGGGCGGGTCAGGGTG
GATACGGCCGAGGTGGGGCAGGTCAAGGCGGTGCTGCGGCCGCGGCGGCCGCTGCAGCCGCTGGT
TCTGGTCAGGGAGGTTATGGAGGCCAAGGAGCAGGCCAAGGCGGCGCCGGGGCCGCTGCCGCCGC
CGCTGCTGCTGGCGGAGCAGGGCAGGGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAG
GAGGTGCCGGCCAAGGCGGAGCTGGAGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAG
GGTGGGCAGGGCGGCTACGGTCAGGGTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGC
AGCGGCTGCAGCTGCGGCCGCGGGCGGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGAC
AAGGAGGCGCGGCGGCCGCTACCGGTGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAA
GGCGGAGCGGGCGCCGCTGCTGCAGCGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGG
ATACGGCAGGGGTGGCTACGGCCAAGGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCG
CCGCAGCCGCCGGAGGAGCTGGGCAAGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAG
GGGGGTGCCGGACAAGGCGGAGCGGCCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGG
28
CTATGGGAGAGGCGGGGCCGGTCAAGGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGG
GGCAGGGCGGTTATGGAGGACAAGGCGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCA
GCTGCGGGCGGCGCTGGGCAAGGAGGTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGG
GGCTGGACAAGGTGGGGCGGGTGCCGCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCT
ACGGTCAGGGTGGATATGGCCAAGGTGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCC
GCAGCCGCAGGCGGCGCGGGCCAAGGAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGC
GGCTGCTGCAGCGGCTTCAGGACCGGGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGC
TGCTGCTGCGGCGTCTGCGTTGGCAGCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGC
ATTGCTTAGTAATGGGCCGACAAATCCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCC
AAATTTCTAGCTCTAACCCAGGCGCTTCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCG
TAACGGCGCTATTGACTATTATCGGCTCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGC
CAGTACGCACAAGTCGTTACTCAGTCAGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAA
AAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQ
GGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQGGAGQGGAAA
AAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQ
GGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGNGQGGAGQGGAAAAAAAAGGA
GQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRG
GYGQGGAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAAAAAAAAGGAGQGGYG
RGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQG
GAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGA
AAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAA
AAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGGAGQGGAAAAAAAAAA
GSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAGQGGAGAAAAAAAAGG
AGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAATGAGQGGYGG
QGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGY
GQGGYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQG
GAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGA
GQGGAAAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAP
ATSARISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGS
VNYDSSGQYAQVVTQSVQNAFA
29
40B LHF1:
Synthetic DNA Sequence:
GGCGGCGCCGGGCAGGGTGGACAGGGTGGCTACGGAAGAGGAGGCTACGGTCAAGGAGGCGCAGG
CCAAGGCGGGGCAGGAGCTGCTGCTGCCGCGGCGGCCGCGGGTGGTGCAGGGCAAGGAGGACAGG
GAGGGTACGGCCAAGGAGGATATGGGCAAGGCGGTTACGGTCAGGGTGGAGCTGGACAAGGCGGA
GCGGCTGCCGCTGCTGCAGCTGCTGGCGGCGCGGGACAAGGTGGTTACGGGAGGGGAGGTGCCGG
ACAGGGTGGCGCTGCAGCCGCCGCTGGTGCTGGCCAAGGAGGTTACGGTGGCCAAGGTGCTGGGC
AGGGTGGGGCTGGTGCGGCTGCTGCGGCGGCAGCTGCGGGAGGAGCAGGGCAGGGCGGACAAGGT
GGCTATGGCAGGGGCGGTTATGGACAGGGCGGCGCAGGACAAGGAGGAGCGGGCGCGGCGGCTGC
AGCAGCCGCTGCCGGAGGCGCCGGTCAGGGTGGCCAGGGCGGGTACGGCCAAGGCGGATACGGTC
AAGGCGGGGCAGGACAGGGTGGCGCTGCGGCTGCGGCAGCAGCAGCGGCAGGAGGGGCGGGCCAA
GGCGGATATGGGCGGGGCGGAGCGGGACAAGGCGGAGCGGCAGCTGCAGCGGCCGCAGCGGCGGG
TTCAGGCCAGGGAGGTTACGGCGGCCAAGGTGCCGGACAGGGAGGAGCAGGCGCCGCGGCCGCCG
CAGCAGCTGCGGGAGGCGCCGGGCAGGGAGGGCAAGGAGGTTACGGGCGTGGCGGCTACGGACAA
GGTGGCGCGGGCCAGGGTGGGGCTGGCGCAGCGGCGGCGGCTGCCGCCGCAGGCGGCGCTGGGCA
AGGCGGGCAGGGTGGATACGGGCAGGGCGGATATGGCCAGGGTGGGTACGGTCAAGGAGGTGCCG
GACAGGGAGGAGCAGCGGCAGCAGCTGCGGCGGCGGCAGCTGGAGGGGCAGGGCAGGGCGGTTAT
GGTCGTGGAGGGGCCGGACAAGGAGGTGCAGCTGCAGCAGCGGGAGCTGGGCAGGGTGGATACGG
AGGTCAGGGCGCAGGACAGGGTGGTGCTGGCGCTGCGGCTGCGGCAGCCGCGGCGGGAGGCGCGG
GGCAGGGAGGCCAAGGCGGGTATGGAAGAGGAGGCTATGGGCAAGGCGGTGCGGGACAAGGTGG
GGCCGGTGCTGCGGCGGCTGCAGCGGCTGCTGGAGGTGCTGGCCAAGGAGGGCAAGGCGGATACG
GTCAGGGTGGAAATGGGCAAGGCGGAGCGGGCCAGGGCGGCGCGGCCGCTGCCGCGGCCGCAGCA
GGCGGCGCGGGCCAAGGCGGGTACGGTCGTGGCGGGGCCGGACAAGGCGGAGCTGCCGCCGCCGC
TGCGGCCGCGGCTGGGGCTGGGCAGGGTGGATATGGAGGGCAAGGCGCGGGACAGGGTGGTGCAG
GCGCGGCTGCCGCTGCTGCTGCGGCGGGCGGAGCAGGGCAGGGCGGCCAGGGTGGATACGGGAGA
GGTGGCTACGGTCAAGGAGGGGCCGGCCAAGGTGGTGCGGGAGCAGCCGCTGCAGCTGCGGCCGC
TGGTGGGGCTTCGCAGGGTGGCCAGGGTGGTTACGGGCAAGGTGATTACGGGCAGGGAGGAGCGG
GACAAGGTGGGGCCGCCGCTGCGGCTGCGGCAGCAGGTGGCGCTGGACAAGGAGGGTATGGACGA
GGAGGGGCAGGCCAAGGCGGGGCTGCCGCTGCTGCAGGGGCAGGTCAAGGCGGGTACGGCGGACA
AGGTGCAGGGCAAGGAGGTGCTGGAGCTGCGGCTGCAGCAGCAGCTGCGGGTGGGGCAGGTAGAG
GAGGCCAAGGAGGCTACGGAAGAGGCGGATACGGTCAGGGCGGTGCGGGCCAGGGTGGTGCTGGT
GCTGCCGCAGCCGCCGCTGCTGCCGGTGGGGCTGGACAGGGTGGACAAGGTGGGTACGGACAAGG
30
TGGCTACGGGCAGGGAGGCGCGGGTCAAGGAGGGGCGGCGGCAGCCGCTGCAGCTGCCGCTGGTG
GCGCCGGACAAGGAGGCTATGGCCGTGGTGGGGCGGGGCAAGGCGGCGCGGCGGCCGCGGCGGGT
GCGGGGCAGGGCGGCTATGGAGGTCAAGGAGCCGGCCAAGGTGGGGCAGGCGCAGCAGCCGCTGC
GGCCGCTGCCGGAGGTGCTGGGCGAGGAGGGCAGGGTGGGTATGGTAGAGGCGGGTACGGGCAGG
GAGGCGCCGGACAGGGAGGGGCTGGTGCCGCTGCGGCTGCGGCAGCCGCCGGAGGAGCGGGTCAG
GGTGGCCAGGGCGGCTACGGTCAGGGCGGGTATGGGCAGGGCGGGGCTGGACAAGGTGGCGCCGC
GGCAGCTGCAGCAGCTGCGGTGGGAGGGGCGGGTCAGGGTGGATACGGCCGAGGTGGGGCAGGTC
AAGGCGGTGCTGCGGCCGCGGCGGCCGCTGCAGCCGCTGGTTCTGGTCAGGGAGGTTATGGAGGC
CAAGGAGCAGGCCAAGGCGGCGCCGGGGCCGCTGCCGCCGCCGCTGCTGCTGGCGGAGCAGGGCAG
GGTGGACAGGGCGGTTACGGTGGTGGAGGATACGGACAAGGAGGTGCCGGCCAAGGCGGAGCTGG
AGCTGCTGCGGCAGCAGCAGCCGCGGGAGGCGCTGGACAGGGTGGGCAGGGCGGCTACGGTCAGG
GTGGATACGGTCAGGGCGGGGCGGGGCAGGGTGGAGCCGCAGCGGCTGCAGCTGCGGCCGCGGGC
GGAGCAGGGCAGGGAGGCTATGGTCGGGGTGGAGCCGGACAAGGAGGCGCGGCGGCCGCTACCGG
TGCTGGACAAGGTGGCTATGGCGGTCAAGGAGCAGGACAAGGCGGAGCGGGCGCCGCTGCTGCAG
CGGCAGCTGCTGGTGGGGCCGGTCAGGGTGGCCAAGGAGGATACGGCAGGGGTGGCTACGGCCAA
GGCGGTGCAGGTCAAGGAGGTGCTGGGGCCGCTGCCGCCGCCGCAGCCGCCGGAGGAGCTGGGCA
AGGCGGACAAGGAGGGTATGGCCAAGGTGGGTACGGGCAGGGGGGTGCCGGACAAGGCGGAGCGG
CCGCAGCGGCTGCAGCCGCCGGTGGTGCTGGCCAGGGTGGCTATGGGAGAGGCGGGGCCGGTCAA
GGAGGCGCCGCGGCAGCTGCGGCGGCCGCGGCAGGGGCGGGGCAGGGCGGTTATGGAGGACAAGG
CGCGGGTCAGGGTGGAGCGGGTGCGGCGGCGGCAGCGGCAGCTGCGGGCGGCGCTGGGCAAGGAG
GTCAAGGCGGGTACGGTCGAGGTGGCTACGGACAGGGAGGGGCTGGACAAGGTGGGGCGGGTGCC
GCAGCGGCGGGAGGAGCCGGTCAGGGTGGGCAAGGCGGCTACGGTCAGGGTGGATATGGCCAAGG
TGGTGCAGGGCAGGGTGGGGCTGCAGCTGCTGCCGCAGCCGCAGCCGCAGGCGGCGCGGGCCAAG
GAGGATACGGCGGATATGGTCAGCAAGGCGGAGCTGGAGCGGCTGCTGCAGCGGCTTCAGGACCG
GGCCAGATCTACTACGGTCCGCAGTCCGTTGCCGCCCCAGCTGCTGCTGCGGCGTCTGCGTTGGCA
GCACCGGCCACTTCTGCACGCATTTCGTCGCACGCGTCCGCATTGCTTAGTAATGGGCCGACAAAT
CCTGCGTCCATCTCCAACGTAATCAGTAACGCAGTAAGCCAAATTTCTAGCTCTAACCCAGGCGCT
TCTGCCTGTGATGTGCTAGTTCAGGCCCTACTTGAGCTCGTAACGGCGCTATTGACTATTATCGGC
TCATCAAATATTGGTAGTGTGAACTATGACAGTTCGGGCCAGTACGCACAAGTCGTTACTCAGTC
AGTTCAAAACGCCTTCGCC
Amino Acid Sequence:
31
GGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQGGAGQ
GGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQ
GGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAA
AGGAGQGGYGRGGAGQGGAAAAAAAAAGSGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGG
YGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGYGQGGYGQGGAGQGGAAAAAAAA
AAGGAGQGGYGRGGAGQGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGR
GGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGNGQGGAGQGGAAAAAAAAGGAGQGGY
GRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGRGGYGQG
GAGQGGAGAAAAAAAAGGASQGGQGGYGQGDYGQGGAGQGGAAAAAAAAGGAGQGGYGRGGAG
QGGAAAAAGAGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGAA
AAAAAAGGAGQGGQGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAG
AGQGGYGGQGAGQGGAGAAAAAAAAGGAGRGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGA
GQGGQGGYGQGGYGQGGAGQGGAAAAAAAAVGGAGQGGYGRGGAGQGGAAAAAAAAAAGSGQG
GYGGQGAGQGGAGAAAAAAAAGGAGQGGQGGYGGGGYGQGGAGQGGAGAAAAAAAAGGAGQGG
QGGYGQGGYGQGGAGQGGAAAAAAAAAGGAGQGGYGRGGAGQGGAAAATGAGQGGYGGQGAGQ
GGAGAAAAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAAAAAGGAGQGGQGGYGQGGY
GQGGAGQGGAAAAAAAAGGAGQGGYGRGGAGQGGAAAAAAAAAGAGQGGYGGQGAGQGGAGAA
AAAAAAGGAGQGGQGGYGRGGYGQGGAGQGGAGAAAAGGAGQGGQGGYGQGGYGQGGAGQGGA
AAAAAAAAAGGAGQGGYGGYGQQGGAGAAAAAASGPGQIYYGPQSVAAPAAAAASALAAPATSAR
ISSHASALLSNGPTNPASISNVISNAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGSVNYDS
SGQYAQVVTQSVQNAFA
Nephila clavipes Flagelliform Silk (NCF1), Genbank Ascension AF027973
Synthetic DNA Sequence:
GGCCCTGGTGGGGTTGGCCCAGGAGGCAGCGGACCAGGAGGGTATGGGCCAGGTGGTGCCGGTCC
AGGCGGCTATGGTCCAGGAGGTTCGGGACCTGGAGGGTACGGTCCTGGAGGTAGCGGGCCGGGAG
GATATGGACCTGGAGGCTCTGGACCCGGTGGGTACGGACCGGGAGGTTCAGGCCCCGGTGGCTAT
GGGCCGGGTGGGAGCGGTCCCGGAGGTTATGGTCCGGGTGGATACGGTCCGGGAGGGTCAGGCCC
AGGTGGATATGGGCCCGGCGGCACGGGCCCCGGAGGGTCGGGACCAGGTGGGTATGGCCCCGGCG
GATCGGGTCCTGGCGGCTCGGGACCCGGCGGGTACGGCCCGGGCGGAAGTGGCCCTGGAGGGTTC
GGCCCTGGTGGCTCGGGCCCTGGCGGATACGGTCCAGGCGGCTCAGGACCAGGAGGAGCCGGTCCT
32
GGTGGAGTGGGACCTGGAGGCTTTGGACCGGGTGGCGCAGGACCTGGTGGTGCTGGCCCCGGTGG
CGCTGGGCCTGGCGGGGCGGGTCCTGGTGGGGCAGGCCCCGGCGGGGCCGGACCGGGCGGAGCAG
GTCCGGGTGGGGCTGGTCCAGGTGGTGCGGGACCAGGCGGAGCGGGACCCGGAGGGGCCGGCGGC
GCAGGCGGTGCGGGAGGCGCTGGAGGATCGGGTGGTGCAGGAGGGTCTGGAGGGACTACTATTAT
TGAAGATCTGGACATTACTATTGATGGGGCGGATGGCCCAATCACTATCTCAGAAGAGCTTACCA
TCTCCGGCGCGGGTGGCTCGGGGCCAGGAGGTGCAGGTCCGGGAGGGGTCGGTCCAGGAGGTAGT
GGACCTGGCGGGGTAGGGCCGGGAGGTAGCGGCCCAGGAGGAGTGGGTCCTGGAGGTTCTGGTCC
CGGCGGAGTCGGTCCCGGAGGGGCCGGTGGGCCCTATGGGCCCGGAGGTTCGGGGCCCGGCGGGGC
GGGAGGTGCCGGCGGTCCGGGTGGAGCTTATGGGCCAGGCGGTAGCTACGGGCCAGGTGGTTCGG
GCGGCCCTGGAGGCGCCGGTGGTCCATACGGGCCTGGCGGCGAAGGCCCCGGTGGCGCAGGCGGTC
CATATGGCCCTGGTGGCGCTGGTGGTCCCTACGGACCCGGTGGTGCAGGTGGGCCATACGGCCCGG
GTGGAGAAGGAGGTCCTTACGGTCCAGGAGGTTCCTATGGTCCAGGTGGAGCCGGCGGGCCTTAT
GGCCCCGGAGGCCCCTATGGCCCGGGCGGGGAAGGTCCTGGCGGGGCTGGTGGACCATATGGACCC
GGCGGTGTAGGCCCTGGAGGTTCCGGACCAGGTGGTTATGGGCCTGGAGGGTCGGGTCCTGGAGG
ATACGGACCGGGCGGTGCGGGCCCAGGTGGGTATGGTCCGGGTGGGAGCGGTCCCGGCGGGTACG
GACCTGGTGGCTCGGGGCCTGGCGGTTACGGCCCTGGCGGAAGCGGTCCGGGAGGGTACGGGCCC
GGCGGATCAGGTCCAGGCGGCTATGGCAGTGGCGGAGCGGGTCCGGGCGGATATGGGCCAGGAGG
ATCGGGACCTGGAGGGTACGGTCCTGGTGGCTCAGGCCCTGGCGGCTATGGGCCAGGAGGCACGG
GGCCTGGAGGGACCGGCCCGGGTGGTAGTGGACCTGGAGGGTATGGTCCAGGAGGAAGTGGCCCA
GGAGGCTCGGGCCCCGGCGGCTCGGGACCTGGCGGATACGGACCCTCAGGCTCTGGTCCTGGAGGC
TACGGTCCGAGTGGATCGGGCCCTGGAGGTTACGGGCCTGGTGGATCAGGACCTGGTGGTTACGG
ACCGGGAGGAAGCGGTGCGGGAGGGACGGGGCCGGGTGGAGCAGGAGGCGCTGGTGGTGCCGGCG
GCAGCGGAGGGGCAGGTGGTTCTGGTGGCGCAGGCGGTTCGGGTGGCGCCGGAGGATCAGGCGGT
GTTGGTGGGTCCGGTGGAACGACAATCACGGAGGACTTGGATATCACGATCGATGGTGCCGACGG
TCCCATTACCATCAGTGAGGAACTGACAATCAGTGGTGCTGGAGGGTCTGGGCCGGGCGGCGCAG
GACCCGGCGGCGTCGGACCTGGAGGGAGTGGACCGGGCGGAGTAGGCCCGGGCGTCAGTGGACCT
GGCGGTGTAGGGCCTGGTGGGTCGGGGCCGGGAGGGGTCGGTTCTGGCGGAAGCGGGCCAGGAGG
CGTGGGACCGGGAGGCTATGGGCCAGGTGGCTCCGGTAGCGGTGGCGTTGGACCCGGAGGTTACG
GTCCGGGCGGGTCGGGTGGGTTTTACGGGCCCGGCGGTTCTGAAGGGCCATACGGACCCAGCGGA
ACGTACGGCTCTGGAGGCGGCTATGGTCCCGGTGGGGCAGGAGGTCCCTACGGTCCCGGGAGCCCT
GGCGGGGCATACGGTCCTGGTTCTCCCGGCGGTGCTTATTATCCCAGTTCTCGGGTGCCGGACATG
GTGAATGGTATCATGTCCGCTATGCAAGGTAGTGGTTTTAACTACCAGATGTTCGGCAACATGCT
ATCGCAGTATAGCTCCGGGAGCGGCACATGCAATCCTAATAATGTCAATGTGCTGATGGACGCTC
33
TACTCGCCGCCTTACATTGCCTATCAAATCATGGCTCATCCTCATTTGCGCCTAGCCCCACTCCCG
CTGCGATGTCGGCCTACAGCAATAGCGTAGGACGGATGTTCGCCTAT
Amino Acid Sequence:
GPGGVGPGGSGPGGYGPGGAGPGGYGPGGSGPGGYGPGGSGPGGYGPGGSGPGGYGPGGSGPGGYG
PGGSGPGGYGPGGYGPGGSGPGGYGPGGTGPGGSGPGGYGPGGSGPGGSGPGGYGPGGSGPGGFGP
GGSGPGGYGPGGSGPGGAGPGGVGPGGFGPGGAGPGGAGPGGAGPGGAGPGGAGPGGAGPGGAGP
GGAGPGGAGPGGAGPGGAGGAGGAGGAGGSGGAGGSGGTTIIEDLDITIDGADGPITISEELTISGAG
GSGPGGAGPGGVGPGGSGPGGVGPGGSGPGGVGPGGSGPGGVGPGGAGGPYGPGGSGPGGAGGAGG
PGGAYGPGGSYGPGGSGGPGGAGGPYGPGGEGPGGAGGPYGPGGAGGPYGPGGAGGPYGPGGEGGP
YGPGGSYGPGGAGGPYGPGGPYGPGGEGPGGAGGPYGPGGVGPGGSGPGGYGPGGSGPGGYGPGGA
GPGGYGPGGSGPGGYGPGGSGPGGYGPGGSGPGGYGPGGSGPGGYGSGGAGPGGYGPGGSGPGGYG
PGGSGPGGYGPGGTGPGGTGPGGSGPGGYGPGGSGPGGSGPGGSGPGGYGPSGSGPGGYGPSGSGPG
GYGPGGSGPGGYGPGGSGAGGTGPGGAGGAGGAGGSGGAGGSGGAGGSGGAGGSGGVGGSGGTTIT
EDLDITIDGADGPITISEELTISGAGGSGPGGAGPGGVGPGGSGPGGVGPGVSGPGGVGPGGSGPGGV
GSGGSGPGGVGPGGYGPGGSGSGGVGPGGYGPGGSGGFYGPGGSEGPYGPSGTYGSGGGYGPGGAGG
PYGPGSPGGAYGPGSPGGAYYPSSRVPDMVNGIMSAMQGSGFNYQMFGNMLSQYSSGSGTCNPNN
VNVLMDALLAALHCLSNHGSSSFAPSPTPAAMSAYSNSVGRMFAY
1B NCF1:
Synthetic DNA Sequence:
TCCGGTGGAACGACAATCACGGAGGACTTGGATATCACGATCGATGGTGCCGACGGTCCCATTAC
CATCAGTGAGGAACTGACAATCAGTGGTGCTGGAGGGTCTGGGCCGGGCGGCGCAGGACCCGGCG
GCGTCGGACCTGGAGGGAGTGGACCGGGCGGAGTAGGCCCGGGCGTCAGTGGACCTGGCGGTGTA
GGGCCTGGTGGGTCGGGGCCGGGAGGGGTCGGTTCTGGCGGAAGCGGGCCAGGAGGCGTGGGACC
GGGAGGCTATGGGCCAGGTGGCTCCGGTAGCGGTGGCGTTGGACCCGGAGGTTACGGTCCGGGCG
GGTCGGGTGGGTTTTACGGGCCCGGCGGTTCTGAAGGGCCATACGGACCCAGCGGAACGTACGGC
TCTGGAGGCGGCTATGGTCCCGGTGGGGCAGGAGGTCCCTACGGTCCCGGGAGCCCTGGCGGGGC
ATACGGTCCTGGTTCTCCCGGCGGTGCTTATTATCCCAGTTCTCGGGTGCCGGACATGGTGAATGG
TATCATGTCCGCTATGCAAGGTAGTGGTTTTAACTACCAGATGTTCGGCAACATGCTATCGCAGT
ATAGCTCCGGGAGCGGCACATGCAATCCTAATAATGTCAATGTGCTGATGGACGCTCTACTCGCC
34
GCCTTACATTGCCTATCAAATCATGGCTCATCCTCATTTGCGCCTAGCCCCACTCCCGCTGCGATG
TCGGCCTACAGCAATAGCGTAGGACGGATGTTCGCCTAT
Amino Acid Sequence:
SGGTTITEDLDITIDGADGPITISEELTISGAGGSGPGGAGPGGVGPGGSGPGGVGPGVSGPGGVGPGG
SGPGGVGSGGSGPGGVGPGGYGPGGSGSGGVGPGGYGPGGSGGFYGPGGSEGPYGPSGTYGSGGGYG
PGGAGGPYGPGSPGGAYGPGSPGGAYYPSSRVPDMVNGIMSAMQGSGFNYQMFGNMLSQYSSGSG
TCNPNNVNVLMDALLAALHCLSNHGSSSFAPSPTPAAMSAYSNSVGRMFAY
2B NCF1:
Synthetic DNA Sequence:
ACTACTATTATTGAAGATCTGGACATTACTATTGATGGGGCGGATGGCCCAATCACTATCTCAGA
AGAGCTTACCATCTCCGGCGCGGGTGGCTCGGGGCCAGGAGGTGCAGGTCCGGGAGGGGTCGGTC
CAGGAGGTAGTGGACCTGGCGGGGTAGGGCCGGGAGGTAGCGGCCCAGGAGGAGTGGGTCCTGGA
GGTTCTGGTCCCGGCGGAGTCGGTCCCGGAGGGGCCGGTGGGCCCTATGGGCCCGGAGGTTCGGG
GCCCGGCGGGGCGGGAGGTGCCGGCGGTCCGGGTGGAGCTTATGGGCCAGGCGGTAGCTACGGGC
CAGGTGGTTCGGGCGGCCCTGGAGGCGCCGGTGGTCCATACGGGCCTGGCGGCGAAGGCCCCGGTG
GCGCAGGCGGTCCATATGGCCCTGGTGGCGCTGGTGGTCCCTACGGACCCGGTGGTGCAGGTGGGC
CATACGGCCCGGGTGGAGAAGGAGGTCCTTACGGTCCAGGAGGTTCCTATGGTCCAGGTGGAGCC
GGCGGGCCTTATGGCCCCGGAGGCCCCTATGGCCCGGGCGGGGAAGGTCCTGGCGGGGCTGGTGG
ACCATATGGACCCGGCGGTGTAGGCCCTGGAGGTTCCGGACCAGGTGGTTATGGGCCTGGAGGGT
CGGGTCCTGGAGGATACGGACCGGGCGGTGCGGGCCCAGGTGGGTATGGTCCGGGTGGGAGCGGT
CCCGGCGGGTACGGACCTGGTGGCTCGGGGCCTGGCGGTTACGGCCCTGGCGGAAGCGGTCCGGGA
GGGTACGGGCCCGGCGGATCAGGTCCAGGCGGCTATGGCAGTGGCGGAGCGGGTCCGGGCGGATA
TGGGCCAGGAGGATCGGGACCTGGAGGGTACGGTCCTGGTGGCTCAGGCCCTGGCGGCTATGGGC
CAGGAGGCACGGGGCCTGGAGGGACCGGCCCGGGTGGTAGTGGACCTGGAGGGTATGGTCCAGGA
GGAAGTGGCCCAGGAGGCTCGGGCCCCGGCGGCTCGGGACCTGGCGGATACGGACCCTCAGGCTCT
GGTCCTGGAGGCTACGGTCCGAGTGGATCGGGCCCTGGAGGTTACGGGCCTGGTGGATCAGGACC
TGGTGGTTACGGACCGGGAGGAAGCGGTGCGGGAGGGACGGGGCCGGGTGGAGCAGGAGGCGCTG
GTGGTGCCGGCGGCAGCGGAGGGGCAGGTGGTTCTGGTGGCGCAGGCGGTTCGGGTGGCGCCGGA
GGATCAGGCGGTGTTGGTGGGTCCGGTGGAACGACAATCACGGAGGACTTGGATATCACGATCGA
35
TGGTGCCGACGGTCCCATTACCATCAGTGAGGAACTGACAATCAGTGGTGCTGGAGGGTCTGGGC
CGGGCGGCGCAGGACCCGGCGGCGTCGGACCTGGAGGGAGTGGACCGGGCGGAGTAGGCCCGGGC
GTCAGTGGACCTGGCGGTGTAGGGCCTGGTGGGTCGGGGCCGGGAGGGGTCGGTTCTGGCGGAAG
CGGGCCAGGAGGCGTGGGACCGGGAGGCTATGGGCCAGGTGGCTCCGGTAGCGGTGGCGTTGGAC
CCGGAGGTTACGGTCCGGGCGGGTCGGGTGGGTTTTACGGGCCCGGCGGTTCTGAAGGGCCATAC
GGACCCAGCGGAACGTACGGCTCTGGAGGCGGCTATGGTCCCGGTGGGGCAGGAGGTCCCTACGG
TCCCGGGAGCCCTGGCGGGGCATACGGTCCTGGTTCTCCCGGCGGTGCTTATTATCCCAGTTCTCG
GGTGCCGGACATGGTGAATGGTATCATGTCCGCTATGCAAGGTAGTGGTTTTAACTACCAGATGT
TCGGCAACATGCTATCGCAGTATAGCTCCGGGAGCGGCACATGCAATCCTAATAATGTCAATGTG
CTGATGGACGCTCTACTCGCCGCCTTACATTGCCTATCAAATCATGGCTCATCCTCATTTGCGCCT
AGCCCCACTCCCGCTGCGATGTCGGCCTACAGCAATAGCGTAGGACGGATGTTCGCCTAT
Amino Acid Sequence:
TTIIEDLDITIDGADGPITISEELTISGAGGSGPGGAGPGGVGPGGSGPGGVGPGGSGPGGVGPGGSGP
GGVGPGGAGGPYGPGGSGPGGAGGAGGPGGAYGPGGSYGPGGSGGPGGAGGPYGPGGEGPGGAGG
PYGPGGAGGPYGPGGAGGPYGPGGEGGPYGPGGSYGPGGAGGPYGPGGPYGPGGEGPGGAGGPYGP
GGVGPGGSGPGGYGPGGSGPGGYGPGGAGPGGYGPGGSGPGGYGPGGSGPGGYGPGGSGPGGYGPG
GSGPGGYGSGGAGPGGYGPGGSGPGGYGPGGSGPGGYGPGGTGPGGTGPGGSGPGGYGPGGSGPGG
SGPGGSGPGGYGPSGSGPGGYGPSGSGPGGYGPGGSGPGGYGPGGSGAGGTGPGGAGGAGGAGGSG
GAGGSGGAGGSGGAGGSGGVGGSGGTTITEDLDITIDGADGPITISEELTISGAGGSGPGGAGPGGVG
PGGSGPGGVGPGVSGPGGVGPGGSGPGGVGSGGSGPGGVGPGGYGPGGSGSGGVGPGGYGPGGSGG
FYGPGGSEGPYGPSGTYGSGGGYGPGGAGGPYGPGSPGGAYGPGSPGGAYYPSSRVPDMVNGIMSA
MQGSGFNYQMFGNMLSQYSSGSGTCNPNNVNVLMDALLAALHCLSNHGSSSFAPSPTPAAMSAYS
NSVGRMFAY
Bombyx mori Heavy Chain Fibroin 3’ partial (BMF1), Genbank Ascension S74439
Synthetic DNA Sequence:
GCGGGAACGGGGTCATCAGGTTTTGGCCCTTATGTAGCGAACGGTGGCTATTCTGGCTATGAGTA
CGCTTGGTCTAGTGAAAGTGATTTTGGTACCGGTAGTGGGGCAGGCGCAGGCAGCGGCGCTGGGG
CCGGTAGTGGGGCGGGAGCTGGATATGGTGCCGGTGTTGGCGCAGGCTATGGGGCCGGATATGGT
GCAGGTGCGGGAGCAGGGTACGGGGCAGGCGCTGGAAGTGGGGTCGCTTCGGGAGCCGGTGCCGG
36
CGCGGGGTCTGGAGCCGGCGCGGGAAGCGGGGCGGGTGCCGGGTCAGGAGCGGGTGCTGGTTCAG
GCGCAGGAGCTGGCTCCGGGGCGGGTGCAGGGTCCGGCGCAGGGGCTGGTTATGGCGCTGGCGCT
GGTTATGGCGCTGGCGCAGGTTATGGGGCGGGTGCTGGAGTCGGGTATGGTGCTGGCGCAGGTGT
AGGATACGGTGCGGGTGCGGGATATGGGGCGGGAGCCGGTGTGGGGTATGGTGCCGGGGCAGGGA
GTGGCGCGGCCTCAGGCGCCGGGGCCGGCTCAGGAGCAGGCGCGGGAAGCGGGGCTGGCGCAGGT
TCGGGTGCAGGAGCAGGCTCGGGAGCTGGTGCAGGATCGGGAGCTGGGGCAGGCAGTGGGGCCGG
AGCAGGTTCCGGTGCTGGCGCAGGTAGCGGAGCCGGTGCGGGTTATGGCGCGGGAGCTGGAGTAG
GCTATGGTGCAGGTGCGGGTTCAGGAGCTGCTAGTGGCGCGGGCGCTGGTTCCGGAGCAGGTGCT
GGCTCGGGCGCCGGAGCCGGGAGTGGCGCAGGCGCTGGATCAGGTGCCGGAGCAGGATCTGGGGC
TGGTGCCGGATCGGGAGCCGGTGCCGGGTCAGGGGCCGGCGCTGGGTCGGGCGCTGGAAGTGGTG
CAGGGGCAGGTTCAGGCGCAGGAGCGGGTTATGGTGCGGGTGCCGGAGCAGGCGTCGGGTACGGA
GCTGGGGCCGGTGCAGGTTATGGAGCGGGCTACGGTTATGGCGCTGGTGCCGGGGTGGGTTACGG
TGCGGGCGCAGGTTCTGGCGCCGCCAGTGGGGCCGGCGCAGGCTCAGGTGCAGGAGCCGGTAGTG
GGGCGGGCGCCGGGAGTGGGGCCGGCGCAGGCAGCGGAGCCGGGGCTGGTTCAGGAGCGGGGTCA
GGCGCGGGAGCTGGAAGTGGGGCCGGGGCGGGTTACGGTGCCGGATACGGAGCCGGTGTTGGGGC
AGGGTACGGAGCCGGCGCAGGCGTAGGGTATGGGGCAGGATATGGTGTAGGAGCTGGAGCAGGTT
ACGGCGCAGGTGCTGGGTCTGGCGCCGCATCAGGAGCCGGGGCCGGCAGTGGTGCGGGCGCTGGG
TCTGGGGCCGGTGCCGGTTCCGGGGCGGGAGCAGGCAGCGGTGCAGGGGCGGGATCGGGCGCTGG
CTCAGGCGCCGGAGCTGGGTATGGGGCGGGTGCTGGAAGTGGAGCGGCGTCAGGAGCCGGAGCAG
GTGCCGGGGCTGGTACAGGTAGCAGCGGATTCGGACCATACGTTGCCAATGGTGGATATTCCCGA
CGCGAAGGGTACGAATATGCCTGGTCTAGTAAATCGGATTTTGAGACAGGATCAGGTGCGGCTTC
GGGAGCTGGGGCAGGCGCCGGGTCAGGGGCCGGTGCAGGTTCGGGCGCGGGTGCTGGATCTGGGG
CTGGCGCAGGCAGTGGTGCGGGGGCGGGAGGTAGCGTTTCTTACGGTGCCGGTAGAGGGTATGGG
CAGGGCGCTGGCTCGGCCGCGAGCTCCGTGTCCAGTGCTTCGAGTAGAAGTTACGATTACTCGCGC
CGTAATGTCAGGAAGAATTGTGGTATCCCACGACGCCAGCTCGTAGTAAAATTTCGGGCCCTCCC
GTGCGTAAACTGC
Amino Acid Sequence:
AGTGSSGFGPYVANGGYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGYGAGVGAGYGAGYG
AGAGAGYGAGAGSGVASGAGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGA
GYGAGAGYGAGAGVGYGAGAGVGYGAGAGYGAGAGVGYGAGAGSGAASGAGAGSGAGAGSGAGAG
SGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGAGVGYGAGAGSGAASGAGAGSGAGA
GSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGAGAGSGAGAGYGAGAGAGVGYG
37
AGAGAGYGAGYGYGAGAGVGYGAGAGSGAASGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGS
GAGAGSGAGAGYGAGYGAGVGAGYGAGAGVGYGAGYGVGAGAGYGAGAGSGAASGAGAGSGAGAG
SGAGAGSGAGAGSGAGAGSGAGSGAGAGYGAGAGSGAASGAGAGAGAGTGSSGFGPYVANGGYSRR
EGYEYAWSSKSDFETGSGAASGAGAGAGSGAGAGSGAGAGSGAGAGSGAGAGGSVSYGAGRGYGQ
GAGSAASSVSSASSRSYDYSRRNVRKNCGIPRRQLVVKFRALPCVNC
Araneus diadematus ADF1, Genbank Ascension U47853
Synthetic DNA Sequence:
CATGAATCTTCCTATGCTGCTGCAATGGCTGCTTCTACTCGTAATTCTGATTTTATCCGTAACATG
AGCTACCAGATGGGTCGTCTGCTGAGCAACGCCGGTGCCATTACCGAATCTACTGCAAGCAGCGC
GGCTTCCAGCGCGTCCTCCACCGTTACCGAGTCTATTCGCACGTATGGCCCGGCTGCGATCTTTTC
TGGTGCGGGCGCTGGCGCAGGCGTGGGTGTAGGTGGTGCCGGTGGTTACGGCCAGGGCTACGGTG
CAGGCGCAGGTGCTGGTGCGGGCGCCGGTGCGGGTGCTGGTGGCGCGGGTGGCTACGGTCAGGGC
TACGGTGCGGGTGCTGCCGCGGCGGCTGGCGCAGGCGCAGGTGCAGCGGGCGGTTACGGCGGTGG
TTCTGGTGCTGGCGCTGGTGGTGCTGGTGGTTATGGCCAGGGTTACGGTGCAGGTTCTGGCGCGG
GTGCGGGCGCTGCTGCGGCAGCTGGCGCATCCGCTGGTGCTGCTGGCGGCTATGGCGGTGGCGCAG
GTGTTGGTGCAGGTGCGGGCGCGGGTGCGGCTGGTGGCTATGGCCAGAGCTATGGCAGCGGTGCT
GGCGCAGGTGCGGGTGCTGGTGCGGCGGCTGCAGCTGGCGCTGGCGCACGTGCAGCGGGTGGCTA
CGGTGGTGGTTACGGCGCAGGCGCGGGCGCCGGTGCTGGCGCCGCTGCTTCCGCTGGTGCCTCCGG
TGGCTACGGTGGCGGTTACGGCGGTGGCGCGGGTGCAGGCGCCGTAGCTGGTGCGTCCGCGGGTT
CTTACGGCGGTGCGGTTAACCGTCTGTCTAGCGCAGGCGCGGCATCTCGTGTTTCCAGCAACGTGG
CTGCCATCGCGTCTGCGGGTGCGGCTGCGCTGCCGAACGTAATCTCTAACATTTATTCTGGTGTGC
TGTCCTCTGGTGTGTCTTCTTCTGAGGCGCTGATCCAGGCTCTGCTGGAAGTCATCTCTGCACTGA
TCCACGTGCTGGGTTCTGCCTCCATCGGTAACGTGTCTTCCGTTGGCGTTAACAGCGCACTGAATG
CAGTGCAGAACGCCGTCGGCGCGTACGCTGGT
Amino Acid Sequence:
HESSYAAAMAASTRNSDFIRNMSYQMGRLLSNAGAITESTASSAASSASSTVTESIRTYGPAAIFSGA
GAGAGVGVGGAGGYGQGYGAGAGAGAGAGAGAGGAGGYGQGYGAGAAAAAGAGAGAAGGYGGGS
GAGAGGAGGYGQGYGAGSGAGAGAAAAAGASAGAAGGYGGGAGVGAGAGAGAAGGYGQSYGSGA
GAGAGAGAAAAAGAGARAAGGYGGGYGAGAGAGAGAAASAGASGGYGGGYGGGAGAGAVAGASA
GSYGGAVNRLSSAGAASRVSSNVAAIASAGAAALPNVISNIYSGVLSSGVSSSEALIQALLEVISALIHV
LGSASIGNVSSVGVNSALNAVQNAVGAYAG
38
Araneus diadematus ADF2, Genbank Ascension U47854
Synthetic DNA Sequence:
GGTAGCCAAGGCGCAGGTGGTGCAGGTCAAGGTGGTTATGGTGCAGGCGGCGGTGGCGCTGCGGC
AGCTGCTGCTGCAGCGGTAGGCGCGGGTGGCGGTGGTCAGGGCGGCCTGGGTTCCGGCGGTGCGG
GCCAGGGTTACGGCGCAGGCCTGGGCGGTCAAGGTGGCGCATCTGCGGCGGCTGCTGCGGCTGGT
GGCCAGGGCGGTCAGGGTGGCCAAGGTGGCTATGGCGGTCTGGGTTCTCAGGGCGCAGGCGGTGC
TGGTCAGCTGGGCTATGGTGCAGGTCAGGAATCTGCAGCGGCTGCCGCTGCCGCAGCGGGCGGCG
CTGGTGGCGGTGGTCAGGGCGGCCTGGGTGCGGGTGGCGCTGGCCAAGGTTACGGTGCCGCTGGC
CTGGGCGGTCAGGGTGGTGCGGGCCAGGGCGGCGGCTCTGGCGCGGCGGCTGCGGCCGGTGGTCA
AGGTGGTCAGGGCGGCTATGGTGGCCTGGGCCCGCAAGGCGCGGGTGGTGCGGGCCAGGGTGGCT
ACGGTGGTGGTTCCCTGCAATACGGCGGTCAGGGTCAGGCTCAGGCAGCTGCGGCATCCGCGGCG
GCATCCCGCCTGTCTTCCCCATCCGCAGCGGCACGTGTGTCTTCCGCTGTATCTCTGGTATCCAAC
GGCGGTCCGACCAGCCCGGCGGCACTGAGCTCTAGCATTTCCAACGTGGTATCTCAGATCTCTGCA
AGCAACCCAGGCCTGTCTGGTTGCGATATCCTGGTTCAAGCCCTGCTGGAAATTATCTCTGCGCTG
GTTCACATCCTGGGTTCTGCCAACATCGGCCCGGTTAACTCTAGCTCCGCCGGTCAGTCCGCATCC
ATTGTAGGTCAATCCGTATACCGCGCTCTGTCT
Amino Acid Sequence:
GSQGAGGAGQGGYGAGGGGAAAAAAAAVGAGGGGQGGLGSGGAGQGYGAGLGGQGGASAAAAAA
GGQGGQGGQGGYGGLGSQGAGGAGQLGYGAGQESAAAAAAAAGGAGGGGQGGLGAGGAGQGYGA
AGLGGQGGAGQGGGSGAAAAAGGQGGQGGYGGLGPQGAGGAGQGGYGGGSLQYGGQGQAQAAAA
SAAASRLSSPSAAARVSSAVSLVSNGGPTSPAALSSSISNVVSQISASNPGLSGCDILVQALLEIISALV
HILGSANIGPVNSSSAGQSASIVGQSVYRALS
Araneus diadematus ADF3, Genbank Ascension U47855
Synthetic DNA Sequence:
GCCCGCGCGGGGTCAGGCCAGCAGGGACCAGGTCAACAGGGCCCGGGCCAACAAGGCCCGGGTCA
ACAGGGTCCGTACGGTCCGGGTGCCAGCGCGGCGGCCGCGGCCGCAGGAGGGTATGGCCCTGGTA
GCGGCCAACAGGGTCCGAGCCAGCAAGGCCCGGGCCAGCAAGGGCCGGGGGGCCAGGGGCCCTAC
GGCCCTGGTGCGTCAGCTGCCGCAGCCGCAGCTGGCGGTTATGGCCCGGGGTCAGGTCAGCAAGGG
CCAGGCGGTCAAGGTCCTTACGGGCCAGGCAGTAGTGCGGCAGCGGCTGCTGCCGGTGGTAACGG
CCCGGGGTCGGGCCAGCAAGGGGCGGGACAGCAGGGTCCAGGCCAACAAGGCCCCGGTGCGTCCGC
39
AGCGGCGGCGGCCGCTGGTGGCTATGGCCCGGGTTCAGGCCAGCAGGGCCCGGGGCAGCAGGGCCC
GGGTGGACAGGGTCCGTATGGCCCGGGGGCCAGTGCAGCGGCCGCGGCTGCTGGGGGCTATGGCC
CTGGCTCAGGTCAGGGTCCGGGTCAACAAGGACCCGGCGGTCAAGGACCGTATGGCCCGGGTGCG
TCCGCGGCGGCTGCGGCGGCTGGAGGCTATGGTCCGGGAAGTGGCCAACAGGGCCCTGGACAGCA
GGGTCCGGGTCAGCAGGGACCCGGTGGACAGGGCCCGTATGGGCCAGGCGCCTCTGCCGCAGCGGC
GGCCGCAGGTGGGTATGGACCGGGGTACGGCCAGCAGGGTCCTGGTCAGCAGGGACCGGGCGGCC
AGGGCCCTTACGGCCCCGGCGCGTCAGCTGCAAGCGCTGCCTCGGGTGGCTACGGCCCGGGTTCCG
GTCAGCAGGGCCCGGGACAGCAGGGTCCGGGTCAGCAGGGACCGTATGGTCCGGGAGCTTCTGCT
GCTGCCGCCGCGGCGGGTGGTTATGGACCCGGCAGTGGCCAACAAGGTCCGGGGCAGCAGGGTCC
AGGTCAGCAGGGCCCAGGACAGCAGGGCCCTGGTGGCCAAGGACCGTACGGTCCCGGCGCAAGTG
CGGCCGCTGCAGCTGCCGGAGGCTACGGTCCAGGTAGTGGACAGCAAGGACCGGGTCAGCAGGGC
CCCGGTCAACAGGGGCCGGGCCAGCAAGGCCCCGGGCAGCAGGGACCTGGGCAGCAGGGTCCCGG
GCAGCAAGGTCCTGGGCAACAGGGTCCGGGACAGCAAGGCCCTGGCGGCCAGGGTGCGTATGGGC
CTGGTGCATCTGCCGCGGCGGGCGCCGCGGGTGGGTACGGGCCGGGGAGCGGCCAGCAAGGTCCG
GGCCAACAGGGCCCCGGACAACAGGGTCCTGGCCAGCAAGGACCTGGCCAGCAGGGGCCGGGACA
ACAAGGGCCCGGCCAACAAGGCCCAGGGCAACAAGGCCCGTACGGCCCTGGGGCCTCGGCAGCCGC
GGCAGCGGCCGGCGGCTATGGCCCGGGCAGTGGTCAACAAGGTCCAGGCCAACAGGGCCCAGGGC
AGCAGGGTCCGGGGGGTCAAGGTCCGTACGGACCGGGTGCCGCCTCGGCAGCGGTGAGTGTAGGC
GGCTACGGACCTCAAAGCTCCTCTGTGCCAGTCGCCAGTGCGGTGGCTAGCCGTCTGTCTAGCCCC
GCCGCCAGCAGTCGTGTCAGCTCAGCCGTGTCGTCTTTAGTATCATCAGGACCGACTAAACACGCA
GCCTTGTCAAACACCATTAGCAGCGTTGTCTCTCAGGTGTCAGCGAGTAACCCGGGGCTGTCGGG
TTGCGACGTCCTGGTACAGGCCCTGCTGGAAGTGGTGAGCGCCCTCGTGTCTATTCTGGGTTCTAG
TTCCATTGGCCAGATTAACTATGGGGCGAGTGCGCAATACACCCAAATGGTCGGACAATCTGTTG
CGCAGGCACTGGCG
Amino Acid Sequence:
ARAGSGQQGPGQQGPGQQGPGQQGPYGPGASAAAAAAGGYGPGSGQQGPSQQGPGQQGPGGQGP
YGPGASAAAAAAGGYGPGSGQQGPGGQGPYGPGSSAAAAAAGGNGPGSGQQGAGQQGPGQQGPG
ASAAAAAAGGYGPGSGQQGPGQQGPGGQGPYGPGASAAAAAAGGYGPGSGQGPGQQGPGGQGPYG
PGASAAAAAAGGYGPGSGQQGPGQQGPGQQGPGGQGPYGPGASAAAAAAGGYGPGYGQQGPGQQ
GPGGQGPYGPGASAASAASGGYGPGSGQQGPGQQGPGQQGPYGPGASAAAAAAGGYGPGSGQQGP
GQQGPGQQGPGQQGPGGQGPYGPGASAAAAAAGGYGPGSGQQGPGQQGPGQQGPGQQGPGQQGP
GQQGPGQQGPGQQGPGQQGPGGQGAYGPGASAAAGAAGGYGPGSGQQGPGQQGPGQQGPGQQGP
40
GQQGPGQQGPGQQGPGQQGPYGPGASAAAAAAGGYGPGSGQQGPGQQGPGQQGPGGQGPYGPGA
ASAAVSVGGYGPQSSSVPVASAVASRLSSPAASSRVSSAVSSLVSSGPTKHAALSNTISSVVSQVSASN
PGLSGCDVLVQALLEVVSALVSILGSSSIGQINYGASAQYTQMVGQSVAQALA
Araneus diadematus ADF4, Genbank Ascension U47856
Synthetic DNA Sequence:
GCAGGCTCTAGCGCCGCAGCTGCCGCTGCTGCAAGCGGTAGCGGTGGTTACGGTCCAGAGAACCA
GGGTCCGTCCGGCCCAGTAGCATATGGCCCTGGTGGTCCAGTCTCTTCCGCTGCTGCCGCAGCTGC
TGCGGGCTCCGGTCCAGGTGGCTACGGTCCGGAAAACCAGGGCCCGTCTGGTCCGGGCGGTTATG
GCCCGGGTGGCTCTGGTAGCTCTGCAGCGGCGGCAGCCGCGGCAGCGTCTGGCCCAGGTGGTTACG
GCCCAGGCTCCCAGGGCCCGTCCGGTCCGGGCGGTAGCGGCGGTTATGGTCCTGGTTCCCAGGGTG
CAAGCGGCCCTGGTGGTCCGGGCGCATCTGCGGCAGCCGCCGCAGCAGCGGCTGCGGCAAGCGGTC
CGGGTGGCTACGGTCCGGGCAGCCAGGGTCCGTCTGGTCCTGGCGCCTACGGTCCAGGTGGCCCGG
GTTCCTCCGCTGCAGCCGCGGCTGCGGCTGCGAGCGGTCCTGGTGGCTACGGTCCGGGTAGCCAGG
GTCCTTCCGGTCCAGGTGTGTACGGCCCTGGTGGCCCGGGTTCTTCTGCTGCTGCAGCGGCGGCTG
CTGGCTCTGGTCCGGGCGGTTATGGCCCGGAAAACCAGGGTCCGTCTGGCCCTGGTGGTTACGGCC
CAGGTGGTTCCGGTTCCTCCGCTGCTGCGGCGGCAGCAGCTGCCAGCGGTCCAGGCGGTTACGGTC
CTGGCTCTCAAGGCCCGTCCGGCCCTGGCGGTTCCGGTGGCTATGGTCCGGGTTCTCAGGGCGGTT
CTGGTCCGGGCGCGAGCGCAGCTGCAGCGGCAGCCGCTGCATCTGGTCCTGGCGGTTACGGTCCGG
GTAGCCAGGGTCCATCCGGTCCGGGCTATCAGGGCCCGTCTGGCCCGGGTGCTTATGGTCCATCCC
CGAGCGCATCTGCGTCCGTGGCCGCTTCCGTCTATCTGCGTCTGCAACCGCGTCTGGAAGTTTCCT
CTGCTGTTAGCAGCCTGGTTTCCAGCGGTCCGACTAACGGCGCTGCTGTCTCTGGCGCCCTGAACT
CTCTGGTTTCCCAGATTTCTGCAAGCAACCCTGGTCTGTCTGGTTGCGACGCGCTGGTGCAGGCTC
TGCTGGAACTGGTTTCTGCGCTGGTTGCAATCCTGAGCAGCGCAAGCATCGGTCAGGTTAACGTT
TCTTCTGTCAGCCAGAGCACCCAAATGATTTCTCAGGCACTGAGC
Amino Acid Sequence:
AGSSAAAAAAASGSGGYGPENQGPSGPVAYGPGGPVSSAAAAAAAGSGPGGYGPENQGPSGPGGYG
PGGSGSSAAAAAAAASGPGGYGPGSQGPSGPGGSGGYGPGSQGASGPGGPGASAAAAAAAAAASGP
GGYGPGSQGPSGPGAYGPGGPGSSAAAAAAAASGPGGYGPGSQGPSGPGVYGPGGPGSSAAAAAAA
GSGPGGYGPENQGPSGPGGYGPGGSGSSAAAAAAAASGPGGYGPGSQGPSGPGGSGGYGPGSQGGS
GPGASAAAAAAAASGPGGYGPGSQGPSGPGYQGPSGPGAYGPSPSASASVAASVYLRLQPRLEVSSA
41
VSSLVSSGPTNGAAVSGALNSLVSQISASNPGLSGCDALVQALLELVSALVAILSSASIGQVNVSSVSQ
STQMISQALS
7. Supplementary References
1.
2.
3.
4.
Muller, J., S. Oehler, and B. Muller-Hill, Repression of lac promoter as a function of
distance, phase and quality of an auxiliary lac operator. J Mol Biol, 1996. 257(1): p.
21-9.
Darwin, K.H. and V.L. Miller, Type III secretion chaperone-dependent regulation:
activation of virulence genes by SicA and InvF in Salmonella typhimurium. Embo J,
2001. 20(8): p. 1850-62.
Widmaier, D.M., et al., Engineering the Salmonella type III secretion system to export
spider silk monomers. Mol Syst Biol, 2009. 5: p. 309.
Ayoub, N.A., et al., Blueprint for a high-performance biomaterial: full-length spider
dragline silk genes. PLoS One, 2007. 2(6): p. e514.
42
Download