SOLEXA aka: Sequencing by Synthesis 1) Randomly cut DNA fragments and ligate adaptors to both ends 2) Attach each adaptor to a flow cell to create ssDNA templates. After each round of amplification, the strands are denatured to create more templates. 3) Add four labeled reversible terminators, primers, and DNA polymerase. 4) Wash off all of the unbound components, excite the laser, and read the emitted fluorescence of the first base for each cluster. 5) The blocked 3’ terminus and fluorophore has to be removed before repeating to determine the next base 434 Sequencing • Capable of sequencing 400-600 megabases of DNA in a 10 hour run • Applications: 1.Whole genome sequencing 2.Amplicon sequencing 3.Transcriptome sequencing 4.Metagenomics • Advantages: low cost, longer reads and higher accuracy than the Sanger chaintermination method 434 Sequencing Concept • Large-scale parallel pyrosequencing, “sequencing by synthesis”: 1.Each nucleotide is added in turn, producing PPi 2.Sulfurylase uses PPi to produce ATP, that is used by luciferase to produce light 3.Apyrase denatures the remaining dNTP at each step 4.The light produced with the addition of each nucleotide can be plotted to form a sequence 434 Sequencing Technology • DNA is fractionated, 5’ biotin labeled and attached to streptavidin coated beads. The non-labeled strand is released. • The DNA on each bead is amplified onto the bead • One bead is deposited in each well • Pyrosequencing takes place in the Genome Sequencer FLX instrument Applied Biosystems SOLiD System © Copyright 2008 Applied Biosystems. All Rights Reserved. © Copyright 2008 Applied Biosystems. All Rights Reserved. Characteristics used to define Origins of Replication * The position on the DNA at which replication start points are found. (4 methods discussed) * A DNA sequence that when added to a non-replicating DNA causes it to replicate. * A DNA sequence whose mutation abolishes replication. * A DNA sequence that in vitro is the binding target for enzyme complexes known to function in initiation of DNA replication. 1) In synchronously replicating cells, density transfer experiments can reveal which sequences replicate earliest. What is the direction of fork movement here? % of probe in HL DNA Genes on a chromosome A B C 100 C B 0 Time A TR50 Calculation for human chromosomal probes on genome tiling arrays (ENCODE project) TR50 - Time at which 50% of the locus is replicated In the example below, ‘probe A’ has a TR50 of 1.25hr (80% at 2hr, 0% at 0hr) ‘probe B’ has a TR50 of 6.33hr (100% at 8hr, 40% at 6hr) Example: TR50 100 Time Point Probe A Probe B 0-2 .80 .10 2-4 .10 .10 4-6 .10 .20 6-8 0 .60 % cumulative replication 90 80 Gene Probe A Gene Probe B 70 60 50 40 30 20 10 0 0 2 4 Hr of S phase 6 8 Predicted origins, UCSC genome browser, ENCODE: UVa DNA replication tracks Neerja Karnani Chris Taylor Ankit Malhotra 2) Electron microscopy of bubbles after restriction enzyme digestion BamHI BamHI EcoRI EcoRI 3) Hybridizing lagging strand nascent DNA to strand-specific fragments shows site of switch from leading to lagging strand synthesis Upper strand A fragments Lower strand fragments E B F C D G H Nascent DNA used to probe DNA fragments: A B C D E F G H Autorad after hybridizing nascent strands A B C D E F G H DNA spotted on membrane 3b) Hybridizing of leading strand nascent DNA shows sites near origins that are enriched in nascent strand Nascent strand abundance E F Probes A B C G D H Okazaki fragments: <500 bases long Leading strand nascent strands: > 500 bases long Leading strand nascent strands near origins < 2000 bases long Nascent DNA 500-2000 base long used to probe fragments along chromosome E F A E F A B C D B C D G H G H Nascent strand peak at origin DNA spotted on array 4) 2D Gel electrophoresis of replication intermediates followed by hybridization with a DNA fragment reveals whether a replication bubble originates in the fragment Dimension 1: separates by size; dimension 2: separates by shape Sequence that when added to a non-replicating DNA causes it to replicate; this approach was used to identify Autonomously Replicating Sequences (ARS) in yeast. Sequence that when mutated causes a replicating DNA to fail to replicate; this approach was used to identify essential elements in an (ARS) in yeast Yeast containing Leu2 on a plasmid Grow under non-selective conditions Plate yeast on Leucine minus plates to estimate rate of plasmid loss Linker scanning mutations in yeast ARS1 Examples of mutants % URA+ colonies WT-ARS1 B3 B2 B1 A OriC in E. coli chromosomal DNA Initiation of DNA replication in E. coli Nature Structural & Molecular Biology – 13, 676 - 683 (2006) Berger JM DnaA and oriC independent replication initiation in E. coli from hairpins, D-loops and R loops Two primosomes for E. coli chromosomal replication Site of primosome assembly oriC, A-site n'-pas, D-loop, R-loop Recognition DnaA protein PriA protein Auxilliary proteins for loading helicase (HU for oriC) PriB PriC DnaT Helicase DnaB/DnaC Priming Primase SV40: a polyomavirus that is very useful for studying eukaryotic replication enzymes Early genes : proteins essential for viral DNA replication T antigen binding sites cluster around the ori Properties of SV40 T antigen * * * * * * 98% nuclear: NLS origin-specific DNA binding DNA independent ATPase ATP dependent DNA helicase Binds to DNA polymerases, AP2, p53, Rb Undergoes phosphorylation etc. SV40 ori Li & Kelly SV40 DNA replication In vitro. PNAS (1984) 81:6973 COS cells (infected with SV40) (expresses T Ag) + plasmid + dNTP + rNTP + 32PdATP 37o C, 60 min Analyze products by electrophoresis and autoradiography SV40 + ori + What else could they do to validate their system? Fractionation to homogeneity COS cells (infected with SV40) (expresses T Ag) Human HeLa or 293 cells T Antigen from cell lysate + baculovirus expression system PC 0.2 M KCl 0.66 M KCl (RF-C, pol + pol ) QS 0.2 M KCl 0.4 M KCl (PCNA) ssDNA 0.6 M KCl 1 M KCl RPA ELONGATION FACTORS ARE CONSERVED Prokaryotic Function Eukaryotic SSB ss DNA coating RPA PRIMASE RNA primer synth. Primase subunit of DNA POL , 5'-3' polymerase POL III CORE 3'-5' exonuclease 5'-3' exonuclease DNA POL , DNA POL DNA POL Fen1 COMPLEX ATP dependent clamploader RF-C CLAMP processivity factor PCNA LIGASE seal nicks Lig1 DnaB Helicase ? Sub Gene unit Bacterial Function Eukaryotic dnaE | DNA POL dnaQ (mutD) | POL III CORE | 5'-3' polymerase 3'-5' exonuclease 5'-3' exonuclease | | | COMPLEX | | ATP dependent clamploader dnaX ' dnaX dnaN CLAMP processivity factor DNA POL Fen1 RF-C PCNA CONSERVATION FROM PROKARYOTES TO EUKARYOTES RF-C RF-C RF-C is a five-subunit complex All subunits are related in sequence and have ATP binding motifs ATP hydrolysis by RF-C is associated with the loading of PCNA RF-C is the functional homolog of the clamp-loader complex Polymerase switching occurs even on lagging strands; pol does most of DNA synthesis Dna2 endonuclease is also necessary for this step How do you think the PCNA is removed after ligation? PCNA interacts with RF-C, pol , Fen1, DNA ligase, CAF1 and MCMT Several of these have a common motif used in the interaction: Q-X-X-L/I/M-X-X-FF/Y p21/CIP1/WAF1, a protein induced by the tumor suppressor p53 uses the same motif to interact with PCNA What effect is p21 expected to have on DNA replication/repair? FEN1 interacts with PCNA DNA Helicase 15mer 3‘ 5‘ P 5‘ * Helicase binds ss DNA * Hydrolyzes ATP to move along ssDNA and peel of substrate DNA * Can move 5‘ to 3’ or 3‘to 5’ or in both directions on ssDNA and is classified as such 30mer 3‘ P 5‘ 3‘ te a TP r t A s + b e e te u s s a s a tr ca d ic i s l l e l b e i e u h h o S + + B * Dimers or hexamers 30mer 15mer Helicases are ATP driven molecular motors Conserved helicase motifs: Includes the following for ATP binding and hydrolysis I = Walker A motif GxGxGKT II = Walker B motif FFFDEad Binds ATP Tight conformation Hydrolyzes ATP Relaxed conformation Mutation in a yeast gene that causes a failure in MiniChromosome Maintenance (MCM genes) Yeast containing Leu2 on a plasmid Grow under non-selective conditions Plate yeast on Leucine minus plates to estimate rate of plasmid loss Form hexamer Associates with MCM2-7 Newbies of unknown function MCM8 MCM9 MCM homolog from M. thermoautotrophicum (an archaebacterium) has helicase activity. What direction? Mt MCM forms double hexamers T antigen helicase domain: XS Chen, Nature, 2003;423:512-8 MtMCM-N terminal (non-helicase) domain: XS Chen, Nature Str. Biol. 2003;10:160-7 Model of replicative double-hexameric helicase action : XS Chen, Nature, 2003;423:512-8