Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Expression systems are based on the insertion of a gene into a host cell for its translation and expression into protein . • Many recombinant proteins can be expressed to high levels in E. coli systems. most common choice for expressing labeled proteins for NMR • Yeast (Pichia pastoris, Saccaromyces cerevisiae) is an alternative choice for NMR protein samples issues with glycosolyation of protein, which is not a problem with E. coli. choice between E. coli and yeast generally depend on personal experience. • Insect cells (Baculovirus) and mammalian cell lines (CHO) are very popular expression systems that are currently not amenable for NMR samples no mechanism to incorporate isotope labeling or the process is cost prohibitive 15N labeling in CHO cells can cost $150-250K! Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • First step of the process involves the insertion of the DNA coding region of the protein of interest into a plasmid. plasmid - small, circular pieces of DNA that are found in E. coli and many other bacteria generally remain separate from the bacterial chromosome carry genes that can be expressed in the bacterium plasmids generally replicate and are passed on to daughter cells along with the chromosome Plasmids are highly infective, so many of the bacteria will take up the particles from simple exposure. – Treating with calcium salts make membranes permeable and increase uptake of plasmids Plasmids used for cloning and expressing proteins are modified natural vectors - more compact and efficient - unnecessary elements removed Some Common plasmids - pBR322 - pUC19 - pBAD large collections of plasmids with unique features and functions - see: http://www.the-scientist.com/yr1997/sept/profile2_970901.html Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Basic Features of a Plasmid Defined region with restriction sites for inserting the DNA Gene that provides antibiotic resistance (ampicillin resistance in this case) replication is initiated Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Restriction Enzymes Recognizes and cuts DNA only at particular sequence of nucleotides blunt end – cleaves both ends sticky ends – cleaves only one strand Complimentary strand from DNA insert will “match” sticky end and insert in plasmid followed by ligation of the strands (T4 DNA Ligase) Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Restriction Enzymes Very large collection of restriction enzymes that target different DNA sequences Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Restriction Enzymes Restriction Map of plasmid showing the location where all restriction enzymes will cleave. allows determination of where & how to insert a particular DNA sequence – want a clean insertion point, don’t want to cleave plasmid multiple times Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Next step of the process involves getting E. coli to express the protein from the plasmid. this occurs by the position of a promoter next to the inserted gene two common promoters are lac complex promoter T7 promoter lac complex promoter: Transcription is simply switched on by the addition of IPTG (isopropyl βD-thiogalactoside) to remove LacI repressor protein. IPTG binds LacI which no longer binds the promoter region allowing transcription to occur Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression T7 promoter: Again, transcription is switched on by the addition of IPTG to remove LacI repressor protein. IPTG binds LacI which no longer binds the promoter region allowing transcription/production of T7 RNA polymerase to occur. T7 RNA polymerase binds the T7 promoter in the plasmid to initiate expression of the protein two-step process leads to an amplification of the amount of gene product - produce very high quantities of protein. Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Next step of the process involves growing the E. coli cells Shake Flask cells are place in a “growth media” that provides the required nutrients to the cell -amino acids, vitamins, growth factors, etc LB Broth Recipe (Luria-Bertani) 10 g tryptone 5 g of yeast extract 10 g of NaCl shake the flask at a constant temperature of 37O – keeps homogenous mixture – increases oxygen uptake grow cells to proper density (OD ~ 0.7 at 600nm) Cell growth in a Shake flask Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Next step of the process involves growing the E. coli cells Bioreactors more efficient higher production volumes – can be 100s of liters in size Can grow cells to a higher density – better control of pH – better control of oxygen levels – better control of temperature – better control of mixing – sterile conditions 14 liter bioreactor Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Next step is to harvest and lysis the cells and purify the protein Now that E. coli is producing the desired protein, need to extract the protein from the cell and purify it. the amount of protein that can be obtained from an expression system is highly variable and can range from mg to mg to even g quantities. it depends on the behavior of the protein, expression level, method of fermentation and the amount of cells grown over-expressed protein Biotechnology Letters (1999) 12,1131 Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Cell Lysis A number of ways to lysis or “break” open a cell Gentle Methods Osmotic – suspend cells in high salt Freeze-thaw – rapidly freeze cells in liquid nitrogen and thaw Detergent – detergents (DSD) solubilize cellular membranes Enzymatic – enzymatic removal of the cell wall with lysozyme Vigorous Methods Sonication – sonicator lyse cells through shear forces French press – cells are lysed by shear forces resulting from forcing cell suspension through a small orifice under high pressure. Grinding – hand grinding with a mortar and pestal Mechanical homogenization - Blenders or other motorized devices to grind cells Glass bead homogenization - abrasive actions of the vortexed beads break cell walls French Press Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Protein Purification - A large number of ways to purify a protein protocols are dependent on the protein chromatography is a common component of the purification protocol where typically multiple columns are used: a) size-exclusion b) ion exchange c) Ni column d) heparin e) reverse-phase f) affinity column dialysis for buffer exchange and removal of low-molecular weigh impurities To increase the ease of purifying a protein generally include a unique tag sequence at the N- or C-terminus HIS tag – add 6 histidines to the N- or C- terminus - preferentially binds Ni column FLAG tag – DYKDDDDK added to terminus - preferentially binds M1 monoclonal antibody affinity column glutathione S-transferase (GST) tags – fusion protein - readily purified with glutathione-coupled column Introduction to Isotope Labeling of Proteins For NMR Overview of Protein Expression • Some Common Problems protein is not soluble and included in inclusion bodies insoluble aggregates of mis-folded proteins inclusion bodies are easily purified and can be solubilized using denaturing conditions How to re-fold the Protein? Finding a re-folding protocol may take significant effort (months-years?) and involve numerous steps something to be avoided if possible protein is toxic to cell find a different expression vector or use a similar protein from a different organisim proper protein fold proper disulphide bond formation – may need to re-fold the protein presence of tag may inhibit proper folding – may need to remove the tag low expression levels try different plasmid constructs try different protein sequences Introduction to Isotope Labeling of Proteins For NMR 13C and 15N Isotope Labeling of the protein • cells need to be grown in “minimal media” • use 13C glucose to achieve ~ 100% uniformed 13C labeling of protein • use 15N NH4Cl to achieve ~ 100% uniformed 15N labeling of protein glucose and NH4Cl are sole source of carbon and nitrogen in “minimal media” E. coli uses glucose and NH4Cl to synthesize all amino-acids protein added prior to expressing protein of interest 13C glucose and 15N NH4Cl can be added simultaneously both Journal of Biomolecular NMR, 20: 71–75, 2001. Introduction to Isotope Labeling of Proteins For NMR 13C and 15N Isotope Labeling of the protein • Usually isotope labeling does not negatively impact protein expression • Some Common Problems with Isotope Labeling Problems “minimal media” stresses cells slower growth typically lower expression levels isotope labeling of All proteins minimal isotope affect may affect enzyme activities isotope labeling of expressed protein may affect protein’ properties solubility? proper folding? 1H-15N HSQC spectra of 13C,15N labeled protein Introduction to Isotope Labeling of Proteins For NMR 13C and 15N Isotope Labeling of the protein • Can introduce specific amino acid labels • A variety of 13C and 15N labeled amino acids are commercial available Add saturating amounts of 19 of 20 amino acids to minimal growth media 13C and 15N labeled amino acid prior to protein expression Add • media is actually very rich and the cells grow very well cells exclusively use the supplied amino-acids to synthesize proteins • all of the occurrences of the amino-acid are labeled in the protein may be some additional labeled residues if the labeled amino acid is a precursor in the synthesis of other amino acids. 1H-15N-HSQC of His, Tyr & Gly labeled SH2-Domain no mechanism to label one specific amino acid i.e Gly-87 Introduction to Isotope Labeling of Proteins For NMR 13C and 15N Isotope Labeling of the protein • Can label specific segment in protein use peptide splicing element intein (Protozyme) inteins are insertion sequences which are cleaved off after translations preceding and succeeding fragments are ligated extein Protein of Interest 15N-labeled J. Am. Chem. Soc. 1998, 120, 5591-5592 Introduction to Isotope Labeling of Proteins For NMR 13C and 15N Isotope Labeling of the protein • Can also label only one component of a complex simply mix unlabeled and labeled components to form the complex greatly simplifies the NMR spectra 13C, 15N NMR resonances for labeled component of complex only “see” can see interactions (NOEs) between labeled and unlabeled compoents J. OF BIOL. CHEM. (2003) 278(27), 25191–25206 Introduction to Isotope Labeling of Proteins For NMR 2H Labeling of the protein • simply requires growing the cells in D2O severe isotope effect for 1H2H stresses the cell E. coli needs to be acclimated to D2O pass cells into increasing percentage of D2O cell growth slows significantly in D2O (18-60 hrs) 2 level of H labeling depends on the percent D2O the cells are grown in aromatic side-chains will be highly protonated if 1H-glucose is used 2 1 exchange labile N H to N H by temperature increase or chemical denaturation of the protein Introduction to Isotope Labeling of Proteins For NMR 2H Labeling of the protein • As we have seen, deuterium labeling a protein removes a majority of protons necessary for protein structure calculation can introduce site specific protonation to regain some proton based distance constraints label the methyl groups of Leu, Ile, and Val by adding [3,3-2H2]-13C 2-ketobutyrate. [2,3-2H2]-15N, 13C Val to the growth media. 1 1 use H-glucose to generate H-aromatic side-chains Metabolic pathway for generating 1H-methyl-Ile