• Define the terms genome, genotype, phenotype, chromosome and nucleoid in the context of the cell structure and characteristics of prokaryotes. Phenotype – physical features and functional traits of an organism, a reflection of the proteins possessed at any point in time Gene – unit of genetic information, segment of DNA which possesses info specifying the sequence of AA in a protein or the sequence of nt in a stable RNA molecule Genotype – particular set of genes in the organism’s genome Genome – total physical DNA that specifies an organism’s characteristics • Explain the importance of proteins and diversity of protein function to life. Structure, catalysis, regulation, transport, sensing/signalling, force-generation • Describe how genes and proteins are named. Gene: rpsL or rpsL Protein: RpsL • Explain why and how chromosomal DNA is folded and compacted in Bacteria and Archaea. Different packing in Bac and Archaea. In Bacteria, small positively charged polyamines used to neutralize negatively charged phosphates of DNA reducing repulsive forces and helping to compact the DNA. Also, supercoiling (like a telephone cord twisting). Bacteria produce enzymes that can twice DNA into supercoils. The boundaries of each loop of a folded chromosome has anchoring proteins called histone-like proteins. Histones are also positively charged. In Archaea, DNA is also supercoiled and some Archaea use histone proteins (similar to Euk) to compact their DNA. Nucleosome. • Describe the nature of the prokaryotic nucleoid highlighting the differences between Bacteria and Archaea as well as the unique features of the nucleoid in some Planctomyces In Bacteria and Archaea, chromosome refers to the DNA component of the nucleoid. Most prok possess singular circular chromosome, some possess single linear chromosome, some possess segmented genome. Difference not understood why. Chromosome refers to chromosome and plasmids and prophage. • Explain the structural and functional relationships between the linear structure of a prokaryotic protein encoding gene, a mRNA and a protein. • Explain what is meant by gene expression and why genes need to be expressed to have an effect on an organism’s characteristics Genes need to be expressed mRNA protein (or tRNA or rRNA). Genotype determines phenotype but to affect the phenotype, the proteins specified by the genes need to be synthesized (from the genes). When the protein is made, the gene is said to be expressed. Gene expression is the process of transcription (and then translation if it was mRNA). Central dogma. Use RNAP and ribosomes. Gene expression depends on many proteins which have the ability to bind to DNA or RNA. Nucleic acid binding proteins tend to be specific. Some only bind to ssDNA, or dsDNA or RNA, etc. • Distinguish between transcription and translation. • Explain the process of bacterial gene expression from the standpoints of initiation, elongation and termination during the processes of transcription and translation. Prokaryotic mRNA is made out of a 5’ UTR containing the ribosome binding site (5 nt), 5 – 9 spacer nt, the initiation/start codon AUG (5’ to 3’), the coding region, the translation termination or stop codon (UGA, UAA or UAG) and 3’ UTR containing the transcription terminator, a region of dsDNA. Region b/t 5’ and 3’ UTR called coding region b/c consists of codons. Nt sequence of mRNA is used by ribosomes as guide to link AA together in a particular sequence resulting in a unique protein. Ribosomes specifically bind to the mRNA at the ribosome binding site to begin translation. Rbs are about 5 bases in size. There are different rbs. After binding at the rbs, with tRNA, ribosomes move along mRNA from 5’ to 3’ joining AA via peptide linkages. Translation can occur at the same time as transcription. Nucleotides are read by the ribosomes and tRNA. The vast majority of prokaryotic proteins begin with AA methionine specified by AUG. Proteins are synthesized from N terminus to C terminus. That means the 5’ end of the mRNA is for the N terminus. Proteins can synthesize methionine at other positions, but translation only starts at the AUG suitably spaced (5-9 nt) from the rbs. Transcription promoter is the dsDNA region where RNAP initially binds to start transcription. It therefore defines the beginning of a gene and is about 50 bp. Bacteria have a single RNAP with 4 core subunits – 2 alpha and 1 beta and 1 beta prime. RNAP exists as a holoenzyme or core enzyme (different). The core enzyme must associate with a 5th protein called the sigma subunit to form the holoenzyme. Sigma binds to the promoter first, then RNAP binds and slides along to find the promoter sequence. When detected, it stops and transcription is initiated as the sigma falls off. There are many sigma subunits. Control transcription of entire categories of genes by controlling which sigmas are made. In Archaea and Euk, RNAP have multiple subunits and none are capable of recognizing promoters. Instead, transcription factors recognize and bind to sites adjacent to promoters. When the RNAP slides along the DNA, it stops and initiates transcription when it encounters a bound transcription factor. Following promoter is the coding region, largest part of gene that specifies order of AA. 1 kb gene has an average coding region of 900 bp 299 AA. It takes 3 nt pairs (6 nucleotides) in DNA to specify 1 amino acid in a protein. Following RNAP binding to the promoter, RNAP separates the DNA strands to gain access to the template strand and moves 3’ to 5’ synthesizing mRNA 5’ to 3’. Enzyme uses nt sequence of template as a guide for choosing the complementary nt to link together. Termination. Transcription termination is poorly understood in Euk and Arc. Only look at Bac now. Two mechanisms – Rho independent termination and Rho dependent termination. RNA binding protein called Rho which binds to C-rich sequences in the RNA that is not being translated, then slides along RNA in 5’ to 3’ direction, towards the RNAP. When it catches up to the RNAP, it pulls the transcript free of the RNAP and template, releasing it. This happens at sequences that cause RNAP to pause briefly, thus allowing Rho to catch up. Rho-dependent termination can stop transcription when the RNAP has moved beyond the end of the last gene in an operon. Rho-independent termination. RNAP falls off teh template when it is paused over a series of A-U base pairs (which are relatively weak). Pausing is caused by a stem-loop structure in the RNA. The DNA sequences that encode rho-independent transcription terminators can be recognized as palindromes. When the terminator sequence is transcribed into RNA, forms loop – double stranded hairpin like structure called the transcription terminator. Should be after the last codon. • State the general differences between the processes of bacterial and archaeal gene expression. The differences mainly lie in the process of transcription initiation. For bacteria, there is a sigma and the RNAP core enzyme is made out of 2 alpha, 1 beta and 1 beta prime subunit which all join to form one enzyme. It joins with the sigma to form the holoenzyme. The sigma is the one responsible for identifying the promoter region on the dsDNA. The sigma subunit falls off once transcription has started. In the Archaea, the RNAP is different. Archaea and Euk have similar transcription initiation methods (compared to Arc and Bac). In Archaea, the RNAP is made out of several protein subunits and none of them are similar to sigma. There are transcription factors that bind to the sites adjacent to the promoters. When RNAP slides along the DNA, it stops and initiates transcription when it encounters the bound transcription factor. • Explain why transcription and translation of a gene can occur at the same time in prokaryotes but not in eukaryotes and why many copies of the same protein can be simultaneously translated from a single prokaryotic mRNA. Nucleus. • Describe the similarities and differences between prokaryotic protein-encoding genes and those genes that encode stable rRNA molecules. Prokaryotes have a substantial number of genes that encode stable RNA. In Bacteria and Archaea, these genes typically occur in operons. This makes sense because both rRNA and tRNA needed to do protein synthesis, makes sense to have genes for them in the same operon. Initially RNA products are made together as one long precursor RNA. Nuclease enzymes cleave the precursor molecule yielding individual RNA products. • Explain the structure of operons and their significance. In Bacteria and Archaea, common to find genes clustered together in the genome if they specify proteins involved in the same cellular process. These gene clusters are called operons. Genes of operons are transcribed together as a single mRNA initiating at one promoter, but two or more different proteins are translated from the single mRNA. Thus on the single mRNA there would be two rbs, two AUG and two stop codons. But only one transcription terminator after the 2nd stop codon. • Compare and contrast the expression of genes in an operon composed of protein-encoding genes and and one composed of stable RNA genes. • Distinguish between the number of unique protein molecules a prokaryote can synthesize and the total number of protein molecules that may be present in a prokaryotic cell at any point in time. Not all proteins are produced due to gene expression – depends on environment and need. • Using examples explain why different proteins are normally present in a prokaryotic cell in in a different number of copies. Proteins needed in difference amounts. RNAP always needed but DNAP not. • Explain why prokaryotes regulate gene expression. Different proteins needed in different amounts. Don’t want to waste resources. Don’t want to produce something that isn’t required. Waste of energy. If waste energy, growth rate will be slowed. • Distinguish between constitutive and environmentally-regulated genes and gene expression. • Explain two ways the expression of constitutively-expressed genes is regulated in prokaryotes. One way of controlling this is through promoter strength. A strong promoter has a nucleotide sequence that RNAP binds to more strongly than a weak promoter. If it binds more strongly, it is more likely initiate transcription. A weak promoter tends not to be bound as much, that is – most of the time, RNAP will not be found bound there so it won’t be transcribing as much. Isn’t all or nothing event – dynamic equilibrium. Another way is mRNA stability. If mRNA is less stable, it will tend to be degraded at a higher rate. If it’s not around for that long after transcription, the protein may not be able to be made as much. As soon as they are made, mRNAs subject to degradation by ribonucleases (RNases). If more specific protein needed, more of the mRNA must be transcribed. RNase E is important in the first step of mRNA degradation. Different mRNAs have different half lives. Some have only a minute some have an hour. Longer it lasts, more protein can be made. One factor that affects mRNA half life is the nucleotide sequence of the 5’ UTR. This affects RNase E binding presumably, which initiates the process of degradation. rRNA is protected by proteins making up the ribosome. Double stranded RNA is usually not subject to degradation as much. RNase binds to the 5’ UTR and moves 5’ to 3’ making nicks and therefore equally sized mRNA segments. These segments are further processes by other RNase enzymes. • Describe the protein components of the iron acquisition system in E. coli and their function. Iron is important for nearly all life. Usually insoluble as Iron oxides or hydroxides which are more difficult to obtain. Pathogenic bacteria face this problem when growing in the body. For example, Pseudomonas aeruginosa. E. coli is a gram negative bacteria. There is a protein called FepA in the outer membrane that serves to transport siderophore-Fe(III) complex. There is a protein called FepB which is a periplasmic protein that carries siderophore-Fe(III) complex from the outer membrane protein FepA to FepC in the CM. Actually it’s FepD. FepD is a protein in the CM and underneath it is FepC. The FepC and D transport the siderophore-Fe(III) complex from the periplasm to the cytoplasm where the Fe(III) is reduced to Fe(II) and then release d from the siderophore – the siderophore can be recycled. ATP is required for the CM transporter to do its job. • Explain how the expression of the genes encoding the iron acquisition system in E. coli is regulated in response to environmental iron and why the regulatory response makes sense. The genes encoding proteins for iron acquisition are present in several operons in the E. coli chromosome. For example, FepC, FepD and FepG are in the same operon – they have the same promoter, and transcription terminator (but the mRNA has several rbs and start codons and stop codons). The three gene operon is called fepCDG. They are transcribed together as one mRNA from which the 3 proteins are then translated. Expression of the fepBDG genes is regulated such that the fepCDG operon is transcribed at high levels only when Fe is limiting in the environment. When sufficient iron is present in the environment, the fepCDG operon is transcribed at very low levels. Overlapping the promoter sequence is an operator which is a sequence within the promoter sequence. The operator is called fur box. fur is a gene. Present in the chromosome of E. coli is regulatory gene fur which is CONSTITUTIVELY expressed. This is so that the bacterium can ALWAYS be sensing whether there is iron in the cytoplasm. If there isn’t then it does stuff. fur specifies the protein Fur which is a regulatory protein that binds to the fur box operator in the promoter region of the fepCDG operon. The Fur protein therefore recognizes the nt sequence of the fur box operator and can bind there. However, whether or not it binds actually depends on a signal molecule which is iron which makes sense because we are talking about acquiring iron and whether we need to make more proteins to get more iron or not. The iron is the signal molecule that can bind to the specific protein Fur. When iron binds to the Fur protein, the Fur protein undergoes an comformational change that allows it to bind strongly to the operator in the promoter region of the fepCDG operon. When it does this, RNAP cannot bind and transcribe the operon. This makes sense because since there’s iron you don’t need to make more of the transport proteins to transport more iron in – that would be a waste of resources. However, when there is no iron present (or little), then Fur protein has a LOWER AFFINITY for the operator called fur box in the operator of the promoter of fepCDG so RNAP can bind more often (sigma first) and initiate transcription which makes sense because you would need the proteins to bring in more iron into the cell transporting the siderophore-Fe(III) complex. The reason why RNAP cannot bind is because it usually binds to the promoter but the operator overlaps with the promoter so it’s harder to bind. RNAP actually still transcribes but less when Fur is bound. It seems like if Fur is not bound to Fe, it cannot bind to the promoter (operator) at all. A regulon is a set of genes whose expression is regulated in the same way regardless of their location in the genome. For example, genes for siderophore synthesis and genes for siderophore-Fe transport constitute a regulon. • Explain the ways the expression of environmentally-regulated genes is regulated at the level of transcription initiation in prokaryotes and why a particular regulatory response makes sense. There are constitutively expressed genes and environmentally expressed genes. Constitutively expressed genes are always expressed because their protein products are always required. For example, RNA polymerase. For example the proteins in the ribosome. Transport proteins. Constitutively expressed genes are still regulated because their degree of expression is fixed at different levels. This is regulated through mechanisms like promoter strength and mRNA stability. These are fixed, obviously. Environmentally regulated genes are expressed at different levels depending on the cell’s environment. The cell needs to be able to sense the environmental conditions and respond accordingly. They use numerous mechanisms to sense the environment and translate this information into action. The surface of the cell has many sensing proteins that monitor osmolarity, pH, etc. They are also sensors for telling whether other organisms are nearby. Some changes trigger an immediate change in behaviour WITHOUT the synthesis of any new proteins – chemotaxis for example. Others result in immediate change in gene expression because new proteins are needed under the new environmental conditions. This change in gene expression may be sequential – one gene expression changes then another. This is cell differentiation and it is important that a certain protein is made first before another. In order to regulate gene expression based on environmental conditions, need to sense environmental conditions and respond by altering level of gene transcription. There are four ways that gene expression is controlled at the transcription initiation level when the gene(s) in question is/are environmentally regulated. Regulatory proteins can exert positive or negative control. If a regulatory protein exerts positive control, the gene is said to be positively controlled. This means that when the regulatory protein binds to the operator for the gene/operon, the gene will be transcribed. On the other hand, if a regulatory protein that exerts negative control binds to the operator of the gene/operon, the gene will not be transcribed. Regulatory proteins that exert positive control are called activator proteins. Ones that exert negative control are called repressor proteins. The ‘operator’ for activator proteins is actually not part of the promoter but upstream of it (on the template strand). The operator for repressor proteins is generally part of the promoter or downstream of the promoter wrt the non-template strand. Sometimes, operators for the activator proteins are called activator-binding sites. For most operons, genes specifying regulatory proteins are not part of the operon whose expression they control wtf that’s obvious. Because if they were they would control themselves. The gene for the regulatory protein is usually constitutively expressed at low levels. Whether a regulatory protein binds to the operator (or activator-binding site) depends on another molecule, an effector molecule aka signal molecule/chemical. In the case of iron acquisition in E.coli, it was iron. Regulatory proteins are allosteric, meaning they can adopt different shapes and therefore some shape(s) allow them to bind to the operator or activator-binding site, others don’t. Their shapes depends on whether they are bound to the effector molecule aka signal molecule or chemical. There are two different types of effector molecules. One is an inducer. When the inducer binds to the regulatory protein, then transcription occurs. For an activator protein, the inducer would cause the protein to bind to the activator-binding site and transcription would occur at a higher level. For a repressor protein, the inducer would cause it not to bind to the operator so that transcription can occur. The other protein is called a corepressor. The corepressor binds to the repressor protein and they bind to the operator and prevent transcription initiation. For activator proteins, the corepressor is in fact called an inhibitor because it inhibits transcription by binding to the activator protein and preventing it from binding to the activator binding site and starting transcription. Usually activator proteins do not bind strongly to the activator-binding site, or else that would waste lots of resources. • Using the Aliivibrio fischeri-Eupryma scolopes example, explain how quorum sensing works and its importance in the life of prokaryotic organisms. Aliivibrio fischeri is a bacteria that lives in the gut of the squid Eupryma scolopes and they have a symbiotic relationship. The squid Eupryma scolopes provides the Aliivibrio fischeri with nutrients, a warm place to live, protection from stuff like protozoa and other things. On the other hand, Aliivibrio fischeri creates light at night. This makes them look like stars so predators don’t go and eat them, and they can come out to eat. In the morning the Eupryma scolopes craps so that there’s not too many bacteria. Quorum sensing is important because Aliivibrio fischeri will not make light in the absence of other bacteria. That is, there needs to be a sufficient amount of bacteria first. There’s not much point for only a single bacterium to make light because that would make very little light. They sense each other by quorum sensing. It is a type of environmentally regulated gene expression in which gene expression is regulated by population density. Quorum sensing involves the production of a specific signal molecule called the autoinducer. The autoinducer can freely diffuse across the cytoplasmic membrane and outside of the cell and it can enter other cells and induce the expression of certain genes if its concentration is high enough. If the concentration is high enough then that means that there are many cells in the area. Each is making the same autoinducer, obviously. Quorum sensing is used to make sure that there are enough members of the bacteria before it does anything. Otherwise, no effect. Basically the autoinducer acts as an inducer for an activator protein. So it binds to it, changes its 3D conformation making it have higher infinity for the activator-binding site on the DNA so that more mRNA will be transcribed more proteins produced. • Using the current information derived from the nucleotide sequencing of prokaryotic genomes describe prokaryotic genome diversity in terms of size, gene number, average gene number and gene content (function, distribution) About 900 prokaryotic genomes have been sequenced. About 900 are in the progress of being sequenced. Size is an indicator of the amount of genetic information. Size ranges from 500 to 15,000 kb. Average is 3,700 kb (3,700,000 bases) and median is 3,600 kb. Parasitic genomes are smallest. Free-living prokaryotic genomes are largest. The average prokaryotic gene is about 1 kb in size. 3’ UTR and 5’ UTR are about 100 bp so the average coding region is about 900 bp and the average protein length is 299 AA. Metabolic and transport genes are typically the most abundant class of genes. • Using examples explain the correlation between prokaryote genome size and prokaryotic life styles (eg. parasitic, free-living etc) Parasitic genomes like those of mycoplasmas are smallest. Free-living like Sorangium cellulosum are largest. • Explain the general principles underlying the use genomic analysis as a way to understand prokaryotes as well some limitations of this type of analysis. When we try to identify the function of proteins, we look at the gene that codes for it and compare the gene sequence with ones we already know of the function that it codes for. Then we say that they are used for the same thing. This can be inaccurate, and furthermore, not all kinds of proteins are known. • Solve problems similar to those in the Chapter 3 Questions and Problems handout (VISTA).