Midterm Exam #1 MB 451 Microbial Diversity

advertisement
Midterm Exam #1
MB 451
Microbial Diversity
Honor pledge: “I have neither given nor received unauthorized aid on this test.”
Name : ___KEY___________________________________________________________
Date : _______________________________
Signed : _________________________________________________________________
PIN * : _______________________________
* Please give me a “PIN” (a short string of numbers, letters, and symbols, that are not personally identifiable) that I can use to
anonymously post your scores. Try to choose a PIN unlikely to be chosen by another student (e.g. not “666” or “123456”) or rejected by me (I’ll leave this to your imagination). If you do not wish your scores to be posted, leave this blank.
1. What are the 3 primary evolutionary branches of life? (5 points)
Archaea, Bacteria & Eukarya
__A__ 2. Many purple non-sulfur Bacteria are photoheterotrophic, meaning that they ... (2 points)
___E__ 5. The ssu-rRNA is a good molecule for generating phylogenetic trees because ... (2 points)
A. get energy from light and carbon from organics
A. It is composed of many independently-evolving parts
B. get energy from light and carbon from CO2
B. it is present in and has the same function in all cells
C. get energy from organics and carbon from light
C. it contains both rapidly and slowly evolving regions
D. get energy from organics and carbon from CO2
D. horizontal transfer (almost) never occurs
E. none of the above
E. all of the above
__C__ 3. Which if the following is not a part of a biological
taxonomy? (2 points)
__B__ 6. RNAs that are too different to align on the basis of
sequence might be alignable based on ... (2 points)
A. A naming method
A. three-dimensional structure determination
B. A grouping system
B. secondary structure superimposition
C. A phylogenetic tree
C. evolutionary history
D. An identification scheme
D. BLAST or CLUSTAL
E. All of the above are included in a taxonomy
E. there would be no way to align these sequences
___D__ 4. The term prokaryote means ... (2 points)
A. bacterial
B. pre-eukaryotic
C. lacking a nucleus or nucleoid
D. not a eukaryote
E. all of the above
__E__ 7. Which of the following is not a method for generating
phylogenetic trees? (2 points)
A. maximum likelihood
B. neighbor-joining
C. parsimony
D. Fitch
E. all of these methods are used to generate trees
Page 1 of 8
8. Charles Darwin described how species evolve by ___diversification_________________, rather than by progression. (2 points)
9. Molecular phylogenetic trees are subjective/objective (circle one) and qualitative/quantitative (circle one). (2 points)
10. Describe in one or two sentences what an outgroup is, and what the purpose of including one in a tree is. (2 points)
An outgroup is a sequence/organism that is known not to be related to the phylogenetic group being treed. The purpose of
including an outgroup in a tree is to place the location of the root in the rest of the tree.
11. What is clock-like behavior of a molecular sequence? What are some of the properties of a sequence required for it to have
clock-like behavior? (6 points)
Clock-like behavior in a gene means the the extent of sequence divergence between sequences from different sources (species) is approximately proportional to how long since they diverged from one another.
Some of the properties required for clock-like behavior are:
- Functional constancy
- a long enough sequence to be statistically meaningful
- the gene product must be composed of many independently-evolving domains
Page 2 of 8
12. What is a substitution model? (4 points)
A substitution model is an attempt to describe mathematically how sequences change, and is used to estimate evolutionary
distance in a distance matrix from distance matrices or alignments.
13. How was the root of the “universal” phylogenetic tree determined? Include a drawing to show how this works. (4 points)
By generating trees from pairs of genes that duplicated before the last common ancestor and are still similar enough to align,
e.g. methionyl-tRNA init and elong, ATPase alpha and beta subunits, or EF-Tu and EF-G. Each half of the tree serves as the
outgroup for the other half - e.g. the EF-Tu part of the tree as outgroup for the EF-G sequences.
Page 3 of 8
14. What is the purpose of bootstrapping. How is it done? (6 points)
Bootstrapping is a method for evaluating the reliability of the branches of a tree.
In a bootstrap analysis, an alignment of length X is randomly sampled X times (one column at a time). Some columns of the
original alignment will be used more than once, some not at all. This randomly sampled alignment is then used to generate a
tree. A large number of these random-sample trees are generated, and the percentage of these trees that contain each
branch are tabulated. These values are usually shown on the tree, and values above 50% are considered generally reliable.
Page 4 of 8
15. Describe in detail one method used to determine phylogenetic relationships, or distinguish or identify organism, other than ssurRNA molecular phylogenetic analysis. (10 points)
We talked about several methods, and any of these would be appropriate:
•DNA:DNA hybridization
•DNA base composition
•serology
•lipid profiling
•rRNA spacer sequence analysis
•other gene phylogenetic analysis
•ribotyping
•RFLP
•phenotype
e.g for DNA:DNA hybridization:
DNA:DNA hybridization is a method that was commonly used in the past. The extent that the genomic DNAs of 2 species will
hybridize is a general measure of how much sequence similarity there is between the genomes, and therefore how closely
related they are. This method is widely used to define bacterial species - in general, two organisms are considered to be
the same species if the DNA:DNA hybridization is 70% or greater, or different species of the same genus if they have measurable hybridization less than 70%.
Page 5 of 8
16. Align the following sequences: (5 points)
Sequence A A C A G A C U U U G U G U A G A C A G A C U U U G U G U A G
Sequence B G C G A C U U C G G U A C G C ‐ G A C U U C G ‐ G U A C
Sequence C G C A G A U U U C G U G U A U G C A G A U U U C G U G U A U
Sequence D A C G A U U U C G G U A A C ‐ G A U U U C G ‐ G U A ‐
17. Create a similarity matrix from the following alignment, using a 3-parameter substitution model in which transversions count as
a full “difference”, a transitions count as a half a difference, and any base mis-matched with a gap counts as 2 differences. In
other words, a match is +1, a transversion is -1, a transition is -0.5, and any base:gap comparison is -2. (6 points)
A B C D
Sequence A A A G C U U G A U C A X X X X Sequence B C A G C U U G A U C B 0.9 X X X Sequence C U A G C U ‐ G A G C C 0.6 0.65 X X Sequence D C A G C U G G A U C D 0.8 0.9 0.65 X 18. Convert the following similarity matrix to a distance matrix using the Jukes & Cantor curve: (6 points)
10) (10 points) Convert the similarity matrix to a distance matrix using the Jukes & Cantor curve.
Similarity
Similarity
matrix matrix
Distance
matrix
Distance
matrix
A B C D A B C D
A
B
C
D
A
X
X
X
X
B
0.95
X
X
X
A
B
C
D
A
X
X
X
X
B
0.05
X
X
X
A X X X X A _X_ _X_ _X_ _X_
B 0.9 X X X B 0.12 _X_ _X_ _X_
C 0.8 0.3 X X C 0.23 1.65 _X_ _X_
C
0.75 0.80
X
X
C
0.30 0.25
X
X
D
0.50 0.60 0.45
X
D
0.75 0.55 0.90
X
D 0.6 0.5 0.4 X D 0.55 0.77 1.05 _X_
JUKES & CANTOR CURVE
1.0
0.9
Similarity
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.0
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
1.1
1.2
1.3
1.4
1.5
1.6
1.7
1.8
1.9
2.0
Evolutionary distance
11) (10 points) Redraw the dendrogram below, to the same approximate scale, as a phylogram.
Page 6 of 8
C
B
D
D
19. Use the distance matrix to create a tree using the neighbor-joining method. SHOW YOUR WORK! Intermediate stages will be
graded. Be sure to label the lengths of all the branches. (15 points)
This was supposed to be 0.9! A
Distance matrix
A B C D E
A _X_ _X_ _X_ _X_ _X_ E B
B 0.8 _X_ _X_ _X_ _X_
C 0.5 0.7 _X_ _X_ _X_
D 0.5 0.7 0.6 _X_ _X_ D C
E 0.7 0.9 0.8 0.4 _X_
A B C D/E A A
A _X_ _X_ _X_ _X_
B 0.8 _X_ _X_ _X_ E B E C
C 0.5 0.7 _X_ _X_
D/E 0.6 0.8 0.7 _X_
D C D B
OK, so there’s a mistake in this table ‐ B/C was supposed to be 0.9 rather than 0.7, a mis‐
take created when I changed the problem at the last minute. No tree can exactly satisfy these distances. A real computer program would adjust branch lengths to minimize the total error over the tree, but of course it wasn’t my intent do give you this. This error only affects this problem of branch lengths, it does not affect solving the tree structure. As a result, the 7 points that went with these branch lengths (1 point for each length) will be given to all students, and only the process of solving the tree structure will be graded, for 8 points.
Page 7 of 8
20. Convert the tree from a dendrogram into a phenogram. Be sure to maintain the same branch lengths! (5 points)
E
B
C
C
E
ROOT
B
A
D
A
D
Use the following tree to answer questions 21 - 25: (2 points each)
Evolutionary distance
0.1
32
100
50
96
50
100
24
53
100 40
100
90
98
60
95
100
Rhodospirillum rubrum
Campylobacter hominis
Comamonas testosteroni
Neisseria gonorrhoeae
Vibrio cholerae
Yersinia pestis
100
Escherichia coli
100
Salmonella enterica
Desulfovibrio africanus
Chondromyces lanuginosus
Chlamydia trachomatis
Anabaena sp. PCC 7108
Streptomyces bikiniensis
Micrococcus luteus
Mycobacterium tuberculosis
Corynebacterium diphtheriae
Clostridium putrefaciens
Bacillus subtilis
Lactobacillus vaginalis
Methanocaldococcus jannaschii
21. What is the closest relative of Bacillus subtilis? ___L. vaginalis_________________________________________________________
22. What are the two closest relatives on this tree? _______E.coli______________ and _________ S.enterica_____________________
23. Circle the last common ancestor of Neisseria gonorrhoeae and Escherichia coli
24. Which sequence(s) is/are presumably the outgroup? _____M. jannaschii_________________________________________________
25. What is the approximate evolutionary distance between Streptomyces bikiniensis and Bacillus subtilis? _____0.2______________
Page 8 of 8
Download