Dry Lab – More Tree Analayses

advertisement
Staffan Bensch 2014-03-18
Dry Lab –Tree Analyses
The file “mammals.meg” contains cytochrome b sequences (1140 bp, almost full length) from
23 species of mammals. The goal of this DryLab is to show that obtained phylogenetic trees
are sensitive to taxon sampling, selection of outgroups and models of molecular evolution.
Neighbour-joining
Open the file mammals.meg in MEGA 5.
[Open a File/Session]



Nucleotide sequences" OK
"Protein-coding DNA" Yes
Select genetic code "Vertebrate mitochondrial" .

Make a neighbour joining tree

Choose the following settings (Kimura 2-parameter model / Complete deletions):

Now you will see a tree with all the names of the taxa. The “true” phylogeny of these
species is pretty much based on full mitochondrial genomes and several nuclear genes (see
Figure 1 at the end of this document). The data contains a clade of carnivores, a clade of
primates, a clade of rodents and two representatives of the African clade “Afrotheria”. In
the true phylogeny of primates, Homo should be a sister taxon to Chimpanzee, followed
1
Staffan Bensch 2014-03-18
by Gorilla, Orang-utan, Baboon, Howler Monkey and with lemurs being basal. Among the
carnivores, we should see the cats forming a separate clade, the seals clustering together
with polar bear and wolf outside. A particular phylogenetic problem has been the
placement of rodents relative primates and carnivores. The present consensus based on
many genes is that rodents are a sister group to primates, with carnivors outside.
Question 1: How does this first K2P tree differ from the expected “true tree”?
Documentation and saving trees. The most convenient way is to copy and paste trees into
PowerPoint. Open a blank document in PowerPoint. In the “Tree Explorer” window of
MEGA, select “Image/Copy to Clipboard”. Jump to PowerPoint and “Edit/Paste”.
Check the quality of the data and select OTUs (Operational Taxonomic Units)
Before continuing with more advanced analyses, it is best to inspect the quality of the data (go
to the window “Sequence Data Explorer” by selecting “Data/Explore Active Data”). Does the
alignment look to be OK?

Rooting. A tree will not show the phylogenetic relationships unless it is rooted. A more
adequate representation of an unrooted tree is by a radiation diagram. In the Tree Explorer
window click on the ikon “Tree/Branch Style/ Radiation”.

Go back to the default representation “Tree/Branch Style / Traditional / Rectangular”. To
root a tree, click on a branch in the Tree Explorer window (now highlighted green) and
then click on the ikon at the left hand side of the window with the green triangle. (Try out
different branches as roots).
Question 2a: How does the tree look like if you root it with homo?
b: Identify the possible outgroups for phylogenetic testing of the relationships
……………..between carnivores, primates and rodents
c: Rerun the trees with one outgroup at the time and compare the results
2
Staffan Bensch 2014-03-18

Bootstrap analysis is a way to evaluate the accuracy of a tree, or rather, how well the
obtained tree is supported by the data. Make a new neighbor-joining tree (as above) but
now change "Test of Phylogeny" to "Bootstrap method". Press “Compute”. Root the tree
with the outgroup.
Question 3. Which clades have high support? Are there any inconsistencies relative the “true
tree” that have good support?

Models of Molecular Evolution. Try for example the Tamura 3-parameter model with
rate variation between sites (see below for the settings). Try different values (high >2, low
<0.2) of the gamma-parameter.
Question 4. How do the trees differ (topology, support values, branch length)?
Maximum parsimony
One of the traditional and still very popular discrete method is Maximum Parsimony.
Go to “Phylogeny/ Test Maximum Parsimony Trees”. Use the default settings [(CNI (level=1)
with initial tree by Random addition (10 reps))] and select bootstrapping. Compare how much
longer it takes to do bootstraps with this method compared to neighbour joining. Root the tree
with the outgroup.
Question 5. How is the tree looking compared to the NJ-trees (branch length)?
Question 6. Which of the two trees look best compared to the true tree? How can this be
explained?
Character mapping. A nice feature with maximum parsimony is that the nucleotides can be
mapped directly on the tree. In the Tree Explorer window, click on the upper-right symbol
and choose “Show all”. Walk through the sequence by changing the “Site Index”
3
Staffan Bensch 2014-03-18
Maximum Likelihood
Go to “Phylogeny/ Test Maximum Likelihood Tree using the settings as below (DO NOT
BOOTSTRAP)
Question 7. Compare this tree with the best NJ tree and the true phylogeny.
Find the most appropriate model of molecular evolution for your data. Go to Model/
Find the best DNA / Protein Model (ML)
Question 8. Study the output and try to interpret the parameters
Go to “Phylogeny/ Test Maximum Likelihood Tree. Using the settings as were found to be
the most optimal model (DO NOT BOOTSTRAP)
4
Staffan Bensch 2014-03-18
Question 9. Are there any differences between this tree and the previous ML tree?
Now, make a final ML tree with bootstrapping (select 100 replications).
Question 10. Compare the bootstrap values with the NJ tree?
Amino Acids
Neighbour Joining, Maximum Parsimony and Maximum Likelihood trees can all be done
based on amino acid sequences (given the sequences are from a protein coding gene). You
can do this by changing the “Substitution type” to amino acids
Make amino acid trees using NJ and ML.
Question 11. Compare these trees with those done from DNA sequences
5
Staffan Bensch 2014-03-18
Figure 1. Consensus phylogenetic tree of mammalian orders. Species included in this exercise
are indicated on the right side of the figure
6
Staffan Bensch 2014-03-18
RAxML is a popular program for maximum likelihood tree constructions. It can be run on a
remote server from the webpage:
http://phylobench.vital-it.ch/raxml-bb/index.php
Open your data in MEGA / Explore Active Data and go to “Exporting Sequence data” and
change format to PHYLIP 3.0 (you may need to do some manual adjustments in the text file).
Save the text file, open it and copy and paste into the RAxML sequence box.
7
Download