Tree Thinking In the age of genomics, groups of organisms can be

advertisement
Tree Thinking
In the age of genomics, groups of organisms can be compared by their DNA and evolutionary relationships can be proposed based on the
similarity of these sequences.
Define the following in your own words:
1. Phylogeny:
2. Homoplasy:
3. Cladogram:
4. Parsimony:
A molecular clock uses changes in DNA sequences to infer the time since two species diverged. To set, or calibrate, a molecular clock, we
need to calculate how much DNA sequences from a common but neutral genomic region have diverged over a known period of time. Below
are sequences from an imaginary gene for three closely related organisms. Most genes consist of several thousand nucleotides, but for
demonstration purposes, we are only going to use some of the bases.
Calculate the percentage sequence divergence between all three sequences. First examine the sequences in question and count the
differences between them. Divide the difference by the total number of nucleotides, and then multiply by 100 to convert to a percent.
Human
Chimp
Baboon
DNA Sequence of Cyt-b (Mitochondrial DNA)
ACTAGCAACGGATACCATAGGTATATCTAGGCTACATTGT
ACTAGAAACGGTTACCACAGGTATATCAAGGCAACATTGT
ACTGGATGCGGATGCCTCAGATTTATCAAGGCTACAATGT
Human
Chimp
Baboon
DNA Sequence of Cyt-b (Mitochondrial DNA) continued
TAGCTTACCGCTAGTACTGGTGACTCTAGAATGCCTAGTC
TACCTTACCGCTAGTACTGGTGACTCTCGAATGCCTTGTC
TACCATAGCGATACTTGTGGTGACTGTCGATTGCCTAGTG
5. Human and Chimp:
6. Chimp and Baboon:
7. Baboon and Human:
The major assumption of a molecular clock is that sequence divergence increases over time in a linear manner as species diverge. This
means that species that diverged 4 million years ago should have twice the sequence divergence as species that diverged 2 million years
ago. If one knows when two species diverged (by using the dates of fossils, geological events, etc.), one can use this information to
calibrate a molecular clock for that particular region of the genome. That clock can then be applied to other species and used to
determine when other species diverged.
8.
Calculate the ratio of the sequence divergence to the known age for each species pair below. What is the % divergence per million years for
this particular gene?
Species-Pair
Sequence Divergence
Known Age
Human-Chimp
9.8%
5.5 million
Human-Gorilla
12.0%
7.0 million
Chimp-Gorilla
12.8%
7.0 million
9. If the sequence divergence between chimpanzees and bonobos at this gene is 4.2%, calculate the estimated age of the divergence between
chimps and bonobos.
10. The sequence divergence between modern humans and Neanderthals (based on DNA obtained from fossils) is 1.2%. How long ago did
modern humans and Neanderthals share a common ancestor?
11. Suppose that other data showed that the Neanderthal fossil was .35-.4 million years old. What are other possible reasons for the discrepancy
between this age and the age obtained by the molecular clock method?
12. Construct a cladogram showing the relationship between humans, chimps, bonobos, and gorillas using this sequence data.
13. Now construct a phylogram showing the relationship between humans, chimps, bonobos, and gorillas using this sequence data. How is it
different from your cladogram?
14. Do you think the common ancestor to humans, chimps, bonobos and gorillas walked upright on two feet (biped)? Use parsimony to explain.
15. Do you think the common ancestor to humans, chimps, bonobos and gorillas had body covered with hair? Use parsimony to explain.
16. The amino acid sequence for cyt-c gene is given for several species at the end of this handout.
How many differences are there:

Between humans and rhesus monkeys?

Between humans and kangaroos?

Between ducks and chickens?

Between chickens and turkeys?

Between Neurospora and baker’s yeast?
17. What do the molecular data suggest about the relationship between a screwworm fly and a silkworm moth compared to the relationship
between a human and a rhesus monkey?
18. Explain how the pig-cow-sheep relationship is possible given that they are classified as different species.
19. Neurospora and Baker’s Yeast are both fungi and humans and monkeys are both primates. What can you say about the divergence time
between Neurospora and Baker’s Yeast compared to humans and monkeys?
AMINO ACID SEQUENCES IN CYTOCHROME-C PROTEINS FROM 20 DIFFERENT SPECIES
Amino Acid Number -----> 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9
Human ............
- - - - - - - - G D V E K G K K I F I M K C S Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A P G Y S Y T A A
Rhesus monkey.....
- - - - - - - - G D V E K G K K I F I M K C S Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A P G Y S Y T A A
Horse ...........
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A P G F T Y T D A
Pig, cow, sheep...
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A P G F S Y T D A
Dog..............
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A P G F S Y T D A
Gray whale ......
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A V G F S Y T D A
Rabbit............
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A V G F S Y T D A
Kangaroo..........
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L N G I F G R K T G Q A P G F T Y T D A
Chicken, Turkey...
- - - - - - - - G D I E K G K K I F V Q K C S Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A E G F S Y T D A
Penguin...........
- - - - - - - - G D I E K G K K I F V Q K C S O C H T V E K G G K H K T G P N L H G I F G R K T G Q A E G F S Y T D A
Pekin duck........
- - - - - - - - G D V E K G K K I F V Q K C S Q C H T V E K G G K H K T G P N L H G L F G R K T G Q A E G F S Y T D A
Snapping turtle...
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T V E K G G K H K T G P N L N G L I G R K T G Q A E G F S Y T E A
Bullfrog..........
- - - - - - - - G D V E K G K K I F V Q K C A Q C H T C E K G G K H K V G P N L Y G L I G R K T G Q A A G F S Y T D A
Tuna..............
- - - - - - - - G D V A K G K K T F V Q K C A Q C H T V E N G G K H K V G P N L W G L F G R K T G Q A E G Y S Y T D A
Screwworm fly.....
- - - - G V P A G D V E K G K K I F V Q R C A Q C H T V E A G G K H K V G P N L H G L F G R K T G Q A A G F A Y T N A
Silkworm moth.....
- - - - G V P A G N A E N G K K I F V Q R C A Q C H T V E A G G K H K V G P N L H G F Y G R K T G Q A P G F S Y S N A
Wheat ............
A S F S E A P P G N P D A G A K I F K T K C A Q C H T V D A G A G H K Q G P N L H G L F G R Q S G T T A G Y S Y S A A
Fungus (Neurospora)
- - - - G F S A G D S K K G A N L F K T R C A E C H G E G G N L T Q K I G P A L H G L F G R K T G S V D G Y A Y T D A
Fungus(baker's yeast)
- - - T E F K A G S A K K G A T L F K T R C E L C H T V E K G G P H K V G P N L H G I F G R H S G Q A Q G Y S Y T D A
Fungus (Candida)
- - P A P F E 0 G S A K K G A T L F K T R C A E C H T I E A G G P H K V G P N L H G I F S R H S G Q A Q G Y S Y T D A
[CONTINUED FROM ABOVE]
Amino Acid Number
0
Human ............
N
Rhesus monkey.....
N
Horse ............
N
Pig, cow, sheep...
N
Dog...............
N
Gray whale........
N
Rabbit............
N
Kangaroo..........
N
Chicken, Turkey...
N
Penguin...........
N
Pekin duck........
N
Snapping turtle...
N
Bullfrog..........
N
Tuna..............
N
Screwworm fly.....
N
Silkworm moth.....
N
Wheat ............
N
Fungus 1 (Neurospora) N
Fungus 2 (bkrs yeast) N
Fungus 3 (Candida)
N
AMINO ACID SYMBOLS
A = Alanine
C = Cysteine
D = Aspartic acid
E = Glutamic acid
F = Phenylalanine
G = Glycine
1
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
I
K
2
N
N
N
N
N
N
N
N
N
N
N
N
N
S
A
A
N
Q
K
R
3
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
A
4
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
A
G
N
G
5
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
V
I
V
V
6
I
T
T
T
T
T
T
I
T
T
T
T
T
V
T
T
E
T
L
E
7
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
W
8
G
G
K
G
G
G
G
G
G
G
G
G
G
N
Q
G
E
D
D
A
9
E
E
E
E
E
E
E
E
E
E
E
E
E
N
D
D
E
E
E
E
0
D
D
E
E
E
E
D
D
D
D
D
E
D
D
D
D
N
N
N
P
1
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
N
T
2
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
M
M
3
M
M
M
M
M
M
M
M
M
M
M
M
M
M
F
F
Y
F
S
S
4
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
D
E
E
D
5
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
6
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
7
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
L
E
T
E
8
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
9
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
0
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
1
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
2
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
H
I
K
L
M
N
P
=
=
=
=
=
=
=
Histidine
Isoleucine
Lysine
Leucine
Methionine
Asparagine
Proline
3
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
4
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
5
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
6
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
T
7
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
8
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
M
9
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
V
V
A
A
A
0
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
F
1
V
V
A
A
A
A
A
A
A
A
A
A
A
A
A
A
P
G
G
G
2
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
G
3
I
I
I
I
I
I
I
I
I
I
I
I
I
I
L
L
L
L
L
L
4
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
5
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
6
K
K
K
K
T
K
K
K
K
K
K
K
K
K
P
A
P
D
E
A
7
E
E
T
G
G
G
D
G
S
S
S
A
G
G
N
N
Q
K
K
K
8
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
D
D
D
D
9
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
0
A
A
E
E
A
A
A
A
V
A
A
A
Q
Q
G
A
A
N
N
N
1
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
D
2
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
I
L
L
Q
R
S
T
V
W
Y
=
=
=
=
=
=
=
3
I
I
I
I
I
I
I
I
I
I
I
I
I
V
I
I
I
I
I
V
4
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
T
T
T
5
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
F
Y
Y
6
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
L
M
L
M
7
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
K
L
8
K
K
K
K
K
K
K
K
D
D
D
D
S
S
S
E
K
E
K
E
9
A
A
A
A
A
A
A
A
A
A
A
A
A
A
A
S
A
A
A
A
0
T
T
T
T
T
T
T
T
T
T
T
T
C
T
T
T
T
T
C
S
1
N
N
N
N
K
N
N
N
S
S
A
S
S
S
K
K
S
A
E
K
2
E
E
E
E
E
E
E
E
K
K
K
K
K
S
-
Glutamine
Arginine
Serine
Threonine
Valine
Tryptohpan
Tyrosine
Symbols in light blue or gray represent conserved amino acids (they show NO differences in any organism on the list), so you can ignore them.
(adapted from Strahler, Arthur, Science & Earth History, 1987, p. 348)
Download