Molecular Clock: Lineage-specific evolutionary rate

Slides:



Advertisements
Similar presentations
Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,
Advertisements

D3.7 Evidence for evol part III jackie. Biochemical evidence provided by the universality of DNA and protein structures for the common ancestry of living.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
An Introduction to Phylogenetics > Sequence 1 GAGGTAGTAATTAGATCCGAAA… > Sequence 2 GAGGTAGTAATTAGATCTGAAA… > Sequence 3 GAGGTAGTAATTAGATCTGTCA… Anton E.
The Evolutionary Basis of Bioinformatics: An Introduction to Phylogenetics > Sequence 1 GAGGTAGTAATTAGATCCGAAA… > Sequence.
Classification of Living Things. 2 Taxonomy: Distinguishing Species Distinguishing species on the basis of structure can be difficult  Members of the.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Molecular Clock I. Evolutionary rate Xuhua Xia
Molecular Evolution Revised 29/12/06
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
HIV/AIDS as a Microcosm for the Study of Evolution.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Classification and Phylogenies Taxonomic categories and taxa Inferring phylogenies –The similarity vs. shared derived character states –Homoplasy –Maximum.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
Generating Diversity: how genes and genomes evolve Erin “They call me Dr. Worm” Friedman 29 September 2005.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
The Molecular Clock? By: T. Michael Dodson. Hypothesis For any given macromolecule (a protein or DNA sequence) the rate of evolution is approximately.
PHYLOGENETICS CONTINUED TESTS BY TUESDAY BECAUSE SOME PROBLEMS WITH SCANTRONS.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Phylogeny GENE why is coalescent theory important for understanding phylogenetics (species trees)? coalescent theory lets us test our assumptions.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Calculating branch lengths from distances. ABC A B C----- a b c.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Molecular and Genomic Evolution Getting at the Gene Pool.
CHARACTERS USED IN RECONSTRUCTING PHYLOGENETIC TREES 1. Morphological “ Tiktaalik is the sister group of Acanthostega + Ichthyostega in one of the two.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
NEW TOPIC: MOLECULAR EVOLUTION.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
5.4 Cladistics The images above are both cladograms. They show the statistical similarities between species based on their DNA/RNA. The cladogram on the.
Molecular Clocks and Continued Research
Reconstructing and Using Phylogenies 16. Concept 16.1 All of Life Is Connected through Its Evolutionary History All of life is related through a common.
Section 2: Modern Systematics
Phylogeny and the Tree of Life
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Evolutionary genomics can now be applied beyond ‘model’ organisms
Title: Different Types of Evolution
The neutral theory of molecular evolution
17.2 Classification based on evolutionary relationships
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are both.
Pipelines for Computational Analysis (Bioinformatics)
In-Text Art, Ch. 16, p. 316 (1).
5.4 Cladistics.
Agenda 10/8 Seashell Sort Phylogeny Lecture Phylogenetics Pracice
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
The process of speciation
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Patterns in Evolution I. Phylogenetic
Molecular Clocks Rose Hoberman.
Molecular Evolution.
Chapter 26 Phylogeny and the Tree of Life
26.5 Molecular Clocks Help Track Evolutionary Time
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Gene Density and Noncoding DNA
Evolutionary genetics
Chapter 19 Molecular Phylogenetics
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Chapter 6 Clusters and Repeats.
Unit Genomic sequencing
But what if there is a large amount of homoplasy in the data?
Phylogeny and the Tree of Life
Presentation transcript:

Molecular Clock: Lineage-specific evolutionary rate Xuhua Xia xxia@uottawa.ca http://dambe.bio.uottawa.ca

Molecular clock hypothesis Within given gene (or DNA region), mutations (nt or aa sub) accumulate at an approximately equal rate in all evolutionary lineages Rate constancy concept Originally based on comparisons of protein sequences for hemoglobin, cytochrome c… from different organisms Information can be used to estimate divergence times, reconstruct phylogenies… BUT… Does it hold for all genes, all genomes… ? How to reconcile with irregular rate of morphological evolution? Xuhua Xia

Clock-like substitution rate Fig. 4.15 Linear relationship What does slope represent? # subs/site per unit time = rate Combined data for hemoglobins, cytochrome c & fibrinopeptide several apparent deviations: slowdown in rate after divergence of ape/monkey lineages & acceleration after split of horse/donkey lineages What would figure look like without combining genes

Different rates among genes Linear relationship between # substitutions and geological time Rates of aa substitution vary among proteins … so slopes will differ Alberts Fig. 5.1 Xuhua Xia

Relative-rate test To compare rates in lineages A and B, use C as reference species If constant rate, then “distance” from outgroup to each member within group should be equal KAC = KOA + KOC (1) KBC = KOB + KOC (2) Fig. 4.16 KAB = KOA + KOB (3) So KOA = (KAC + KAB - KBC ) / 2 KOB = (KAB + KBC - KAC ) / 2 KOC = (KAC + KBC - KAB ) / 2 Xuhua Xia

Relative-rate test Then according to molecular clock hypothesis: KOA = KOB so KOA – KOB = 0 and from equations (1) and (2) KOA – KOB = KAC – KBC Fig. 4.16 Can compare rates of substitution in lineages A and B directly from KAC and KBC The importance of chooing a good outgrout: a long OC could obscure the difference between OA and OB Xuhua Xia

Rate difference Equal rates in lineages Slower rate in B lineage leading to A and B Slower rate in B lineage KAC > KBC KAC = KBC Xuhua Xia

Relative-rate test K’AC = KOA + KOC + AC K’BC = KOB + KOC + BC O Critical assumption: KAC, KBC and KAB are estimated without error. K’AC = KOA + KOC + AC K’BC = KOB + KOC + BC K’AB = KOA + KOB + AB O A C KOA – KOB = K’AC – K’BC + AC - BC B The longer the branch, the more substitution saturation and the more difficult to arrive at an accurate estimate. Another reason for chooing a good outgrout wisely: a long OC could obscure the difference between OA and OB Xuhua Xia

How do you interpret the data shown in this table? Tip: Title of this section in text: “Nearly equal rates in mice and rats” Xuhua Xia

Sub. rates between rodent and human P. 149: "The number of gaps was also higher in humans (44 gaps) than in rodents (31 gaps)" Switch "humans" and "rodents" Nr = number aa positions where human vs. rat different but human vs. chicken identical so replacement in rodent lineage Nr = 600 Nh = number of aa positions where human vs. rat different but rat vs. chicken identical Growth hormone is one of the exceptions. It evolves faster in human than in rodents. So replacement in human lineage Nh = 416 Tip: p.148-150 section is called “Higher rates in rodents than in primates” The reason for using AA: lower mutiple hits than nucleotide sequence, essential for inferring Nr and Nh

Beta hemoglobin gene cluster Adult: 22(HbA) 22(HbA-2) Fatal: A136 22(HbF1) G136 22 (HbF2) Embryonic: 22 (Hb Gower I) 22 (Hb Gower II) Xuhua Xia

Can use duplicated genes to test if rates are constant (Table 4.13) Human b-globin genes Mouse b-globin genes embryonic fetal adult embryonic fetal adult Score numbers of nt subs per syn site (KS) and per non-syn site (KA) between duplicated genes in humans & rodents How do you interpret these data? “Duplicate copies in mouse are all more divergent than [counterpart copies] in humans” Cautionary notes: (1) there may be gene conversion events (“copy correction”) between sequences in multi-gene families (Topic 11), (2) the mouse genes may be in genome regions with high mutation rate “Higher rates in rodents than in primates” p.148-50 Xuhua Xia

Causes of rate differences (p.152) Mutation rates Generation time Metabolic rate (e.g., high aerobic respiration leads to mutagenic effects of oxygen free radicals) DNA repair Gene conversion in multi-gene families Substitution rates: population size genetic variation purifying selection or positive selection Different genomic background (known different rates at different genomic regions) Xuhua Xia

Sub. rate, generation time, metabolic rate Martin PNAS 1993 Xuhua Xia

Rate difference between nuc and mt DNA For mammalian mitochondrial genes, Ks ~ 5.7 x 10 -8 sub/ site/ year ~ 10 x higher than for mammalian nuclear genes Mitochondrial DNA used extensively in taxonomic, forensic, conservation biology,… studies But.. in plants, mitochondrial nt sub rate very slow… This explains why the maximum parsimony method is used widely by plant phylogeneticists but rarely by animal phylogeneticists

Relative rates for nuclear, chloroplast and mitochondrial genes in plant cells? In plants, mitochondrial rates of nt sub are much slower than nuclear (or chloroplast) … whereas mammalian mito rate ~ 10x faster than nuclear L = # sites

Positive selection? Fig. 4.19 Tree based on growth hormone genes, with branch length proportional to the number of nucleotide substitutions, and number of AA replacement shown under branch Didn't we just conclude that rodents evolve faster than humans? Phase Rate of AA replacement KA/Ks Slow phase 0.3±0.1 0.03 Ruminant rapid phase 5.6±1.4 0.30 Primate rapid phase 10.8±1.3 0.49 Fig. 4.19 Xuhua Xia

RNA viruses and retroviruses - very rapid rate of evolution (Table 4.17) HIV retrovirus ~ 10 6 x higher than mammalian nuclear genes - error prone reverse transcription (RT) - sequences may be useful in retracing spread through population Xuhua Xia

Evolution of HIV population within an individual patient The rapid evolution makes it possible to do tip-dating. - HIV virions harvested (blue vertical lines) at various times & sequenced “Each blue tick represents a virion sampled from the patient during the course of the infection; its horizontal position indicates when it was sampled and its vertical position indicates how genetically different it was from the first sample”. Freeman & Herron Fig. 1.10 Xuhua Xia

Who brought HIV-1 to America? Gilbert, M. T. et al. The emergence of HIV/AIDS in the Americas and beyond. Proc Natl Acad Sci U S A 104, 18566-70 (2007). Xuhua Xia

Who brought HIV-1 to America? Gilbert, M. T. et al. The emergence of HIV/AIDS in the Americas and beyond. Proc Natl Acad Sci U S A 104, 18566-70 (2007). Xuhua Xia