1) Taxonomy: Classification and naming of organisms a.Hierarchical nomenclature with taxonomic categories (kingdom, phylum, class, order, family, genus,

Slides:



Advertisements
Similar presentations
Estimating and using phylogenies
Advertisements

LG 4 Outline Evolutionary Relationships and Classification
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
LEQ: How do biologist organize living things?
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetic Trees Systematics, the scientific study of the diversity of organisms, reveals the evolutionary relationships between organisms. Taxonomy,
Phylogeny and Systematics
BIO2093 – Phylogenetics Darren Soanes Phylogeny I.
Reconstructing and Using Phylogenies
Fig Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings Overview: Investigating the Tree of Life Phylogeny is the.
Classification of Living Things. 2 Taxonomy: Distinguishing Species Distinguishing species on the basis of structure can be difficult  Members of the.
PHYLOGENY AND SYSTEMATICS
Chapter 26 – Phylogeny & the Tree of Life
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
Classification and Phylogenies Taxonomic categories and taxa Inferring phylogenies –The similarity vs. shared derived character states –Homoplasy –Maximum.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
Phylogeny & The Tree of Life. Phylogeny  The evolutionary history of a species or group of species.
What Is Phylogeny? The evolutionary history of a group.
Terminology of phylogenetic trees
Molecular phylogenetics
Classification and Systematics Tracing phylogeny is one of the main goals of systematics, the study of biological diversity in an evolutionary context.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Underlying Principles of Zoology Laws of physics and chemistry apply. Principles of genetics and evolution important. What is learned from one animal group.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Warm-Up 1.Contrast adaptive radiation vs. convergent evolution? Give an example of each. 2.What is the correct sequence from the most comprehensive to.
Systematics and the Phylogenetic Revolution Chapter 23.
Lecture 2: Principles of Phylogenetics
GENE 3000 Fall 2013 slides wiki. wiki. wiki.
Parsimony is Computationally Intensive
Phylogenies Reconstructing the Past. The field of systematics Studies –the mechanisms of evolution evolutionary agents –the process of evolution speciation.
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
{ Early Earth and the Origin of Life Chapter 15.  The Earth formed 4.6 billion years ago  Earliest evidence for life on Earth  Comes from 3.5 billion-year-old.
Systematics and Phylogenetics Ch. 23.1, 23.2, 23.4, 23.5, and 23.7.
PHYOGENY & THE Tree of life Represent traits that are either derived or lost due to evolution.
Ch. 26 Phylogeny and the Tree of Life. Opening Discussion: Is this basic “tree of life” a fact? If so, why? If not, what is it?
Phylogeny and Taxonomy. Phylogeny and Systematics The evolutionary history of a species or related species Reconstructing phylogeny is done using evidence.
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
Phylogeny & Systematics The study of the diversity and relationships among organisms.
Section 2: Modern Systematics
Phylogeny and the Tree of Life
Systematics: Tree of Life
Section 2: Modern Systematics
5.4 Cladistics.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogeny and the Tree of Life
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Systematics: Tree of Life
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Parsimony is Computationally Intensive
Chapter 25 Phylogeny and the Tree of Life
Systematics: Tree of Life
Time morphospace. time morphospace time morphospace.
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Phylogeny and Systematics
18.2 Modern Systematics I. Traditional Systematics
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
Chapter 19 Molecular Phylogenetics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular data assisted morphological analyses
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
But what if there is a large amount of homoplasy in the data?
Chapter 18: Evolution and Origin of Species
Presentation transcript:

1) Taxonomy: Classification and naming of organisms a.Hierarchical nomenclature with taxonomic categories (kingdom, phylum, class, order, family, genus, and species) 2) Phylogenetic analysis: The study of evolutionary relationships among species a.Under common decent, hierarchical classification reflects true genealogical relationships Tree of Life Systematics:

Phylogenetic Terms Species 4 Ancestor 1 Ancestor 2 Ancestor 3 e1e1 d1d1 c1c1 f1f1 g1g1 i1i1 h1h1 j1j1 a1a1 b1b1 monophyletic group: set of species that share a common ancestor. synapomorphy: shared derived character state. autapomorphy: uniquely derived character state. Species 3Species 2 Species 1

Phylogenetic Analysis Species 1 a b c d e f g h i j Species 2 a b c d e f g h i j Species 3 a b c d e f g h i j Species 4 a b c d e f g h i j Ancestor 1 a b c d e f g h i j Ancestor 2 (node) Ancestor 3 (node) e1e1 d1d1 c1c1 f1f1 g1g1 i1i1 h1h1 j1j1 a1a1 b1b shared characters shared derived characters

The identification of synapomorphies help define nested series of monophyletic groups.

Phylogenetic Analysis Species 1 a b c d e f g h i j Species 2 a b c d e f g h i j Species 3 a b c d e f g h i j Species 4 a b c d e f g h i j Ancestor 1 Ancestor 2 Ancestor 3 h1h1 g1g1 c1c1 f1f1 g1g1 i1i1 h1h1 j1j1 a1a1 b1b1 j0j0 e1e1 d1d shared characters shared derived characters

Phylogenetic Terms Species 1 Species 2 Species 3 Species 4 Ancestor 1 Ancestor 2 Ancestor 3 h1h1 g1g1 c1c1 f1f1 g1g1 i1i1 h1h1 j1j1 a1a1 b1b1 j0j0 e1e1 d1d1 homoplasy: when two species share a derived character state because of convergent evolution or evolutionary reversal, but not because of common descent. convergent evolution: independent evolution of a derived character state in two or more taxa.

Convergence: Shared derived similarities, that are not based on common origin (i.e. homology ), but on an independent origin in different taxa. Types of Homoplasy Reversal : The secondary presence of an apparently ”ancestral” character state. Example: Wings in insects, birds, and bats Example: Aquatic mode of life for fish, terrestriality for tetrapods, reversal to aquatic life in whales

Homoplasy: Common in DNA sequence data. Each nucleotide position defines a separate character

Homoplasy - independent evolution Human Lizard Frog Dog TAIL (adult) absent present Loss of tails evolved independently in humans and frogs - there are two steps on the true tree

Homoplasy: Misleading evidence of phylogeny If misinterpreted as a synapomorphy, the absence of tails would be evidence for a wrong tree: grouping humans with frogs and lizards with dogs Human Frog Lizard Dog TAIL absent present

Homoplasy: Reversal Reversals are evolutionary changes back to an ancestral condition As with any homoplasy, reversals can provide misleading evidence of relationships True tree Wrong tree

How do we sort out phylogeny from a mixture of signal (synapomorphies) and noise (homoplasy). Cladistic methodology (Willi Hennig) utilizes the principle of parsimony. Parsimony= The tree that requires the fewest number of evolutionary changes or steps to explain the data is preferred. So how do we construct trees with a sample of homologous characters?

Tree Reconstruction with Parsimony

Tree 1Tree 2Tree 3 Nucleotide substitution = evolutionary step

Tree Reconstruction with Parsimony

Tree 1Tree 2Tree 3 Character 2 I(A) II(G) III(A) IV(G) I(A)II(G) III(A)IV(G) I(A)II(G) III(A)IV(G) I(A) II(G) III(A) IV(G) or I(A)II(G) III(A)IV(G)

What to do when some characters tell you one thing and others tell you something else (Homoplasy)? The most parsimonious pattern of character change is noted for each character separately, for each tree. The number of changes is summed across characters for each tree. The preferred tree is the one that implies the fewest overall character changes. Parsimony with Multiple Characters 1 2 3

Tree 1 is favored under the criterion of Parsimony Tree 1Tree 2 Tree 3

Parsimony Advantages: –Simple method - easily understood operation –Does not seem to depend on an explicit model of evolution –Should give reliable results if the data are well structured and homoplasy is either rare or widely (randomly) distributed on the tree Disadvantages: –Doesn’t always provide the best estimate of phylogeny Maximum likelihood Bayesian analysis

Parsimony is Computationally Intensive The number of possible trees increases exponentially with the number of species, making exhaustive searches impractical for many data sets. Need to utilize a way to search for the best tree without evaluating all possible trees. Tree bisection and reconnection

What if there is a large amount of homoplasy in the data? Sequence data may have multiple, “hidden” substitutions. Use a model of evolution to correct for different rates of substitutions or unequal base frequencies or other parameters. Maximum-likelihood phylogenetic analysis L = P (D  T, M) Plot of base pair differences between pairs of mammalian species for a representative gene. C A A Seq 1 AGCGAG Seq 2 GCGGAC Seq 1C A Seq 2C T C A

Example: Model of sequence evolution Simplest Model = Jukes- Cantor - Assumes all substitutions are equally likely (        AG C T K = ln ( ) p = 0.27 Total expected = 0.27 x 30 = 8.24 Example: What is the total number of substitutions? AGGTCG CATTGC CCCGAT CTCTTG ATCGGG AGATCG CAACGC CCGGAC TTCTTA ATCGGG total observed = 7 ; p = 7/30 = 0.23 Expected Difference Observed Difference Sequence Difference Time Correction

Phylogenetic Inference Using Maximum Likelihood Model of sequence evolution and the estimation of its parameters allows the placement of probabilities on different types of substitutional change. Likelihood analysis focuses on the data, not the tree. It is the Probability of the Data given a Tree and a Model of evolution. Seq 1ATATC Seq 2CTAGC The Likelihood (i.e. the probability of observing the data) is a sum over all possible assignments of nucleotides to the internal nodes L = P (D  T, M)

Calculate the Likelihood for each base position in the sequence and summarizes across all base positions. The ML tree is the tree that produces the highest likelihood. Evaluates the branching structure of the tree, and also the branch length, using similar tree-searching strategies as used in parsimony analysis. –This is important, because by using a model-based approach, mutational change is more probable along longer branches than on shorter branches. Can be extremely computationally intensive. Phylogenetic Inference Using Maximum Likelihood

Important point about ML: The model you choose to use can have a large impact on the resulting ML tree. If you flip a coin and get a head, what is its likelihood? –If it’s a 2 sided and fair coin (your model), the likelihood is 0.5 –If it’s a two-headed coin (your model), the likelihood is 1.0

Assessing the Robustness Of Trees We can use a number of methods to assess the robustness of particular branches in our trees –Bootstrapping (Jackknifing, Decay-Index) Bootstrapping: Multiple new data sets are made by re-sampling from the original data set. –Bootstrapping: Sampling done with replacement The resampled data sets are subjected to phylogenetic analysis. The proportion of times a clade appears in the trees across all replicate data sets is called its bootstrap proportion.

Taken from Baldauf, S. L. Phylogeny for the faint of heart: a tutorial. Trends in Genetics 19:

Bootstrapping Clades that receive a high bootstrap are considered to be more supported by the data than clades with a lower bootstrap. –70% or greater is good, but many phylogeneticists will only consider branches with ≥90% as being strongly supported. Bootstrap Can perform with any type of phylogenetic analysis: parsimony, ML, distance-based Important to emphasize that a bootstrap does not reveal the probability that a particular clade is true, but only how well it is supported by the particular dataset.

Molecular Clocks The mutation rate for some genes may be relatively constant across species. This idea is based on neutral theory (this will be introduced later in the course) - nucleotide or amino acid substitutions occur at a rate equal to the mutation rate. Generally in applying a molecular clock, you assume that the mutation rate for a gene does not differ among species.

Molecular Clocks Outgroup Species 1 Species 2 Species 3 Species 4 1) Construct A Tree Fossil for Species 4 ~1 MY Outgroup Species 1 Species 2 Species 3 Species 4 You know that the most recent possible divergence between 3 and 4 is at least 1 MY Species 3 Species 4 2) Date a Node in the Tree } 2% Sequence Divergence 3) Calculate Divergence 4) Calculate a Rate R= 2%/1MY

Molecular Clocks Outgroup Species 1 Species 2 Species 3 Species 4 5) Apply Rate to Other Nodes in Tree 1MY 2MY 5MY Best applied when dates available for multiple nodes. Can utilize solid geological information as well as fossil information. Must be aware of possible non-clock behavior of genes.

Phylogeny of North American Black Basses Near et al., Evolution 57:1610–1621. Previous hypothesis that speciation within the genus Micropterus occurred during the Pleistocene. Micropterus has a very good fossil record. Calibration of a molecular clock and calculation of divergence times among species reveals that most species diverged well before the Pleistocene

Bayesian Inference of Phylogeny Pr[Tree | Data] = Pr[Data | Tree] x Pr [Tree] Pr[Data] Generates a posterior probability distribution of trees The tree with the highest posterior probability provides the best estimate of phylogeny

Species Delimitation in Rapidly Radiating Systems Accumulation of species diversity over short periods of time. Adaptive radiations Often of very recent origin Difficult to resolve monophyletic species-level lineages. Salzburger, W. and A. Meyer Naturwissenschaften 91:

Gerald and Buff Corsi © California Academy of Sciences Shaffer & McKnight 1996 Evolution 50: A. californiense Ambystoma tigrinum species complex

Species Delimitation in Rapidly Radiating Systems Lineage sorting and the retention of ancestral alleles or allelic lineages (Species trees vs gene trees)

Species Delimitation in Rapidly Radiating Systems Lineage sorting and the retention of ancestral alleles or allelic lineages Darwin’s Finches East African Cichlid Fish Takahashi et al Mol. Biol. Evol. 18: Moran and Kornfield Mol. Biol. Evol. 10: Sato et al PNAS. 96:

Species Delimitation in Rapidly Radiating Systems Limited reproductive isolation leads to hybridization and introgression

Gerald and Buff Corsi © California Academy of Sciences Shaffer & McKnight 1996 Evolution 50: A. californiense Ambystoma tigrinum species complex

An early study found that A. ordinarium was not a monophyletic group (when using mtDNA as the source of characters).

Indeed, more data shows extensive mtDNA non- monophyly with respect to A. ordinarium.

Polyphyly: A. ordinarium does not form a monophyletic group. Paraphyly: A. ordinarium does form a monophyletic group but other species should also be included in this group, based on the character that is used to reconstruct relationships.

4 genes yield A. ordinarium monophyly. 3 genes yield A. ordinarium paraphyly. (2 are nearly monophyletic.) 1 gene yields A. ordinarium polyphyly. Nuclear Genes Summary Nuclear data strongly suggests that A. ordinarium is a monophyletic lineage.

Short Internal BranchesPhylogenetic Discordance Among Loci Poe, S., and A. L. Chubb Syst. Biol. 58: Signatures of Rapid Lineage Diversification

Shared and minimally divergent mtDNA haplotypes strongly indicate recent hybrid introgression. A. dumerilii