1 Dan Graur Molecular Phylogenetics. 2 Objectives of molecular phylogenetics Reconstruct the correct evolutionary relationships among biological entities.

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

Jean-Baptiste Lamarck sketchy diagram for animals in 1809.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
Terminology of Phylogenetic Trees
Introduction Classification Phylogeny Cladograms Quiz
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
Introduction to Phylogenies
Lecture 4: Phylogeny and the Tree of Life Campbell: Chapter 26
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic Trees - I.
BIO2093 – Phylogenetics Darren Soanes Phylogeny I.
Nomenclature is the science of naming organisms Evolution has created an enormous diversity, so how do we deal with it? Names allow us to talk about groups.
Phylogenetic reconstruction
Reconstructing and Using Phylogenies
Reading Phylogenetic Trees
Phylogeny and Systematics
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Phylogeny Reconstruction II. The edges of tree can be freely rotated without changing the relationships among the terminal nodes. Trees are like mobiles.
Phylogenetic Concepts. Phylogenetic Relationships Phylogenetic relationships exist between lineages (e.g. species, genes) These include ancestor-descendent.
Branches, splits, bipartitions In a rooted tree: clades (for urooted trees sometimes the term clann is used) Mono-, Para-, polyphyletic groups, cladists.
Cenancestor (aka LUCA or MRCA) can be placed using the echo remaining from the early expansion of the genetic code. reflects only a single cellular component.
Phylogenetic trees Sushmita Roy BMI/CS 576
CS 177 Phylogenetics I Taxonomy and phylogenetics Phylogenetic trees Cladistic versus phenetic analyses Model of sequence evolution Phylogenetic trees.
Phylogenetics Phylogenetic trees illustrate the evolutionary relationships among groups of organisms, or among a family of related nucleic acid or protein.
Terminology of phylogenetic trees
Molecular phylogenetics
Molecular Systematics
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Jargon Brian O’Meara EEB464 Fall From BBC Life of Birds Channel.
Systematics and the Phylogenetic Revolution Chapter 23.
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Introduction to Phylogenetics
Reading Phylogenetic Trees
What is a synapomorphy?. Terms systematics [taxonomy, phylogenetics] phylogeny/phylogenetic tree cladogram tips, branches, nodes homology apomorphy synapomorhy.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Phylogenies Reconstructing the Past. The field of systematics Studies –the mechanisms of evolution evolutionary agents –the process of evolution speciation.
Phylogeny & the Tree of Life
Classification. Cell Types Cells come in all types of shapes and sizes. Cell Membrane – cells are surrounded by a thin flexible layer Also known as a.
Classification and Phylogenetic Relationships
Cenancestor (aka LUCA or MRCA) can be placed using the echo remaining from the early expansion of the genetic code. reflects only a single cellular component.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Systematics and Phylogenetics Ch. 23.1, 23.2, 23.4, 23.5, and 23.7.
5.4 Cladistics The images above are both cladograms. They show the statistical similarities between species based on their DNA/RNA. The cladogram on the.
Chapter 26 Phylogeny and the Tree of Life
Lesson Overview Lesson Overview Modern Evolutionary Classification 18.2.
Tree Terminologies. Phylogenetic Tree - phylogenetic relationships are normally displayed in a tree-like diagram (phylogenetic tree/cladogram) - a cladogram.
Phylogeny & the Tree of Life
Phylogenic trees..
Phylogenetics
Cladistics (Ch. 22) Based on phylogenetics – an inferred reconstruction of evolutionary history.
The Tree of Life Phylogeny.
IB290 SEM 465 Topics in Phylogenetics
The Ribosomal “Tree of Life”
The Tree of Life Phylogeny.
Phylogeny and the Tree of Life
Reading Phylogenetic Trees
Phylogenetics Chapter 26.
Phylogenetic Trees Jasmin sutkovic.
Chapter 26 Phylogeny and the Tree of Life
Chapter 20 Phylogeny and the Tree of Life
The Ribosomal “Tree of Life”
1 2 Biology Warm Up Day 6 Turn phones in the baskets
Evolution Biology Mrs. Johnson.
Presentation transcript:

1 Dan Graur Molecular Phylogenetics

2 Objectives of molecular phylogenetics Reconstruct the correct evolutionary relationships among biological entities Estimate the time of divergence between biological entities Chronicle the sequence of events along evolutionary lineages

3 Evolutionary relationships are illustrated by means of a phylogenetic tree or a dendogram.

4 Ernst Heinrich Haeckel

5 July 1837 July 2007

6 November 1859

7 The routes of inheritance represent the passage of genes from parents to offspring, and the branching pattern depicts a gene tree.

8 Different genes, however, may have different evolutionary histories, i.e., different routes of inheritance.

9 The routes of inheritance are confined by reproductive barriers, i.e., gene flow occurs only within a species. A species tree is a representation of splitting of species lineages.

10 Terminology

11 A phylogenetic tree or dendrogram is a graph composed of nodes and branches, in which only one branch connects any two adjacent nodes.

12 Internal External or Peripheral Branch

13

14 Assumptions: Bifurcation = Real speciation event Multifurcation = Lack of resolution

15 Binary tree

16 Rooted and unrooted trees

17 How many unrooted topologies are here? a b c d e a e c d b a b c e d b a c d e 43 21

18 central branch In an unrooted tree with four external nodes, the internal branch is referred to as the central branch.

19 Cladograms & Phylograms (collectively Dendograms) Bacterium 1 Bacterium 3 Bacterium 2 Eukaryote 1 Eukaryote 4 Eukaryote 3 Eukaryote 2 Bacterium 1 Bacterium 3 Bacterium 2 Eukaryote 1 Eukaryote 4 Eukaryote 3 Eukaryote 2 Phylograms show branch order and branch lengths Cladograms show branching order - branch lengths are meaningless

20 Unscaled phylogram Scaled phylogram

21

23 The Newick format In computer programs, trees are represented in a linear form by a string of nested parentheses, enclosing taxon names (and possibly also branch lengths and bootstrap values), and separated by commas. This type of representation is called the Newick format. The originator of this format in mathematics was Arthur Cayley.

24 The Newick format The Newick format for phylogenetic trees was adopted on June 26, 1986 at an informal meeting at Newick's Lobster House in Dover, New Hampshire. The Newick format currently serves as the de facto standard for representing phylogenetic tree and is employed by almost all phylogenetic software tools. Unfortunately, it has never been described in a formal publication; the first time it is mentioned in a publication is in 1992.

25 The Newick format In the Newick format, the pattern of the parentheses indicates the topology of the tree by having each pair of parentheses enclose all members of a monophyletic group. A phylogenetic tree in the Newick format always ends in a semicolon (;). ;

26 The Newick format One can use the Newick format to write down rooted trees, unrooted trees, multifurcations, branch lengths, and bootstrap values.

27 3 OTUs 1 unrooted tree = 3 rooted trees

28 4 OTUs 3 unrooted trees = 15 rooted trees

29 The number of possible bifurcating rooted trees (N R ) for n  2  OTUs The number of possible bifurcating unrooted trees (N U ) for n  3  OTUs

30  Number of OTUs Number of possible rooted tree  , ,135 92,027, ,459, ,458,046,676, ,200,794,532,637,891,559,375 

31 Evolution is an historical process. Only one historical narrative is true. From 8,200,794,532,637,891,559,375 possibilities, 1 possibility is true and 8,200,794,532,637,891,559,374 are false. Truth is one, falsehoods are many.

32 8,200,794,532,637,891,559,375 How do we know which of the 8,200,794,532,637,891,559,375 trees is true?

33 We don’t, we infer by using decision criteria.

34 True and inferred trees The sequence of speciation events that has led to the formation of a group of OTUs is historically unique. A tree representing the true evolutionary history is called the true tree. A tree that is obtained by using a certain set of data and a certain method of tree reconstruction is called an inferred tree. An inferred tree may or may NOT be the true tree.

35 ancestor descendant 1descendant 2 Cladogenesis Cladogenesis = the splitting of an evolutionary lineage into two genetically independent lineages.

36 ancestor descendant 1descendant 2 Anagenesis Anagenesis = changes occurring along an evolutionary lineage.

37 In molecular phylogenetics, we assume that species are only created by cladogenesis.

38 A gene tree may differ from a species tree

39 Gene trees and species trees It is often assumed that gene trees always equal species trees. This may be not be true. a b c A B D Gene tree Species tree

40 Orthologs and paralogs a A* b* cBC* Ancestral gene Duplication yields 2 copies (paralogs) on the same genome orthologous paralogous A*C*b* A mixture of orthologs and paralogs is sampled

41

42 Homo sapiensLepidoptera herbs A taxon is a species or a group of species that has been given a name, e.g., Homo sapiens (modern humans), or Lepidoptera (butterflies), or herbs. There are codes of biological nomenclature which seek to ensure that every taxon has a single and stable name, and that every name is used for only one taxon. Taxon (singular); Taxa (plural)

43 Strictly: A clade is a group of all the taxa that have been derived from a common ancestor plus the common ancestor itself. In molecular phylogenetics: A clade is a group of taxa under study that share a common ancestor, which is not shared by any other species outside the group. Clades* *also: monophyletic groups, natural clades

44 A taxon whose common ancestor is shared by any other taxon is called a paraphyletic taxon or an invalid taxon. Paraphyletic Taxa Reptiles are paraphyletic. 44

45 A named taxon that lacks phylogenetic validity, but is nonetheless used, is called a convenience taxon. “a convenience fish” Fish (Pisces)

46 If a clade is composed of two taxa, these are referred to as sister taxa. Sister Taxa Birds and crocodiles are sister taxa.

47 Phenotypic distance = clades

48 Which of the following groups are not monophyletic? E. coli rat mouse baboon chimp human a. human, chimpanzee, baboon b. mouse, chimpanzee, baboon c. rat, mouse d. human, chimpanzee, baboon, rat, mouse e. E. coli, human, chimpanzee, baboon, rat, mouse

49 Which of the following groups are not monophyletic? E. coli rat mouse baboon chimp human a. human, chimpanzee, baboon b. mouse, chimpanzee, baboon c. rat, mouse d. human, chimpanzee, baboon, rat, mouse e. E. coli, human, chimpanzee, baboon, rat, mouse

50

51 A character provides information about an individual OTU. A distance represents a quantitative statement concerning the dissimilarity between two OTUs.

52 A character is a well-defined feature that in a taxonomic unit can assume one out of two or more mutually exclusive character states. Mutually exclusive: If David is tall, David cannot be short.

53

54

55 ContinuousDiscrete BinaryMultistate Unordered UnpolarPolar UnpolarPolar Character Ordered

56 A character is unordered if a change from one character state to any other character state can occur in one step.

57 A character is ordered if there exists a unique symmetrical path of change from one character state to another.

58 Polar A character is polar if there exists a unique asymmetrical (irreversible) path of change from one character state to another.

59 In partially ordered characters the number of steps varies for the different pairwise combinations of character states, but no definite relationship exists between the number of steps and the character-state. Amino-acid sites are partially ordered characters. An amino acid cannot change into all other amino acids in a singe step, as sometimes 2 or 3 steps are required. For example, a tyrosine may only change into a leucine through an intermediate state, i.e., phenylalanine or histidine.

60 The number of steps in partially ordered characters is specified by a step matrix, the elements of which indicate the number of steps required between any two character states

61

62 Assumptions about character evolution Methods of phylogenetic reconstruction require that we make explicit assumptions about: (1) the number of discrete steps required for one character state to change into another. (2) the probability with which such a change may occur.

63 Temporal Polarity of Character States Character states may be ranked by relative antiquity into: (1) primitive or ancestral (plesiomorphy) (2) derived or novel (apomorphy)

64 Taxonomic Distribution of Character States A primitive state that is shared by several taxa is a symplesiomorphy. A derived state that is shared by several taxa is a synapomorphy. A derived character state unique to a particular taxon is an autapomorphy. A character state that is shared by several taxa due to convergence, parallelism and reversals, rather than due to common descent, is a homoplasy. sympathy synapse syllable system

65 CC C A A A BAA A B plesiomorphy apomorphy (autapomorphy ) synapomorphy symplesiomorphy homoplasy A D

66

67 Distance Data

68

69 Most molecular data yield character states that are subsequently converted into distances.

70 Some molecular data can only be expressed as distances.

71

72

73

74 +