Molecular phylogenetics

Slides:



Advertisements
Similar presentations
LG 4 Outline Evolutionary Relationships and Classification
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
An Introduction to Phylogenetic Methods
Introduction to Phylogenies
Slide 1 of Modern Evolutionary Classification.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetic reconstruction
18-2 Modern Evolutionary Classification
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces.
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
Branch lengths Branch lengths (3 characters): A C A A C C A A C A C C Sum of branch lengths = total number of changes.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Bell Work Dogs of a certain breed can have black fur or white fur. Black fur is dominant, but the breeder only wants puppies with white fur. Cross two.
Lecture 13 – Performance of Methods Folks often use the term “reliability” without a very clear definition of what it is. Methods of assessing performance.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Classification and Phylogenies Taxonomic categories and taxa Inferring phylogenies –The similarity vs. shared derived character states –Homoplasy –Maximum.
Phylogeny & The Tree of Life. Phylogeny  The evolutionary history of a species or group of species.
What Is Phylogeny? The evolutionary history of a group.
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
COMPUTATIONAL MODELS FOR PHYLOGENETIC ANALYSIS K. R. PARDASANI DEPTT OF APPLIED MATHEMATICS MAULANA AZAD NATIONAL INSTITUTE OF TECHNOLOGY (MANIT) BHOPAL.
Models of sequence evolution GTR HKY Jukes-Cantor Felsenstein K2P Tree building methods: some examples Assessing phylogenetic data Popular phylogenetic.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Phylogenetic trees School B&I TCD Bioinformatics May 2010.
BINF6201/8201 Molecular phylogenetic methods
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Underlying Principles of Zoology Laws of physics and chemistry apply. Principles of genetics and evolution important. What is learned from one animal group.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Warm-Up 1.Contrast adaptive radiation vs. convergent evolution? Give an example of each. 2.What is the correct sequence from the most comprehensive to.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Systematics and the Phylogenetic Revolution Chapter 23.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
Phylogeny and Genome Biology Andrew Jackson Wellcome Trust Sanger Institute Changes: Type program name to start Always Cd to phyml directory before starting.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Phylogeny Ch. 7 & 8.
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Classification and Phylogenetic Relationships
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Lecture 15: Reconstruction of Phylogeny Adaptive characters: 1.May indicate derived character (special adaptation) e.g. Raptorial forelegs in mantids 2.May.
Maximum Parsimony Phenetic (distance based) methods are fast and often accurate but discard data and are not based on explicit character states at each.
Bioinformatics Lecture 3 Molecular Phylogenetic By: Dr. Mehdi Mansouri Mehr 1395.
Section 2: Modern Systematics
Introduction to Bioinformatics Resources for DNA Barcoding
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
Section 2: Modern Systematics
Multiple Alignment and Phylogenetic Trees
Patterns in Evolution I. Phylogenetic
Systematics: Tree of Life
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Systematics: Tree of Life
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
Chapter 19 Molecular Phylogenetics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogeny and Systematics (Part 6)
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular data assisted morphological analyses
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
But what if there is a large amount of homoplasy in the data?
Chapter 18: Evolution and Origin of Species
Presentation transcript:

Molecular phylogenetics Lecture 8 Molecular phylogenetics

NB: Phylogenetic trees are hypotheses The goal of phylogeny is simply to reconstruct the historical relationships between a group of taxa NB: Phylogenetic trees are hypotheses A tree may have very strong support, or it may have very little support - a large number of characters supporting a specific topology Weak support for a topology may arise due to – many possible alternatives which cannot be excluded

NB Gene trees are not the same as species trees species trees illustrate the evolutionary histories of a group of related species - i.e., species trees record the details of speciation for the group gene trees show the evolutionary relationships among DNA sequences for a locus gene trees may not be the same as species trees for one main reason – the existence of ancestral polymorphism if this ancestral polymorphism is lost in some taxa but not in others, then one sequence isolated in species A may be more closely related to one in species B than to any other conspecific sequence the gene tree will thus be different from the true species tree the best way to guarantee that this will not occur is to use information provided by multiple independent loci!

Terminology Trees may be Rooted and Unrooted By an Outgroup – Which is a taxon assumed to have diverged before the taxa studied Topology – refers to the branching pattern You can have internal and external branches Any type of data can be used to reconstruct phylogenetic trees Historically morphological data was used Molecular data sources are: Allozymes, immunological distance, DNA-DNA hybridization, Restriction site data, Amino acid sequences, DNA sequence data

Characters Characters may be binary (i.e., presence or absence of an isozyme allele) or multistate (i.e., ACGT) characters may also be ordered or unordered when characters are ordered a certain directionality is implied among changes

The important stuff Two important assumptions about the characters used to build trees: 1. the characters are independent 2. the characters are homologous a homologous character is one shared by two species because it was inherited from a common ancestor homoplasy character or trait is possessed by two species but was not possessed by all the ancestors intervening between them, it is said to exhibit homoplasy can result from convergent or parallel evolution, or from evolutionary reversals. reversals are very common at the DNA sequence level because there are only four nucleotide bases. the molecular characters used must also minimize “homoplasy”. - homoplasy occurs when two taxa possess a certain character but that character was not present in all ancestors of the two taxa. - homoplasy can occur by convergent or parallel evolution, or by reversals. - for molecular data, evolutionary reversals are the main source of homoplasy. - for example, consider the following series of substitutions:

How do we construct trees? There are four major types of phylogenetic methods: 1. Distance methods (e.g. UPGMA, NJ) 2. Maximum parsimony methods (MP) 3. Maximum likelihood methods (ML) 4. Bayesian tree-building methods (such as that used by the computer program MrBayes) MP and ML are both called “cladistic” methods All these methods can use an Outgroup or Root

Distance Methods The general strategy behind distance methods is to cluster taxa (or OTUs) so that the most similar ones are found close together in the tree This strategy is called a phenetic approach The best tree, according to this approach, is to minimize the total distance among all taxa The branch lengths in phenograms carry important information The closer two taxa resemble one another the higher they are positioned in the phenogram phenograms may not represent the true phylogeny - in fact construction of the true phylogeny is not one of their goals. - the branch lengths in phenograms carry information about the degree of similarity between any pair of taxa. - an important assumption of UPGMA is that an equal rate of evolution occurs along all branches. - although this assumption may be violated frequently, the UPGMA methods does quite well in simulation studies recovering true tree topographies. - the principle behind NJ is to find neighbors sequentially that may minimize the total length of the tree. - the total length of the tree is simply the sum of all branch lengths. an important advantage of the NJ approach over UPGMA is that it allows for unequal rates of evolution along different branches distance methods are fast and efficient but all suffer from the same problem. - this problem is that they are all based on an estimate derived from the data – they do not use the data itself to produce the tree. - because information is lost in coverting the data into a distance matrix, distance methods fail to use all of the information present in the data.

Maximum parsimony (MP) according to the MP approach, the best tree minimizes the number of evolutionary steps (i.e., changes among characters) this is the principle of parsimony - the least number of changes, required the better the tree evolutionary change does not always obey laws of parsimony but it is a reasonable starting point MP trees are based exclusively on synapomorphies shared by two or more taxa that are derived from some ancestral state essential to use one or more outgroups

Maximum Likelihood (ML) given a certain model of base substitution and a specific tree, what is the probability of obtaining this set of DNA sequences? this probability is estimated by a tree’s likelihood score the best tree is that which has the highest likelihood, or probability of being produced ML methods are computationally intensive and thus have not been easy to use until recently

How do you select the correct tree Correlation between distance, MP, and ML topologies is found – Reconstruction is GOOD Common approach is to use a statistical technique called bootstrapping Bootstrapping procedure works by randomly re-sampling the nucleotide sequence data (with replacement), constructing a tree from this data and counting the number of times a particular branch is found out of say, 100, 500, or 1,000 replicate pseudosamples

So? Trees based on genetic markers will accurately reflect evolutionary relationships only if the rates of evolution are constant markers are neutral foundation population is monomorphic (or there are many independently inherited markers) If starting population is polymorphic, fixation of different initial sequences in different lineages (termed lineage sorting) may lead to incorrect inferred phylogenies

Outbreeding depression Outbreeding depression is the reduction in reproductive fitness resulting from crossing of populations Occurs when: adaptation to a certain area occurred; crossing different sub-species

Diagnosing genetic problems 1. How large is the population (Ne)? 2. Has it experienced significant bottlenecks in the past? 3. Has it lost genetic diversity? 4. Is it suffering from inbreeding depression? 5. Is it genetically fragmented? How do you recover a small population? outbreed (if available), or inbred but genetically differentiated from the population to which they are being introduced DL - Eldridge et al. 1999 – Black-footed Rock wallaby

Genetically viable populations How large must populations be, to be genetically viable in the long term? minimum viable population size (MVP) the minimum size required to retain reproductive fitness and evolutionary potential over thousands of years

How large? Three genetic components must be considered in answering this question: Is the population size large enough to avoid inbreeding depression? Is there sufficient genetic diversity for evolution to occur in response to environmental change? Is the population large enough to avoid accumulating new deleterious mutations?