For immunologists 2013 Introduction to Phylogenies Dr Laura Emery

Slides:



Advertisements
Similar presentations
Phylogenetic Interpretation
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
An Introduction to Phylogenetic Methods
Introduction to Phylogenies
Wellcome Trust Workshop Working with Pathogen Genomes Module 6 Phylogeny.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
The Evolutionary Basis of Bioinformatics: An Introduction to Phylogenetics > Sequence 1 GAGGTAGTAATTAGATCCGAAA… > Sequence.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Molecular Evolution Revised 29/12/06
© Wiley Publishing All Rights Reserved. Phylogeny.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
In addition to maximum parsimony (MP) and likelihood methods, pairwise distance methods form the third large group of methods to infer evolutionary trees.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Phylogenetic Analysis. 2 Phylogenetic Analysis Overview Insight into evolutionary relationships Inferring or estimating these evolutionary relationships.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
Phylogenetic trees Sushmita Roy BMI/CS 576
What Is Phylogeny? The evolutionary history of a group.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
Molecular phylogenetics
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Building and visualizing phylogeny Henrik Lantz Dept. of Medical Biochemistry and Microbiology, BMC, Uppsala University.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Systematics and the Phylogenetic Revolution Chapter 23.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
GENE 3000 Fall 2013 slides wiki. wiki. wiki.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Phylogeny Ch. 7 & 8.
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Systematics and Phylogenetics Ch. 23.1, 23.2, 23.4, 23.5, and 23.7.
Chapter 26 Phylogeny and the Tree of Life
Phylogenetic trees. 2 Phylogeny is the inference of evolutionary relationships. Traditionally, phylogeny relied on the comparison of morphological features.
Section 2: Modern Systematics
Introduction to Bioinformatics Resources for DNA Barcoding
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
Pipelines for Computational Analysis (Bioinformatics)
Reading Cladograms Who is more closely related?
Section 2: Modern Systematics
Schedule Cultural connection Introduction to evolution
Phylogenetic Trees.
Summary and Recommendations
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Phylogenetic Trees Jasmin sutkovic.
Chapter 26 Phylogeny and the Tree of Life
Chapter 20 Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Summary and Recommendations
1 2 Biology Warm Up Day 6 Turn phones in the baskets
Presentation transcript:

for immunologists 2013 Introduction to Phylogenies Dr Laura Emery

Objectives After this tutorial you should be able to… Use essential phylogenetic terminology effectively Discuss aspects of phylogenies and their implications for phylogenetic interpretation Apply phylogenetic principles to interpret simple trees This course will not: Provide you with an overview of phylogenetic methods Enable you to use tools to construct your own phylogenies Enable you to evaluate whether a sensible phylogenetic model or method was selected to construct a phylogeny

Outline Introduction Aspects of a tree 1.Topology 2.Branch lengths 3.Nodes 4.Confidence Simple phylogenetic interpretation Including homology, gene duplication, co-evolution

What can I do with phylogenetics? Deduce relationships among species or genes or cells Deduce the origin of pathogens Identify biological processes that affect how your sequence has evolved e.g. identify genes or residues undergoing positive selection Explore the evolution of traits through history Estimate the timing of major historical events Explore the impact of geography on species diversification

What is a phylogenetic tree? A tree is an explanation of how sequences evolved, their genealogical relationships and thus how they came to be the way they are today (or at the time of sampling). Darwin 1837

Phylogenies explain genealogical relationships Family tree

Aspects of a tree 1. Topology (branching order) 2. Branch lengths (indication of genetic change) 3. Nodes i.Tips (sampled sequences known as taxa) ii.Internal nodes (hypothetical ancestors) iii.Root (oldest point on the tree) 4. Confidence (bootstraps/probabilities)

1. Topology The topology describes the branching structure of the tree, which indicate patterns of relatedness. ABCABCBAC These trees display the same topology ABCCBACAB These trees display different topologies

Topology Question Are these topologies the same? Answer = yes

Topology Question II Which of these trees has a different topology from the others? ABCFD E AEDFB C BACFD E CABFE D EDFCA B

2. Branch lengths indicate genetic change Longer branches indicate greater change Change is typically represented in units of number of substitutions per site (but check the legend)

A scale bar can represent branch lengths 0.5 These are alternative representations of the same phylogeny

Alternative representations of phylogenies All of these representations depict the same topology Branch lengths are indicated in blue Red lengths are meaningless Newick format

Not all trees include branch length data CladogramPhylogram

Distance and substitution rate are confounded Branch lengths indicate the genetic change that has occurred We often don’t know if long branch lengths reflect: A rapid evolutionary rate An ancient divergence time A combination of both Genetic change = Evolutionary rate x Divergence time (substitutions/site) (substitutions/site/year) (years) C D EAB

3. Nodes Nodes occur at the ends of branches There are three types of nodes: i.Tips (sampled sequences known as taxa) ii.Internal nodes (hypothetical ancestors) iii.Root (oldest point on the tree) CDEAB Figures Andrew Rambaut

The root is the oldest point on the tree The root indicates the direction of evolution It is also the (hypothesised) most recent common ancestor (MRCA) of all of the samples in the tree CDEAB past present Figures Andrew Rambaut

Trees can be drawn in an unrooted form Rooted Unrooted These are alternative representations of the same topology CDEAB A B C D E

There are multiple rooted tree topologies for any given unrooted tree Most tree-building methods produce unrooted trees Identifying the correct root is often critical for interpretation! * Figure Aiden Budd

How to root a tree Midpoint rooting Assume constant evolutionary rate Often not the case! Outgroup rooting The outgroup is one or more taxa that are known to have diverged prior to the group being studied The node where the outgroup lineage joins the other taxa is the root Midpoint rooted Outgroup rooted Unrooted Recommended

Root Question This tree shows a cladogram i.e. the branch lengths do not indicate genetic change. Indicate any root positions where bird and crocodile are not sister taxa (each other's closest relatives).

Alternative Representations Question

4. Confidence How good is a tree? A tree is a collection of hypotheses so we assess our confidence in each of its parts or branches independently There are three main approaches: Bootstraps Bayesian methods Approximate likelihood ratio test (aLRT) methods probabilistic

What is a monophyletic group? A monophyletic group (also described as a clade) is a group of taxa that share a more recent common ancestor with each other than to any other taxa. monophyletic group

Confidence Question Which of the bootstrap values indicates our confidence in the grouping of A, B, C, and D together as a monophyletic group? Do you think we can be confident in this grouping? ABCDEFABCDEF Note: high bootstrap values do not always mean that we have confidence in a branch. False confidence can be generated under some phylogenetic methods

for immunologists 2013 Part two: Phylogenetic interpretation Dr Laura Emery

Phylogenetic interpretation skill set 1. Tree-thinking skills relatedness, confidence, homology 2. Knowledge of phylogenetic methods and their limitations 3. Knowledge of biological processes affecting sequence evolution gene duplication, recombination, horizontal gene transfer, population genetic processes, and many more! 4. Knowledge of the data you wish to interpret Covered in introduction to phylogenies

Simple phylogenetic interpretation question Which is true? A) Mouse is more closely related to fish than frog is to fish B) Lizard is more closely related to fish than mouse is to fish C) Human and frog are equally related to fish

Homology is similarity due to shared ancestry Example: limbs and wings Limbs are homologous they share a common ancestor Wings are not homologous they are an analogous as they have evolved similarity independently

Gene duplication Gene duplication and subsequent divergence can result in novel gene functions (it can also result in pseudogenes) Genes that are homologous due to gene duplication are paralogous Genes that are homologous due to speciation are orthologous

Can you spot any MHC class II gene duplication events? Harstad et al BMC Genomics 2008 Teleost MHC class II phylogeny

Park et al Scientific Reports Immunology genes have a high d N /d S ratio indicative of positive selection Rapid evolutionary rate Difficult to align Violate assumptions of many phylogenetic models Immunology related genes have atypical patterns of molecular evolution

Positive selection can lead to ladder-like phylogenies

Example: influenza haemagglutination phylogeny and immunological mapping Smith et al Science

Phylogenetics can inform us of host- pathogen interactions and co-evolution "Mirror" phylogenies are indicative of host-parasite vertical inheritance Jiggins web page:

What does this phylogeny tell us about Human Cytomegalovirus (HCMV)? Nicholson et al Virol J Human Chimp Rhesus Simian Baboon Rat Murine

T-cell receptors and immunoglobulin chains are homologous Richards et al 2000

An extremely brief introduction to methods, analyses, & pitfalls

There is only one true tree The true tree refers to what actually happened in the evolutionary past All methods attempt to reconstruct the true phylogeny Even the best method may not give you the true tree

Phylogenetic Methods: The general approach We want to find the tree that best explains our aligned sequences We need to be able to define “best explains” we need a model of sequence evolution we need a criterion (or set of criteria) to use to choose between alternative trees then evaluate all possible trees (NB: if N=20, then 2 x 1020 possible unrooted trees!) or take a short cut Paul Sharp

The problem of multiple substitutions More likely to have occurred between distantly related species > We need an explicit model of evolution to account for these A A AT G * * * * hidden mutations

Methodological approaches 1. Distance matrix methods (pre-computed distances) UPGMAassumes perfect molecular clock Sokal & Michener (1958) Minimum evolution (e.g. Neighbor-joining, NJ) Saitou & Nei (1987) 2. Maximum parsimony Fitch (1971) Minimises number of mutational steps 3. Maximum likelihood, ML Evaluates statistical likelihood of alternative trees, based on an explicit model of substitution 4. Bayesian methods Like ML but can incorporate prior knowledge

Phylogenetic analyses are not straightforward Data assessment - known biology - additional data (e.g. geography) Decide upon and implement method Phylogeneti c Result(s) Formulate hypotheses Answere d your question? Investigate unexpected and unresolved aspects further - consider including more data Final phylogeny and analysis Can you validate this? Yes No Yes

Further Reading Molecular Evolution: A Phylogenetic Approach (1998) Roderic D M Page & Edward C Holmes, Blackwell Science, Oxford. The Phylogenetic Handbook (2003), Marco Salemi and Anne-Mieke Vandamme Eds, Cambridge University Press, Cambridge. Inferring Phylogenies (2003) Joseph Felsenstein, Sinauer. Molecular Evolution (1997) Wen-Hsiung Li, Sinauer

Phylogenetics at the EBI Clustal phylogeny currently available RAxML coming soon…

Acknowledgements People Andrew Rambaut (University of Edinburgh) …and the EBI training team Paul Sharp (University of Edinburgh) Nick Goldman (EMBL-EBI) Benjamin Redelings (Duke University) Brian Moore (University of California, Davis) Olivier Gascuel (University of Montpelier) Aiden Budd (EMBL-Heidelberg) Funding EMBL member states and…

Thank you! Facebook: EMBLEBI