Evolutionary genomics can now be applied beyond ‘model’ organisms

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
THE EVOLUTIONARY HISTORY OF BIODIVERSITY
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Orthology, paralogy and GO annotation Paul D. Thomas SRI International.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
© Wiley Publishing All Rights Reserved. Phylogeny.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Building Phylogenies Parsimony 1. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Terminology of phylogenetic trees
Molecular phylogenetics
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Warm-Up 1.Contrast adaptive radiation vs. convergent evolution? Give an example of each. 2.What is the correct sequence from the most comprehensive to.
PHYLOGENETICS CONTINUED TESTS BY TUESDAY BECAUSE SOME PROBLEMS WITH SCANTRONS.
Phylogeny GENE why is coalescent theory important for understanding phylogenetics (species trees)? coalescent theory lets us test our assumptions.
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
26.1 Organisms Evolve Through Genetic Change Occurring Within Populations. “Nothing in Biology makes sense except in the light of Evolution” –Theodosius.
Lecture 17: Phylogenetics and Phylogeography
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny Ch. 7 & 8.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Molecular Evolution. Study of how genes and proteins evolve and how are organisms related based on their DNA sequence Molecular evolution therefore is.
Section 2: Modern Systematics
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic basis of systematics
Evolution of gene function
The neutral theory of molecular evolution
Very important to know the difference between the trees!
Section 2: Modern Systematics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Goals of Phylogenetic Analysis
Methods of molecular phylogeny
Endeavour to reconstruct the characters of each hypothetical ancestor.
Patterns in Evolution I. Phylogenetic
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular Evolution.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Reading Phylogenetic Trees
Chapter 25 – Phylogeny & Systematics
Chapter 19 Molecular Phylogenetics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogeny and Systematics (Part 6)
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular data assisted morphological analyses
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Unit Genomic sequencing
Phylogenetic Trees Jasmin sutkovic.
But what if there is a large amount of homoplasy in the data?
Phylogeny and the Tree of Life
Presentation transcript:

Evolutionary genomics can now be applied beyond ‘model’ organisms Technological advances brings genomics to the study of ecology & evolution But genomics has also made apparent the need for incorporating evolution into basic biological study Knowing how and why characters evolve (e.g. which residues of a protein are under constraint) informs on function (e.g. which residues are important) Brings genomics to study of ecology and evolution, when the organism is chosen NOT because it is easiest to work with (‘model’) but because it is the best organism to address a specific scientific question

Types of questions & comparisons in evolutionary genomics Given several whole-genome sequences, we can compare: * Genome size, organization (chromosomes/plasmids), structure * Gene/ncRNA content: number of genes, duplicates, size of gene families, etc * Sequence differences related to: gene evolution, regulatory evolution * RNA & protein abundance across species, for all RNAs/proteins Ultimately, many of us are interested in genomic features under selection: * Which genomic features are restricted from varying? Why? * Which genomic features are least restricted from varying? Why? * Which genomic features are/were involved in adaptation?

The How and the Why of Evolution We can observe what characters are different within and between species Using phylogeny, we can often reconstruct the common ancestral state Together these can inform on the history of changes The WHY: often much more challenging The goal is to understand the forces that drive changes or restrict change Often this means looking for known signatures of evolution (e.g. selection)

“Nothing in Biology Makes Sense Except in the Light of Evolution” T. Dobzhansky ‘Nothing in [comparative genomics] Makes Sense Except in the Light of the [phylogenetic tree]’ A. Gasch bastardization

Primer on Phylogeny Cladogram Phylogram a a b b c c d d e e Shows structure of the tree only Shows structure AND distance between nodes

4 incarnations of the SAME unrooted tree Primer on Phylogeny 4 incarnations of the SAME unrooted tree a a a b c d e b a b b c e c c d d d e e Unrooted means do not know history (i.e. where the common ancestor is)

Now distance also corresponds to ‘time’ (molecular clock theory) Primer on Phylogeny 3 incarnations of the SAME rooted tree a b c d e z a a b b c c * d d e * e z z known outgroup * Rooted means DO know history (i.e. where the common ancestor * is) Now distance also corresponds to ‘time’ (molecular clock theory) or order of events

Reconstructing the Ancestral State b c d * e z known outgroup

Reconstructing the Ancestral State Speciation event b c d * e z known outgroup In a species tree, each bifurcation represents a species split

Reconstructing the Ancestral State 4 b 3 c 2 d 5 1 * e z known outgroup Time/Distance Given a tree, we can reconstruct the ancestral state at each node. Usually work by Parsimony = smallest number of changes to explain the tree (i.e. the simplest explanation)

Reconstructing the Ancestral State Species can utilize: a Glucose, Galactose, Lactose 4 b 3 Glucose, Galactose, Lactose c 2 Glucose, Galactose, Lactose, Fructose d 5 Glucose, Galactose, Fructose 1 * e Glucose, Galactose, Fructose z Glucose, Lactose, Fructose Time/Distance Given a tree, we can reconstruct the ancestral state at each node. Usually work by Parsimony = smallest number of changes to explain the tree (i.e. the simplest explanation)

Reconstructing the Ancestral State Species can utilize: a Glucose, Galactose, Lactose GGL b GGLF Glucose, Galactose, Lactose c GGLF Glucose, Galactose, Lactose, Fructose d GGF Glucose, Galactose, Fructose GGLF or GLF * e Glucose, Galactose, Fructose z Glucose, Lactose, Fructose Time/Distance Events: Last common ancestor could have been either GGLF or GLF Galactose utilization might have been gained No change in states Fructose utilization was lost Lactose utilization was lost

several excellent courses on campus that cover this Making inferences based on phylogeny: Confidence in your inferences depends on how much you trust your tree Methods of phylogeny construction: * Neighbor joining: organize species based on similarity score - computationally the simplest but can be misleading especially if species are of variable evolutionary distances (“variable branch lengths”) * Parsimony: simplest tree to explain the observed data - simplest to model, can be computationally intensive without ‘heuristics’ * Maximum likelihood: requires specific models of evolution - computationally very intensive, need specific models * Baysian: - can be computationally intensive, need specific models ** Methods of phylogeny construction are beyond this course, but there are several excellent courses on campus that cover this

Making inferences based on phylogeny: Confidence in your inferences depends on how much you trust your tree Most methods have some way of assessing confidence at each node a Bootstrapping: Remake the tree 1,000 times using a subset of the data and see how many times you get the same node. High bootstrap value (>0.6) means in 60% of remade trees you observe that node. 1 b 0.6 c 0.4 d 1 1 * e z bootstrap values

Making inferences based on phylogeny: Confidence in your inferences depends on how much you trust your tree Most methods have some way of assessing confidence at each node Often represent the consensus tree a a 1 b 0.6 b c c 0.4 d d 1 1 * e e z z bootstrap values Collapse nodes without high confidence

Making inferences based on phylogeny: Confidence in your inferences depends on how much you trust your tree Most methods have some way of assessing confidence at each node a Baysian methods use a different approach Posterior Probability: Typically don’t trust nodes with <90% posterior probability. 1 b 0.6 c 0.4 d 1 1 * e z posterior probabilities

Making inferences based on phylogeny: Confidence in your inferences depends on how much you trust your tree Most methods have some way of assessing confidence at each node Consensus tree a a 1 b 0.6 b c c 0.4 d d 1 1 * e e z z posterior probabilities Collapse nodes with <0.9 posterior prob.

Species tree vs. gene/protein tree Trees can be very different, since genes can have their own histories Very important to know the difference between the trees! a. Gene tree is based a set of orthologous genes (i.e. related by a common ancestor) Often (but certainly not always) the gene tree is similar to the species tree b. Species tree is meant to represent the historical relationship between species. Want to build on characters that reflect time since divergence: In the genomic age, often use as many genes as possible (hundreds to thousands) to generate a species tree: Phylogenomics

Phylogenomics: Using Whole-genome information to reconstruct the Tree of Life Several approaches: 1. Concatonate many gene sequences and treat as one Use a ‘super matrix’ of variable sequence characters 2. Construct many separate trees, one for each gene, and then compare Often construct a ‘super tree’ that is built from all single trees 3. Incorporate non-sequence characters like synteny, intron structure, etc. The goal is to use many different # and types of characters to avoid being mislead about the relationship between species. Now recognized that different regions of the genome can have distinct histories.

A few other key basic concepts: Selection acts on phenotypes, based on their fitness cost/advantage, to affect the population frequencies of the underlying genotypes. In the case of DNA sequence: Neutral substitutions = no effect on fitness, no effect on selection Given a ~constant mutation rate, can convert the # of substitutions into time of divergence since speciation = molecular clock theory. Deleterious substitutions = fitness cost * These are removed by purifying (negative) selection Advantageous substitutions = fitness advantage * These alleles are enriched for through adaptive (positive) selection