GSR The Gene Sequence evolution model with iid Rate variation over tree Örjan Åkerborg, KTH Lars Arvestad, KTH Jens Lagergren, KTH Bengt Sennblad.

Slides:



Advertisements
Similar presentations
Topic models Source: Topic models, David Blei, MLSS 09.
Advertisements

Quick Lesson on dN/dS Neutral Selection Codon Degeneracy Synonymous vs. Non-synonymous dN/dS ratios Why Selection? The Problem.
CS 598AGB What simulations can tell us. Questions that simulations cannot answer Simulations are on finite data. Some questions (e.g., whether a method.
Phylogenetic analysis To infer and study evolutionary history of homologous gene families Manuel Ruiz (CIRAD, Data Integration team) Alexis Dereeper (IRD)
Gene an d genome duplication Nadia El-Mabrouk Université de Montréal Canada.
Whole Genome Sequencing &Crop Genetic Breeding Presentation: Wenhui Gao
THE EVOLUTIONARY HISTORY OF BIODIVERSITY
Orthology, paralogy and GO annotation Paul D. Thomas SRI International.
Genomics Chapter 18.
Molecular Clock I. Evolutionary rate Xuhua Xia
Genome evolution There are both proximate and ultimate explanations in molecular biology Mutation continually generates variation in genome content and.
Molecular Evolution Revised 29/12/06
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Evolution at the DNA level …ACGGTGCAGTTACCA… …AC----CAGTCCACCA… Mutation SEQUENCE EDITS REARRANGEMENTS Deletion Inversion Translocation Duplication.
CS273a Lecture 8, Win07, Batzoglou Evolution at the DNA level …ACGGTGCAGTTACCA… …AC----CAGTCCACCA… Mutation SEQUENCE EDITS REARRANGEMENTS Deletion Inversion.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment.
HGM Hybrid networks with Gene Evolution Ali Tofigh, KTH Jens Lagergren, KTH Bengt Sennblad.
Ohnologs and Regulatory Networks Robbie Sedgewick Group Meeting March 2, 2006.
Genomic Rearrangements CS 374 – Algorithms in Biology Fall 2006 Nandhini N S.
SRT Sequence model with Rates and Times Martin Linder, UU Tom Britton, SU Örjan Åkerborg, KTH Jens Lagergren, KTH, Bengt Sennblad.
1 Genome Rearrangements João Meidanis São Paulo, Brazil December, 2004.
Short Primer on Comparative Genomics Today: Special guest lecture 12pm, Alway M108 Comparative genomics of animals and plants Adam Siepel Assistant Professor.
Queensland Parallel Supercomputing Foundation 1. Professor Mark Ragan (Institute for Molecular Bioscience) 2. Dr Thomas Huber (Department of Mathematics)
Geometric Approaches to Reconstructing Time Series Data Project Update 29 March 2007 CSC/Math 870 Computational Discrete Geometry Connie Phong.
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
Why Models of Sequence Evolution Matter Number of differences between each pair of taxa vs. genetic distance between those two taxa. The x-axis is a proxy.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Genome Alignment. Alignment Methods Needleman-Wunsch (global) and Smith- Waterman (local) use dynamic programming Guaranteed to find an optimal alignment.
3- RIBOSOMAL RNA GENE RECONSTRUCITON  Phenetics Vs. Cladistics  Homology/Homoplasy/Orthology/Paralogy  Evolution Vs. Phylogeny  The relevance of the.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Yeast genome sequencing: the power of comparative genomics MEDG 505, 03/02/04, Han Hao Molecular Microbiology (2004)53(2), 381 – 389.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Genetics Presentation: Sea Squirts Catherine Dong 9/6/12 Bio303 H Dr. Ely.
whole-genome duplications and large segmental duplications… …seem to be a common feature in eukaryotic genome evolution …play a crucial role in the evolution.
Phylogeny and Genome Biology Andrew Jackson Wellcome Trust Sanger Institute Changes: Type program name to start Always Cd to phyml directory before starting.
Molecular and Genomic Evolution Getting at the Gene Pool.
The Tree of Life How do we select a gene sequence for comparison?
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Phylogeography of Leucetta chagosensis (Porifera, Calcarea) Christoph Flucke, Jens Kurz, Rasmus Liedigk, Zdenka Valenzova Fig.4: RAxML Phylogram Fig.5:
Phylogeny & Systematics
Gene Family Size Distributions Brought to You By Your Neighorhood Durand Lab Narayanan Raghupathy Nan Song Rose Hoberman.
Bootstrap ? See herehere. Maximum Likelihood and Model Choice The maximum Likelihood Ratio Test (LRT) allows to compare two nested models given a dataset.Likelihood.
Bioinf.cs.auckland.ac.nz Juin 2008 Uncorrelated and Autocorrelated relaxed phylogenetics Michaël Defoin-Platel and Alexei Drummond.
Full modeling versus summarizing gene- tree uncertainty: Method choice and species-tree accuracy L.L. Knowles et al., Molecular Phylogenetics and Evolution.
The genomic democracy of sex. Genetic variability Mutation Gene flow Sex.
Molecular Evolution. Study of how genes and proteins evolve and how are organisms related based on their DNA sequence Molecular evolution therefore is.
Population sequencing using short reads: HIV as a case study Vladimir Jojic et.al. PSB 13: (2008) Presenter: Yong Li.
PrIME Probabilistic Integrated Models of Evolution
Evolutionary genomics can now be applied beyond ‘model’ organisms
Shin-Han Shiu Department of Plant Biology
Fig. 1. — The life cycle of S. papillosus. (A) The life cycle of S
Genetics and Evolutionary Biology
Pipelines for Computational Analysis (Bioinformatics)
Very important to know the difference between the trees!
Molecular Clock: Lineage-specific evolutionary rate
In-Text Art, Ch. 16, p. 316 (1).
Human Evolution: Turning Back the Clock
Biological Evidence of Evolution
Molecular Evolution.
Why could a gene tree be different from the species tree?
26.5 Molecular Clocks Help Track Evolutionary Time
Evolutionary genetics
Reverend Thomas Bayes ( )
Study phylogeny in the context of species evolution
Qian Cong, Dominika Borek, Zbyszek Otwinowski, Nick V. Grishin 
Presentation transcript:

GSR The Gene Sequence evolution model with iid Rate variation over tree Örjan Åkerborg, KTH Lars Arvestad, KTH Jens Lagergren, KTH Bengt Sennblad

What? Gene evolution through duplication and loss Sequence evolution

Why? Base reconciliation analysis directly on data –Avoid information loss –Addresses uncertainty better Gene tree reconstruction should mirror generation

When? Arvestad et al. 2003, –MrBayes + GEM –Flawed model

60s ribosomal data

Arvestad et al. 2003, –MrBayes + GEM –Flawed model Arvestad et al –Intergrated GEM + Substitution model –Mathematically correct model –Molecular clock –Sampling algorithm - slow When?

MHC revisited

Åkerborg et al. Submitted (GSR) –Integrated GEM + SRT model When? Arvestad et al. 2003, –MrBayes + GEM –Flawed model Arvestad et al –Intergrated GEM + Substitution model –Mathematically correct model –Molecular clock –Sampling algorithm - slow

How? GSR –GEM Reconciled trees – duplication and loss –SRT Relaxed clock model (iid) Substitution model –Fast algorithm Discretized time space

Trees, T Sequence data, F Pr[D,T] Self-consistency

Self-consistency – the X%-test

Application to Yeast data Compare to prev. Results –YGOB –Orthogrups (SYNERGY) Both synteny-based

Synteny -- gene order

Application to Yeast data Compare to prev. Results –YGOB –Orthogrups (SYNERGY) Both synteny-based –Whole genome duplication Challenge! Genome-wide analysis! –4809 gene families (orthogroups)

Comparison YGOB results

Molecular clock?

Comparison SYNERGY results

Sequence vs. synteny data No sequence diff –36% of data sets are >85% similar Strong divergence –25% of data set are <40% similar –Long-branch attraction Conflicting sequence-synteny signal

Sequence vs. synteny data

Orthogroup % Single best SYNERGY tree 21.6 % 31.6 %

Summary primeGSR –Integrated model Reconciliation – gene duplication loss Relaxed clock Sequence evolution –Efficient algorithms –Improved gene tree reconstruction Future prospects –Divergence time estimates (MAP) –Species tree reconstruction –Include synteny