Ln(7.9*10 -14 ) –ln(6.2*10 -12 ) is  2 – distributed with (n-2) degrees of freedom Output from Likelihood Method. Likelihood: 6.2*10 -12  = 0.34.

Slides:



Advertisements
Similar presentations
Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,
Advertisements

Molecular Clocks Prediction of time from molecular divergence.
Phylogenies and the Tree of Life
D3.7 Evidence for evol part III jackie. Biochemical evidence provided by the universality of DNA and protein structures for the common ancestry of living.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Bayesian Estimation in MARK
Introduction to Phylogenies
Lecture 3 Molecular Evolution and Phylogeny. Facts on the molecular basis of life Every life forms is genome based Genomes evolves There are large numbers.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Molecular Clock I. Evolutionary rate Xuhua Xia
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Bayesian Inference Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
Phylogeny. Reconstructing a phylogeny  The phylogenetic tree (phylogeny) describes the evolutionary relationships between the studied data  The data.
Course overview Tuesday lecture –Those not presenting turn in short review of a paper using the method being discussed Thursday computer lab –Turn in short.
Realistic evolutionary models Marjolijn Elsinga & Lars Hemel.
07/05/2004 Evolution/Phylogeny Introduction to Bioinformatics MNW2.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Molecular Evolution: Plan for week Monday 3.11: Basics of Molecular Evolution Lecture 1: Molecular Basis and Models I (JH) Computer :
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Bayesian Inference Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
BIOS E-127 – Phenetics vs. cladistics Lysozyme amino acid changes in unrelated ruminants Phenetics vs. cladistics.
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
BINF6201/8201 Molecular phylogenetic methods
Molecular phylogenetics
Christian M Zmasek, PhD 15 June 2010.
plants animals monera fungi protists protozoa invertebrates vertebrates mammals Five kingdom system (Haeckel, 1879)
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
A brief introduction to phylogenetics
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
Algorithms in Computational Biology11Department of Mathematics & Computer Science Algorithms in Computational Biology Building Phylogenetic Trees.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Lecture 16 – Molecular Clocks Up until recently, studies such as this one relied on sequence evolution to behave in a clock-like fashion, with a uniform.
Rooting Phylogenetic Trees with Non-reversible Substitution Models Von Bing Yap* and Terry Speed § *Statistics and Applied Probability, National University.
FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Why phylogenetics? Barbara Holland School of Physical Sciences University of Tasmania.
Maximum Likelihood Given competing explanations for a particular observation, which explanation should we choose? Maximum likelihood methodologies suggest.
Phylogeny Ch. 7 & 8.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Selecting Genomes for Reconstruction of Ancestral Genomes Louxin Zhang Department of Mathematics National University of Singapore.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Bootstrap ? See herehere. Maximum Likelihood and Model Choice The maximum Likelihood Ratio Test (LRT) allows to compare two nested models given a dataset.Likelihood.
Bioinf.cs.auckland.ac.nz Juin 2008 Uncorrelated and Autocorrelated relaxed phylogenetics Michaël Defoin-Platel and Alexei Drummond.
Building Phylogenies Maximum Likelihood. Methods Distance-based Parsimony Maximum likelihood.
HW7: Evolutionarily conserved segments ENCODE region 009 (beta-globin locus) Multiple alignment of human, dog, and mouse 2 states: neutral (fast-evolving),
Monkey Business Bioinformatics Research Center University of Aarhus Thomas Mailund Joint work with Asger Hobolth, Ole F. Christiansen and Mikkel H. Schierup.
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
Lecture 16 – Molecular Clocks
In-Text Art, Ch. 16, p. 316 (1).
Multiple Alignment and Phylogenetic Trees
Models of Sequence Evolution
Bayesian inference Presented by Amir Hadadi
Biological Classification: The science of taxonomy
Molecular Clocks Rose Hoberman.
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
The Most General Markov Substitution Model on an Unrooted Tree
Lecture 19: Evolution/Phylogeny
But what if there is a large amount of homoplasy in the data?
Presentation transcript:

ln(7.9* ) –ln(6.2* ) is  2 – distributed with (n-2) degrees of freedom Output from Likelihood Method. Likelihood: 6.2*  = Likelihood: 7.9*  = s1s2 s3s4 s5 Now Duplication Times Amount of Evolution Molecular Clock 23 -/ / / /+1.2 n-1 heights estimated s1 s2 s3 s4 s5 No Molecular Clock 6.9 -/ / / / / /+2.1 2n-3 lengths estimated 4.1 -/+0.7

Bayesian Approach Likelihood function L() – the probability of data as function of parameters: L( ,D) In Likelihood analysis,  is not stochastic variable,  max (D) is In Bayesian Analysis,  is a stochastic variable with a prior distribution before data is included in the analysis. D  After the observation of Data, there will be a posterior on  Bayesian Analysis have seem a major rise in use as a consequence of numerical/stochastic integration techniques such as Markov Chain Monte Carlo. Likelihood - L( ) Probability of going from to - q(, ) J - Jacobian Acceptance ratio Likelihood function L( ,D) is central to both approaches

Assignment to internal nodes: The simple way. C A C C A C T G ? ? ? ? ? ? If branch lengths and evolutionary process is known, what is the probability of nucleotides at the leaves? Cctacggccatacca a ccctgaaagcaccccatcccgt Cttacgaccatatca c cgttgaatgcacgccatcccgt Cctacggccatagca c ccctgaaagcaccccatcccgt Cccacggccatagga c ctctgaaagcactgcatcccgt Tccacggccatagga a ctctgaaagcaccgcatcccgt Ttccacggccatagg c actgtgaaagcaccgcatcccg Tggtgcggtcatacc g agcgctaatgcaccggatccca Ggtgcggtcatacca t gcgttaatgcaccggatcccat

Probability of leaf observations - summing over internal states A C G T P(C  G) *P C (left subtree)

The Molecular Clock First noted by Zuckerkandl & Pauling (1964) as an empirical fact. How can one detect it? Known Ancestor, a, at Time t s1 s2 a Unknown Ancestors s1 s2 s3 ??

1) Outgrup: Enhance data set with sequence from a species definitely distant to all of them. It will be be joined at the root of the original data Rootings Purpose 1) To give time direction in the phylogeny & most ancient point 2) To be able to define concepts such a monophyletic group. 2) Midpoint: Find midpoint of longest path in tree. 3) Assume Molecular Clock.

Rooting the 3 kingdoms 3 billion years ago: no reliable clock - no outgroup Given 2 set of homologous proteins, i.e. MDH & LDH can the archea, prokaria and eukaria be rooted? E P A Root?? E P A LDH/MDH Given 2 set of homologous proteins, i.e. MDH & LDH can the archea, prokaria and eukaria be rooted? E P A LDH/MDH E P A E P A LDH MDH

The generation/year-time clock Langley-Fitch,1973 s1 s3 s2 l2l2 l1l1 l3l3 Absolute Time Clock: Generation Time Clock: Elephant Mouse 100 Myr Absolute Time Clock Generation Time variable constant s1s3 s2 {l 1 = l 2 < l 3 } l3l3 Some rooting techniquee l 1 = l 2

The generation/year-time clock Langley-Fitch,1973 Can the generation time clock be tested? s1s3 s2 Any Tree Generation Time Clock Assume, a data set: 3 species, 2 sequences each s1s3 s2 s1 s3 s2 s1 s3 s2

The generation/year-time clock Langley-Fitch,1973 s1 s3 s2 c*l 2 c*l 1 c*l 3 s1 s3 s2 l2l2 l1l1 l3l3 s1s3 s2 l 1 = l 2 l3l3 k=3: degrees of freedom: 3 k: dg: 2k-3 dg: 2 dg: k-1 k=3, t=2: dg=4 k, t: dg =(2k-3)-(t-1) s1 s3 s2 l2l2 l1l1 l3l3

 – globin, cytochrome c, fibrinopeptide A & generation time clock Langley-Fitch,1973 Relative rates  -globin  – globin cytochrome c fibrinopeptide A 0.137

III Relaxed Molecular Clock (Huelsenbeck et al.). At random points in time, the rate changes by multiplying with random variable (gamma distributed) Almost Clocks (MJ Sanderson (1997) “A Nonparametric Approach to Estimating Divergence Times in the Absence of Rate Constancy” Mol.Biol.Evol ), J.L.Thorne et al. (1998): “Estimating the Rate of Evolution of the Rate of Evolution.” Mol.Biol.Evol. 15(12) , JP Huelsenbeck et al. (2000) “A compound Poisson Process for Relaxing the Molecular Clock” Genetics ) Comment: Makes perfect sense. Testing no clock versus perfect is choosing between two unrealistic extremes. I Smoothing a non-clock tree onto a clock tree (Sanderson) II Rate of Evolution of the rate of Evolution (Thorne et al.). The rate of evolution can change at each bifurcation

Adaptive Evolution Yang, Swanson, Nielsen,.. Models with positive selection. Positive Selection is interesting as it is as functional change and could at times be correlated with change between species.

Summary Combinatorics of Trees Principles of Phylogeny Inference Distance Parsimony Probablistic Methods Applications Clocks Selection