DIVERSIFYING SELECTION AND FUNCTIONAL CONSTRAINT

Slides:



Advertisements
Similar presentations
IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Advertisements

Evolution of genomes.
Quick Lesson on dN/dS Neutral Selection Codon Degeneracy Synonymous vs. Non-synonymous dN/dS ratios Why Selection? The Problem.
Background The demographic events experienced by populations influence their genealogical history and therefore the pattern of neutral polymorphism observable.
Recombination and genetic variation – models and inference
A New Nonparametric Bayesian Model for Genetic Recombination in Open Ancestral Space Presented by Chunping Wang Machine Learning Group, Duke University.
Atelier INSERM – La Londe Les Maures – Mai 2004
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
14 Molecular Evolution and Population Genetics
Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms Yufeng Wu UC Davis RECOMB 2007.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Scott Williamson and Carlos Bustamante
Positive selection A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Molecular Evolution, Part 2 Everything you didn’t want to know… and more! Everything you didn’t want to know… and more!
Adaptive Molecular Evolution Nonsynonymous vs Synonymous.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
Molecular basis of evolution. Goal – to reconstruct the evolutionary history of all organisms in the form of phylogenetic trees. Classical approach: phylogenetic.
Gil McVean Department of Statistics, Oxford Approximate genealogical inference.
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Eric C. Rouchka, University of Louisville Sequence Database Searching Eric Rouchka, D.Sc. Bioinformatics Journal Club October.
TOWARDS TESTING THE EPIDEMIC CLONE MODEL OF BACTERIAL PATHOGENS Daniel J. Wilson, Gilean A.T. McVean and Martin C.J. Maiden Peter Medawar Building for.
PAML: Phylogenetic Analysis by Maximum Likelihood Ziheng Yang Depart of Biology University College London
Introduction to Bioinformatics.
Identifying and Modeling Selection Pressure (a review of three papers) Rose Hoberman BioLM seminar Feb 9, 2004.
Estimating evolutionary parameters for Neisseria meningitidis Based on the Czech MLST dataset.
Confidence Intervals Population Mean σ 2 Unknown Confidence Intervals Population Proportion σ 2 Known Copyright © 2013 Pearson Education, Inc. Publishing.
Selectionist view: allele substitution and polymorphism
N=50 s=0.150 replicates s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among.
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Evolution of individual genes in humans
Chapter 17 How to read a table of codons. These are two forms in which you might see a table of codons.
Modelling evolution Gil McVean Department of Statistics TC A G.
LBA ProtPars. LBA Prot Dist no Gamma and no alignment.
Lecture 6 Genetic drift & Mutation Sonja Kujala
Sequence similarity, BLAST alignments & multiple sequence alignments
BAE 6520 Applied Environmental Statistics
ESTIMATION.
IMa2(Isolation with Migration)
DIVERSIFYING SELECTION AND FUNCTIONAL CONSTRAINT: ESTIMATING
BAE 5333 Applied Water Resources Statistics
Molecular mechanism of mutation
Fig. 1. —The potential action of selection on a genealogy
Signatures of Selection
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Linkage and Linkage Disequilibrium
The Neutral Theory M. Kimura, 1968
Very important to know the difference between the trees!
Models of Sequence Evolution
Mutations. Mutations Mutation A permanent, heritable change in the DNA of an organism. One or several nucleotides can be added, deleted, or replaced.
Molecular basis of evolution.
Schematic of Eukaryotic Protein-Coding Locus
What are the Patterns Of Nucleotide Substitution Within Coding and
The genetic code © 2016 Paul Billiet ODWS.
Statistical Modeling of Ancestral Processes
More about Posterior Distributions
Evolution and Spread of Ebola Virus in Liberia, 2014–2015
Methods used to study mutations
Molecular Clocks Rose Hoberman.
Chapter Three Amino Acids and Peptides
Levels of Protein Structure
Pedir alineamiento múltiple
Translation.
DN/dS.
_____ _____ are _____ by _____ ____ sequences.
MULTIPLE SEQUENCE ALIGNMENT
Mutations.
Presentation transcript:

DIVERSIFYING SELECTION AND FUNCTIONAL CONSTRAINT ESTIMATING THE dN/dS RATIO FOR GENE SEQUENCES IN THE PRESENCE OF RECOMBINATION Danny Wilson 12th October 2004

Menu Codon-based models of molecular evolution An new method for estimating omega with recombination Does it work? Simulation studies and example data

Codon-based models of molecular evolution Part one Codon-based models of molecular evolution

Sampling usually occurs at this point i.e. post-selection Ancestral type Neutral mutant Inviable mutant Sampling usually occurs at this point i.e. post-selection Mutation Selection Underlying rates of non-synonymous mutation are usually confounded with selection against inviable mutants. Thus it is convenient to model functional constraint as mutational bias. (Or rather, make no attempt to disentangle the two).

Types of single nucleotide mutation Transitions vs. transversions Purine Transitions Transversions T C Pyramidine Transitions For any base there are always 2 possible transversions and 1 possible transition.

Types of codon mutation Synonymous vs. non-synonymous G A Leucine T G A Leucine Methionine Synonymous Non-synonymous Leucine pH 5.98 6-fold degeneracy in the genetic code Methionine pH 5.74 Single unique codon ATG CH3-S-(CH2)2-CH(NH2)-COOH (CH3)2-CH-CH2-CH(NH2)-COOH

Example: CTT C T T T T T A T T G T T T C T T A T T G T T T C T T A T T Phe Non-synonymous transition wkm Ile Non-synonymous transversion wm Val Ser Tyr Cys Leu Synonymous transversion m km A T T Leucine G T T T C T T A T T G T T T C T T A T T G

Nielsen and Yang (1998) codon-based model of molecular evolution Mutation rate Synonymous transversion m Synonymous transition km Non-synonymous transversion wm Non-synonymous transition wkm Other Interpretation k Transition-transversion ratio w = dN/dS Relative viability of non-synonymous mutations

codeML Pros Viable method for detecting mode of selection on a codon sequence Cons Categorizes possible values for omega into a small number of discrete intervals Results can be misleading in the presence of recombination

An new method for estimating omega with recombination Part two An new method for estimating omega with recombination

Inference with recombination

Li and Stephens (2003) Approximation to the likelihood

Li and Stephens (2003) Approximation to the likelihood TTTGATACTGTTGCCGAAGGTTTGGGCGAAATTCGCGATTTATTGCGCCGTTATCATCAT TTTGATACCGTTGCCGAAGGTTTGGGTGAAATTCGCGATTTATTGCGCCGTTACCACCGC TTTGATACCGTTGCCGAAGGTTTGGGTAAAATTCGCGATTTATTGCGCCGTTACCACCGC TTTGATACCGTTGCCGAAGGTTTGGGCGAAATTCGTGATTTATTGCGCCGTTATCATCAT

Li and Stephens (2003) Approximation to the likelihood TTTGATACTGTTGCCGAAGGTTTGGGCGAAATTCGCGATTTATTGCGCCGTTATCATCAT TTTGATACCGTTGCCGAAGGTTTGGGTGAAATTCGCGATTTATTGCGCCGTTACCACCGC TTTGATACCGTTGCCGAAGGTTTGGGTAAAATTCGCGATTTATTGCGCCGTTACCACCGC

My modification to Li and Stephens(2003)

Estimating variable omega The problem A constant omega model is prone to averaging positive and negative omegas in a gene Allowing every site its own omega leaves little information for inference The solution A change-point model where windows of adjacent sites share the same omega

Estimating variable omega MCMC moves: Change omega for a single block Extend a block 5’ or 3’ Split an existing block Merge adjacent blocks w1 w2 w3 w4 w5

Does it work? Simulation studies and example data Part three Does it work? Simulation studies and example data

Posterior distribution for known and unknown genealogy

Posterior distribution for known and unknown genealogy

Neutral dataset True omega Posterior mean Posterior HPD interval

Non-neutral dataset True omega Posterior mean Posterior HPD interval

HIV envelope gene Slow Non-Progressors vs Rapid Progressors

HIV envelope gene Slow Non-Progressors vs Rapid Progressors

Neisseria meningitidis PorB3

Neisseria meningitidis PorB3 95% HPD Upper 0.0386 95% HPD Lower 0.0187

Work in progress… Variable recombination rate Model indels Falsifiability test Test for sensitivity to rate heterogeneity

Acknowledgements Gil McVean (Supervisor) Martin Maiden (Supervisor) Ziheng Yang Rachel Urwin (meninge data) Charlie Edwards (HIV data)