Statistical Modeling of Ancestral Processes

Slides:



Advertisements
Similar presentations
Exact Computation of Coalescent Likelihood under the Infinite Sites Model Yufeng Wu University of Connecticut DIMACS Workshop on Algorithmics in Human.
Advertisements

Genetic Statistics Lectures (5) Multiple testing correction and population structure correction.
The Coalescent Theory And coalescent- based population genetics programs.
Gene tree analyses of Aboriginal Australians Rosalind Harding University of Oxford.
Background The demographic events experienced by populations influence their genealogical history and therefore the pattern of neutral polymorphism observable.
Amorphophallus titanum Largest unbranched inflorescence in the world Monecious and protogynous Carrion flower (fly/beetle pollinated) Indigenous to the.
Sampling distributions of alleles under models of neutral evolution.
Lecture 23: Introduction to Coalescence April 7, 2014.
Atelier INSERM – La Londe Les Maures – Mai 2004
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
Islands in Africa: a study of structure in the source population for modern humans Rosalind Harding Depts of Statistics, Zoology & Anthropology, Oxford.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Exact Computation of Coalescent Likelihood under the Infinite Sites Model Yufeng Wu University of Connecticut ISBRA
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
From population genetics to variation among species: Computing the rate of fixations.
Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms Yufeng Wu UC Davis RECOMB 2007.
Polymorphism Structure of the Human Genome Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Scott Williamson and Carlos Bustamante
Inferring human demographic history from DNA sequence data Apr. 28, 2009 J. Wall Institute for Human Genetics, UCSF.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
Monte Carlo methods for estimating population genetic parameters Rasmus Nielsen University of Copenhagen.
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Lecture 13 – Performance of Methods Folks often use the term “reliability” without a very clear definition of what it is. Methods of assessing performance.
RECOMB Satellite Workshop, 2007 Algorithms for Association Mapping of Complex Diseases With Ancestral Recombination Graphs Yufeng Wu UC Davis.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Gil McVean Department of Statistics, Oxford Approximate genealogical inference.
Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.
Simon Myers, Gil McVean Department of Statistics, Oxford Recombination and genetic variation – models and inference.
Coalescent Models for Genetic Demography
MAT 4830 Mathematical Modeling
Estimating evolutionary parameters for Neisseria meningitidis Based on the Czech MLST dataset.
Population genetics. coalesce 1.To grow together; fuse. 2.To come together so as to form one whole; unite: The rebel units coalesced into one army to.
Figure 5.1 Giant panda (Ailuropoda melanoleuca)
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Restriction enzyme analysis The new(ish) population genetics Old view New view Allele frequency change looking forward in time; alleles either the same.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Bayesian Evolutionary Analysis by Sampling Trees (BEAST) LEE KIM-SUNG Environmental Health Institute National Environment Agency.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
A Little Intro to Statistics What’s the chance of rolling a 6 on a dice? 1/6 What’s the chance of rolling a 3 on a dice? 1/6 Rolling 11 times and not getting.
Our Current Understanding of Human Demographic History and Migrations NeandertalModern Homo Sapiens.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Molecular Evolution and Population Genetics A few notes on population genetics of interest in phylogenetics Thomas Mailund.
An Algorithm for Computing the Gene Tree Probability under the Multispecies Coalescent and its Application in the Inference of Population Tree Yufeng Wu.
Fig. 1. —The potential action of selection on a genealogy
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
COALESCENCE AND GENE GENEALOGIES
Simple Linear Regression - Introduction
Making Statistical Inferences
Sample vs Population comparing mean and standard deviations
Populations What is a population? population – consists of all the
Testing the Neutral Mutation Hypothesis
Confidence Intervals Chapter 10 Section 1.
BI820 – Seminar in Quantitative and Computational Problems in Genomics
The coalescent with recombination (Chapter 5, Part 1)
Henry R. Johnston, David J. Cutler 
There is a Great Diversity of Organisms
Incorporating changing population size into the coalescent
David H. Spencer, Kerry L. Bubb, Maynard V. Olson 
John Wakeley, Rasmus Nielsen, Shau Neen Liu-Cordero, Kristin Ardlie 
Maternal History of Oceania from Complete mtDNA Genomes: Contrasting Ancient Diversity with Recent Homogenization Due to the Austronesian Expansion  Ana T.
Human Evolution: Thrifty Genes and the Dairy Queen
Notes: Sample Means
Sampling Distributions
Ecological Level of Organization
Presentation transcript:

Statistical Modeling of Ancestral Processes Based on a Review by: N. Rosenberg and M. Nordborg

Method: Phylogenetics or Genetic Analysis Goal: to understand demographic history of humans based on polymorphism data Since Polymorphisms are Random Processes, then they can be studied by their statistical properties Method: Phylogenetics or Genetic Analysis Phylogenetics/Species Tree Genetic Anaylsis/Gene Tree

What do Genetic Methods do? Genealogical methods estimate parameters of random genealogical processes that give rise to each tree Different sites can have different geneologies

Coalescence is used to establish a Model For a Model you need: Coalescence, Mutation & Recombination You tabulate the statistical properties of your data and compare that against known data Simulate Data See which pattern your observed data corresponds to (see which prediction it looks like)

The Model is relative to what you are trying to model Statistical Properties can vary - this is an example of an Average polymorphism rate Polymorphisms are proportionate to population size You may not want to look at the polymorphisms that are average. So, you would expand the parameters of your model to see if you data is expanded, collapsed, or bottlenecked The Model is relative to what you are trying to model