Lab 11 :Test of Neutrality and Evidence for Selection

Slides:



Advertisements
Similar presentations
Lab 9: Linkage Disequilibrium. Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate.
Advertisements

Lab 10: Mutation, Selection and Drift
Lab 10: Mutation, Selection and Drift. Goals 1.Effect of mutation on allele frequency. 2.Effect of mutation and selection on allele frequency. 3.Effect.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Lab 3 : Exact tests and Measuring of Genetic Variation.
Lab 3 : Exact tests and Measuring Genetic Variation.
Lab 4: Inbreeding and Kinship. Inbreeding Causes departure from Hardy-Weinburg Equilibrium Reduces heterozygosity Changes genotype frequencies Does not.
Sampling distributions of alleles under models of neutral evolution.
Modeling Populations forces that act on allelic frequencies.
Natural Selection on the Olfactory Receptor Gene Family in Humans and Chimpanzee Chloe Lee.
Section 3 Characterizing Genetic Diversity: Single Loci Gene with 2 alleles designated “A” and “a”. Three genotypes: AA, Aa, aa Population of 100 individuals.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
ATG GAG GAA GAA GAT GAA GAG ATC TTA TCG TCT TCC GAT TGC GAC GAT TCC AGC GAT AGT TAC AAG GAT GAT TCT CAA GAT TCT GAA GGA GAA AAC GAT AAC CCT GAG TGC GAA.
Supplementary Fig.1: oligonucleotide primer sequences.
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Pattern of similarity between Europeans and Neanderthals Green et al. Science 328, 710 (2010)
Detection of domestication genes and other loci under selection.
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Model Selection Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
From population genetics to variation among species: Computing the rate of fixations.
Bruce Walsh, University of Arizona
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Scott Williamson and Carlos Bustamante
Salit Kark Department of Evolution, Systematics and Ecology The Silberman Institute of Life Sciences The Hebrew University of Jerusalem Conservation Biology.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Model Selection Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
IGEM Arsenic Bioremediation Possibly finished biobrick for ArsR by adding a RBS and terminator. Will send for sequencing today or Monday.
Hidenki Innan and Yuseob Kim Pattern of Polymorphism After Strong Artificial Selection in a Domestication Event Hidenki Innan and Yuseob Kim A Summary.
Molecular phylogenetics
Genetic Variations Lakshmi K Matukumalli. Human – Mouse Comparison.
Lecture 21: Tests for Departures from Neutrality November 9, 2012.
Medical Genetics 08 基因变异的群体行为 Population Genetics.
Lab 12. Linkage Disequilibrium November 28, 2012.
HARDY-WEINBERG EQUILIBRIUM
The Evolution of Populations.  Emphasizes the extensive genetic variation within populations and recognizes the importance of quantitative characteristics.
Chapter 7 Population Genetics. Introduction Genes act on individuals and flow through families. The forces that determine gene frequencies act at the.
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012.
Lab 11 :Test of Neutrality and Evidence for Selection.
Population assignment likelihoods in a phylogenetic and demographic model. Jody Hey Rutgers University.
Supplemental Table S1 For Site Directed Mutagenesis and cloning of constructs P9GF:5’ GAC GCT ACT TCA CTA TAG ATA GGA AGT TCA TTT C 3’ P9GR:5’ GAA ATG.
Rates and Fitness Effects of Mutations Adam Eyre-Walker (University of Sussex)
1 Evolutionary Change in Nucleotide Sequences Dan Graur.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Estimating evolutionary parameters for Neisseria meningitidis Based on the Czech MLST dataset.
Lab 7. Estimating Population Structure. Goals 1.Estimate and interpret statistics (AMOVA + Bayesian) that characterize population structure. 2.Demonstrate.
Selectionist view: allele substitution and polymorphism
Lab 9: Linkage Disequilibrium. Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate.
Lecture 20 : Tests of Neutrality
Lab 7. Estimating Population Structure
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Genomics of Adaptation
Lab 4: Inbreeding and Kinship. Inbreeding Reduces heterozygosity Does not change allele frequencies.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Human survivorship Developed Developing Bob May (2007), TREE 22:
Lecture 21: Introduction to Phylogenetics November 9, 2015.
Lecture 3 - Concepts of Marine Ecology and Evolution II 3) Detecting evolution: HW Equilibrium Principle -Calculating allele frequencies, predicting genotypes.
Robert Page Doctoral Student in Dr. Voss’ Lab Population Genetics.
Fig. 1. Genomic structure of the csd gene in A
A Genetic Analysis of the Local Rana sylvatica Population
Why study population genetic structure?
Signatures of Selection
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Linkage and Linkage Disequilibrium
Sequence Alignments—part 2
Diversity and selection of the MHC class II genes in canids
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
Molecular evolution: traditional tests of neutrality
Presentation transcript:

Lab 11 :Test of Neutrality and Evidence for Selection

Goals: Calculate the expected number of different alleles for a given locus in a population. Detect departure from neutrality using: Ewens- Watterson test. Tajima’s D test. HKA test and Synonymous and Nonsynonymous nucleotide substitution test

Infinite Alleles Model (IAM) Each mutation produces a new allele At equilibrium: number of alleles – constant shape of allele frequency distribution – constant frequency of any given allele – not constant lost alleles replaced by new mutations

Ewens -Watterson test Expected homozygosity under mutation-drift equilibrium and assuming IAM: Expected homozygosity under HWE: P-value < 0.025: Too even -> Balancing selection or recent bottleneck P-value > 0.975: Too uneven -> Directional selection or population growth

Problem 1. Estimates of the long-term effective population size of human populations vary widely, ranging from as low as ~3,000 to as high as ~100,000. To estimate allele frequencies for a forensic identification study, you are genotyping individuals selected at random from a population with an estimated Ne = 7,500. You are using one allozyme and one microsatellite marker, with estimated mutation rates  = 0.810-6 and  = 9.210-2, respectively. How many different alleles do you expect to find for each marker in a sample of: 7 people? 12 people? List assumptions that need to be made for these calculations to be valid. Discuss which of these assumptions (if any) seem reasonable.

Pairwise DNA sequence diversity

Tajima’s D Under neutrality, we expect the following: Two different ways of estimating the same parameter Deviation of these two indicates deviation from neutral expectations Assumes neutral alleles and constant population size (Based on nucleotide diversity) (Based on number of polymorphic sites)

Tajima’s D test . Under neutrality: d =  − S = 0 D =

plantsciences.ucdavis.edu

Problem 2. File aspen_phy Problem 2. File aspen_phy.arp (which is already in Arlequin format) contains sequence data from exon 1 of the phytochrome B2 (phyB2) gene of 24 aspen (Populus tremula) trees sampled along a wide latitudinal gradient in Europe. Use Arlequin to: Determine the number of polymorphic sites (S) and calculate the nucleotide diversity () based on these sequences. Perform the tests of neutrality developed by Ewens-Watterson and Tajima and interpret the results. Provide a statistical and a biological interpretation of the results from the two neutrality tests. Is selection the only possible reason for significant discrepancies between  and S? Describe at least one other scenario in which a force other than selection would make D significantly different from 0. Include only a summary of the Arlequin output in your report (i.e., values of S and , as well as test statistics and p-values for the two neutrality tests). Do not include the entire output.

Hudson-Kreitman-Aguade (HKA) test p437 Hedrick 2005   Adh Control locus Polymorphism within species a 0.101 0.022 Divergence between species b 0.056 0.052 Ratio (within/between) 1.80 0.42 χ2 6.09 p-value 0.016

For each of these regions of tb1 and for each subspecies: Problem 3. Files utr_mays.arp, utr_par.arp, exon_mays.arp, and exon_par.arp contain sequence data from the 5’ untranslated region and from an exon of the teosinte branched1(tb1) gene of maize (Zea mays ssp. mays) and its most likely wild progenitor Zea mays ssp. parviglumis. For each of these regions of tb1 and for each subspecies: a) Use Arlequin to determine the number of segregating sites (S) and calculate the nucleotide diversity (). What can you infer by comparing nucleotide diversity between the two species for each region? b) Use Arlequin to perform the tests of neutrality developed by Ewens-Watterson and Tajima. Interpret and discuss the results both statistically and biologically. File Region of tb1 Subspecies utr_mays.arp 5’ untranslated region mays utr_par.arp parviglumis exon_mays.arp exon exon_par.arp

Problem 3 continue. C) Interpret and discuss the results from the following 2 HKA tests: Test A tb1 5’ untranslated region Average of control loci Polymorphism within subspecies 0.00093 0.01996 Divergence between subspecies 0.05255 0.02242 χ2 13.58   p-value 0.001 Test B tb1 translated region Average of control loci Polymorphism within subspecies 0.00243 0.01996 Divergence between subspecies 0.01273 0.02242 χ2 2.70   p-value 0.26

Synonymous and Nonsynonymous Nucleotide Substitutions

Actual synonymous substitutions: 1 seq1 Sj Nj seq2 AGT 0.333 2.667 AGC TTT TTA 0.667 2.333 CCA 1 2 GGG GCG CTA 1.333 1.667 Total 3.999 11.001 4.333 10.667 S F P G L 5’-AGT TTT CCA GGG CTA-3’ 5’-AGC TTA CCA GCG CTA-3’ S L P A L Actual synonymous substitutions: 1 Actual nonsynonymous substitutions: 2 𝑑𝑆= 1 (3.999+4.333)/2 =0.24 𝑑𝑁= 2 (11.001+10.667)/2 =0.1846 ω= 𝑑𝑁 𝑑𝑆 =0.1846/0.24= 0.77

Problem 4. Calculate the ω = dN/dS ratio based on the following 2 DNA sequences:   5’-ATG GTT CAT TTT ACC GGA CGA AGT CGA TTA-3’ 5’-ATG GAT CAC TTG ACC GCA CGA AGT AGA TTA-3’ What does the value of ω indicate?

Problem 5. GRADUATE STUDENTS ONLY: Search the literature for an example of an application of one of the tests for departures from neutrality. Describe the question that the test is addressing, the results, and the authors’ interpretation of the results. Receive two points of extra credit if you can find a case in which the test is inappropriately applied and/or interpreted. Please send the paper to Ran when you submit your report.