Lab 3 : Exact tests and Measuring Genetic Variation.

Slides:



Advertisements
Similar presentations
How do we know if a population is evolving?
Advertisements

1 BI3010H08 Population genetics Halliburton chapter 9 Population subdivision and gene flow If populations are reproductible isolated their genepools tend.
PV92 PCR/Informatics Kit
Lab 10: Mutation, Selection and Drift
Lab 10: Mutation, Selection and Drift. Goals 1.Effect of mutation on allele frequency. 2.Effect of mutation and selection on allele frequency. 3.Effect.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Lab 3 : Exact tests and Measuring of Genetic Variation.
Alleles = A, a Genotypes = AA, Aa, aa
Hypothesis Testing Steps in Hypothesis Testing:
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
The Proper Conclusion to a Significance Test. Luke Wilcox is an “acorn” at the 2013 AP Statistics reading. After three days of scoring, his table leader.
BIOE 109 Summer 2009 Lecture 5- Part I Hardy- Weinberg Equilibrium.
 Read Chapter 6 of text  Brachydachtyly displays the classic 3:1 pattern of inheritance (for a cross between heterozygotes) that mendel described.
Section 3 Characterizing Genetic Diversity: Single Loci Gene with 2 alleles designated “A” and “a”. Three genotypes: AA, Aa, aa Population of 100 individuals.
Data Analysis Statistics. Inferential statistics.
Inferences About Means of Two Independent Samples Chapter 11 Homework: 1, 2, 3, 4, 6, 7.
The Simple Regression Model
Chapter Topics Types of Regression Models
Data Analysis Statistics. Inferential statistics.
 Read Chapter 6 of text  We saw in chapter 5 that a cross between two individuals heterozygous for a dominant allele produces a 3:1 ratio of individuals.
Introducing the Hardy-Weinberg principle The Hardy-Weinberg principle is a mathematical model used to calculate the allele frequencies of traits with dominant.
11.4 Hardy-Wineberg Equilibrium. Equation - used to predict genotype frequencies in a population Predicted genotype frequencies are compared with Actual.
Hypothesis Testing in Linear Regression Analysis
Animal Breeding and Genetics
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Biodiversity IV: genetics and conservation
1 Ch6. Sampling distribution Dr. Deshi Ye
Population Genetics Learning Objectives
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
HARDY-WEINBERG EQUILIBRIUM
How to: Hardy - Weinberg
How do we know if a population is evolving?
Chapter 7 Population Genetics. Introduction Genes act on individuals and flow through families. The forces that determine gene frequencies act at the.
Population genetics and Hardy-Weinberg equilibrium.
Course outline HWE: What happens when Hardy- Weinberg assumptions are met Inheritance: Multiple alleles in a population; Transmission of alleles in a family.
Genes within Populations. What is a population? How are populations characterized? What does it mean to be diploid, haploid, polyploid? How can we characterize.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Lecture 5: Genetic Variation and Inbreeding August 31, 2015.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Monday, September 10, 2012.
Lab 7. Estimating Population Structure. Goals 1.Estimate and interpret statistics (AMOVA + Bayesian) that characterize population structure. 2.Demonstrate.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Chi Square & Correlation
Lab 7. Estimating Population Structure
Godfrey Hardy ( ) Wilhelm Weinberg ( ) Hardy-Weinberg Principle p + q = 1 Allele frequencies, assuming 2 alleles, one dominant over the.
Lecture 11. The chi-square test for goodness of fit.
1.Stream A and Stream B are located on two isolated islands with similar characteristics. How do these two stream beds differ? 2.Suppose a fish that varies.
Mammalian Population Genetics
Lab 4: Inbreeding and Kinship. Inbreeding Reduces heterozygosity Does not change allele frequencies.
Chi square and Hardy-Weinberg
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson0-1 Supplement 2: Comparing the two estimators of population variance by simulations.
Hardy-Weinberg Equilibrium When mating is completely random, the zygotic frequencies expected in the next generation may be predicted from the knowledge.
Modern Evolutionary Biology I. Population Genetics A. Overview Sources of VariationAgents of Change MutationN.S. Recombinationmutation - crossing over.
Population Genetics I. Basic Principles. Population Genetics I. Basic Principles A. Definitions: - Population: a group of interbreeding organisms that.
Lecture 5: Genetic Variation and Inbreeding September 7, 2012.
Lecture 3 - Concepts of Marine Ecology and Evolution II 3) Detecting evolution: HW Equilibrium Principle -Calculating allele frequencies, predicting genotypes.
Hardy-Weinberg Theorem
Population Genetics: Selection and mutation as mechanisms of evolution
Since everything is a reflection of our minds,
Allele Frequencies Genotype Frequencies The Hardy-Weinberg Equation
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
Chapter 10 Analyzing the Association Between Categorical Variables
Modern Evolutionary Biology I. Population Genetics
Modern Evolutionary Biology I. Population Genetics
Hardy-Weinberg Lab Data
Presentation transcript:

Lab 3 : Exact tests and Measuring Genetic Variation

χ 2 - test Where,

Chi-square Assumptions : 1.Finite # of observations. 2.Observations are independent. 3.Samples collected randomly. 4.Large sample size (>20; >50)

Example: Suppose you caught 5 Bluegill fish and detected two alleles (A1 and A2) and observed that all 5 fish were A1A2 heterozygotes. Calculate allele frequencies and do a χ 2 – test to determine whether the population is in HWE. GenotypeObservedExpected A1A A1A A2A χ25 Conclusion: Reject H 0 at α = 0.05, because calculated χ 2 -value (=5) is more than critical χ 2 - value with 1 d.f. (≈ 3.84) i.e. Bluegill population is not in HWE.

Why is the previous conclusion not reliable? Because it violates the assumption of large sample size. As a rule of thumb, the Chi-square test should not be used when the expected number for any genotype class is less than 5.

Exact Test 1.Calculate the probability of observing N11=0, N12=5, N22=0 under HWE using the multinomial probability equation. 2. Generate all possible permutations of 5 A1 alleles and 5 A2 alleles into 3 genotypes i.e. 10! =3,628, Calculate probability of observing each of these samples under HWE using multinomial probability equation. 4. Determine proportion of samples, whose probability is ≤ If proportion (p-value) is less than 0.05, then reject Ho at α = 0.05.

Genotype 12345N 11 N 12 N 22 Probability A1 B1 A1 B A1 B1 A1 B A1B1 A1B1A1 B1A A1B1 A1 B1A1B1A1B B1A1 B1A1 B A1 B1A1B1A1B1 A B1A1 B1 A1B1 A B1A1 B1A1B1A1 B A1 B1A1B1 A A1B1 A1B1A1 B B1 A1B1A1 B1A B1 A1 B1 A A1 B1 A1B B1A1B1A1 B1A1 B B1A1B1 A1 B1A1 B A1B1 A1 B1 A B1A1 B1 A1B1A1 B B1A1B1A1 B1 A1B B1A1 B1A1 B A1 B1A1 B A1 B1A1B1 A1B1A1B B1 A1 B1A1B A1B1 A B1 A1B1A1 B B1 A1 B1A1B1 A B1 A1 B1A1 B1A A1 B1A1B1 A1 B A1 B1 A1 B A1B1A1 B1A1B1 A A1B1A1B1A1B1A1B1 A

p – value = [Sample with probability ≤ ] / [Total # of sample] = 3/30 = 0.10 Conclusion: The p-value is more than 0.05, therefore we fail to reject H 0 i.e. The bluegill population is in HWE at α = 0.05

Generation of all possible samples and calculation of probability for each sample is computationally intensive. It will require too much time and is practically impossible for large samples. In practice, exact tests are done by sampling a distribution generated from a Markov Chain (beyond the scope of this course).

Measures of Genetic Variation 1. Heterozygosity (Gene diversity). 2. Number of alleles (Allele diversity). 3. Effective number of alleles. 4. Percentage of polymorphic loci.

1. Heterozygosity (Gene diversity) -Most commonly used measure of genetic variation. -Can be thought of as the probability that a randomly sampled individual will have two different alleles (will be heterozygous) at a given locus -Observed heterozygosity (H O ) = Proportion of heterozygotes in a sample. -Expected heterozygosity(H E ) = Heterozygosity expected under HWE. = Expected homozygosity under HWE = p p p …….+ p n 2 For small sample size(< 50), unbiased H E can be calculated by :

2. Number of Alleles (N a ): - Number of alleles present at a locus in a population. - Also called allele diversity. - Strongly influenced by sample size. 3. Effective number of Alleles (N e ): The number of alleles a population would have if all alleles were at equal frequency 4. Proportion of polymorphic loci (P) : - Not so useful for highly variable loci like Microsatellites. - Locus selection bias

GenAlEx

Problem 1. Use GenAlEx to perform the following analyses based on the human SSR data: a.Calculate the genetic variation measures H O, H E, N a, and N e for all loci in all populations. Include the estimated values of these measures for all loci in a population you will be assigned during the lab. What can you conclude about the allele frequencies of the 10 loci by comparing N a to N e ? b.Calculate the average H O and H E across loci for your assigned population. Can you predict anything about the test of HWE based on these values? c.Perform a Chi-square test of HWE for all loci in all populations. Include a summary of the test for your assigned population in the lab report. How do you interpret the results of this test?

Problem 2: Perform an exact test of HWE for all loci and all populations using Arlequin. Include the results for your assigned population in the lab report. a.How do these results compare to those from the Chi-square test and why? Which test do you trust more? b.Why might some populations have significant departures from Hardy-Weinberg expectations, while others do not? c.GRADUATE STUDENTS ONLY: Find an example from the literature of a human population with genotype frequencies that violate Hardy-Weinberg expectations. What is the main cause of this deviation?