Statistical Genomics Zhiwu Zhang Washington State University Lecture 9: Linkage Disequilibrium.

Slides:



Advertisements
Similar presentations
BIOL EVOLUTION AT MORE THAN ONE GENE SO FAR Evolution at a single locus No interactions between genes One gene - one trait REAL evolution: 10,000.
Advertisements

Two-locus systems. Scheme of genotypes genotype Two-locus genotypes Multilocus genotypes genotype.
Allele Frequencies and the Gene Pool
 Establishes a benchmark from a non- evolving population in which to measure an evolving population.  Investigates the properties of populations that.
The Hardy-Weinberg Equilibrium Allele Frequencies in a Population G.H. Hardy English Mathematician Dr. Wilhelm Weinberg German Physician.
BMI 731- Winter 2005 Chapter1: SNP Analysis Catalin Barbacioru Department of Biomedical Informatics Ohio State University.
Section 3 Characterizing Genetic Diversity: Single Loci Gene with 2 alleles designated “A” and “a”. Three genotypes: AA, Aa, aa Population of 100 individuals.
Chapter 2: Hardy-Weinberg Gene frequency Genotype frequency Gene counting method Square root method Hardy-Weinberg low Sex-linked inheritance Linkage and.
MALD Mapping by Admixture Linkage Disequilibrium.
Microevolution Chapter 18 contined. Microevolution  Generation to generation  Changes in allele frequencies within a population  Causes: Nonrandom.
Essentials of Biology Sylvia S. Mader
THE EVOLUTION OF POPULATIONS
Population Genetics (Ch. 16)
Population genetics and speciation
Population Genetics A.The Hardy-Weinberg principle B.Factors that can change allele frequencies.
Population Genetics.
PROCESS OF EVOLUTION I (Genetic Context). Since the Time of Darwin  Darwin did not explain how variation originates or passed on  The genetic principles.
Brachydactyly and evolutionary change
Lecture 2: Basic Population and Quantitative Genetics.
The Hardy-Weinberg Equilibrium
Population Genetics and Evolution. Darwin’s Observations (review) Galapagos Islands Many similar species had slight differences Favorable variations allow.
POPULATION GENETICS & SPECIATION
Lab 12. Linkage Disequilibrium November 28, 2012.
Genetic Drift Random change in allele frequency –Just by chance or chance events (migrations, natural disasters, etc) Most effect on smaller populations.
The Hardy-Weinberg Principles Changing Populations.
Maintaining Genetic Variation (Population Equilibrium) Populations have TWO competing factors: Remaining stable (not evolving) vs Changing (evolving)
HARDY-WEINBERG CALCULATIONS Evolution & Homeostasis 2012.
Chapter 16 Section 1: Genetic Equilibrium. Variation of Traits In a Population Population Genetics Population Genetics –Microevolution vs. macroevolution.
Unit 11 7F Analyze and evaluate the effects of other evolutionary mechanisms, including genetic drift, gene flow, mutation, and recombination.  
How do we know if a population is evolving?
PBG 650 Advanced Plant Breeding Module 1: Introduction Population Genetics – Hardy Weinberg Equilibrium – Linkage Disequilibrium.
Genetic Linkage. Two pops may have the same allele frequencies but different chromosome frequencies.
Evolution as Genetic Change in Populations. Learning Objectives  Explain how natural selection affects single-gene and polygenic traits.  Describe genetic.
The Evolution of Populations Chapter 21. Microevolution Evolutionary changes within a population  Changes in allele frequencies in a population over.
Mechanisms of Evolution 16.1 Causes of microevolution.
AP Biology Lab 7: Genetics (Fly Lab). AP Biology Lab 7: Genetics (Fly Lab)  Description  given fly of unknown genotype use crosses to determine mode.
INTRODUCTION TO ASSOCIATION MAPPING
1 Population Genetics Definitions of Important Terms Population: group of individuals of one species, living in a prescribed geographical area Subpopulation:
End Show Slide 1 of 40 Copyright Pearson Prentice Hall 16-2 Evolution as Genetic Change.
Lab 9: Linkage Disequilibrium. Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate.
Population Genetics Measuring Evolutionary Change Over Time.
The Evolution of Populations Chapter Weaknesses  He didn’t know how heritable traits pass from one generation to the next  Although variation.
1 1 Population Genetics _aIocyHc Bozeman..7:39min. _aIocyHc
Population Genetics Genetic structure of a population.
POINT > Define Hardy-Weinberg Equilibrium POINT > Use Hardy-Weinberg to determine allele frequencies POINT > Define “heterozygous advantage” POINT > Describe.
Statistical Genomics Zhiwu Zhang Washington State University Lecture 7: Impute.
Population Genetics Measuring Evolutionary Change Over Time.
Statistical Genomics Zhiwu Zhang Washington State University Lecture 5: Linear Algebra.
Statistical Genomics Zhiwu Zhang Washington State University Lecture 4: Statistical inference.
What is the Hardy-Weinberg Theorem? The principle states that allele and genotype frequencies in a population will remain constant from generation to generation.
HS-LS-3 Apply concepts of statistics and probability to support explanations that organisms with an advantageous heritable trait tend to increase in proportion.
Lecture 10: GWAS by correlation
Genetic Linkage.
Lecture 4: Statistical inference
Measuring Evolutionary Change Over Time
Hardy Weinberg Equilibrium, Gene and Genotypic frequencies
Lecture 10: GWAS by correlation
What we know….
Evolution as Genetic Change
Modelling Effects at Multiple Loci
Genetic Linkage.
Washington State University
The ‘V’ in the Tajima D equation is:
Basic concepts on population genetics
Mechanisms of Evolution
Genetic Linkage.
Washington State University
Chapter State Standard: 8a. Students know how natural selection determines the differential survival of groups of organisms. Objectives: How does.
Modern Evolutionary Biology I. Population Genetics
Hardy-Weinberg Lab Data
Presentation transcript:

Statistical Genomics Zhiwu Zhang Washington State University Lecture 9: Linkage Disequilibrium

 Homework 2, due Feb 17, Wednesday, 3:10PM  Add page and line numbers on reports  Midterm exam: February 26, Friday, 50 minutes (3:35- 4:25PM), 25 questions.  Final exam: May 3, 120 minutes (3:10-5:10PM) for 50 questions. Administration

Outline  Trait-marker association  Hardy-Weinberg principle  Linkage an recombination  LD measurements  D  D’  R2  Causes of LD  LD decade

AATTSUM Herbicide Resistant Non herbicide Resistant SUM Observed and expected frequency AATTSUM Herbicide Resistant Non herbicide Resistant SUM

 Poisson distribution: Mean=Var=Expected  (Observed-Expected)/Sqrt(Expected) ~ N(0,1)  SUM(Observed-Expected) 2 / Expected ~ X 2 (df)  df=number of independent cells  df=1 for two marker loci (approximation). Approximate Distributions

AATTSUM Herbicide Resistant Non herbicide Resistant SUM Observed and expected frequency AATTSUM Herbicide Resistant Non herbicide Resistant SUM /28+49/12+49/42+49/18=9.72

P value by using R par(mfrow=c(2,2),mar = c(3,4,1,1)) x=rchisq(10000,1) d=density(x) plot(x) plot(d) hist(x) plot(ecdf(x)) 1-pchisq(9.72,1) index=x>9.72 length(x[index])/10000

Permutation test t=100 s=sample(4,t,replace=T) x=table(s) P(>9.72)= xc=rchisq(10000,1) plot(density(x2),col="blue") lines(density(xc),col="red") index=x2>9.72 length(x2[index])/10000 x2=replicate(10000,{ }) fh=(x[1]+x[3])/t fa=(x[1]+x[2])/t e1=t*fh*fa e2=t*(1-fh)*fa e3=t*fh*(1-fa) e4=t*(1-fh)*(1-fa) e=c(e1,e2,e3,e4) d=(x-e)^2/e sum(d)

AATTSUM Herbicide Resistant Non herbicide Resistant SUM Association scale AATTSUM Herbicide Resistant Non herbicide Resistant SUM Stronger

AATTSUM Herbicide Resistant Non herbicide Resistant SUM Observed and expected frequency AATTSUM Herbicide Resistant Non herbicide Resistant SUM /14+25/6+25/21+25/9=9.92 (similar to weaker association) Observed Expected

 No indication on association scales: LD  Not for continued traits: GWAS Problems with Chi-square association test

The Hardy–Weinberg principle  Allele and genotype frequencies in a population will remain constant from generation to generation in the absence of other evolutionary influences.  These influences include non-random mating, mutation, selection, genetic drift, gene flow and meiotic drive.  f(A)=p, f(a)=q, then f(AA)=p 2, f(aa)=q 2, f(Aa)=2pq

Linkage equilibrium Random join between alleles at two or more loci P AB =P A P B D (ifference)=0

Linkage Disequilibrium (LD) Loci and allele AaBb frequency Gametic type ABAbaBab Observed D=P AB -P A P B =P ab -P a P b =-(P Ab -P A P b ) =-(P aB -P a P B ) Frequency equilibrium Difference

D parameter  Deviation of gamete frequency from the random association  Positive if product of frequencies of coupling gametes minus the product of repulsion gametes  Negative, otherwise

D depends on allele frequency  Vary even with complete LD  P Ab =P aB =0  P AB =1-P ab =P A =P B  D=P A -P A P A

Property of D  Deviation between observed and expected  Extreme values: and 0.25  Non LD: D=0  Dependency on allele frequency

D’  Lewontin (1964) proposed standardizing D to the maximum possible value it can take:  D’=D/D Max =0.08/0.18=0.44  D max : the maximum D for given allele frequency  D max = min(P A P B, P a P b ) if D is negative, or min(P A P b, P a P B ) if D is positive  Range of D’: -1 to 1

R2R2  Hill and Robertson (1968) proposed the following measure of linkage disequilibrium:  r 2 (Δ 2 )=D 2 /(P A P B P a P b )  Square makes positive  The product of allele frequency creates penalty for 50% allele frequency.  Range: 0 to 1

Causes of LD  Mutation  Selection  Inbreeding  Genetic drift  Gene flow/admixture

Mutation and selection A____qA____Q A____q A____Q A____q A____Q A____q A____QA____q Generation 1 Generation 2 Generation 3 mutation A____q Selection

 c: recombination rate  D t =D 0 (1-c) t  t=log(D t /D 0 )/log(1-c)  if c=10%, it takes 6.5 generation for D to be cut in half  if two SNPs 1kb apart  1Mb=1cM,  c=10 -2 /10 6 =10 -8 /bp=10 -5 /kb  It takes 69,319 generations for D to be cut in half Change in D over time

t=seq(1:50) D0=.25 c=.01 Dt=(1-c)^t*D0 plot(t,Dt,type="l",col="red",ylim=c(0,.25)) c=.05 Dt=(1-c)^t*D0 lines(t,Dt,type="l",col="blue") c=.1 Dt=(1-c)^t*D0 lines(t,Dt,type="l",col="green") c=.25 Dt=(1-c)^t*D0 lines(t,Dt,type="l",col="black")

LD decay over distance

Highlight  Trait-marker association  Hardy-Weinberg principle  Linkage an recombination  LD measurements  D  D’  R2  Causes of LD  LD decade