Download presentation
Presentation is loading. Please wait.
1
Correlation for a pair of relatives
2 G 1 P h e i g gi = correlation in genetic values for the ith type of relative hi = correlation in environmental values for the ith type of relative
2
Two Generic Methods: (1) Genetic Epidemiology (2) Molecular approach
Unmeasured genotypes Use correlations between informative relatives (e.g., twins adoptees) (2) Molecular approach Measured genotypes (almost always SNPs) Several approaches, two most common = GCTA analysis and LD score regression
3
Twin Method: 1 Three Structural Equations:
π
ππ =1.0 β 2 + π ππ π 2 π
π·π = πΎ π·π β 2 + π π·π π 2 1= β 2 + π 2 (1) (2) (3) Problem: 3 Equations but 5 unknowns Solution: Make Assumptions Additive gene action, no assortative mating, therefore πΎ π·π =0.5 Equal environments assumption: π ππ = π π·π = π
4
Twin Method: 2 Three Structural Equations Rewritten: Solution:
π
ππ =1.0 β 2 +π π 2 π
π·π =0.5 β 2 +π π 2 1= β 2 + π 2 (1) (2) (3) Solution: (1.A) Subtract Eq (2) from Eq (1) π
ππ β π
π·π = β 2 +π π 2 β0.5 β 2 βπ π 2 =0.5 β 2 (1.B) Multiply both sides by 2 2(π
ππ β π
π·π )= β 2
5
Twin Method: 3 Three Structural Equations Rewritten: Solution:
π
ππ =1.0 β 2 +π π 2 π
π·π =0.5 β 2 +π π 2 1= β 2 + π 2 (1) (2) (3) Solution: (2) Substitute the estimate of h2 into Eq (3) π 2 =1β β 2 (3) Substitite the estimates of h2 and e2 into either Eq (1) or (2) π=( π
ππ β β 2 )/ π 2 π=( π
π·π β 0.5β 2 )/ π 2
6
Adoption Method (1) RBioSibs = .24 = .5h2 + he2
(2) RAdpSibs = .06 = he2 (3) h2 + e2 = 1 Solution: (1) Subtract Equation (2) from Equation (1): RBioSibs = .24 = .5h2 + he2 - RAdpSibs = -.06 = he = .5h2 (2) Multiply this result by 2: 2(.18) = 2(.5h2), so .36 = h2 (3) Substitute this quantity into Equation (3): .36 + e2 = 1, so e2 = .64 (4) Substitute the results from steps (2) and (3) into Equation (1) & solve for h: .06 = h(.64), so h = .06/.64 = .09
7
GCTA Analysis Select random individuals from the general population
Genotype on a large number of loci Compute the genetic similarity for between each pair of individuals Those pairs with high genetic similarity should have more similar phenotypes than those with low genetic similarity πΏ=π¨ π π΄ +π« π ππ =( π π βπ)( π π βπ) π΄ ππ = correlation between additive genetic values for ijth pair D = diagonal matrix of residual effects VA = additive genetic variance
8
Linkage Disequilibrium (LD) Score Regression
Logic = a causal variant in a haplotype block in strong disequilibrium is more more likely to have a high association with each loci than one in a block with weak disequilibrium. Block 3: High Probability Block 2: Medium Probability Block 1: Low Probability Bulaik-Sullivan et al. (2015), Nat. Genetics, 47,
9
Linkage Disequilibrium (LD) Score Regression
So, compute a LD score for each locus. For the ith locus, the LD score equals the sum of the squared correlations with all the loci in the block, or β π = π π ππ 2 The larger the value of β π , the greater the chance of a causal variant, so regress the observed π 2 for each locus on its β π value. Bulaik-Sullivan et al. (2015), Nat. Genetics, 47,
10
Linkage Disequilibrium (LD) Score Regression
πΈ π 2 β π = π β 2 β π π +ππ+1 N = sample size M = number of loci h2 = heritability a = confounding effects (e.g., pop stratification) β π = π π ππ 2 π ππ 2 = squared correlation for all j loci in LD with the ith locus Bulaik-Sullivan et al. (2015), Nat. Genetics, 47,
11
Linkage Disequilibrium (LD) Score Regression
πΈ π 2 β π = π β 2 β π π +ππ+1 Although it may not look like it, this equation is a linear regression equation. The dependent variable is the π 2 , the independent variable is β π , the intercept is Na + 1, and the slope is Nh2/M. Because we know N and M, we can calculate h2. Bulaik-Sullivan et al. (2015), Nat. Genetics, 47,
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.