Presentation is loading. Please wait.

Presentation is loading. Please wait.

Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.

Similar presentations


Presentation on theme: "Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001."— Presentation transcript:

1 Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001

2 Linkage Mapping  Compares inheritance pattern of trait with the inheritance pattern of chromosomal regions  First gene-mapping in 1913 (Sturtevant)  Uses naturally occurring DNA variation (polymorphisms) as genetic markers  >400 Mendelian (single gene) disorders mapped  Current challenge is to map QTLs

3 Linkage = Co-segregation A2A4A2A4 A3A4A3A4 A1A3A1A3 A1A2A1A2 A2A3A2A3 A1A2A1A2 A1A4A1A4 A3A4A3A4 A3A2A3A2 Marker allele A 1 cosegregates with dominant disease

4 Recombination A1A1 A2A2 Q1Q1 Q2Q2 A1A1 A2A2 Q1Q1 Q2Q2 A1A1 A2A2 Q1Q1 Q2Q2 Likely gametes (Non-recombinants) Unlikely gametes (Recombinants) Parental genotypes

5 Recombination of three linked loci (1-  1 )(1-  2 )  1  2 (1-  1 )  2  1 (1-  2 ) 1212

6 Map distance Map distance between two loci (Morgans) = Expected number of crossovers per meiosis Note: Map distances are additive

7 Recombination & map distance Haldane map function

8 Methods of Linkage Analysis  Model-based lod scores  Assumes explicit trait model  Model-free allele sharing methods  Affected sib pairs  Affected pedigree members  Quantitative trait loci  Variance-components models

9 Double Backcross : Fully Informative Gametes AaBb aabb AABB aabb AaBbaabb Aabb aaBb Non-recombinantRecombinant

10 Linkage Analysis : Fully Informative Gametes Count DataRecombinant Gametes: R Non-recombinant Gametes: N ParameterRecombination Fraction:  LikelihoodL(  ) =  R (1-  ) N Parameter Chi-square

11 Phase Unknown Meioses AaBb aabb AaBbaabb Aabb aaBb Non-recombinantRecombinant Non-recombinant Either : Or :

12 Linkage Analysis : Phase-unknown Meioses Count DataRecombinant Gametes: X Non-recombinant Gametes: Y orRecombinant Gametes: Y Non-recombinant Gametes: X LikelihoodL(  ) =  X (1-  ) Y +  Y (1-  ) X An example of incomplete data : Mixture distribution likelihood function

13 Parental genotypes unknown Likelihood will be a function of allele frequencies (population parameters)  (transmission parameter) AaBbaabb Aabb aaBb

14 Trait phenotypes Penetrance parameters Genotype Phenotype f2f2 AA aa Aa Disease Normal f1f1 f0f0 1- f 2 1- f 1 1- f 0 Each phenotype is compatible with multiple genotypes.

15 General Pedigree Likelihood Likelihood is a sum of products (mixture distribution likelihood) number of terms = (m 1, m 2 …..m k ) 2n where m j is number of alleles at locus j

16 Elston-Stewart algorithm Reduces computations by Peeling: Step 1 Condition likelihoods of family 1 on genotype of X. 1 2 X Step 2 Joint likelihood of families 2 and 1

17 Lod Score: Morton (1955) Lod > 3  conclude linkage Prior odds linkage ratioPosterior odds 1:50100020:1 Lod <-2  exclude linkage

18 Linkage Analysis Admixture Test Model Probabilty of linkage in family =  Likelihood L( ,  ) =  L(  ) + (1-  ) L(  =1/2)

19 Allele sharing (non-parametric) methods Penrose (1935): Sib Pair linkage For rare diseaseIBD Concordant affected Concordant normal Discordant Therefore Affected sib pair design Test H 0 : Proportion of alleles IBD =1/2

20 Affected sib pairs: incomplete marker information Parameters: IBD sharing probabilities Z=(z 0, z 1, z 2 ) Marker Genotype Data M: Finite Mixture Likelihood SPLINK, ASPEX

21 Joint distribution of Pedigree IBD  IBD of relative pairs are independent e.g If IBD(1,2) = 2 and IBD (1,3) = 2 then IBD(2,3) = 2  Inheritance vector gives joint IBD distribution Each element indicates whether paternally inherited allele is transmitted (1) ormaternally inherited allele is transmitted (0)  Vector of 2N elements (N = # of non-founders)

22 Pedigree allele-sharing methods Problem APM: Affected family members Uses IBS ERPA: Extended Relative Pairs AnalysisDodgy statistic Genehunter NPL: Non-Parametric LinkageConservative Genehunter-PLUS: Likelihood (“tilting”) All these methods consider affected members only

23 Convergence of parametric and non-parametric methods  Curtis and Sham (1995) MFLINK: Treats penetrance as parameter Terwilliger et al (2000) Complex recombination fractions Parameters with no simple biological interpretation

24 Quantitative Sib Pair Linkage X, Y standardised to mean 0, variance 1 r = sib correlation V A = additive QTL variance (X-Y) 2 = 2(1-r) – 2V A (  -0.5) +  Haseman-Elston Regression (1972) Haseman-Elston Revisited (2000) XY = r + V A (  -0.5) + 

25 Improved Haseman-Elston  Sham and Purcell (2001)  Use as dependent variable Gives equivalent power to variance components model for sib pair data

26 Variance components linkage  Models trait values of pedigree members jointly  Assumes multivariate normality conditional on IBD  Covariance between relative pairs = Vr + V A [  -E(  )] WhereV = trait variance r = correlation (depends on relationship) V A = QTL additive variance E(  ) = expected proportion IBD

27  QTL linkage model for sib-pair data P T1 QS N P T2 QSN 1 [0 / 0.5 / 1] nqsnsq

28 No linkage

29 Under linkage

30 Incomplete Marker Information  IBD sharing cannot be deduced from marker genotypes with certainty  Obtain probabilities of all possible IBD values Finite mixture likelihood Pi-hat likelihood

31  QTL linkage model for sib-pair data P T1 QS N P T2 QSN 1 nqsnsq

32 Conditioning on Trait Values Usual test Conditional test Z i = IBD probability estimated from marker genotypes P i = IBD probability given relationship

33 QTL linkage: some problems  Sensitivity to marker misspecification of marker allele frequencies and positions  Sensitivity to non-normality / phenotypic selection  Heavy computational demand for large pedigrees or many marker loci  Sensitivity to marker genotype and relationship errors  Low power and poor localisation for minor QTL


Download ppt "Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001."

Similar presentations


Ads by Google