Download presentation
Presentation is loading. Please wait.
Published byGodwin Stafford Modified over 9 years ago
1
Tutorial #10 by Ma’ayan Fishelson
2
Classical Method of Linkage Analysis The classical method was parametric linkage analysis the Lod-score method. This method has been very successful in mapping both Mendelian disease genes and DNA markers. In order to use this method the transmission mode of all loci analyzed needs to be specified allele frequencies, penetrance probabilities, # of loci affecting the trait..
3
Mode of Inheritance
4
Why Is Non-Parametric Linkage Analysis Needed ? When the disease model is complex (non-Mendelian disease) it is difficult to estimate penetrance values and allele frequencies. It is possible that there are several loci influencing the disease, some more influential than others, some dominant and some recessive. It is possible that different modes of transmission operate in different families (heterogeneity). The argument is that if the transmission model is specified incorrectly the results produced by the Lod-score method are invalid this method should not be used when analyzing a disease with unknown inheritance mode.
5
Proposed Solution A variety of methods for testing for linkage have been developed. These methods are termed nonparametric or “model-free”. When using these methods, the parameters defining the transmission model don’t need to be specified.
6
Affected Sib-Pair Analysis 12 34 1/3 4/6 1/6 Both siblings inherited the 1 allele from the father (1) and the 6 allele from the mother (2). What is the probability for this according to Mendelian Segregation rules ?
7
Average No. of Shared alleles Between Sib-Pairs Under the Null Hypothesis Assume the parental genotypes are (g 1,g 2 ) and (g 3,g 4 ). Assume sib1 has genotype (g 1,g 3 ). Pr(sib2 has allele g 1 ) = Pr(sib2 has allele g 2 ) = 0.5 Pr(sib2 has allele g 3 ) = Pr(sib2 has allele g 4 ) = 0.5 Pr(sib2 has (g 1,g 3 )) = Pr(IBD=2) = 0.5 2 = 0.25 Pr(sib2 has (g 2,g 4 )) = Pr(IBD=0) = 0.5 2 = 0.25 Pr(sib2 has (g1,g4)) = Pr(sib2 has (g2,g3)) = 0.5 2 = 0.25 Pr(IBD=1) = Pr(sib2 has (g1,g4)) + Pr(sib2 has (g2,g3)) = 0.5
8
Average No. of Shared alleles Between Sib-Pairs According to Mendelian Segregation rules, the probability that a pair of siblings would share: –Both marker alleles 0.25 –One marker allele 0.5 –No marker allele 0.25 The average number of shared alleles is: (0.25 * 2) + (0.5 * 1) + (0.25 * 0) = 1 However, if the marker is linked to the disease locus the overall amount of sharing between affected sibs will be increased !
9
Affected Sib-Pair Analysis Main Idea: if a marker is linked to a disease locus the same marker allele will be inherited by two siblings who are both ill more often than expected by chance (if they were unlinked). Disease Locus Close to disease locus (θ<<0.5): affected sibs share on average more than 1 allele IBD if the genetic component is strong. Far away (θ=0.5): affected sibs share on average 1 marker allele IBD.
10
Identity-by-Descent (IBD) Identity-by-State (IBS) IBD Two alleles are IBD if they have the same ancestral origin. IBS Two alleles are IBS if they are of the same type. 12 34 1/4 2/4 4/14/2 The 2 sibs have 1 allele in common (4), but each one got it from a different parent. IBS-count : 1 IBD-count: 0
11
IBS ↔ IBD 12 34 1/3 1/2 1/1 Both children got 1 from the father (1) and from the mother (2) 2 alleles IBS, 2 alleles IBD IBS = ? IBD = ?
12
IBS ↔ IBD 12 34 1/1 2/3 1/31/2 Both children got 1 from the father (1), but we don’t know if they got the same allele 1 allele IBS, 1 or 0 alleles IBD IBS = ? IBD = ?
13
IBS ↔ IBD 12 34 1/2 Both children got allele 1 and allele 2, but we don’t know from which parent they got each allele 2 allele IBS, 0 or 2 alleles IBD IBS = 2 0, probability 0.5 IBD = 1, probability 0 2, probability 0.5 IBS = ? IBD = ?
14
IBD & IBS Probabilities IBD sharing probabilities: 0.25, 0.5 and 0.25. IBS sharing probabilities: depend on the allele frequencies of the marker, since allele pairs are more likely to be IBS for a common allele than a rare one.
15
Shortcomings of IBS-based Analyses Have less power than IBD-based analyses, because parental information isn’t available. May be biased if incorrect marker allele frequencies are used.
16
IBD sharing Binomial Distribution Assume a sib-pair, and some locus.. The probability is 0.5 that they got 2 alleles IBD from their father. The same is true for the 2 alleles received from the mother. Let N be the total number of alleles shared IBD for the sib-pair. N can be viewed as the number of successes in two experiments, where success means the parent passes on 2 alleles IBD to the sib-pair. The probability of success is 0.5
17
IBD Sharing + ASP We now wish to compute the conditional probability of N given that the two sibs are affected and the locus is the disease locus.. The desired probabilities are: z 0 = P(N=0|ASP) z 1 = P(N=1|ASP) z 2 = P(N=2|ASP) –ASP Affected Sib Pair These probabilities depend on the inheritance model of the disease (disease allele frequency, penetrance value…).
18
P( N=k): k = 0, 1, 2 ? Possible parental disease locus genotypes: The corresponding genotypes under the assumptions of HWE and independence between the parents:
19
P( N=k): k = 0, 1, 2 ? P(C i )Mating Type q4q4 aa,aaC1C1 4pq 3 Aa,aaC2C2 4p 2 q 2 Aa,AaC3C3 2p 2 q 2 AA,aaC4C4 4p 3 qAA,AaC5C5 p4p4 AA,AAC6C6 Assume a genetic model of a recessive disease with full penetrance: f =(0,0,1). Explanation: both affected sibs must have 2 disease alleles and these Must be of different parental origin…
20
P( N=k): k = 0, 1, 2 ? P(C i )Mating Type q4q4 aa,aaC1C1 4pq 3 Aa,aaC2C2 4p 2 q 2 Aa,AaC3C3 2p 2 q 2 AA,aaC4C4 4p 3 qAA,AaC5C5 p4p4 AA,AAC6C6 In a similar way: P = 0.001 P(N=0) = 0.000 P(N=1) = 0.002 P(N=2) = 0.998
21
P( N=k): k = 0, 1, 2 ? Assume now that the locus x is not the disease locus… Now (z 0, z 1, z 2 ) depend on how closely linked the locus x is to the disease locus. If x is unlinked to the disease locus, the fact that the sib-pair is affected gives no extra information..
22
IBD Probabilities as a Function of the Distance from the Disease Locus At the disease locus: z 1 = 0.15 and z 2 = 0.8. Conditional IBD probabilities approach the values under H0 (z 1 =0.5, z 2 =0.25), as the distance from the disease locus increases.
23
ASP Analysis 1.Collect affected sib-pairs. 2.Genotype all 4 members of each sib-ship. 3.Estimate the conditional IBD probabilities Ψ = (z 0, z 1, z 2 ). 4.Compare with the IBD probabilities under the null hypothesis of no linkage: z H0 = (0.25, 0.5, 0.25).
24
ASP Analysis – one approach Idea: compare between the expected and observed number of pairs sharing 0,1,and 2 alleles IBD. Test for linkage by computing the chi-squared statistic: χ 2 = (O 2 -E 2 ) 2 /E 2 + (O 1 -E 1 ) 2 /E 1 + (O 0 -E 0 ) 2 /E 0 O 2,O 1, and O 0 are the observed no. of sib-pairs sharing 2,1, or 0 alleles. E 2, E 1, and E 0 are the expected no. of sib-pairs sharing 2, 1, or 0 alleles. Note: there are 2 degrees of freedom.
25
ASP Example In a study of diabetes mellitus 119 sib-pairs are ascertained. Researches have genotyped a candidate locus (FGF3) and would like to determine if there is linkage between IDDM and FGF3. After genotyping they are able to determine that 20 of the sib pairs share 0 alleles IBD, 59 sib-pairs share1 allele IBD and 40 sib-pairs share 2 alleles IBD. Determine if there is significant evidence for linkage between IDDM and FGF3.
26
ASP Analysis – 2 nd approach Idea: compare the observed average number of shared alleles to the expected number of 50%. Test for linkage by computing the chi-squared statistic: χ 2 = (2O 2 +O 1 -N) 2 /N + (2O 0 +O 1 -N) 2 /N where there are N sib-pairs. Note: there is 1 degree of freedom. # of shared alleles # of unshared alleles
27
Question Siblings share 0,1, or 2 alleles IBD at a marker locus with probabilities p 0 =0.25, p 1 =0.5, and p 2 =0.25, respectively under the null hypothesis of no linkage, but what is the corresponding IBD-distribution for father and son ? a.p 0 = 0, p 1 = 1, and p 2 = 0; b.p 0 = 0, p 1 = 0.5, and p 2 = 0.5; c.p 0 = 0.25, p 1 = 0.5, and p 2 = 0.25; d.p 0 = 0.5, p 1 = 0.5, and p 2 = 0;
28
Extended Sib-Pair Analysis If one of the affected sibs or their parents is untyped at the specific locus the probability of each possible genotype for this person at this locus can be computed, and the IBD value can be computed based on these probabilities. P(i alleles IBD) is computed as a sum of the probabilities of all the configurations of the family where i alleles are IBD for the relative pair examined.
29
What is the IBS status of the 2 indicated individuals ? Affected Pedigree Member (APM) Method Aims to detect increased allele-sharing (IBS) between all affected members of a pedigree. The 2 indicated individuals are IBS for two alleles. What is their IBD status ? The 2 indicated individuals are IBD for neither one of the alleles.
30
APM Method – cont. Z ij – a similarity statistic between a relative pair. Assume the genotype of the 1 st relative is (A 1,A 2 ) and the genotype of the 2 nd relative is (B 1,B 2 ). The general similarity statistic is computed as follows:
31
APM Method – Taking Gene Frequencies into Account It is more meaningful that two relatives share a rare allele than a common one. Therefore, weights (based on the allele frequencies) are added: Possible weight functions:
32
APM Method - Weaknesses 1.The actual null distribution may be slightly skewed, while the asymptotic null distribution is normally distributed. 2.The method is very sensitive to misspecification of allele frequencies. 3.Because IBS status is used and not IBD sharing, genotype information in unaffected individuals is ignored, and therefore its power is low compared to linkage methods that use IBD status.
33
SimAPM Marker genotypes in the affected individuals are simulated conditional on the marker genotypes in the unaffecteds, to determine a conditional empirical null distribution. Solves the first two shortcomings of the APM method. The third shortcoming still exists….
34
SimIBD Uses the same principles as SimIPM, except for measuring IBD sharing (and not IBS sharing). 1 5 2 6 43 7 3/3 1/2 1/3 2/3 1/3 Individuals 5, 6 & 7 are affected with the disease. Individuals 5 & 7 share 2 alleles IBS. What about IBD sharing ? Individuals 5 & 7 share 2 alleles IBS. What about IBD sharing ? IBD=1 What about Individuals 5 & 6 ?
35
Recursive Algorithm for Computing P(i x ≡ j y | G) – Boundary Conditions The algorithm is based on 3 boundary conditions, and 2 recurrence rules. Boundary condition 2: If X 1 and X 2 are the two different alleles of founder X, then: Boundary condition 3: If X i is the single allele of person X, then: Boundary condition 1: If X and Y are two distinct individual founders, then:
36
Recursive Algorithm for Computing P(i x ≡ j y | G) – Recurrence Rules 1 The recurrence rules replace each allele of the child B with her parents’ alleles F 1,F 2,M 1, and M 2. Recurrence Rule 1: Given 2 individuals A and B with A≠B, then: otherwise :
37
Recursive Algorithm for Computing P(i x ≡ j y | G) – Recurrence Rules 2 Recurrence Rule 2: If B1 and B2 are two alleles in the same person, then if allele B1 is not IBS to B2, then: otherwise :
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.