Presentation is loading. Please wait.

Presentation is loading. Please wait.

Molecular & Genetic Epi 217 Association Studies: Indirect John Witte.

Similar presentations


Presentation on theme: "Molecular & Genetic Epi 217 Association Studies: Indirect John Witte."— Presentation transcript:

1 Molecular & Genetic Epi 217 Association Studies: Indirect John Witte

2 Homework, Question 4: Haplotypes IDMTHFR_C677TMTHFR_A1298CHaplotypes? 959CCAAC-A / C-A 1044CCACC-A / C-C 147CTAAC-A / T-A 123CTACC-A / T-C or C-C / T-A Genotypes 677TT and 1298CC never observed together: Suggests most Probable haplotype, and potential selection or chance. Rare variants: not necessarily lethal, especially those that are associated with late onset diseases.

3 3 SNPs in the TAS2R38 Gene P AV AVIAVI P A I A AV P V I P VV A A I A VV

4 TASR: 3 SNPs form Haplotypes PAVPAV AVIAVI Taster Non-taster

5 TAS2R38 Haplotype Function

6 IDTaster rs10246939rs1726866rs713598 HaplotypesAmino Acid 100CTAGCGCGG*/TACPAV/AVI 121CTAGCGCGG*/TACPAV/AVI 141..... 170CCGG CGG/CGGPAV/PAV 191CTAGCGCGG*/TACPAV/AVI 201CTAGCGCGG*/TACPAV/AVI 22.TTAACCTAC/TACAVI/AVI 241CCGG CGG/CGGPAV/PAV 26.CTAGCGCGG*/TACPAV/AVI 281CTAGCGCGG*/TACPAV/AVI 291CCGGCGCGG/CGCPAV/PAI 300TTAACCTAC/TACAVI/AVI 311CCGG CGG/CGGPAV/PAV TASR Genotyping Results

7 Too many MTHFR SNPs Solution: Tag SNP Selection  SNPs are correlated (aka Linkage Disequilibrium) Carlson et al. (2004) AJHG 74:106 high r 2 AAAA TTTT G C C G A CCCCCC G C C G T CCCCCC GGGG AAAA A/T 1 G/A 2 G/C 3 T/C 4 G/C 5 A/C 6 Pairwise Tagging: SNP 1 SNP 3 SNP 6 3 tags in total Test for association: SNP 1 SNP 3 SNP 6

8 Coverage: Measurement Error in TagSNPs

9 Common Measures of Coverage Threshold Measures –e.g., 73% of SNPs in the complete set are in LD with at least one SNP in the genotyping set at r 2 > 0.8 Average Measures –e.g., Average maximum r 2 = 0.84

10 Coverage and Sample Size Sample size required for Direct Association, n Sample size for Indirect Association n* = n/ r 2 For r 2 = 0.8, increase is 25% For r 2 = 0.5, increase is 100%

11 Tag SNPs Database Resources http://www.hapmap.org http://gvs.gs.washington.edu/GVS/index.jsp

12 HapMap Re-sequencing to discover millions of additional SNPs; deposited to dbSNP. SNPs from dbSNP were genotyped Looked for 1 SNP every 5kb SNP Validation –Polymorphic –Frequency Haplotype and Linkage Disequilibrium Estimation –LD tagging SNPs

13 HapMap Phase III Populations ASW African ancestry in Southwest USA CEU Utah residents with Northern and Western European ancestry from the CEPH collection CHB Han Chinese in Beijing, China CHD Chinese in Metropolitan Denver, Colorado GIH Gujarati Indians in Houston, Texas JPT Japanese in Tokyo, Japan LWK Luhya in Webuye, Kenya MEX Mexican ancestry in Los Angeles, California MKK Maasai in Kinyawa, Kenya TSI Toscani in Italia YRI Yoruba in Ibadan, Nigeria

14 Tag SNPs: HapMap

15

16 Tag SNPs: HapMap & Haploview http://www.broad.mit.edu/mpg/haploview/

17 Tag SNPs: HapMap & Haploview

18

19

20

21  Identified 33 common MTHR SNPs (MAF > 5%) among Caucasians  Forced in 3 potentially functional/previously associated SNPs  Identified tag based on pairwise tagging  15 tags SNPs could capture all 33 MTHR SNPs (mean r2 = 97%)  Note: number of SNPs required varies from gene to gene and from population to population Tag SNPs: HapMap Summary

22 1K Genomes Project

23 Genome-wide Assocation Studies (GWAS)

24 1,2,3,………………………,N 1,2,3,……………………………, M SNPs Samples One-Stage Design Stage 1 Stage 2  samples  markers Two-Stage Design 1,2,3,……………………………, M SNPs Samples 1,2,3,………………………,N One- and Two-Stage GWA Designs

25 SNPs Samples Replication-based analysis SNPs Samples Stage 1 Stage 2 One-Stage Design Joint analysis SNPs Samples Stage 1 Stage 2 Two-Stage Design

26 Multistage Designs Joint analysis has more power than replication p-value in Stage 1 must be liberal Lower cost—do not gain power http://www.sph.umich.edu/csg/abecasis/CaTS/index.html

27 Complex diseases Diabetes Obesity Diet Physical activity Hypertension Hyperlipidemia Vulnerable plaques Atherosclerosis MI Genetic susceptibility Complex diseases: Many causes = many causal pathways!

28 Pathways Many websites / companies provide ‘dynamic’ graphic models of molecular and biochemical pathways. Example: BioCarta: http://www.biocarta.com/http://www.biocarta.com/ May be interested in potential joint and/or interaction effects of multiple genes in one pathway.

29 Interactions “The interdependent operation of two or more causes to produce or prevent an effect” “Differences in the effects of one or more factors according to the level of the remaining factor(s)” Last, 2001 AAAaaa BBAt risk No risk BbAt risk No risk bbNo risk

30 Why look for interactions? Improve detection of genetic (& environmental) risks. Understand etiology/biology New hypotheses? Diagnostics Prevention and interventions

31 Dilution of effects OR=1.5 5.2 2.1 0.1 2.8 Drinker? Micronutrient X 2.7 0.6 Environmental exposure Y Gene A 19 0.1 25 21 0.2 0.1 16 Other gene Z Within particular subgroups, effect of gene may be quite high or low

32 Statistical vs. Biological Interactions Not identical. One hypothesizes biological interaction But ‘tests’ for statistical interaction Does statistical evidence support our biological hypothesis?

33 Multiplicative vs. Additive Interactions gG e1.01.4 E2.02.4 gG e1.01.4 E2.02.8 gG e1.01.4 E2.07.8 Multiplicative “effect” (ORs, RRs) Multiplicative interaction (ORs, RRs) 2.8/2.0 1.4/1.0  = = 1.0 7.8/2.0 1.4/1.0  = = 2.8 Departure from =1 is a multiplicative interaction Additive “effect” RER = (OR(E,G)-1)/((OR(E,g)-1)+(OR(e,G)-1)) = (2.4-1)/((2.0-1)+(1.4-1)) = 1.0 RER = relative excess risk

34 Brennan, P. Carcinogenesis 2002 23:381-387 Two possible causal pathways: additive and multiplicative interaction for colorectal cancer Additive interaction: G1 and E5: independent risk factors Multiplicative interaction: G2 and E2: work through same pathway If factors are not known to act independently, use multiplicative.

35 Analysis of Multiple Genes Joint / Additive Multiplicative Increasing complexity

36 More Complex Modeling Multifactor-dimensionality reduction –(Moore & Williams, Ann Med 2002) Logic regression –(Kooperberg & Ruczinski, Genetic Epi 2005) Multi-loci analysis –(Marchini, Donnelly, Cardon, Nat Genet 2005) Bayesian epistasis association mapping –(Zhang & Liu, Nat Genet 2007)

37

38


Download ppt "Molecular & Genetic Epi 217 Association Studies: Indirect John Witte."

Similar presentations


Ads by Google