Presentation is loading. Please wait.

Presentation is loading. Please wait.

From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics

Similar presentations


Presentation on theme: "From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics"— Presentation transcript:

1 From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics schmitzberger@stanford.edu

2 Topics 1. Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008); 2. Zacho et al. Genetically Elevated C-Reactive Protein and Ischemic Vascular Disease. N Engl J Med 359, 18 (2008); 3. Jakobsdottir et al. Interpretation of Genetic Association Studies: Markers with Replicated Highly Significant Odds Ratios May Be Poor Classifiers. PLoS Genetics 5, 2 (2009)

3 Topics 1. Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008); 2. Zacho et al. Genetically Elevated C-Reactive Protein and Ischemic Vascular Disease. N Engl J Med 359, 18 (2008); 3. Jakobsdottir et al. Interpretation of Genetic Association Studies: Markers with Replicated Highly Significant Odds Ratios May Be Poor Classifiers. PLoS Genetics 5, 2 (2009)

4 Genome-wide association studies

5

6 Source: Hardy et al. Genomewide Association Studies and Human Disease. N Eng J Med, 360:1759-1768; 17 (2009)

7 Genome-wide association studies

8 Source: Hardy et al. Genomewide Association Studies and Human Disease. N Eng J Med, 360:1759-1768; 17 (2009)

9 Human Genome Research Over Time Source: Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008);

10 Linkage Analysis Source: genome.wellcome.ac.uk

11 Human Genome Research Over Time Information source: Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008);

12 Initial Lessons 1. “Candidate gene” approach inadequate

13 Initial Lessons 2. Mutations that cause disease often change protein structure Hemoglobin subunit beta mutation in sickle-cell disease.

14 Initial Lessons 3. Loci often have many rare disease-causing alleles

15 Initial Lessons 4. 90% of sites of genetic variation are common variants in the population

16 Common disease – common variant (CDCV) Common polymorphisms (minor allele freq > 1%) contributes to susceptibility to disease.

17 Common disease – common variant (CDCV) Common polymorphisms (minor allele freq > 1%) contributes to susceptibility to disease. We can use GWAS to see how common variants contribute to disease. Gives us ideas on which positions to investigate.

18 Tag SNPs Source: The International HapMap Consortium The International HapMap Project Nature Vol 426 18/25 2003

19 Tag SNPs Source: The International HapMap Consortium The International HapMap Project Nature Vol 426 18/25 2003

20 Tag SNPs Source: The International HapMap Consortium The International HapMap Project Nature Vol 426 18/25 2003

21 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150

22 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5)

23 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5) 3. Power to detect associations has been low

24 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5) 3. Power to detect associations has been low 4. Association studies have identified regions rather than causal genes

25 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5) 3. Power to detect associations has been low 4. Association studies have identified regions rather than causal genes 5. A single locus may contain more than one risk variant

26 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5) 3. Power to detect associations has been low 4. Association studies have identified regions rather than causal genes 5. A single locus may contain more than one risk variant 6. A single locus may contain both common and rare variants

27 GWAS – General Lessons Learned 1. GWAS work - 2006: ~two dozen reproducible associations - 2008: >150 2. Effect-sizes are modest for common variants (mostly increases by 1.1-1.5) 3. Power to detect associations has been low 4. Association studies have identified regions rather than causal genes 5. A single locus may contain more than one risk variant 6. A single locus may contain both common and rare variants 7. There is great variation between ethnic groups

28 Sample size required For P < 10 −8. Source: Altshuler et al.

29 Sample size required For P < 10 −8. Source: Altshuler et al.

30 GWAS – Common Diseases: Lessons Learned 1. The risk for loci already identified by GWAS is currently underestimated due to currently unknown mutations.

31 GWAS – Common Diseases: Lessons Learned 1. The risk for loci already identified by GWAS is currently underestimated due to currently unknown mutations. 2. Many more disease loci remain to be found. (low statistical power with studies so far)

32 GWAS – Common Diseases: Lessons Learned 1. The risk for loci already identified by GWAS is currently underestimated due to currently unknown mutations. 2. Many more disease loci remain to be found. (low statistical power with studies so far) 3. Some loci will only contain rare variants (won’t be found using common polymorphisms)

33 Disease Risk VS Disease Mechanism Primary value of genetic mapping is not risk prediction but gaining knowledge about mechanisms of disease.

34 GWAS: The Path Ahead 1.Increased sample sizes: 1000 cases,1000 controls, 20% variant, 1.3 increase in risk 1% power 5000 cases, 5000 controls 98% power

35 GWAS: The Path Ahead 1.Increased sample sizes: 1000 cases,1000 controls, 20% variant, 1.3 increase in risk 1% power 5000 cases, 5000 controls 98% power 2.Different ancestry groups

36 GWAS: The Path Ahead 1.Increased sample sizes: 1000 cases,1000 controls, 20% variant, 1.3 increase in risk 1% power 5000 cases, 5000 controls 98% power 2.Different ancestry groups 3.Find rare mutations in suspect loci 1000 genomes project

37 Topics 1. Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008); 2. Zacho et al. Genetically Elevated C-Reactive Protein and Ischemic Vascular Disease. N Engl J Med 359, 18 (2008); 3. Jakobsdottir et al. Interpretation of Genetic Association Studies: Markers with Replicated Highly Significant Odds Ratios May Be Poor Classifiers. PLoS Genetics 5, 2 (2009)

38 C-Reactive Protein (CRP) Elevated levels of CRP lead to increased risk of ischemic heart disease and cerebrovascular disease Studies of >40,000 people with ~4,000 with disease Followed for 12-15 years Measured levels of CRP Genotyping for four CRP polymorphisms

39 Results Increased CRP levels CRP PolymorphismsIncreased CRP levels Increased likelihood of disease

40 Results Increased CRP levels CRP PolymorphismsIncreased CRP levels Increased likelihood of disease

41 Zacho et al.

42

43

44

45

46 Results Increased CRP levels CRP PolymorphismsIncreased CRP levels Increased likelihood of disease

47 Increased CRP levels lead to increased disease risk Zacho et al.

48 Increased CRP levels lead to increased disease risk

49 Increased CRP levels lead to increased disease risk

50 Increased CRP levels lead to increased disease risk

51 Results Increased CRP levels CRP PolymorphismsIncreased CRP levels Increased likelihood of disease ?

52 Zacho et al.

53

54

55 Results Increased CRP levels CRP PolymorphismsIncreased CRP levels Increased likelihood of disease

56 Zacho et al.

57

58

59

60

61

62

63

64 Possible issues with this study CRP polymorphisms could lead to higher plasma levels of less active CRP (unlikely, polymorphisms not near coding region) Limitations of the four individual studies Variability with race (only white participants studied) Potential lack of statistical power

65 Conclusion Genetic variants that lead to increased CRP levels do not lead to an increased risk of heart-disease (and cerebrovascular disease) Increased CRP levels are likely to be a marker rather than cause for disease.

66 Topics 1. Altshuler et al. Genetic Mapping in Human Disease. Science 322, 881 (2008); 2. Zacho et al. Genetically Elevated C-Reactive Protein and Ischemic Vascular Disease. N Engl J Med 359, 18 (2008); 3. Jakobsdottir et al. Interpretation of Genetic Association Studies: Markers with Replicated Highly Significant Odds Ratios May Be Poor Classifiers. PLoS Genetics 5, 2 (2009)

67 Statistical methods to evaluate markers in genetic testing 1.ROC (receiver operating characteristic) curves 2.Logistic regression

68 Genetic testing for the public Sources: 23andme.com decodeme.com navigenics.com

69 Classification based statistics Evaluates how well one can distinguish between cases and controls.

70 Disease YES NO Diagnostic Test Negative Positive

71 True Positive Disease YES NO Diagnostic Test Negative Positive

72 True Positive False Positive Disease YES NO Diagnostic Test Negative Positive

73 True Positive False Positive False Negative Disease YES NO Diagnostic Test Negative Positive

74 True Positive False Positive False Negative True Negative Disease YES NO Diagnostic Test Negative Positive

75 True Positive False Positive False Negative True Negative Disease YES NO Diagnostic Test Negative Positive Sensitivity = TP TP + FN

76 True Positive False Positive False Negative True Negative Disease YES NO Diagnostic Test Negative Positive Sensitivity = TP TP + FN With this test, how many people that are actually ill will I catch?

77 True Positive False Positive False Negative True Negative Disease YES NO Diagnostic Test Negative Positive Sensitivity = TP TP + FN Specificity = TN TN + FP

78 True Positive False Positive False Negative True Negative Disease YES NO Diagnostic Test Negative Positive Sensitivity = TP TP + FN Specificity = TN TN + FP With this test, will I tell too many people they might be ill?

79 ROC curves Source: medcalc.be Important measure: area under the curve (AUC)

80 Odds Ratios (risk analysis) The odds of an event occurring in one group The odds of an event occurring in the control group

81 Odds Ratios (risk analysis) The odds of an event occurring in one group The odds of an event occurring in the control group event less likely in first group < 1 < event more likely in first group equal likelihood

82 Take-home message OR “Strong association (low p-value) does not guarantee effective discrimination between cases and controls (classification). Excellent classification (high AUC) does not guarantee good prediction of actual risk” - Jakobsdottir et al.

83 Source: newscientist.com


Download ppt "From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics"

Similar presentations


Ads by Google