Genome-Wide Association (GWA) Studies National Human Genome Research Institute National Institutes of Health U.S. Department of Health and Human Services.

Slides:



Advertisements
Similar presentations
II. Potential Errors In Epidemiologic Studies Random Error Dr. Sherine Shawky.
Advertisements

What is an association study? Define linkage disequilibrium
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Genetics for Epidemiologists Lecture 5: Analysis of Genetic Association Studies National Human Genome Research Institute National Institutes of Health.
November 17, 2007Georgia Tech / ORNL Bioinformatics Conference1 Opportunities at the NIH Peter Good, Ph.D. NHGRI November 17, th Georgia Tech / ORNL.
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Bias in Studies of the Human Genome Thomas A. Pearson, MD, PhD University of Rochester School of Medicine Visiting Scientist, NHGRI.
Office hours Wednesday 3-4pm 304A Stanley Hall. Fig Association mapping (qualitative)
Genome-Wide Association for the Rest of Us: Introduction and Goals National Human Genome Research Institute National Institutes of Health U.S. Department.
Comunicación y Gerencia 1Case control studies15/12/2010.
Genetics for Epidemiologists Lecture 5: Analysis of Genetic Association Studies National Human Genome Research Institute National Institutes of Health.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Study Design Discussion The Ghost of Candidate Gene Past and the Ghost of Genome-wide Association Yet to Come Stephen S. Rich, Ph.D. Wake Forest University.
Are exposures associated with disease?
Understanding Genetics of Schizophrenia
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Multiple Choice Questions for discussion
Dr. Abdulaziz BinSaeed & Dr. Hayfaa A. Wahabi Department of Family & Community medicine  Case-Control Studies.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Causation and the Rules of Inference Classes 4 and 5.
CHP400: Community Health Program- lI Research Methodology STUDY DESIGNS Observational / Analytical Studies Case Control Studies Present: Disease Past:
Advances in Genomics Since the Publication of the Human Genome Greg Feero, M.D., Ph.D. Faculty, Maine-Dartmouth Family Practice Residency Program, Fairfield,
A Primer on Genetic Variation Variety Lawrence Brody - NHGRI.
Introduction to Epidemiologic Methods Tuesday 9:30 – 10:30 am.
Measures of Association
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
CS177 Lecture 10 SNPs and Human Genetic Variation
A Genome-wide association study of Copy number variation in schizophrenia Andrés Ingason CNS Division, deCODE Genetics. Research Institute of Biological.
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
BGRS 2006 SEARCH FOR MULTI-SNP DISEASE ASSOCIATION D. Brinza, A. Perelygin, M. Brinton and A. Zelikovsky Georgia State University, Atlanta, GA, USA 123.
Complement Factor H Polymorphism in Age- Related Macular Degeneration* *Klein RJ, et al. Science. 2005; 308:
Measures of Association and Impact Michael O’Reilly, MD, MPH FETP Thailand Introductory Course.
Case Control Study Dr. Ashry Gad Mohamed MB, ChB, MPH, Dr.P.H. Prof. Of Epidemiology.
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.
Case-Control Studies Abdualziz BinSaeed. Case-Control Studies Type of analytic study Unit of observation and analysis: Individual (not group)
Instructor Resource Chapter 17 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Genome wide association studies (A Brief Start)
The International Consortium. The International HapMap Project.
Case Control Studies Dr Amna Rehana Siddiqui Department of Family and Community Medicine October 17, 2010.
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Methods of Presenting and Interpreting Information Class 9.
Interpreting exomes and genomes: a beginner’s guide
SNPs and complex traits: where is the hidden heritability?
Genomic Analysis: GWAS
Present: Disease Past: Exposure
Comparison of three Observational Analytical strategies
Genetics of common complex diseases: a view from Iceland
Matched Case-Control Study
Epidemiologic Measures of Association
Epidemiological Methods
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
High level GWAS analysis
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Mpundu MKC MSc Epidemiology and Biostatistics, BSc Nursing, RM, RN
Interpreting Epidemiologic Results.
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Presentation transcript:

Genome-Wide Association (GWA) Studies National Human Genome Research Institute National Institutes of Health U.S. Department of Health and Human Services National Institutes of Health National Human Genome Research Institute Teri A. Manolio, M.D., Ph.D. Senior Advisor to the Director, NHGRI, for Population Genomics Director, Office of Population Genomics

There’s a revolution going on… Williard, AM. Spirit of ‘76

There’s a revolution going on… Williard, AM. Spirit of ‘76 Technologic advances now allow us to measure hundreds of thousands of variable points across human genome –Relatively low cost –Relatively little DNA Can be applied to unrelated individuals studied over years or decades Can identify multitude of subtle genetic effects increasing risk of “complex” disease

What is a GWA Study? Method for interrogating all 10 million variable points across human genome Variation inherited in groups, or blocks, so not all 10 million points have to be tested Blocks are shorter (so need to test more points) the less closely people are related Technology now allows studies in unrelated persons, assuming ~10,000 base pair lengths in common (300, ,000 markers)

Christensen and Murray, N Engl J Med 2007; 356: Mapping the Relationships Among SNPs

Christensen and Murray, N Engl J Med 2007; 356: One SNP May Serve as Proxy for Many

Progress in Genotyping Technology Nb of SNPs Cost per genotype (Cents, USD) ABI TaqMan ABI SNPlex Illumina Golden Gate Illumina Infinium/Sentrix Affymetrix 100K/500K Perlegen Affymetrix MegAllele Affymetrix 10K Courtesy S. Chanock, NCI

Affymetrix 500K Illumina 317K Illumina 550K Illumina 650Y Continued Progress in Genotyping Technology Courtesy S. Gabriel, Broad/MIT July 2005Oct 2006 Cost per person (USD)

Courtesy, K. Doheny, Johns Hopkins Intensity Data for Three Combinations of Two Alleles

GWA Genotyping Data, Chromosome 22, Parkinson’s Study Study ID Case/ Control Status rs rs Allele 1Allele 2Allele 1Allele 2 14CaseTTGG 20CaseTCGG 41CaseTCGG 412ControlTCGG 592ControlCCGG 665ControlTCAG

Association of rs Alleles with Development of Parkinson Disease (Made Up!) Development of Disease Variant Allele (A) Develop Disease Do Not Develop Disease Total Present Absent Total509501,000 Relative Risk = Risk in Exposed = 10/80 = 12.5% =2.9 Risk in Unexposed40/9204.3%

Measures of Association: The Odds Ratio Odds are related to probability: odds = p/(1-p) –If probability of horse winning race is 50%, odds are 1/1 –If probability of horse winning race is 25%, odds are 1/3 for win or 3 to 1 against win If probability of exposed person getting disease is 25%, odds = p/(1-p) = 25/75 = 1/3 When don’t have denominators for risk estimates, can calculate odds ratio = cross- product ratio (“ad/bc”); computationally easier If disease is rare, odds ratio approximates relative risk but always overestimates effect

Association of rsxxxx3207 Alleles with Occurrence of Myocardial Infarction Presence of Disease Variant Allele (G) PresentAbsentTotal Present 8133,061? Absent 7943,667? Total1,5076,728?? OR = Odds in Exposed = 813 / 3,061 = 813 x 3,667 =1.23 Odds in Unexposed794 / 3, x 3,061 Helgadottir et al, Sciencexpress 3 May 2007.

Association of rsxxxx3207 Alleles with Occurrence of Myocardial Infarction Presence of Disease Variant Allele (G) PresentAbsentTotal Present 8133,061? Absent 7943,667? Total1,5076,728?? OR = Odds in Exposed = 813 / 3,061 = 813 x 3,667 =1.23 Odds in Unexposed794 / 3, x 3,061 Helgadottir et al, Sciencexpress 3 May 2007.

This is a tsunami of data… Hokusai, K. The Great Wave

This is a tsunami of data… Hokusai, K. The Great Wave New approaches needed for accessing, manipulating, visualizing Requires entirely new perspective Recognize potential for differences to be observed by chance alone

A Few Epidemiologic Definitions P-Value Probability of finding result as extreme or more extreme by chance alone ( or 1 x ) Type I error (α) Probability of finding a difference when in fact none exists (also called “spurious association”) Type II error (β) Probability of failing to find a difference when in fact one does exist Power Probability of finding a difference when one in fact does exist, = 1 - β Effect Size Magnitude of risk associated with variant Sample Size P-value Effect size Allele frequency Variability of measure

Klein et al, Science 2005; 308: P Values of GWA Scan for Age-Related Macular Degeneration

Genome-Wide Scan for Type 2 Diabetes in a Scandinavian Cohort

P-values for 8q24 SNPs Most Strongly Associated with Prostate Cancer Haiman et al, Nat Genet 2007; 39:

P-values for Chromosome 11 SNPs Most Strongly Associated with Diabetes Scott et al, Sciencexpress 26 April 2007.

Of 600 Gene-Disease Associations, Only 6 Significant in > 75% of Identified Studies Disease/TraitGenePolymorphismFrequency DVTF5Arg506Gln0.015 Graves’ DiseaseCTLA4Thr17Ala0.62 Type 1 DMINS5’ VNTR0.67 HIV/AIDSCCR532 bp Ins/Del Alzheimer’sAPOEEpsilon 2/3/ Creutzfeldt- Jakob Disease PRNPMet129Val0.37 Hirschhorn J et al, Genet Med 2002; 4:45-61.

Aspects of GWA Studies that Make Data Sharing Crucial Expensive, generate many “false positives” Replication held as sine qua non of valid association Large sample sizes and multiple studies needed to replicate findings Massive data sets, analysis requires huge and specialized effort Better analytic methods needed Once genome is measured can be related to just about anything

Larson, G. The Complete Far Side

The revolution is here… Williard, AM. Spirit of ‘76

The revolution is here… Williard, AM. Spirit of ‘76 Extensive characterization of individual person’s genome now feasible Can be applied to unrelated individuals Many existing studies have carefully characterized thousands of persons New approaches to manipulating and interpreting data needed Responsible and widespread data sharing key to fully exploring GWA datasets Collaboration for replication and functional determination is crucial

Measures of Public Health Impact: Population Attributable Risk Measures the proportion of disease that would be eliminated if particular causal factor were eliminated Directly related to prevalence of risk factor and risk it conveys Almost always over-estimates proportion attributable to risk factor PAR = (Prevalence of exposure) x (Relative risk - 1) 1 + (Prevalence of exposure) x (Relative risk - 1)

GWA Genotyping Data, Chromosome 22, Parkinson’s Study Study ID Case/ Control Status rs rs Allele 1Allele 2Allele 1Allele 2 14CaseTTGG 20CaseTCGG 41CaseTCGG 412ControlTCGG 592ControlCCGG 665ControlTCAG

GWA Genotyping Data, Chromosome 22, One Person T C G G G G A A A A C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T …

Chromosome 22, One Person, Continued …G A A A A C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T TC C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T …

Chromosome 22, One Person, Continued… …A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T TC C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T T T C C A A A A T C A G C C T C T T T C T C T T A G C C C A A A A T C A G C C T C T T T C T C T T A G C C A G A A T C C A A A A T C A G C C T C …