Hunting, shooting and fishing... Shooting –Prior hypothesis Hunting –“Promising area” Fishing –Angling –Trawling –Floundering.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Prenatal Nutrition and IQ: A causal analysis using a Mendelian randomization approach Sarah Lewis.
Gene-by-Environment and Meta-Analysis Eleazar Eskin University of California, Los Angeles.
AllerGen / Vancouver - 01/03//2009 Meta-Analysis of GABRIEL GWAS Asthma & IgE F. Demenais, M. Farrall, D. Strachan GABRIEL Statistical Group.
Deriving Biological Inferences From Epidemiologic Studies.
GENETICS AND VARIABILITY IN CROP PLANTS. Genetics and variability of traits are grouped by:  Qualitative traits Traits that show variability that can.
Efficient Algorithms for Imputation of Missing SNP Genotype Data A.Mihajlović, V. Milutinović,
Mendelian Genetics The term ‘Mendelian genetics’ typically relates to the outcomes of simple dominant and recessive gene pairings Shows specific ratios.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
The British 1958 cohort (National Child Development Study)
Gene-gene and gene-environment interactions Manuel Ferreira Massachusetts General Hospital Harvard Medical School Center for Human Genetic Research.
3%20GWASancestry.pptx.
Genome-wide association studies Usman Roshan. SNP Single nucleotide polymorphism Specific position and specific chromosome.
Population Genetics.
Gene-Environment Interaction: Definitions and Study Designs
Using biological networks to search for interacting loci in genome-wide association studies Mathieu Emily et. al. European journal of human genetics, e-pub.
Gene-gene and gene-environment interactions Manuel Ferreira Massachusetts General Hospital Harvard Medical School Center for Human Genetic Research.
Lesson #11 Relative Risk and the Odds Ratio. The risk of disease, given exposure, is: The risk of disease, given no exposure, is: The relative risk is.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Writing a formal Scientific report for an investigation.
Lecture 9: p-value functions and intro to Bayesian thinking Matthew Fox Advanced Epidemiology.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Analysis of genome-wide association studies
Measuring Associations Between Exposure and Outcomes.
Understanding Health: Theoretical challenges and possible approaches September 25, 2006.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.
Multifactor Dimensionality Reduction Laura Mustavich Introduction to Data Mining Final Project Presentation April 26, 2007.
Figure S1. Quantile-quantile plot in –log10 scale for the individual studies The red line represents concordance of observed and expected values. The shaded.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
Type 1 Error and Power Calculation for Association Analysis Pak Sham & Shaun Purcell Advanced Workshop Boulder, CO, 2005.
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
Sampling Design in Regional Fine Mapping of a Quantitative Trait Shelley B. Bull, Lunenfeld-Tanenbaum Research Institute, & Dalla Lana School of Public.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
A physical characteristic like eye color.. A small part on a chromosome that controls a trait.
Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.
Measuring Associations Between Exposure and Outcomes Chapter 3, Szklo and Nieto.
Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Lecture 22: Quantitative Traits II
Some Methodological Considerations in Mendelian Randomization Studies Eric J. Tchetgen Tchetgen Depts of Epidemiology and Biostatistics.
Confounding Biost/Stat 579 David Yanez Department of Biostatistics University of Washington July 7, 2005.
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
Association tests. Basics of association testing Consider the evolutionary history of individuals proximal to the disease carrying mutation.
11-3: Exploring Mendelian Genetics Objectives:  Explain the principle of independent assortment.  Describe the inheritance patterns that exist aside.
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Case control & cohort studies
Purpose of Epi Studies Discover factors associated with diseases, physical conditions and behaviors Identify the causal factors Show the efficacy of intervening.
Genome-Wides Association Studies (GWAS) Veryan Codd.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Methods of Presenting and Interpreting Information Class 9.
Power Calculations for GWAS
Punnett Squares.
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
Genetics definitions Label each chromosome pair as homozygous dominant, homozygous recessive, or heterozygous with definitions Label dominant.
Punnett Squares.

Mendelian Randomization (Using genes to tell us about the environment)
Vocab #18 Mr. Addeo.
Relationship Relation: Association: real and spurious Statistical:
Punnett Squares.
Patterns of Inheritance
Effect Modifiers.
Presentation transcript:

Hunting, shooting and fishing... Shooting –Prior hypothesis Hunting –“Promising area” Fishing –Angling –Trawling –Floundering

Hunting, shooting and fishing... Shooting –Prior hypothesis Hunting –“Promising area” Fishing –Angling –Trawling –Floundering G*E suspected –Prior publication G known, E not –E * (top SNPs) E known, G not –E * (GWAS) Neither G nor E is known as a risk factor for asthma

Hunting, shooting and fishing... Single test –p<0.05 or tests –p<0.001 ? 550,000 tests –p<10 -6 / ? 25 million tests –p<10 -9 ? –plus biology…? G*E suspected –Prior publication G known, E not –E * (top SNPs) E known, G not –E * (GWAS) Neither G nor E is known as a risk factor for asthma

Hunting, shooting and fishing... 1-stage test –interaction analysis 1-stage test (?) –interaction analysis 2-stage test (?) –screen then confirm 2-stage test –screen then confirm –plus biology…? G*E suspected –Prior publication G known, E not –E * (top SNPs) E known, G not –E * (GWAS) Neither G nor E is known as a risk factor for asthma

Association of chr17q21 variant rs with childhood asthma / wheezy bronchitis Trend p = 2 x MAF = 44% PARF = 27%

Association of chr17q21 variant rs with childhood asthma / wheezy bronchitis Trend p = 2 x MAF = 44% PARF = 27%

PersonsChromosomes GGGgggG+G- CasesnnnD+ab ControlsnnnD-cd Logistic regression (G=0,1,2)Odds ratio ad/bc Same results for per-allele odds ratio and significance

Formal case-control interaction analysis (1) ExposedGGGgggE+G+G- CasesnnnD+ab ControlsnnnD-cd Unexp.GGGgggE-G+G- CasesnnnD+ef ControlsnnnD-gh Interaction OR = (ad/bc) / (eh/fg) = (adfg) / (bceh)

Formal case-control interaction analysis (2) CasesGGGgggD+G+G- ExposednnnE+ab Unexp.nnnE-ef ControlsGGGgggD-G+G- ExposednnnE+cd Unexp.nnnE-gh Interaction OR = (af/be) / (ch/dg) = (adfg) / (bceh)

Case-only approach to G*E interaction analysis CasesGGGgggD+G+G- ExposednnnE+ab Unexp.nnnE-ef Interaction OR = (af/be) / 1 Assumes no association between exposure and genotype in undiseased (Mendelian randomization). Gains statistical power as no error in (ch/dg) term. Not statistically independent of (af/be) / (ch/dg)

Two-stage case-control interaction analysis D+ & D-GGGgggAllG+G- Exposedn+nn+nn+nE+a+cb+d Unexp.n+nn+nn+nE-e+gf+h The “screening” OR = (a+c)(f+h)/(b+d)(e+g), an “average” of G*E associations across cases & controls is statistically independent of the interaction OR. No assumption of Mendelian randomization. Gains statistical power in GWAS if used to select SNPs for formal interaction testing in 2nd stage at p<0.01.

Interaction analysis with multi-level exposures D+ & D-GGGgggAllG+G- Exposedn+nn+nn+nE+a+cb+d Unexp.n+nn+nn+nE-e+gf+h The “screening” OR = (a+c)(f+h)/(b+d)(e+g) (or the “case-only” OR in a case-only design) can also be derived by logistic regression: Modelling exposure as a function of genotype … or … Modelling G (0,1) as a function of exposure (0,1).

Interaction with multi-level exposures (step 1) D+ & D-GGGgggAllG+G- High3 Medium2 Low1 None0 Test for association of G with E by logistic regression among cases and controls combined: Modelling G (0,1) as a function of exposure (0-3).

Interaction with multi-level exposures (step 2) Compare association of G with E between cases and controls, for SNPs with “promising” screening ORs: Model G (presence of effect allele or genotype) as a function of level of exposure (0,1,2...k) in cases Model G (presence of effect allele or genotype) as a function of level of exposure (0,1,2...k) in controls Calculate difference between betas for E=1, 2, 3 … k within each study (exposure definition is consistent). Pool results for these differences (log interaction ORs) across studies (but exposure definition or groupings may not be consistent between studies).

Hunting, shooting and fishing... 1-stage test –interaction analysis 1-stage test (?) –interaction analysis 2-stage test (?) –screen then confirm 2-stage test –screen then confirm –plus biology…? G*E suspected –Prior publication G known, E not –E * (top SNPs) E known, G not –E * (GWAS) Neither G nor E is known as a risk factor for asthma

Analytical strategy: interactions (1) How many asthma-related SNPs should be tested for interactions with: –each other (G*G interactions)? –environment and lifestyle factors (G*E)? Should biological candidate genes be prioritised for G*E interactions? –If so, only for biologically plausible interaction effects? Should interactions be tested only for disease outcomes/subgroups that are associated with the genotypes? –At what p value cut-off for the main effect?

Analytical strategy: interactions (2) Should interactions be tested only for the genetic model (additive, dominant or recessive) that best fits phenotype? –Probably, to maximise statistical power Should interactions be tested in all cohorts (with relevant exposure data) even if the environmental effect is non-significant in one or more cohorts? –Yes, if we want to meta-analyse eventually Should we seek “replication” of new G*E interactions before publication? –Yes, if single study. No, if meta-analysis.

Why study interactions? “Antidote to determinism” –Genetic susceptibility, programming –Lifecourse approach to disease causation More certain identification of causes –Interaction RR larger than overall RR –Bias and confounding more easily excluded Guidance for public health policy –“Safety for susceptibles”

Mr Blobby Fat Free- Blotchy wheeler Bloke