Mendelian Randomization: Genes as Instrumental Variables David Evans University of Queensland.

Slides:



Advertisements
Similar presentations
Epidemiologic Study Designs Clinical Studies & Objective Medicine
Advertisements

Prenatal Nutrition and IQ: A causal analysis using a Mendelian randomization approach Sarah Lewis.
Department of Public Health and Primary Care, Cardiovascular Epidemiology Unit, Strangeways Research Laboratory, Cambridge, UK Mendelian randomization:
Observational Studies and RCT Libby Brewin. What are the 3 types of observational studies? Cross-sectional studies Case-control Cohort.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
What do you mean Mendel’s Random? Talk by Timothy Bates 12 November 2010.
Review of Univariate Linear Regression BMTRY 726 3/4/14.
UNIVERSITY OF CAMBRIDGE
New concepts and guidelines in the management of LDL-c and CV Risk: Need for early intervention Prof. Ulf Landmesser University Hospital Zürich Switzerland.
Sensitivity Analysis for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
School of Social and Community Medicine University of BRISTOL Environmental and genetic influences on childhood growth trajectories Laura D Howe.
Chance, bias and confounding
What is Mendelian Randomisation? Frank Dudbridge.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
SOME ADDITIONAL POINTS ON MEASUREMENT ERROR IN EPIDEMIOLOGY Sholom May 28, 2011 Supplement to Prof. Carroll’s talk II.
Study Design and Measures of Disease Frequency Intermediate Epidemiology.
Higher BMI (body mass index) is linked to greater brain atrophy in 700 MCI and AD patients, and in healthy elderly ADNI (N=587,critical P-value: 0.025)
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
Global impact of ischemic heart disease World Heart Federation, 2011.
Broad societal determinants of CVD and health Dubai 6/1/2006.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 5: Analysis Issues in Large Observational Studies.
ALCOHOL AND HEALTH Morten Grønbæk National Institute of Public Health Copenhagen, Denmark.
Lecture 3: Inference in Simple Linear Regression BMTRY 701 Biostatistical Methods II.
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
Mendelian Randomization A deconfounding method Dr. Jørn Olsen Epi 200B February 23 & 25, 2010.
Prospective Evaluation of B-type Natriuretic Peptide Concentrations and the Risk of Type 2 Diabetes in Women B.M. Everett, N. Cook, D.I. Chasman, M.C.
C Reactive Protein Coronary Heart Disease Genetics Collaboration BMJ 2011;342:d548.
FACTORS AFFECTING HOUSING PRICES IN SYRACUSE Sample collected from Zillow in January, 2015 Urban Policy Class Exercise - Lecy.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
FEDME-EPIDEMIEN OG ALMEN PRAKSIS Thorkild I.A. Sørensen, Professor, Dr.Med.
Design and Analysis of Clinical Study 10. Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Tutorial 4 MBP 1010 Kevin Brown. Correlation Review Pearson’s correlation coefficient – Varies between – 1 (perfect negative linear correlation) and 1.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
Epidemiological Research. Epidemiology A branch of medical science that deals with the incidence, distribution, and control of disease in a population.
Some Methodological Considerations in Mendelian Randomization Studies Eric J. Tchetgen Tchetgen Depts of Epidemiology and Biostatistics.
Stat 1510: Statistical Thinking and Concepts REGRESSION.
Interleukin 6 pathway and coronary heart disease
Tutorial 5 Thursday February 14 MBP 1010 Kevin Brown.
CHAPTER 14  MENDEL & THE GENE IDEA 14.1  Mendel used the scientific approach to identify two laws of inheritance 14.2  The laws of probability govern.
Chapter 14 Patterns in Health and Disease: Epidemiology and Physiology EXERCISE PHYSIOLOGY Theory and Application to Fitness and Performance, 6th edition.
ARE STRATIFICATION EFFECTS REAL? A CASE STUDY FROM THE ALCOHOL FIELD Andrew C. Heath, D.Phil. Washington University School of Medicine St. Louis, Missouri.
THE USE OF GENETIC VARIANTS AS TOOLS FOR EPIDEMIOLOGISTS George Davey Smith MRC Integrative Epidemiology Unit University of Bristol.
Genome-Wides Association Studies (GWAS) Veryan Codd.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Date of download: 9/18/2016 Copyright © The American College of Cardiology. All rights reserved. From: Nonfasting Glucose, Ischemic Heart Disease, and.
Validity in epidemiological research Deepti Gurdasani.
Mendelian Randomization
Director, Lipid Research Center The Johns Hopkins Medical Institutions
Mendelian randomization with invalid instruments: Egger regression and Weighted Median Approaches David Evans.
George Dedoussis Professor of Human Biology
Copyright © 2009 American Medical Association. All rights reserved.
Fig. 1. Mean plasma lipid and lipoprotein levels as a function of the common genetic variants in LCAT, g.-293G>A, S208T, and L369L in the CCHS and S208T.
Cardiometabolic effects of genetic upregulation of the interleukin 1 receptor antagonist: a Mendelian randomisation analysis    The Lancet Diabetes &
HDL cholesterol and cardiovascular risk Epidemiological evidence
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
Figure 1 Mendelian randomization study
Cardiometabolic effects of genetic upregulation of the interleukin 1 receptor antagonist: a Mendelian randomisation analysis    The Lancet Diabetes &
Correlation for a pair of relatives
Joshua A. Bell et al. JACC 2018;72:
Kanguk Samsung Hospital, Sungkyunkwan University
Evaluating Effect Measure Modification
Challenges in Interpreting Multivariable Mendelian Randomization: Might “Good Cholesterol” Be Good After All?  Michael V. Holmes, George Davey Smith 
Mendelian Randomization (Using genes to tell us about the environment)
The Healthy Beverage Index Is Associated with Reduced Cardio-metabolic Risk in US Adults: A Preliminary Analysis Kiyah J. Duffey, PhD Brenda M. Davy, PhD,
Schematic representation of an MR analysis.
Mendelian Randomization (Using genes to tell us about the environment)
Alcohol, Other Drugs, and Health: Current Evidence May–June 2019
Mendelian Randomization: Genes as Instrumental Variables
Presentation transcript:

Mendelian Randomization: Genes as Instrumental Variables David Evans University of Queensland

This Session What is Mendelian Randomization (MR)? Examples of MR in research Some ideas Using R to perform MR

Some Criticisms of GWA Studies… So you have a new GWAS hit for a disease… so what!? You can’t change people’s genotypes (at least not yet) You can however modify people’s environments… Mendelian Randomization is a method of using genetics to inform us about associations in traditional observational epidemiology

RCTs are the Gold Standard in Inferring Causality RANDOMISATION METHOD RANDOMISED CONTROLLED TRIAL CONFOUNDERS EQUAL BETWEEN GROUPS EXPOSED: INTERVENTION CONTROL: NO INTERVENTION OUTCOMES COMPARED BETWEEN GROUPS

Observational Studies RCTs are expensive and not always ethical or practically feasible Association between environmental exposures and disease can be assessed by observational epidemiological studies like case-control studies or cohort studies The interpretation of these studies in terms of causality is problematic

CHD risk according to duration of current Vitamin E supplement use compared to no use Rimm et al NEJM 1993; 328: RR

Use of vitamin supplements by US adults, Source: Millen AE, Journal of American Dietetic Assoc 2004;104: Percent

Vitamin E levels and risk factors: Women’s Heart Health Study Childhood SES Manual social class No car access State pension only Smoker Daily alcohol Exercise Low fat diet Obese Height Leg length Lawlor et al, Lancet 2004

Vitamin E supplement use and risk of Coronary Heart Disease Stampfer et al NEJM 1993; 328: 144-9; Rimm et al NEJM 1993; 328: ; Eidelman et al Arch Intern Med 2004; 164:

“Well, so much for antioxidants.”

Classic limitations to “observational” science Confounding Reverse Causation Bias

An Alternative to RCTs: Mendelian randomization Mendel in 1862 In genetic association studies the laws of Mendelian genetics imply that comparison of groups of individuals defined by genotype should only differ with respect to the locus under study (and closely related loci in linkage disequilibrium with the locus under study) Genotypes can proxy for some modifiable risk factors, and there should be no confounding of genotype by behavioural, socioeconomic or physiological factors (excepting those influenced by alleles at closely proximate loci or due to population stratification)

Mendelian randomisation and RCTs RANDOMISATION METHOD RANDOMISED CONTROLLED TRIAL CONFOUNDERS EQUAL BETWEEN GROUPS MENDELIAN RANDOMISATION RANDOM SEGREGATION OF ALLELES CONFOUNDERS EQUAL BETWEEN GROUPS EXPOSED: FUNCTIONAL ALLELLES EXPOSED: INTERVENTION CONTROL: NULL ALLELLES CONTROL: NO INTERVENTION OUTCOMES COMPARED BETWEEN GROUPS

Assumptions of Mendelian randomisation analysis Z associated with X Z is independent of U Z is independent of Y given U and X ZX U Y

Examples – using instruments for adiposity ZX U Y FTO Adiposity

Examples – using instruments for adiposity U Traits of metabolism CRP/BMI FTO Adiposity

In a Nutshell If adiposity DOES NOT causally affect metabolic traits, then the FTO variant should NOT be related to these metabolic traits If adiposity causally affects metabolic traits, then the FTO variant should also be related to these metabolic traits In this situation, the causal effect of adiposity can be estimated using an “instrumental variables analysis” as fitted by two stage least squares

PhenotypeExpected ChangeObserved ChangeObserved Change (BMI adjusted) Fasting insulin (0.033, 0.043)0.039 (0.013,0.064) (-0.027, 0.018) Fasting Glucose (0.014, 0.021)0.024 (0.001, 0.048)0.006 (-0.017, 0.029) Fasting HDL (-0.029, ) (-0.057, ) (-0.027, 0.019) … N~12,000 samples of European ancestry Do intermediate metabolic traits differ as one would expect given a FTO-BMI effect? Given the per allele FTO effect of ~0.1SD and known observational estimates one can derive an expected, per allele, effect on metabolic traits

Examples – using instruments for adiposity U Traits of metabolism CRP FTO Adiposity

Bidirectional MR

CRP and BMI C-Reactive Protein (CRP) is a biomarker of inflammation It is associated with BMI, metabolic syndrome, CHD and a number of other diseases It is unclear whether these observational relationships are causal or due to confounding or reverse causality This question is important from the perspective of drug development

“Bi-directional Mendelian Randomization” ???

Informative Interactions

Kelly Y, Sacker A, Gray R et al. J Epidemiol Community Health (2010), doi: /jech

Maternal Alcohol Dehydrogenase and Offspring IQ

Alcohol dehydrogenase (ADH) risk allele score in offspring and offspring IQ, stratified by maternal alcohol intake during pregnancy P value for interaction of risk allele score and drinking during pregnancy equals Lewis S et al, PLoS One Nov IQ Change All women Women not drinking in pregnancy Women drinking during pregnancy

Association of LDL-C, HDL-C, and risk for coronary heart disease (CHD) Emerging Risk Factors Collaboration, JAMA K participants in 68 prospective studies LDL-C HDL-C

LDL and CHD Risk Ference et al, JACC 2012

HDL: endothelial lipase Asn396Ser 2.6% of population carry Serine allele higher HDL-C No effect on other lipid fractions No effect on other MI risk factors Edmondson, J Clin Invest 2009

LIPG N396S and plasma HDL-C HDL Difference 396S carriers have 5.5 mg/dl higher HDL-C P<10 -8

After testing in 116,320 people, summary OR for LIPG Asn396Ser is 0.99

Individuals who carry the HDL-boosting variant have the same risk for heart attack as those who do not carry the variant Individuals who carry the HDL-boosting variant have the same risk for heart attack as those who do not carry the variant

Using Multiple Genetic Variants as Instruments Palmer et al (2011) Stat Method Res Allelic scores Testing multiple variants individually

Mining the Phenome Using Allelic Scores Could be applied to hundreds of thousands of molecular phenotypes simultaneously (gene expression, methylation, metabolomics etc)

Limitations to Mendelian Randomisation 1- Pleiotropy 2- Population stratification 3- Canalisation 4- Power (also “weak instrument bias”) 5- The existence of instruments

Mendelian Randomization in R There is a positive observational association between body mass index (BMI) and bone mineral density (BMD) It is unclear whether this represents a causal relationship We will use two stage least squares as implemented in R to address this question and estimate the causal effect of BMI on BMD

BMI BMD BMI_SCORE Fitting in R - Datafile

library(sem) #Load BMI and BMD Data x <- read.table(file="BMI_BMD_known.txt", header=TRUE, na.strings=-9) #Observational regression of BMD on BMI print("Observational regression of BMD on BMI") summary(lm(x$BMD ~ x$BMI)) #Regression of BMI on BMI score – Check Instrument Strength print("Regression of BMI on BMI score") summary(lm(x$BMI ~ x$BMI_SCORE)) #Perform two stage least squares analysis print("Two stage least squares analysis of BMD on BMI score") summary(tsls(x$BMD ~ x$BMI, ~ x$BMI_SCORE)) Fitting in R

[1] "Observational regression of BMD on BMI" Call: lm(formula = x$BMD ~ x$BMI) Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) <2e-16 *** x$BMI <2e-16 *** #Observational regression of BMD on BMI print("Observational regression of BMD on BMI") summary(lm(x$BMD ~ x$BMI)) COMMAND: OUTPUT:

[1] "Regression of BMI on BMI score" Call: lm(formula = x$BMI ~ x$BMI_SCORE) Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) <2e-16 *** x$BMI_SCORE <2e-16 *** Residual standard error: on 5552 degrees of freedom Multiple R-squared: , Adjusted R-squared: F-statistic: on 1 and 5552 DF, p-value: < 2.2e-16 #Regression of BMI on BMI score – Check Instrument Strength print("Regression of BMI on BMI score") summary(lm(x$BMI ~ x$BMI_SCORE)) COMMAND: OUTPUT:

[1] "Two stage least squares analysis of BMD on BMI score" 2SLS Estimates Model Formula: x$BMD ~ x$BMI Instruments: ~x$BMI_SCORE Estimate Std. Error t value Pr(>|t|) (Intercept) e+00 x$BMI e-05 #Perform two stage least squares analysis print("Two stage least squares analysis of BMD on BMI score") summary(tsls(x$BMD ~ x$BMI, ~ x$BMI_SCORE)) COMMAND: OUTPUT:

Acknowledgments George Davey Smith Nic Timpson Sek Kathiresan