Beyond GWAS. Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmacogenetics, Phamacogenomics.

Slides:

Advertisements

Similar presentations

Statistical methods for genetic association studies

Advertisements

Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee.

Gene-by-Environment and Meta-Analysis Eleazar Eskin University of California, Los Angeles.

CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.

Association Tests for Rare Variants Using Sequence Data

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.

Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P

Gene-gene and gene-environment interactions Manuel Ferreira Massachusetts General Hospital Harvard Medical School Center for Human Genetic Research.

Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.

Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College

MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.

Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.

Integrating domain knowledge with statistical and data mining methods for high-density genomic SNP disease association analysis Dinu et al, J. Biomedical.

Using biological networks to search for interacting loci in genome-wide association studies Mathieu Emily et. al. European journal of human genetics, e-pub.

Gene-gene and gene-environment interactions Manuel Ferreira Massachusetts General Hospital Harvard Medical School Center for Human Genetic Research.

Testing Dose-Response with Multivariate Ordinal Data Bernhard Klingenberg Asst. Prof. of Statistics Williams College, MA Paper available at

Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.

Study Design Discussion The Ghost of Candidate Gene Past and the Ghost of Genome-wide Association Yet to Come Stephen S. Rich, Ph.D. Wake Forest University.

Sequence comparison: Significance of similarity scores Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.

Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &

Kaitlyn Cook Carleton College Northfield Undergraduate Mathematics Symposium October 7, 2014 A METHOD FOR COMBINING FAMILY-BASED RARE VARIANT TESTS OF.

Comments on Rare Variants Analyses Ryo Yamada Kyoto University 2012/08/27 Japan.

CCEB Pharmacogenetics of Leukemia Treatment Response Richard Aplenc May 2 nd, 2008.

Pharmacogenomics Eric Jorgenson.

Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology

IAP workshop, Ghent, Sept. 18 th, 2008 Mixed model analysis to discover cis- regulatory haplotypes in A. Thaliana Fanghong Zhang*, Stijn Vansteelandt*,

Pharmacogenetics and Pharmacogenomics Eric Jorgenson 2/24/9.

The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,

1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.

From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics

Genome-Wide Association Study (GWAS)

Interactions Eric Jorgenson EPI 217 2/22/11. Outline Gene-Environment Interaction Gene-Gene Interaction Pharmacogenetics Pharmacogenomics.

Quantitative Genetics

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (1) J. R. O’Connell 1 and P. M. VanRaden 2 1 University of Maryland School of Medicine,

Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,

Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.

Multiple Testing Matthew Kowgier. Multiple Testing In statistics, the multiple comparisons/testing problem occurs when one considers a set of statistical.

Statistical Methods for Rare Variant Association Test Using Summarized Data Qunyuan Zhang Ingrid Borecki, Michael A. Province Division of Statistical Genomics.

Qunyuan Zhang Ingrid Borecki, Michael A. Province

Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.

The Broad Institute of MIT and Harvard Differential Analysis.

1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)

Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College

Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.

Analysis of Next Generation Sequence Data BIOST /06/2015.

Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 統計論文奈良原.

An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)

약물유전체학 Pharmacogenomics Kangwon National Univ School of Medicine Hee Jae Lee PhD.

Genome-Wides Association Studies (GWAS) Veryan Codd.

Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.

Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.

Pharmacogenetics/Pharmacogenomics. Outline Introduction  Differential drug efficacy  People react differently to drugs Why does drug response vary?

Common variation, GWAS & PLINK

Nucleotide variation in the human genome

Differential Gene Expression

Pharmacogenomics Identification of genes variants that influence drug effects. Is it possible to predict the effect of a drug in a certain patient? Pharmacogenetics/genomics.

Genome Wide Association Studies using SNP

Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam

Mahla sattarzadeh Kerman University of Medical Sciences

Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.

Beyond GWAS Erik Fransen.

Pharmacogenomics Genes and Drugs.

Medical genomics BI420 Department of Biology, Boston College

Genetics of Human Cardiovascular Disease

Medical genomics BI420 Department of Biology, Boston College

Introduction to Pharmacogenetics

Pharmacogenomics Identification of genes variants that influence drug effects. Is it possible to predict the effect of a drug in a certain patient? Pharmacogenetics/genomics.

Detecting Treatment by Biomarker Interaction with Binary Endpoints

Hong Zhang, Judong Shen & Devan V. Mehrotra

Presentation transcript:

Beyond GWAS

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmacogenetics, Phamacogenomics

Multiple testing Recall we are testing ~1 Million markers, more or less Several strategies to adjust the p-values for doing so many tests – Bonferroni – False Discovery Rate (FDR) – Permutation

Multiple testing - Bonferroni Bonferroni adjustment – 0.05/{# tests, i.e., # markers, M} – most widely used in practice – Pr(Reject any test | null hypothesis true) = 0.05

Multiple testing - FDR False Discovery Rate (FDR) limits the expected number of false positives Less stringent control than Bonferroni, e.g. “Another way to look at the difference is that a p-value of 0.05 implies that 5% of all tests will result in false positives. An FDR adjusted p-value (or q-value) of 0.05 implies that 5% of significant tests will result in false positives. The latter is clearly a far smaller quantity.” values.aspx (Your textbook)

Multiple testing - Permutation Many of the tested genotype markers are correlated with each other (in LD), and so the tests are correlated Bonferroni adjusts as if they were completely independent Permutation will be more powerful, but… [max(T) in plink, --mperm]

Summary: Multiple testing Most people just use Bonferroni correction Other methods more powerful (and people have reasonable arguments for them) Nan Laird comments (text for the course) “Given the many false positive findings in the history of genetic association studies, one rather errs on being too conservative.” – Initial GWAS had a lot of false positives (recall, replication, replication, replication...)

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmocogenetics, Phamacogenomics

Gene environment interaction ● Need strong initial hypothesis about the environment ● e.g., Chronic Obstructive Pulmonary Disease (COPD) and smoking (DeMeo et al., AJHG 2006, SERPINE2 gene) ● Environmental exposures can be difficult to characterize (e.g., pollution)

Gene-Environment Interaction Example – Phenylketoneuria (PKU) (Gene) (Environment)

Gene-Environment Interaction Odds Ratio (OR) ah / bg ch / dg eh / fg 1 ● OR Interaction = OR G+E+ / OR G+E- OR G-E+ ● If OR Interaction = 1, multiplicative effects ● Example: OR Interaction = 15 / 5 x 3 = 1

Example 2: Factor V Leiden Mutations, Oral Contraceptive Use, and Venous Thrombosis OR G+E+: 34.7 G+E-: 6.9 G-E+: 3.7 G-E-: Reference Total Vanderbroucke et al., The Lancet 1994 OR Interaction = OR G+E+ / OR G+E- OR G- E+ = 34.7 / 6.9 x 3.7 = 1.4

Testing for GxE in regression logit{P(Y=1|g,E)}=  0 +  g X(g)+  e E+  ge X(g)E E could also be continuous, as could Y (then linear regression instead of logistic)... Tricky! - Scale dependent – Continuous environmental exposure - What if we modeled E differently, i.e. log(E) or added in E 2, etc.? Also can adjust for E 2, E 3 to make sure an interaction. – Can model X(g)=(I g=AA, I g=AB ) Tricky! Statistical interaction  biological interaction

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmocogenetics, Phamacogenomics

Gene-gene interaction Similar to gene-environment interaction, in terms of scale, etc. Also called epistasis

Gene-gene interaction P(Y=1|g 1,g 2 )=  0 +  1 X(g 1 ) +  2 X(g 2 ) +  12 X(g 1 ) X(g 2 ) Usually test when g 1 is from one gene, and g 2 from another gene OR from a GWAS, take the hits Feasible to do all pairwise: plink: --fast-epistasis – “4.5 billion two-locus tests generated from a 100K data set took just over 24 hours to run” (

Gene-Gene Interaction Models Marchini et al. Nature Genetics 2005

Example: GWAS of Psoriasis Strange et al. Nature Genetics 2010 Take the hits, and follow up on gene-gene interaction test --(nextslide)-->

Gene-Gene Interaction Strange et al. Nature Genetics 2010 Only example I am currently aware of where took GWAS hits and found something when looking for interactions.

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmocogenetics, Phamacogenomics

Minor Allele Frequency (MAF) for Rare variants “Common”: MAF > 0.05 “Less common”: 0.05>MAF>0.01 “Rare”: 0.01<MAF SNP: MAF>0.01 (Single Nucleotide Polymorphism) SNV: MAF<0.01 (Single Nucleotide Variant)

Rare variants Previous GWAS focused on chips designed for MAF > 0.05 (most powered for MAF > 0.10) Sequencing (de novo) Exome arrays How do we analyze them?

Analysis of rare variants Still an open area of research: One-at-a-time analysis Multi-marker tests Cohort Allelic Sums Test (CAST) Combined multivariate and collapsing (CMC) More flexible methods...

One-at-a-time analysis Standard univariate test we’ve been talking about Univariate analysis will have low power unless a very large sample size Nejentsev et al., Science 2009 MAF = ( ) / [ *( )] =

Standard Multi-marker tests Evaluate multiple rare variants simultaneously in a single model logit(P(Y=1|X))=  +   x 1 +   x 2 +…+   x M H 0 :  =0 Standard approach (likelihood ratio, score test) may have difficulty fitting the model due to sparse data (e.g., singleton SNP in case OR?) (Recap: one of the approaches we brought up last time to analyze groups of common variants also)

Cohort Allelic Sums Test (CAST) Collapsing method: group rare variants, e.g., within a gene Assumes same effect size of each variant in a group, logit(P(Y=1|X))=  +  {  k=1,…,M x k } – Like regressing count of number of minor alleles across multiple loci Cohen et al., Science 2004; Morgenthaler Mut Res 2007 >95%

Combined multivariate and Collapsing (CMC) Test rare and common togther? Only rare? Only common? Combines the previous two approaches, but simultaneously models rare and common variants Rare variants collapsed together per MAF, and treated as a single variant logit(P(Y=1|X))=  +   k=common variants}  k x k +  rare {  k=1,…,M x k }

Other rare variant approaches Many, many other rare variants methods out there Different assumptions (or lack there of) on how rare variants effect disease, e.g., how smoothed together, prior knowledge,… A common approach with less assumptions is SKAT, a more flexible multivariate test (Wu et al., AJHG, 2011)

Summary: Rare variants Need to aggregate rare variants for increased efficiency Difficult to choose aggregation a priori, more data-driven approaches may be more useful

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmocogenetics, Phamacogenomics

What is Pharmacogenetics? The study of the role of inheritance in the individual variation in drug response. Efficacy Toxicity

Phillips et al. JAMA 2001 Adverse Drug Reactions are common

Pharmacodynamics How a drug acts Drug target

Pharmacokinetics How a drug is processed ADME o Absorption o Distribution o Metabolism o Excretion Drug Levels (dosage) o Efficacy o Toxicity

Measure drug levels in the body Plasma concentration Metabolic Ratio o Compare blood vs. urine o Can be measured over time

Example: TPMT ● TMPT gene: Thiopurine methythyltransferase gene ● TPMT controls metabolism of the thiopurine drugs azathioprine, 6-mercaptopurine, and 6- thioguanine ● Chemotherapeutic agents and immunosuppresive drugs sensitivity and toxicity altered by variant

Standard TPMT Dosing

Standard Dosing: Drug Exposure and Toxicity

Genotype Specific TPMT Dosing

Genotype Specific: Drug Exposure and Toxicity

Outline Multiple testing Gene-environment interaction Gene-gene interaction Rare variants Pharmacogenetics, Phamacogenomics

Outline Gene-Environment Interaction Gene-Gene Interaction Pharmacogenetics Pharmacogenomics

What is Pharmacogenomics and how is it different from Pharmacogenetics? Genomic scale Array based platforms

Pharmacogenomics Evans and Relling Nature 2004

Challenges for Pharmacogenomics How predictive is a test? Does the test apply to all groups? Is a test superior to current clinical practice? Will testing improve outcomes? Is testing cost effective?