A Method for Detecting Pleiotropy

Slides:



Advertisements
Similar presentations
Agenda of Week VII Review of Week VI Multiple regression Canonical correlation.
Advertisements

To go further: intra- versus interindividual variability.
Analysis of imputed rare variants
Gene-by-Environment and Meta-Analysis Eleazar Eskin University of California, Los Angeles.
SEM PURPOSE Model phenomena from observed or theoretical stances
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Structural Equation Modeling
Empirical Estimator for GxE using imputed data Shuo Jiao.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
1 Associating Genomic Variations with Phenotypes Model comparison, rare variants, and analysis pipeline Qunyuan Zhang Division of Statistical Genomics.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
More Powerful Genome-wide Association Methods for Case-control Data Robert C. Elston, PhD Case Western Reserve University Cleveland Ohio.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Correlation Patterns. Correlation Coefficient A statistical measure of the covariation or association between two variables. Are dollar sales.
Chapter 11 Multiple Regression.
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Analysis of Variance & Multivariate Analysis of Variance
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Lorelei Howard and Nick Wright MfD 2008
Simple Linear Regression Analysis
Multivariate Analysis of Variance, Part 1 BMTRY 726.
Regression and Correlation Methods Judy Zhong Ph.D.
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Covariance structures in longitudinal analysis Which one to choose?
Generalized Linear Mixed Model (GLMM) & Weighted Sum Test (WST) Detecting Association between Rare Variants and Complex Traits Qunyuan Zhang, Ingrid Borecki,
Population Stratification
Xuhua Xia Slide 1 MANOVA All statistical methods we have learned so far have only one continuous DV and one or more IVs which may be continuous or categorical.
Regression. Population Covariance and Correlation.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.
MANOVA Mechanics. MANOVA is a multivariate generalization of ANOVA, so there are analogous parts to the simpler ANOVA equations First lets revisit Anova.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Sequential Multiple Decision Procedures (SMDP) for Genome Scans Q.Y. Zhang and M.A. Province Division of Statistical Genomics Washington University School.
Statistical Methods for Rare Variant Association Test Using Summarized Data Qunyuan Zhang Ingrid Borecki, Michael A. Province Division of Statistical Genomics.
Adjusted from slides attributed to Andrew Ainsworth
PSYC 3030 Review Session April 19, Housekeeping Exam: –April 26, 2004 (Monday) –RN 203 –Use pencil, bring calculator & eraser –Make use of your.
Qunyuan Zhang Ingrid Borecki, Michael A. Province
Sequential & Multiple Hypothesis Testing Procedures for Genome-wide Association Scans Qunyuan Zhang Division of Statistical Genomics Washington University.
Session 13: Correlation (Zar, Chapter 19). (1)Regression vs. correlation Regression: R 2 is the proportion that the model explains of the variability.
Environmental Modeling Basic Testing Methods - Statistics III.
Correlation Matrix Diagonal Segmentation (CMDS) A Fast Genome-wide Approach for Identifying Recurrent DNA Copy Number Alterations across Cancer Patients.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Randomized block designs  Environmental sampling and analysis (Quinn & Keough, 2002)
Bootstrap Event Study Tests Peter Westfall ISQS Dept. Joint work with Scott Hein, Finance.
[Topic 1-Regression] 1/37 1. Descriptive Tools, Regression, Panel Data.
KNN Ch. 3 Diagnostics and Remedial Measures Applied Regression Analysis BUSI 6220.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.
Université d’Ottawa / University of Ottawa 1999 Bio 8100s Multivariate biostatistics L5.1 Two sample comparisons l Univariate 2-sample comparisons l The.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
29 October 2009 MRC CBU Graduate Statistics Lectures 4: GLM: The General Linear Model - ANOVA & ANCOVA1 MRC Cognition and Brain Sciences Unit Graduate.
Date of download: 7/2/2016 Copyright © 2016 American Medical Association. All rights reserved. From: How to Interpret a Genome-wide Association Study JAMA.
Multiple Imputation using SOLAS for Missing Data Analysis
Genome Wide Association Studies using SNP
Quantitative-Trait Loci Influencing Body-Mass Index Reside on Chromosomes 7 and 13: The National Heart, Lung, and Blood Institute Family Heart Study 
BA 275 Quantitative Business Methods
Chapter 9 Hypothesis Testing
From GLM to HLM Working with Continuous Outcomes
Quantitative-Trait Loci Influencing Body-Mass Index Reside on Chromosomes 7 and 13: The National Heart, Lung, and Blood Institute Family Heart Study 
Chapter 3 Statistical Concepts.
Chapter 9: Differences among Groups
Multivariate Linear Regression
Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies  Hugues Aschard, Bjarni J. Vilhjálmsson,
Perspectives from Human Studies and Low Density Chip
Hugues Aschard, Bjarni J. Vilhjálmsson, Amit D. Joshi, Alkes L
Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test  Michael C. Wu, Seunggeun Lee, Tianxi Cai, Yun Li, Michael.
Presentation transcript:

A Method for Detecting Pleiotropy Ingrid Borecki, Qunyuan Zhang, Michael Province Division of Statistical Genomics Washington University School of Medicine

Pleiotropy Biological question: Does a genetic variant have independent effects on both of two traits? Statistical question: Can the correlation or a portion of the correlation between two traits be explained by a genetic variant?

Hypotheses & Models Compound null: no pleiotropy Alternative: X Y1 Y2 Y1 Y2 Y1 Y2 X X X

Statistical Parameter (δ) of Pleiotropy & Hypotheses to Be Tested Compound null: no pleiotropy Alternative: pleiotropy

Estimating δ Two traits are simultaneously fit into a mixed model T is the trait indicating variable; R is block diagonal covariance matrix (after re-ordering by individuals), with blocks corresponding to the individuals and each block having the compound-symmetry structure When excluding X from the model When including X in the model

Pleiotropy Estimation Test (PET) Testing δ Q-Q Plot under the null Pleiotropy Estimation Test (PET) Estimated by bootstrap re-sampling 100 times with replacement -LOG10(P)

Other Methods for Comparison MANOVA (Wilks' test, wrong null) FCP: Fisher’s combined p-value test (meta-analysis ignoring correlations, wrong null) RCM: Reverse compound model (two tests) SUM: Simple univariate model (two tests) Testing if β1≠0 and β2≠0 =Residual of Y1 adjusted by Y2 =Residual of Y2 adjusted by Y1

Power Comparison PET FCP MANOVA RCM SUM

Power Comparison PET FCP MANOVA RCM SUM

Application SNP WC HOMA PET Cov(%) 1 3.33E-06 8.35E-06 2.87E-09 1.74 2 Correlation (WC, HOMA)= 0.542 SNP WC HOMA PET Cov(%) 1 3.33E-06 8.35E-06 2.87E-09 1.74 2 1.77E-04 8.96E-06 8.33E-07 1.29 3 2.25E-03 8.06E-06 1.93E-06 1.39 4 2.29E-04 4.04E-06 5.68E-06 1.15 5 2.28E-04 4.15E-06 4.91E-05 6 1.92E-04 9.84E-06 7.18E-05 1.28 7 1.42E-02 3.02E-05 7.68E-05 1.04 Correlation (TG, CAC)= 0.089 SNP TG CAC PET Cov(%) 1 1.83E-18 5.95E-01 2.57E-01 3.63 2 7.47E-01 3.28E-09 6.76E-01 0.42 3 2.61E-02 1.85E-05 1.30E-04 5.36

Conclusions The PET Method Tests proper compound null for pleiotropy; Gives estimation of covariance due to pleiotropy; Has greater power other alternatives; Under mixed model framework, can easily be expanded to other data (covariates, family data etc.) ; Practical to GWAS data (with 300 blades, R version takes less than 1 day for the analysis of 2M SNPs and ~3000 subjects) ; Must be fit to primary phenotype and (typed or imputed) genotype data.

Acknowledgement Ling-Yun Chang (programming & testing) Mary Feitosa (GWAS data and application)