Power Calculation for QTL Association

Slides:



Advertisements
Similar presentations
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Introduction to Regression Analysis
Structural Equation Modeling
Biometrical genetics Manuel Ferreira Shaun Purcell Pak Sham Boulder Introductory Course 2006.
Biometrical genetics Manuel Ferreira Shaun Purcell Pak Sham Boulder Introductory Course 2006.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Power in QTL linkage: single and multilocus analysis Shaun Purcell 1,2 & Pak Sham 1 1 SGDP, IoP, London, UK 2 Whitehead Institute, MIT, Cambridge, MA,
Chapter 10 Simple Regression.
Confidence intervals. Population mean Assumption: sample from normal distribution.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Association analysis Shaun Purcell Boulder Twin Workshop 2004.
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Simple Linear Regression Analysis
Statistical Power Calculations Boulder, 2007 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Shaun Purcell & Pak Sham Advanced Workshop Boulder, CO, 2003
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Type 1 Error and Power Calculation for Association Analysis Pak Sham & Shaun Purcell Advanced Workshop Boulder, CO, 2005.
Power of linkage analysis Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Regression-Based Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Environmental Modeling Basic Testing Methods - Statistics III.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
Genetic Theory Pak Sham SGDP, IoP, London, UK. Theory Model Data Inference Experiment Formulation Interpretation.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Linkage Disequilibrium Mapping of Complex Binary Diseases Two types of complex traits Quantitative traits–continuous variation Dichotomous traits–discontinuous.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Powerful Regression-based Quantitative Trait Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Introduction to Genetic Theory
Biometrical Genetics Shaun Purcell Twin Workshop, March 2004.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Power in QTL linkage analysis
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Regression Models for Linkage: Merlin Regress
Basic Estimation Techniques
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
B&A ; and REGRESSION - ANCOVA B&A ; and
Estimation & Hypothesis Testing for Two Population Parameters
Inference for Regression
Genome Wide Association Studies using SNP
Part Three. Data Analysis
Slope of the regression line:
I Have the Power in QTL linkage: single and multilocus analysis
Basic Estimation Techniques
Regression-based linkage analysis
Linkage in Selected Samples
Harald H.H. Göring, Joseph D. Terwilliger 
Modelling data and curve fitting
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
Linear regression Fitting a straight line to observations.
Why general modeling framework?
Pak Sham & Shaun Purcell Twin Workshop, March 2002
Simple Linear Regression
Tutorial 1: Misspecification
p(A) = p(AA) + ½ p(Aa) p(a) = p(aa)+ ½ p(Aa)
Chapter 7 Lecture 3 Section: 7.5.
Lecture 9: QTL Mapping II: Outbred Populations
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Causal Relationships with measurement error in the data
Chapter 7 Lecture 3 Section: 7.5.
Presentation transcript:

Power Calculation for QTL Association Pak Sham, Shaun Purcell Twin Workshop 2001

Biometrical model Genotype AA Aa aa Frequency (1-p) 2 2p(1-p) p2 Trait mean -a d a Trait variance 2 2 2 Overall mean a(2p-1)+2dp(1-p)

P(X) = GP(X|G)P(G) P(X) Aa aa AA X

Equal allele frequencies

Rare increaser allele

Linear regression analysis aa Aa AA

Power of QTL association - regression analysis N = [z - z1-] 2 / A2 z : standard normal deviate for significance  z1- : standard normal deviate for power 1- A2 : proportion of variance due to additive QTL

Required Sample Sizes QTL variance 10%

Power of likelihood ratio tests For chi-squared tests on large samples, power is determined by non-centrality parameter () and degrees of freedom (df)  = E(2lnL1 - 2lnL0) = E(2lnL1 ) - E(2lnL0) where expectations are taken at asymptotic values of maximum likelihood estimates (MLE) under an assumed true model

Between and within sibships components of means

Variance/Covariance explained The better the fit of a means model: - the greater the explained variances and covariances - the smaller the residual variances and covariances

Variance of b- component

Variance of w- component

Covariance between b- and w- components

Null model

Between model

Within model

Full model

NCPs for component tests

Determinant of a uniform covariance matrix

Determinants of residual covariance matrices

NCPs of b- and w- tests

Definitions of LD parameters B1 B2 A1 pr + D ps - D p A2 qr - D qs + D q r s pr + D < min(p, r) D < min(p, r) - pr  DMAX = min(ps, rq) = min(p-pr, r-pr) D’ = D / DMAX = min(ps, rq) R2 = D2 / pqrs

Apparent variance components at marker locus where

Exercise: Genetic Power Calculator Use Genetic Power Calculator, Association Analysis option Investigate the sample size requirement for the between and within sibship tests under a range of assumptions Vary sibship size additive QTL variance sibling correlation QTL allele frequencies marker allele frequencies D’

N for 90% power Individuals 0 - 10% QTL variance QTL, Marker allele freqs = 0.50 D-prime = 1 No dominance Type I error rate = 0.05 Test for total association

QTL variance

QTL variance

Effect of sibship size Sibship size 1 - 5 Sib correlation = 0.25 , 0.75 5% QTL variance QTL, Marker allele freqs = 0.50 D-prime = 1 No dominance Type I error rate = 0.05

Total

Within

Between

Exercises 1. What effect does the QTL allele frequency have on power if the test is at the QTL ? 2. What effect does D’ have? 3. What is the effect of differences between QTL and marker allele frequency?

Allele frequency & LD QTL allele freq = 0.05, no dominance Sample sizes for 90% power : Marker allele freq 0.1 0.25 0.5 D’ 1 1 1 N 205 625 1886 D’ 0.5 0.5 0.5 N 835 2517 7560