Power Calculation Practical

Slides:



Advertisements
Similar presentations
Bivariate analysis HGEN619 class 2007.
Advertisements

ACE estimates under different sample sizes Benjamin Neale Michael Neale Boulder 2006.
Univariate Model Fitting
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
A Mathematica ® based regression analysis program Analisys … A Curve Fitting Application.
Multiple Testing, Permutation, False Discovery Benjamin Neale Pak Sham Shaun Purcell.
Path Analysis SPSS/AMOS
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Power and Sample Size + Principles of Simulation Benjamin Neale March 4 th, 2010 International Twin Workshop, Boulder, CO.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Power and Sample Size Adapted from: Boulder 2004 Benjamin Neale Shaun Purcell I HAVE THE POWER!!!
Linkage Analysis in Merlin
Karri Silventoinen University of Helsinki Osaka University.
Data Analysis II Anthony E. Butterfield CH EN "There is a theory which states that if ever anybody discovers exactly what the Universe is for and.
Kayla Jordan D. Wayne Mitchell RStats Institute Missouri State University.
Power & Sample Size and Principles of Simulation Michael C Neale HGEN 691 Sept Based on Benjamin Neale March 2014 International Twin Workshop,
Karri Silventoinen University of Helsinki Osaka University.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Power and Sample Size Boulder 2004 Benjamin Neale Shaun Purcell.
Random Sampling Approximations of E(X), p.m.f, and p.d.f.
Distributions, Iteration, Simulation Why R will rock your world (if it hasn’t already)
Achievement & Ascription in Educational Attainment Genetic & Environmental Influences on Adolescent Schooling François Nielsen.
Power Simulation Ascertainment Benjamin Neale March 6 th, 2014 International Twin Workshop, Boulder, CO Denotes practical Denotes He-man.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
Frühling Rijsdijk & Kate Morley
Categorical Data Frühling Rijsdijk 1 & Caroline van Baal 2 1 IoP, London 2 Vrije Universiteit, A’dam Twin Workshop, Boulder Tuesday March 2, 2004.
Nonparametric Statistics
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Extended Pedigrees HGEN619 class 2007.
Nonparametric Statistics
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Advanced Statistical Methods: Continuous Variables
BINARY LOGISTIC REGRESSION
Measurement, Quantification and Analysis
HGEN Thanks to Fruhling Rijsdijk
Power and p-values Benjamin Neale March 10th, 2016
Normal Distribution and Parameter Estimation
Structural Equation Modeling using MPlus
Randomization Tests PSU /2/14.
Univariate Twin Analysis
Opening Routine.
Multiple testing etc. Benjamin Neale Leuven 2008.
Re-introduction to openMx
Classification of unlabeled data:
Path Analysis Danielle Dick Boulder 2008
CJT 765: Structural Equation Modeling
Quick Start Guide for Visual Studio 2010
Univariate modeling Sarah Medland.
Nonparametric Statistics
(Re)introduction to Mx Sarah Medland
Sarah Medland faculty/sarah/2018/Tuesday
Statistics II: An Overview of Statistics
Power and Sample Size I HAVE THE POWER!!! Boulder 2006 Benjamin Neale.
This is where R scripts will load
Lecture Slides Elementary Statistics Twelfth Edition
Power Calculation for QTL Association
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
ACE estimates under different sample sizes
Power Calculation Practical
Multivariate Genetic Analysis: Introduction
Threshold Autoregressive
Presentation transcript:

Power Calculation Practical Benjamin Neale

Power Calculations Empirical Attempt to Grasp the NCP from Null Simulate Data under theorized model Calculate Statistics and Perform Test Given α, how many tests p < α Power = (#hits)/(#tests)

Practical: Empirical Power 1 We will Simulate Data under a model online We will run an ACE model, and test for C We will then submit our results and Jeff will collate the empirical values While that is being calculated, we’ll talk about theoretical power calculations

Practical: Empirical Power 2 First get ace.mx and rprog.R from /faculty/ben/2006/power/practical/. We’ll talk about what the R program does before we run it

Simulation of the MZs: model rGmz rCmz 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 A C E A C E C C . 5477 . 5477 E E A A . 4472 . 4472 . 7071 . 7071 MZ twin 1 MZ twin

Redrawn MZ model 1 . 00 1 . 00 E 1 . 00 E A E A A . 4472 . 7071 . 7071 . 4472 . 7071 . 7071 E . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 1 . 00 C

When we simulate From a path diagram, we can simulate trait values from simulating each latent trait These latent traits are assumed to be normal (μ=0,σ2=1 or =0,2=1) The latent trait is then multiplied by the path coefficient

What’s a random normal 0.4 0.3 frequency) 0.2 0.1 0.0 -4 -2 2 4 x

Redrawn MZ model Random Normal 1 MZ twin 1 trait: Norm1*A(0.7071) + 00 1 . 00 E 1 . 00 E A A E . 7071 A . 4472 . 7071 E . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + MZ twin 2 trait:

Redrawn MZ model Random Normal 1 MZ twin 1 trait: Norm1*A(0.7071) + 00 1 . 00 E 1 . 00 E A A E A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + MZ twin 2 trait: Norm1*A(0.7071) +

Redrawn MZ model Random Normal 2 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 Random Normal 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) MZ twin 2 trait: Norm1*A(0.7071) +

Redrawn MZ model Random Normal 2 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 Random Normal 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477)

Redrawn MZ model Random Normal 3 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E Random Normal 3 . 4472 . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm3*E(0.4472) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477) +

Redrawn MZ model Random Normal 4 1 . 00 1 . 00 E 1 . 00 E A E A A . 4472 . 7071 . 7071 E . 4472 Random Normal 4 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm3*E(0.4472) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm4*E(0.4472)

Simulation of the DZs: model rGmz rCmz . 50 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 A C E A C E C C . 5477 . 5477 E E A A . 4472 . 4472 . 7071 . 7071 MZ twin 1 MZ twin

Redrawn DZ model . 50 . 50 1 . 00 1 . 00 Asp Asp E . 50 E Asp Asp . 7071 Aco . 7071 E E . 4472 . 4472 Aco Aco . 7071 . 7071 DZ twin 1 DZ twin 2 C C . 5477 . 5477 C 1 . 00 How many random normals will we need to supply a trait value for both DZ twins?

Redrawn DZ model Note: σ2(K*X) = K2*σ2(x) . 50 . 50 1 . 00 1 . 00 Asp Asp E . 50 E Asp Note: σ2(K*X) = K2*σ2(x) When K is a constant hence 0.7071*norm5 Asp . 7071 Aco . 7071 E E . 4472 . 4472 Aco Aco . 7071 . 7071 DZ twin 1 DZ twin 2 C C . 5477 . 5477 C 1 . 00 DZ twin 1 trait: 0.7071*Norm5*Aco(0.7071) + 0.7071*Norm6*Asp(0.7071) + Norm7*C(0.5477) + Norm8*E(0.4472) DZ twin 2 trait: 0.7071*Norm5*Aco(0.7071) + 0.7071*Norm9*Asp(0.7071) + Norm7*C(0.5477) + Norm10*E(0.4472)

Simulation conditions 50% additive genetic variance 30% common environment variance 20% specific environment variance

Notes on the R program When you run the R program it is essential that you change your working directory to where you saved the Mx script. File menu then Change dir… After changing directory, load the R program. A visual guide to this follows this slide

Picture of the menu CHANGE DIR… This is the menu item you must change to change where the simulated data will be placed Note you must have the R console highlighted

Picture of the dialog box Either type the path name or browse to where you saved ACE.mx

Running the R script SOURCE R CODE… This is where we load the R program that simulates data

Screenshot of source code selection This is the file rprog.R for the source code

How do I know if it has worked? If you have run the R program correctly, then the file sim.fun ought to be in the directory where your rprog.R and ACE.mx is. If not, try again or raise your hand.

When you have finished Note your likelihoods and your parameter estimates and complete the survey at: https://ibgwww.colorado.edu/phpsurveyor/index.php?sid=4

Theoretical power calculations Either derive the power solutions by hand (though this requires lots of time and more IQ points than I have) Use Mx to setup the variance covariance structure and use option power to generate power levels

Quick note on the power calculations for Mx Total sample size is reported at the end of the script The sample size proportions for your groups are maintained. For example if we say 50 MZ pairs and 100 DZ pairs, then Mx will assume 1/3 of your sample is MZ and 2/3 is DZ

Time to look at a script Open power.mx, and we’ll chat about it. Quick overview of what the script does: Generates the variance covariance structure under the full model (1st half) Intentionally fits the wrong model (by dropping the parameter of interest for power calculations) (2nd half) Based on the number of observations that you supply generates power estimates.

Theoretical script Following chatting, depending on time, here are some suggestions: Change ratio of MZ and DZ keeping same total sample size Drop A rather than C Change effect sizes for A, C, or E