Power Calculation Practical

Slides:



Advertisements
Similar presentations
ACE estimates under different sample sizes Benjamin Neale Michael Neale Boulder 2006.
Advertisements

Univariate Model Fitting
A Mathematica ® based regression analysis program Analisys … A Curve Fitting Application.
Estimating the Standard Deviation © Christine Crisp “Teach A Level Maths” Statistics 2.
Multiple Testing, Permutation, False Discovery Benjamin Neale Pak Sham Shaun Purcell.
Path Analysis SPSS/AMOS
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Power and Sample Size + Principles of Simulation Benjamin Neale March 4 th, 2010 International Twin Workshop, Boulder, CO.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
LECTURE 16 STRUCTURAL EQUATION MODELING.
Power and Sample Size Adapted from: Boulder 2004 Benjamin Neale Shaun Purcell I HAVE THE POWER!!!
Linkage Analysis in Merlin
Karri Silventoinen University of Helsinki Osaka University.
Kayla Jordan D. Wayne Mitchell RStats Institute Missouri State University.
Power & Sample Size and Principles of Simulation Michael C Neale HGEN 691 Sept Based on Benjamin Neale March 2014 International Twin Workshop,
Karri Silventoinen University of Helsinki Osaka University.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Power and Sample Size Boulder 2004 Benjamin Neale Shaun Purcell.
Macro Recording. Macros Image-pro Plus has an internal programming language called Auto-Pro. We can use Auto-pro to create: Executable Routines Executable.
Random Sampling Approximations of E(X), p.m.f, and p.d.f.
Distributions, Iteration, Simulation Why R will rock your world (if it hasn’t already)
Achievement & Ascription in Educational Attainment Genetic & Environmental Influences on Adolescent Schooling François Nielsen.
Estimating and Testing Hypotheses about Means James G. Anderson, Ph.D. Purdue University.
Power Simulation Ascertainment Benjamin Neale March 6 th, 2014 International Twin Workshop, Boulder, CO Denotes practical Denotes He-man.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Frühling Rijsdijk & Kate Morley
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Nonparametric Statistics
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
BINARY LOGISTIC REGRESSION
Measurement, Quantification and Analysis
HGEN Thanks to Fruhling Rijsdijk
Power and p-values Benjamin Neale March 10th, 2016
Structural Equation Modeling using MPlus
Randomization Tests PSU /2/14.
Multiple testing etc. Benjamin Neale Leuven 2008.
Re-introduction to openMx
Classification of unlabeled data:
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
CJT 765: Structural Equation Modeling
Quick Start Guide for Visual Studio 2010
Slope of the regression line:
SA3202 Statistical Methods for Social Sciences
Univariate modeling Sarah Medland.
Nonparametric Statistics
Quantitative Methods in HPELS HPELS 6210
(Re)introduction to Mx Sarah Medland
Sarah Medland faculty/sarah/2018/Tuesday
Statistics II: An Overview of Statistics
Power Calculation for QTL Association
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Power Calculation Practical
ACE estimates under different sample sizes
UNIT-4.
Multivariate Genetic Analysis: Introduction
SEM: Step by Step In AMOS and Mplus.
Heritability of the Specific Cognitive Ability of Face Perception
Presentation transcript:

Power Calculation Practical Benjamin Neale

Power Calculations Empirical Attempt to Grasp the NCP from Null Simulate Data under theorized model Calculate Statistics and Perform Test Given α, how many tests p < α Power = (#hits)/(#tests)

Practical: Empirical Power 1 We will Simulate Data under a model online We will run an ACE model, and test for C We will then submit our results and Jeff will collate the empirical values While that is being calculated, we’ll talk about theoretical power calculations

Practical: Empirical Power 2 First get 2008powerpractical.ppt, ace.mx and rprog.R from /faculty/ben/2008/power/practical/. We’ll talk about what the R program does before we run it

Simulation of the MZs: model rGmz rCmz 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 A C E A C E C C . 5477 . 5477 E E A A . 4472 . 4472 . 7071 . 7071 MZ twin 1 MZ twin

Redrawn MZ model 1 . 00 1 . 00 E 1 . 00 E A E A A . 4472 . 7071 . 7071 . 4472 . 7071 . 7071 E . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 1 . 00 C

When we simulate From a path diagram, we can simulate trait values from simulating each latent trait These latent traits are assumed to be normal (μ=0,σ2=1) The latent trait is then multiplied by the path coefficient

What’s a random normal 0.4 0.3 frequency) 0.2 0.1 0.0 -4 -2 2 4 x

Redrawn MZ model Random Normal 1 MZ twin 1 trait: Norm1*A(0.7071) + 00 1 . 00 E 1 . 00 E A A E . 7071 A . 4472 . 7071 E . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + MZ twin 2 trait:

Redrawn MZ model Random Normal 1 MZ twin 1 trait: Norm1*A(0.7071) + 00 1 . 00 E 1 . 00 E A A E A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + MZ twin 2 trait: Norm1*A(0.7071) +

Redrawn MZ model Random Normal 2 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 Random Normal 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) MZ twin 2 trait: Norm1*A(0.7071) +

Redrawn MZ model Random Normal 2 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E . 4472 . 4472 MZ twin 1 MZ twin 2 Random Normal 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477)

Redrawn MZ model Random Normal 3 1 . 00 1 . 00 E 1 . 00 E A E A A . 7071 . 7071 E Random Normal 3 . 4472 . 4472 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm3*E(0.4472) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477) +

Redrawn MZ model Random Normal 4 1 . 00 1 . 00 E 1 . 00 E A E A A . 4472 . 7071 . 7071 E . 4472 Random Normal 4 MZ twin 1 MZ twin 2 C C . 5477 . 5477 C 1 . 00 MZ twin 1 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm3*E(0.4472) MZ twin 2 trait: Norm1*A(0.7071) + Norm2*C(0.5477) + Norm4*E(0.4472)

Simulation of the DZs: model rGmz rCmz . 50 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 1 . 00 A C E A C E C C . 5477 . 5477 E E A A . 4472 . 4472 . 7071 . 7071 MZ twin 1 MZ twin

Redrawn DZ model . 50 . 50 1 . 00 1 . 00 Asp Asp E . 50 E Asp Asp . 7071 Aco . 7071 E E . 4472 . 4472 Aco Aco . 7071 . 7071 DZ twin 1 DZ twin 2 C C . 5477 . 5477 C 1 . 00 How many random normals will we need to supply a trait value for both DZ twins?

Redrawn DZ model Note: σ2(K*X) = K2*σ2(x) . 50 . 50 1 . 00 1 . 00 Asp Asp E . 50 E Asp Note: σ2(K*X) = K2*σ2(x) When K is a constant hence 0.7071*norm5 Asp . 7071 Aco . 7071 E E . 4472 . 4472 Aco Aco . 7071 . 7071 DZ twin 1 DZ twin 2 C C . 5477 . 5477 C 1 . 00 DZ twin 1 trait: 0.7071*Norm5*Aco(0.7071) + 0.7071*Norm6*Asp(0.7071) + Norm7*C(0.5477) + Norm8*E(0.4472) DZ twin 2 trait: 0.7071*Norm5*Aco(0.7071) + 0.7071*Norm9*Asp(0.7071) + Norm7*C(0.5477) + Norm10*E(0.4472)

Simulation conditions 50% additive genetic variance 30% common environment variance 20% specific environment variance

Notes on the R program When you run the R program it is essential that you change your working directory to where you saved the Mx script. File menu then Change dir… After changing directory, load the R program. A visual guide to this follows this slide

Picture of the menu CHANGE DIR… This is the menu item you must change to change where the simulated data will be placed Note you must have the R console highlighted

Picture of the dialog box Either type the path name or browse to where you saved ACE.mx

Running the R script SOURCE R CODE… This is where we load the R program that simulates data

Screenshot of source code selection This is the file rprog.R for the source code

How do I know if it has worked? If you have run the R program correctly, then the file sim.fun ought to be in the directory where your rprog.R and ACE.mx is. If not, try again or raise your hand.

When you have finished Record your A2, C2, and E2 from the full model [1st part of the script] Note the chi square test from dropping C2 Report these online to Jeff, and he’ll make some pretty pictures: https://ibgwww.colorado.edu/limesurvey/index.php?sid=98196&newtest=Y

Theoretical power calculations Either derive the power solutions by hand (though this requires lots of time and more IQ points than I have) Use Mx to setup the variance covariance structure and use option power to generate power levels

Quick note on the power calculations for Mx Total sample size is reported at the end of the script The sample size proportions for your groups are maintained. For example if we say 50 MZ pairs and 100 DZ pairs, then Mx will assume 1/3 of your sample is MZ and 2/3 is DZ

Time to look at a script Open power.mx, and we’ll chat about it. Quick overview of what the script does: Generates the variance covariance structure under the full model (1st half) Intentionally fits the wrong model (by dropping the parameter of interest for power calculations) (2nd half) Based on the number of observations that you supply generates power estimates.

Theoretical script Following chatting, depending on time, here are some suggestions: Change ratio of MZ and DZ keeping same total sample size Drop A rather than C Change effect sizes for A, C, or E