Ordinal Data Sarah Medland.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Structural Equation Modeling. What is SEM Swiss Army Knife of Statistics Can replicate virtually any model from “canned” stats packages (some limitations.
SEM PURPOSE Model phenomena from observed or theoretical stances
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Correlation and regression
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Ch11 Curve Fitting Dr. Deshi Ye
(Re)introduction to Mx Sarah Medland. KiwiChinese Gooseberry.
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Thresholds and ordinal data Sarah Medland – Boulder 2010.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Objectives of Multiple Regression
Karri Silventoinen University of Helsinki Osaka University.
Regression with 2 IVs Generalization of Regression from 1 to 2 Independent Variables.
1 More about the Sampling Distribution of the Sample Mean and introduction to the t-distribution Presentation 3.
MULTIPLE REGRESSION Using more than one variable to predict another.
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
Introduction to OpenMx Sarah Medland. What is OpenMx? Free, Open-source, full–featured SEM package Software which runs on Windows, Mac OSX, and Linux.
Ordinal (yet again) Sarah Medland – Boulder 2010.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Tutorial I: Missing Value Analysis
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Means, Thresholds and Moderation Sarah Medland – Boulder 2008 Corrected Version Thanks to Hongyan Du for pointing out the error on the regression examples.
Frühling Rijsdijk & Kate Morley
Categorical Data Frühling Rijsdijk 1 & Caroline van Baal 2 1 IoP, London 2 Vrije Universiteit, A’dam Twin Workshop, Boulder Tuesday March 2, 2004.
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Copy folder (and subfolders) F:\sarah\linkage2. Linkage in Mx Sarah Medland.
More on thresholds Sarah Medland. A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
GIS and Spatial Analysis1 Summary  Parametric Test  Interval/ratio data  Based on normal distribution  Difference in Variances  Differences are just.
Categorical Data HGEN
Extended Pedigrees HGEN619 class 2007.
HGEN Thanks to Fruhling Rijsdijk
Lecture Nine - Twelve Tests of Significance.
Univariate Twin Analysis
M.Sc. in Economics Econometrics Module I
Introduction to Multivariate Genetic Analysis
Correlation, Regression & Nested Models
Fitting Univariate Models to Continuous and Categorical Data
Re-introduction to openMx
Ordinal data, matrix algebra & factor analysis
More on thresholds Sarah Medland.
Univariate modeling Sarah Medland.
Factor model of Ordered-Categorical Measures: Measurement Invariance
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
Comparing Populations
Liability Threshold Models
Why general modeling framework?
(Re)introduction to Mx Sarah Medland
Longitudinal Modeling
Sarah Medland & Nathan Gillespie
Sarah Medland faculty/sarah/2018/Tuesday
More on thresholds Sarah Medland.
Multiple Regression Analysis: Estimation
Lucía Colodro Conde Sarah Medland & Matt Keller
Lecture 11 – Increasing Model Complexity
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Brad Verhulst & Sarah Medland
Testing Causal Hypotheses
Presentation transcript:

Ordinal Data Sarah Medland

A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just genetic analyses Regression Polycohoric correlations Factor analysis...

Two approaches to the liability threshold model Problem Ordinal data has 1 less degree of freedom MZcov, DZcov, Prevalence No information on the variance Thinking about our ACE/ADE model 4 parameters being estimated A C E mean ACE/ADE model is unidentified without adding a constraint

Two approaches to the liability threshold model Solution? Traditional Maps data to a standard normal distribution Total variance constrained to be 1 Alternate Fixes an alternate parameter (usually E) Estimates the remaining parameters

Traditional Approach Imagine we have a set of binary data Trait – lifetime cannabis use Never Smoked/Ever Smoked

Twin 1 cannabis use 0 = never used

Twin 1 cannabis use

Twin 1 cannabis use Liability or ‘risk’ of initiation distribution Just because an individual has never used cannabis does not mean their ‘risk’ of initiation is zero

Mean = .47 SD =.499 Non Smokers =53% The observed phenotype is an imperfect measurement of an underlying continuous distribution ie Obesity vs BMI MDD vs quantitative depression scales

Raw data distribution Mean = .47 SD =.499 Non Smokers =53% Threshold =.53 Standard normal distribution Mean = 0 SD =1 Non Smokers =53% Threshold =.074

Threshold = .074 – Huh what? How can I work this out Excell =NORMSINV() Thresholds.xls

Why rescale the data this way? Convenience Variance always 1 Mean is always 0 We can interpret the area under a curve between two z-values as a probability or percentage

Threshold.R

Threshold.R

Threshold = .075 – Huh what?

What about more than 2 categories? Very similar We create a matrix containing the 1st threshold and the displacements between subsequent matrices We then add the 1st threshold and the displacement to obtain the subsequent thresholds

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment MxAlgebra L%*%T

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment MxAlgebra L%*%T 2nd threshold

Check the xls spreadsheet…

Two approaches to the liability threshold model Solution? Traditional Maps data to a standard normal distribution Total variance constrained to be 1 Alternate Fixes an alternate parameter Binary or Ordinal data fix E Ordinal data fix 1st two thresholds (aka invariant threshold approach) Estimate the remaining parameters

Models are equivalent, but… Alternate approach means the data is no longer mapped to a standard normal No easy conversion to % Makes it difficult to compare between groups as the scaling is now arbitrary

We are going to run traditional and FixedE ACE models with binary data twinAceBin-Traditional.R twinAceBin-FixE.R There is are scripts in the folder that have multiple threshold models in them – take a look later twinAceOrd-Traditional.R twinAceOrd-FixE.R twinAceOrd-FixThresholds.R

Lisbon Castle