More on thresholds Sarah Medland.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

SEM PURPOSE Model phenomena from observed or theoretical stances
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Correlation and regression
Presentation title Date Responder endpoint and continuous endpoint, logistic regression or ANOVA? DSBS 24 OCT 2013 Søren Andersen.
Ch11 Curve Fitting Dr. Deshi Ye
Statistical Tests Karen H. Hagglund, M.S.
(Re)introduction to Mx Sarah Medland. KiwiChinese Gooseberry.
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Thresholds and ordinal data Sarah Medland – Boulder 2010.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
FE Mathematics Review Dr. Scott Molitor Associate Professor
Karri Silventoinen University of Helsinki Osaka University.
1 More about the Sampling Distribution of the Sample Mean and introduction to the t-distribution Presentation 3.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
Introduction to OpenMx Sarah Medland. What is OpenMx? Free, Open-source, full–featured SEM package Software which runs on Windows, Mac OSX, and Linux.
Ordinal (yet again) Sarah Medland – Boulder 2010.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Tutorial I: Missing Value Analysis
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Means, Thresholds and Moderation Sarah Medland – Boulder 2008 Corrected Version Thanks to Hongyan Du for pointing out the error on the regression examples.
Frühling Rijsdijk & Kate Morley
Categorical Data Frühling Rijsdijk 1 & Caroline van Baal 2 1 IoP, London 2 Vrije Universiteit, A’dam Twin Workshop, Boulder Tuesday March 2, 2004.
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Copy folder (and subfolders) F:\sarah\linkage2. Linkage in Mx Sarah Medland.
More on thresholds Sarah Medland. A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
GIS and Spatial Analysis1 Summary  Parametric Test  Interval/ratio data  Based on normal distribution  Difference in Variances  Differences are just.
Categorical Data HGEN
Extended Pedigrees HGEN619 class 2007.
Regression Models for Linkage: Merlin Regress
HGEN Thanks to Fruhling Rijsdijk
Lecture Nine - Twelve Tests of Significance.
Ordinal Data Sarah Medland.
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Correlation, Regression & Nested Models
Fitting Univariate Models to Continuous and Categorical Data
Re-introduction to openMx
Ordinal data, matrix algebra & factor analysis
More on thresholds Sarah Medland.
Univariate modeling Sarah Medland.
Factor model of Ordered-Categorical Measures: Measurement Invariance
Statistical Methods For Engineers
The t distribution and the independent sample t-test
Hypothesis testing. Chi-square test
Comparing Populations
Model Comparison: some basic concepts
Liability Threshold Models
Why general modeling framework?
(Re)introduction to Mx Sarah Medland
Longitudinal Modeling
Sarah Medland & Nathan Gillespie
Sarah Medland faculty/sarah/2018/Tuesday
Lucía Colodro Conde Sarah Medland & Matt Keller
Multivariate Linkage Continued
Lecture 11 – Increasing Model Complexity
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Brad Verhulst & Sarah Medland
Testing Causal Hypotheses
Presentation transcript:

More on thresholds Sarah Medland

A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just genetic analyses Regression Polycohoric correlations Factor analysis...

Names for different types of thresholds http://ibgwww.colorado.edu/workshop2004/cdrom/HTML/book2004a.pdf

Two approaches to the liability threshold model Problem Ordinal data has 1 less degree of freedom MZcov, DZcov, Prevalence No information on the variance Thinking about our ACE/ADE model 4 parameters being estimated A C E mean ACE/ADE model is unidentified without adding a constraint

Two approaches to the liability threshold model with binary data Solution? Traditional Maps data to a standard normal distribution Total variance constrained to be 1 Alternate Fixes an alternate parameter (usually E) Estimates the remaining parameters

Traditional Approach Imagine we have a set of binary data Trait – lifetime cannabis use Never Smoked/Ever Smoked

Twin 1 cannabis use 0 = never used

Twin 1 cannabis use

Twin 1 cannabis use Liability or ‘risk’ of initiation distribution Just because an individual has never used cannabis does not mean their ‘risk’ of initiation is zero

Mean = .47 SD =.499 Non Smokers =53% The observed phenotype is an imperfect measurement of an underlying continuous distribution ie Obesity vs BMI MDD vs quantitative depression scales

Raw data distribution Mean = .47 SD =.499 Non Smokers =53% Threshold =.53 Standard normal distribution Mean = 0 SD =1 Non Smokers =53% Threshold =.074

Threshold = .074 – Huh what? How can I work this out Excell R =NORMSINV() Thresholds.xls R qnorm(.5296)

Why rescale the data this way? Convenience Variance always 1 Mean is always 0 We can interpret the area under a curve between two z-values as a probability or percentage

Threshold.R

Threshold.R

Threshold = .075 – Huh what?

What about more than 2 categories? Very similar We create a matrix containing the 1st threshold and the displacements between subsequent matrices We then add the 1st threshold and the displacement to obtain the subsequent thresholds

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment MxAlgebra L%*%T

Mx Threshold Specification: 3+ Cat. -3 3 1.2 -1 2.2 Threshold matrix: T Full 2 2 Free Twin 1 Twin 2 1st threshold increment MxAlgebra L%*%T 2nd threshold

Check the xls spreadsheet…

Two approaches to the liability threshold model Solution? Traditional Maps data to a standard normal distribution Total variance constrained to be 1 Alternate Fixes an alternate parameter Binary or Ordinal data fix E Ordinal data fix 1st two thresholds (aka invariant threshold approach) Estimate the remaining parameters

Fixed Thresholds 1 ?

Models are equivalent, but… Alternate approach means the data is no longer mapped to a standard normal Need to know mean and variance to convert to % Makes it difficult to compare between groups as the scaling is now arbitrary

There is are other scripts in the folder – take a look later We are going to run direct variance estimation traditional and Fixed Thresholds ACE models with ordinal data twinAceOrd-Traditional.R twinAceOrd-FixThreshold.R There is are other scripts in the folder – take a look later twinAceBin-Traditional.R twinAceBin-FixE.R twinAceOrd-FixE.R

Lisbon Castle