G89.2247 Lecture 101 Examples of Binary Data Binary Data and Correlation Measurement Models and Binary Data Measurement Models and Ordinal Data Analyzing.

Slides:



Advertisements
Similar presentations
Multilevel Models with Latent Variables Daniel J. Bauer Department of Psychology University of North Carolina 9/13/04 SAMSI Workshop.
Advertisements

Continued Psy 524 Ainsworth
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Copyright © Allyn & Bacon (2010) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Exploratory factor analysis GHQ-12. EGO GHQ-12 EFA 1) Assuming items are continuous Variable: Names are ghq01 ghq02 ghq03 ghq04 ghq05 ghq06 ghq07 ghq08.
Correlation. Introduction Two meanings of correlation –Research design –Statistical Relationship –Scatterplots.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
A Method for Estimating the Correlations Between Observed and IRT Latent Variables or Between Pairs of IRT Latent Variables Alan Nicewander Pacific Metrics.
QUANTITATIVE DATA ANALYSIS
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
MSc Applied Psychology PYM403 Research Methods Quantitative Methods I.
A quick introduction to the analysis of questionnaire data John Richardson.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Bivariate Relationships Chapter 5 SHARON LAWNER WEINBERG SARAH KNAPP ABRAMOWITZ StatisticsSPSS An Integrative Approach SECOND EDITION Using.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Correlation & Regression
Week 9: QUANTITATIVE RESEARCH (3)
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Introduction to Linear Regression and Correlation Analysis
Equations in Simple Regression Analysis. The Variance.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Chapter 15 Correlation and Regression
Statistics for Education Research Lecture 3 Bivariate Correlations Coefficients Instructor: Dr. Tung-hsien He
© aSup Statistics II – SPECIAL CORRELATION  1 SPECIAL CORRELATION.
Irkutsk State Medical University Department of Faculty Therapy Correlations Khamaeva A. A. Irkutsk, 2009.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Chapter 7 Item Analysis In constructing a new test (or shortening or lengthening an existing one), the final set of items is usually identified through.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Association between 2 variables
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Basic Statistics Correlation Var Relationships Associations.
B AD 6243: Applied Univariate Statistics Correlation Professor Laku Chidambaram Price College of Business University of Oklahoma.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Examining Relationships in Quantitative Research
Descriptive Research: Quantitative Method Descriptive Analysis –Limits generalization to the particular group of individuals observed. –No conclusions.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Chapter 13 Descriptive Data Analysis. Statistics  Science is empirical in that knowledge is acquired by observation  Data collection requires that we.
Psychology 820 Correlation Regression & Prediction.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
Chapter 14 Correlation and Regression
Item Factor Analysis Item Response Theory Beaujean Chapter 6.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Correlation Chapter 6. What is a Correlation? It is a way of measuring the extent to which two variables are related. It measures the pattern of responses.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Measurement Math DeShon Univariate Descriptives Mean Mean Variance, standard deviation Variance, standard deviation Skew & Kurtosis Skew & Kurtosis.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Applied Regression Analysis BUSI 6220
Effect Sizes.
Bivariate Relationships
Correlation analysis is undertaken to define the strength an direction of a linear relationship between two variables Two measurements are use to assess.
Statistics in SPSS Lecture 10
Chapter 10 CORRELATION.
Evaluation of measuring tools: validity
Ch. 11: Quantifying and Interpreting Relationships Among Variables
Quantitative Data Analysis
Simple Linear Regression
Unit XI: Data Analysis in nursing research
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Presentation transcript:

G Lecture 101 Examples of Binary Data Binary Data and Correlation Measurement Models and Binary Data Measurement Models and Ordinal Data Analyzing binary data with different SEM software packages

G Lecture 102 Examples of Binary Data Some binary outcomes have categorical meaning  Did Tasha get an academic job? (yes/no)  Has Jimmy ever injected heroin? (yes/no) Other binary outcomes reflect passing some threshold  Did Jenna make the Dean's list this semester? Other binary outcomes may reflect some complex position on an ordered dimension  True or False: I am an outgoing person  True or False: I smoked marijuana last year

G Lecture 103 Dichotomized Data: A Bad Habit of Psychologists Sometimes perfectly good quantitative data is made binary because it seems easier to talk about "High" vs. "Low"  The worst habit is median split Usually the High and Low groups are mixtures of the continua Rarely is the median interpreted rationally  See references Cohen, J. (1983) The cost of dichotomization. Applied Psychological Measurement, 7, McCallum, R.C., Zhang, S., Preacher, K.J., Rucker, D.D. (2002) On the practice of dichotomization of quantitative variables. Psychological Methods, 7,

G Lecture 104 Correlations of Binary data Product moment correlations computed on binary data are called phi coefficients Phi depends on the means of the two variables as well as their “strength of relationship”

G Lecture 105 Example: Phi is.13, Underlying r is.66

G Lecture 106 Factor Analysis of Phi Coefficients Loadings tend to be low In exploratory factor analysis, some factors emerge that cluster together variables that have the same proportion positive (mean values)  In educational psychology these are called "difficulty factors"  Considered to be an artifact of cutpoint Conventional psychometric wisdom says factor analysis of phi correlations is incorrect

G Lecture 107 Phi Factor Analysis as Incorrect Mislevy (1986) summarized problems with the analysis of phi coefficients in an often-cited paper on factor analysis of categorical data :  Phi coefficients depend on the means of the X variables as well as their “strength of relationship”  The linear factor model inherently mispecified  More appropriate models exist

G Lecture 108 The linear phi factor model is inherently mispecified Suppose that the binary X variables are coded as (0, 1). Consider the linear factor model: X j = 1j f 1 +  2j f 2 + e j, (j=1, 2,... q). Even if we assume that the model is meaningful for values between 0 and 1, there is no guarantee that the fitted values of X j will be in that interval.

G Lecture 109 Modern "appropriate" methods Suppose X is a dichotomized variable  X * is the original continuous variable X j =1 if X j * >  j and X j =0 otherwise  Tetrachoric correlations estimate the correlations among the X * variables rather than the dichotomized ones.  When the sample size is large, SEM software will compute the tetrachoric correlations, assuming that the underlying distribution is bivariate normal.

G Lecture 1010 Example:  =.66, phi=.13 Tetrachoric =.875 X1=0X1=1 X2=1 X2=0

G Lecture 1011 Example of Factor Analysis Use EQS to simulate simple one factor model Check solution with SPSS Dichotomize variables at two thresholds Compute biased factor analysis Compute analysis based on tetrachoric correlations. Note the standard errors!

G Lecture 1012 Possible Overstatement of Conventional Wisdom In many substantive fields, binary data are included in factor analyses and measurement models  Inferences not necessarily wrong Means of binary data may similar Binary outcomes conceived more as categorical events than measures of some underlying continuum

G Lecture 1013 Model Specification: Always a problem? X 1 =  f  + e 1 X 2 =  f  + e 2 X q = 1q f  + e q Whether the term, 1j f 1, exceeds the interval (0,1) depends on the distribution of f. What do we know about the distribution of f?  ONLY WHAT WE ASSUME Normal (Gibbons et al) Continuous and unbounded (Mislevy) Arbitrary (Bartholomew) Distribution may be some other that prevents out of range scores in factor model

G Lecture 1014 Generalization: Ordinal data, mixed data (binary, ordinal, quantitative) When one variable is quantitative and the other is binary  Product moment correlation is called point biserial correlation  Analogue of tetrachoric is simply biserial correlation When variables are ordinal  Product moment r is Spearman Rank Correlation  Inferred process correlation is Polychoric Correlation

G Lecture 1015 Tetrachoric, Polychoric Correlations require large (1000s) to estimate For small n's the estimates can be unstable Unstable estimates lead to covariance structures that have problems  Not positive definite  Cannot be inverted  Cannot be fit with SEM Muthen's software MPlus has better estimators of the polychoric and tetrachoric values.

G Lecture 1016 Interpretation of SEM models based on Categorical data Latent variables represent processes inferred from RECONSTRUCTED quantitative variables  Think in terms of X * rather than X  Unit is standard deviation of implied continuum Effects are often larger Work on standard errors is still being done