CFA 1. 2 3 4 5 6 7 Rationale: Adjust, repair measurement model as needed without violating the integrity of the structural model test. Often we.

Slides:



Advertisements
Similar presentations
Making Inferences about Causality In general, children who watch violent television programs tend to behave more aggressively toward their peers and siblings.
Advertisements

Ch 8: Experimental Design Ch 9: Conducting Experiments
Writing up results from Structural Equation Models
Treatment of missing values
Random Assignment Experiments
Holland on Rubin’s Model Part II. Formalizing These Intuitions. In the 1920 ’ s and 30 ’ s Jerzy Neyman, a Polish statistician, developed a mathematical.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
The Basics of Experimentation I: Variables and Control
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Alvin Kwan Division of Information & Technology Studies
1 Language of Research Partially Adapted from: 1. The Research Methods Knowledge Base, William Trochim (2006). 2. Methods for Social Researchers in Developing.
PHSSR IG CyberSeminar Introductory Remarks Bryan Dowd Division of Health Policy and Management School of Public Health University of Minnesota.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Who are the participants? Creating a Quality Sample 47:269: Research Methods I Dr. Leonard March 22, 2010.
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Concept of Measurement
Non-Experimental designs: Developmental designs & Small-N designs
Common Problems in Writing Statistical Plan of Clinical Trial Protocol Liying XU CCTER CUHK.
Non-Experimental designs: Developmental designs & Small-N designs
Chapter 2 – Tools of Positive Analysis
How to deal with missing data: INTRODUCTION
Personality, 9e Jerry M. Burger
TOOLS OF POSITIVE ANALYSIS
G Lect 31 G Lecture 3 SEM Model notation Review of mediation Estimating SEM models Moderation.
Survey Experiments. Defined Uses a survey question as its measurement device Manipulates the content, order, format, or other characteristics of the survey.
What is Descriptive Research Method also known as statistical research describes data and characteristics about the population or phenomenon the questions.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Chapter 2: The Research Enterprise in Psychology
HYPOTHESIS TESTING Dr. Aidah Abu Elsoud Alkaissi
Determining Sample Size
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
Chapter 1: Introduction to Statistics
EVAL 6970: Cost Analysis for Evaluation Dr. Chris L. S. Coryn Nick Saxton Fall 2014.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Chapter 1: Research Methods
G Lect 21 G Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.
Causal inferences During the last two lectures we have been discussing ways to make inferences about the causal relationships between variables. One of.
Correlational Research Chapter Fifteen Bring Schraw et al.
Between groups designs (2) – outline 1.Block randomization 2.Natural groups designs 3.Subject loss 4.Some unsatisfactory alternatives to true experiments.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 f02kitchenham5 Preliminary Guidelines for Empirical Research in Software Engineering Barbara A. Kitchenham etal IEEE TSE Aug 02.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
Why is Research Important?. Basic Research Pure science or research Research for the sake of finding new information and expanding the knowledge base.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Research Strategies. Why is Research Important? Answer in complete sentences in your bell work spiral. Discuss the consequences of good or poor research.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
Question paper 1997.
1.) *Experiment* 2.) Quasi-Experiment 3.) Correlation 4.) Naturalistic Observation 5.) Case Study 6.) Survey Research.
1 G Lect 13M Why might data be missing in psychological studies? Missing data patterns Overview of statistical approaches Example G Multiple.
Correlation & Regression Analysis
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Tutorial I: Missing Value Analysis
Causal inferences This week we have been discussing ways to make inferences about the causal relationships between variables. One of the strongest ways.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Mediation: The Causal Inference Approach David A. Kenny.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Methods of Presenting and Interpreting Information Class 9.
MODULE 2 Myers’ Exploring Psychology 5th Ed.
Explanation of slide: Logos, to show while the audience arrive.
G Structural Equation Models
Common Problems in Writing Statistical Plan of Clinical Trial Protocol
RESEARCH METHODS Lecture 33
Experimental Design: The Basic Building Blocks
Product moment correlation
RESEARCH METHODS Lecture 33
Presentation transcript:

CFA 1

2

3

4

5

6

7

Rationale: Adjust, repair measurement model as needed without violating the integrity of the structural model test. Often we have no particular theoretic interest in measurement, except as a means of testing theory at the construct level. Not without controversy, however. 8

Chi-square structural model minus chi-square measurement model with df(s)-df(m) degrees of freedom. 9

Reliability (Really validity?) (∑λ) 2 / (∑λ) 2 + ∑θ AVE (∑λ 2 ) / (∑λ 2 ) + ∑θ 10

Discriminant Validity AVE > φ 2 φ not equal to

12

13

14

15

16

17 Causal Inference Issues Causal inference is often illusive in social and behavioral sciences Prototypes of Causal Effects seem to implicate primary (single) causes. –billiard balls –bacteria or viruses In reality, effects usually have multiple causes –For distress Stressors Personal dispositions Familial factors Social environment Biological environment

18 Causal Inference, continued Effects of causes are not always constant social buffers developmental stages immune system interventions synergistic causal effects stochastic variation in causal factor strength stochastic measurement factors

19 David Hume's framework for Causality If E is said to be the effect of C, then –1) C and E must have temporal and spatial contiguity: ASSOCIATION –2) C must precede E temporally: DIRECTION –3) There must be CONSTANT CONJUNCTION: If C, then E for all situations

20 Although still influential, Hume's analysis is known to have limitations. Analysis of any cause C must be isolated from competing causes (ISOLATION) Constant conjunction is too restrictive: stochastic processes affect causal relations, and mechanisms may vary across situations. –Causal relations may be expressed in terms of expectations over stochastic variation

21 Formal causal analyses have led to important advances Robert Koch, the Nobel Prize winning bacteriologist, investigated bacteria as causes of disease using three principles: –The organism must be found in all cases of the disease in question. (association) –The organism must be isolated and grown in pure culture (isolation) –When inoculated with the isolated organism, susceptible subjects must reproduce the disease (direction and hedged constant conjunction)

22 Causal Process in Time In the behavioral, social, and biological sciences, the units of observation cannot be trusted to stay the same over time. For example, in Koch's inoculation test, how do we know that the subject had not been infected by chance? For studies of distress, we expect both stress and distress to change over time.

23 Statisticians developed the randomized experiment to address causal issues: Randomly assign subjects to one of two conditions, Treatment (T) or Control (C), Administer treatment and control procedures Measure outcome variable Y (assumed to reflect the process of interest) blind to treatment group Infer effect of treatment from difference in group means

24 Holland’s formal analysis of randomized experiments: Suppose Y(u) is a measurement on subject u that reflects the process that is supposed to be affected by treatment, T. If subject u is given treatment T, then Y T (u) is observed. If subject u is given a control treatment, C, then Y C (u) is observed. We would like to compare Y T (u) with Y C (u), but only one of these can be available as u is either in T or C. – Let the desired comparison be called D = Y T (u) - Y C (u). – Holland calls this the Effect of cause T Although D can not be observed, its average can be estimated by computing

25 Between-subject is substituted for within-subject information. Within subject analyses are intuitively appealing, but require strong assumptions about constancy over time. When D≠0, then ASSOCIATION is established. Randomization prior to treatment deals with the causal issue of DIRECTION. It also partially supports ISOLATION (double blind trials, manipulation checks help address other aspects of isolation). Randomization does not establish CONSTANT CONJUNCTION. The effect is only established for the specific experimental conditions used in the study.

26 Key Feature: Treatment is applied to subjects sampled into group T Holland argues that this manipulation is critical to guarantee DIRECTION, and ISOLATION. Holland and Rubin go on to assert that clear causal inference is only possible if manipulation is at least conceivable. They propose the motto, NO CAUSATION WITHOUT MANIPULATION

27 NO CAUSATION WITHOUT MANIPULATION This motto is not popular with sociologists and economists. It explicitly denies causal status to personal attributes, such as race, sex, age, nationality, and family history. Instead, it encourages the investigation of processes such as discrimination, physical changes corresponding to age, government policy, and biochemical consequences of genetic makeup.

28 NO CAUSATION WITHOUT MANIPULATION To illustrate, Holland would not say that my height causes me to hit my head going into my suburban cellar, as my height cannot be manipulated. My failure to duck, and the dangerous obstruction could be shown to be causally related to my bumped head.

29 Structural Equation Models Researchers of topics such as stress, discrimination, poverty, coping and so on cannot easily design randomized experiments Structural Equation Models (SEM) are often presented as a major tool for establishing causes.

30 SEM and ISOLATION, ASSOCIATION, and DIRECTION Consider a simple SEM model: –Y = b1 X + e For every unit change in X, Y is expected to change by b1 units. This equation implies clear association of Y and X, and it makes the assumed direction underlying the association unambiguous. For the equation to be meaningful in terms of causation, we must also assume that alternative causes of Y are accounted by the independent stochastic term, e. Bollen calls the requirement that e be uncorrelated with X, the pseudo-isolation condition.

31 Analysis of Randomized Experiment through SEM Y = b0 + b1 X + e Let X take one of two values representing whether a subject received the treatment (X=1) or the control placebo(X=0). b1 estimates D. Because the assignment is randomized, X is expected to be uncorrelated with residual causes of Y. –Randomization justifies the pseudo-isolation condition. The randomized experiment also reminds us that between subject comparisons can be informative about average within subject effects. We can contemplate what would have happened if a given subject had been assigned to a different group.

32 In non-experimental studies, Isolation is difficult to establish We need to specify EVERY causal factor that is correlated with X, the causal variable of interest. X W2W2 W3W3 W4W4 Y e

33 The effects of model misspecification Suppose some W 2 is missing in the data set, even though we know it is correlated with both Y and X. If we know that W is a causal factor for both X and Y, then we would portray the model as on the right: If we consider the misspecified model, in which W 2 is missing, we can see that the estimated effect of X will include the indirect effect of W 2 on Y. The causal impact of X will be overestimated in the misspecified model.

34 Missing Data Mechanisms Terms suggested by Rubin – Rubin (1976), Little & Rubin (1987) MISSING COMPLETELY AT RANDOM (MCAR) – Which data point is missing cannot be predicted by any variable, measured or unmeasured. Prob(M|Y)=Prob(M) – The missing data pattern is ignorable. Analyzing available complete data is just fine.

35 Missing Data Mechanisms MISSING AT RANDOM (MAR) – Which data point is missing is systematically related to subject characteristics, but these are all measured Conditional on observed variables, missingness is random Prob(M|Y)=Prob(M|Y observed ) – E.g. Lower educated respondents might not answer a certain question. – Missingness can be treated as ignorable

36 Missing Data Mechanisms NOT MISSING AT RANDOM (NMAR) – Data are missing because of process related to value that is unavailable Someone was too depressed to come report about depression Abused woman is not allowed to meet interviewer – Missing data pattern is not ignorable. – Whether missing data are MAR or NMAR can not usually be established empirically.

37 Approaches to Missing Data Listwise deletion – If a person is missing on any analysis variable, he is dropped from the analysis. Pairwise deletion – Correlations/Covariances are computed using all available pairs of data. Imputation of missing data values. Model-based use of complete data – E-M (estimation-maximization approach) SEM-based FIML

38 EM and FIML Use available data to infer sample moment matrix. Uses information from assumed multivariate distribution Patterns of associations can be structured or unstructured. Now implemented in AMOS, EQS, Mplus

39 Example of CFA with Means Model

40 Multiple Imputation Substitute expected values plus noise for missing values. Repeat >5 times. Combine estimates and standard errors using formulas described by Rubin (1987). See also Schafer & Grahm (2002) Missing data: Our view of the state of the art. Psychological Methods, 7:

41

42

43

44 Communicating SEM Results Keeping up with the expert recommendations – Psychological Methods – Specialty journals Structural Equation Models Multivariate Behavioral Research Applied Psychological Measurement Psychometrika Two kinds of audiences – Researchers interested in the substance of the empirical contribution – Experts in SEM

45 Talking Points of Hoyle&Panter, McDonald&Ho Model specification – Theoretical justification – Identifiability Measurement Model Structural Model Model estimation – Characteristics of data Distribution form Sample size Missing data

46 Talking Points of Hoyle&Panter, McDonald&Ho Model estimation – Estimation method: ML, GLS, ULS, ADF – Goodness of estimates and standard errors Model Selection and Fit Statistics Alternative and Equivalent Models Reporting Results – Path diagrams – Tabular information – Use software conventions?