© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 1 More details can be found in the “Course Objectives and Content”

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Unit 6a: Motivating Principal Components Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 6a– Slide 1
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008.
Inference for Regression
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
© Willett, Harvard University Graduate School of Education, 5/21/2015S052/I.3(b) – Slide 1 More details can be found in the “Course Objectives and Content”
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
Topic 3: Regression.
Analysis of Variance & Multivariate Analysis of Variance
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Linear Regression/Correlation
S052/Shopping Presentation – Slide #1 © Willett, Harvard University Graduate School of Education S052: Applied Data Analysis Shopping Presentation: A.
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Unit 3b: From Fixed to Random Intercepts © Andrew Ho, Harvard Graduate School of EducationUnit 3b – Slide 1
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Unit 4b: Fitting the Logistic Model to Data © Andrew Ho, Harvard Graduate School of EducationUnit 4b – Slide 1
MEASUREMENT MODELS. BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent.
Inferential Statistics: SPSS
Overview of Meta-Analytic Data Analysis
Unit 6b: Principal Components Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 6b – Slide 1
Andrew Ho Harvard Graduate School of Education Tuesday, January 22, 2013 S-052 Shopping – Applied Data Analysis.
Unit 5b: The Logistic Regression Approach to Life Table Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 5b– Slide 1
Unit 1c: Detecting Influential Data Points and Assessing Their Impact © Andrew Ho, Harvard Graduate School of EducationUnit 1c – Slide 1
© Willett, Harvard University Graduate School of Education, 10/23/2015S052/I.1(b) – Slide 1 S052/§I.1(b): Applied Data Analysis Roadmap of the Course.
S052/Shopping Presentation – Slide #1 © Willett, Harvard University Graduate School of Education S052: Applied Data Analysis Shopping Presentation: Six.
Inferential Statistics
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
S052/Shopping Presentation – Slide #1 © Willett, Harvard University Graduate School of Education S052: Applied Data Analysis What Would You Like To Know.
© Willett, Harvard University Graduate School of Education, 11/13/2015S052/I.1(c) – Slide 1 More details can be found in the “Course Objectives and Content”
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
INDE 6335 ENGINEERING ADMINISTRATION SURVEY DESIGN Dr. Christopher A. Chung Dept. of Industrial Engineering.
Unit 3a: Introducing the Multilevel Regression Model © Andrew Ho, Harvard Graduate School of EducationUnit 3a – Slide 1
One-Way Analysis of Covariance (ANCOVA)
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
© Willett, Harvard University Graduate School of Education, 12/16/2015S052/I.1(d) – Slide 1 More details can be found in the “Course Objectives and Content”
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
What Types Of Data Are Collected? What Kinds Of Question Can Be Asked Of Those Data?  Do people who say they study for more hours also think they’ll.
© Willett, Harvard University Graduate School of Education, 1/19/2016S052/I.2(a) – Slide 1 More details can be found in the “Course Objectives and Content”
Tutorial I: Missing Value Analysis
© Willett, Harvard University Graduate School of Education, 2/19/2016S052/II.1(c) – Slide 1 S052/II.1(c): Applied Data Analysis Roadmap of the Course.
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
© Willett, Harvard University Graduate School of Education, 3/1/2016S052/III.1(b) – Slide 1 S052/III.1(b): Applied Data Analysis Roadmap of the Course.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Unit 2a: Dealing “Empirically” with Nonlinear Relationships © Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 1
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
© Willett, Harvard University Graduate School of Education, 6/13/2016S052/II.2(a3) – Slide 1 S052/II.2(a3): Applied Data Analysis Roadmap of the Course.
What Types Of Data Are Collected? What Kinds Of Question Can Be Asked Of Those Data?  Do people who say they study for more hours also think they’ll.
© Murnane & Willett, Harvard University Graduate School of Education, 6/28/2016S290/Class #09 – Slide 1 S052: Applied Data Analysis Everything in the.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
CJT 765: Structural Equation Modeling
Eastern Michigan University
Applied Statistical Analysis
12 Inferential Analysis.
Making Sense of Advanced Statistical Procedures in Research Articles
Linear Regression/Correlation
12 Inferential Analysis.
Checking the data and assumptions before the final analysis.
MGS 3100 Business Analysis Regression Feb 18, 2016
Structural Equation Modeling
REVIEW Course Review.
Presentation transcript:

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 1 More details can be found in the “Course Objectives and Content” handout on the course webpage. Multiple Regression Analysis (MRA) Multiple Regression Analysis (MRA) Do your residuals meet the required assumptions? Test for residual normality Use influence statistics to detect atypical datapoints If your residuals are not independent, replace OLS by GLS regression analysis Use Individual growth modeling Specify a Multi-level Model If your sole predictor is continuous, MRA is identical to correlational analysis If your sole predictor is dichotomous, MRA is identical to a t-test If your several predictors are categorical, MRA is identical to ANOVA If time is a predictor, you need discrete- time survival analysis… If your outcome is categorical, you need to use… Binomial logistic regression analysis (dichotomous outcome) Multinomial logistic regression analysis (polytomous outcome) If you have more predictors than you can deal with, Create taxonomies of fitted models and compare them. Form composites of the indicators of any common construct. Conduct a Principal Components Analysis Use Cluster Analysis Use non-linear regression analysis. Transform the outcome or predictor If your outcome vs. predictor relationship is non-linear, How do you deal with missing data? S052/I.3(c): Applied Data Analysis Roadmap of the Course – What Is Today’s Topic Area? Today’s Topic Area

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 2 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability Printed Syllabus – What Is Today’s Topic? Please check inter-connections among the Roadmap, the Daily Topic Area, the Printed Syllabus, and the content of today’s class when you pre-read the day’s materials. Using The Multilevel Model To Obtain Internal Consistency Estimates Of Test Reliability Today, in Section I.3(c) on Using The Multilevel Model To Obtain Internal Consistency Estimates Of Test Reliability, I will: Estimate the internal consistency of a six-item scale that measures teacher satisfaction, using Cronbach’s  statistic (Slides #3-#4) Show how to restack the dataset from its original multivariate format into the corresponind univariate or “person-item” format that is required for multilevel modeling (Slides #5-#6). Demonstrate estimation of internal consistency reliability of the same scale by fitting of an unconditional multilevel model with teacher satisfaction as the outcome, in the person-item dataset. Using The Multilevel Model To Obtain Internal Consistency Estimates Of Test Reliability Today, in Section I.3(c) on Using The Multilevel Model To Obtain Internal Consistency Estimates Of Test Reliability, I will: Estimate the internal consistency of a six-item scale that measures teacher satisfaction, using Cronbach’s  statistic (Slides #3-#4) Show how to restack the dataset from its original multivariate format into the corresponind univariate or “person-item” format that is required for multilevel modeling (Slides #5-#6). Demonstrate estimation of internal consistency reliability of the same scale by fitting of an unconditional multilevel model with teacher satisfaction as the outcome, in the person-item dataset.

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 3 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data “Alpha” option in PROC CORR requests a “Cronbach’s Alpha” estimate of internal consistency reliability for the set of indicators listed in the VAR statement I’ve instituted “listwise deletion” to eliminate all cases that are missing data on one or more of the variables being composited. In this analysis, I assume that variables X1 through X6 are observed “indicators” of an underlying latent construct of teacher satisfaction and that we would like to know the reliability that we could anticipate for a scale that composited them together _N_ is a SAS system variable that is present in every SAS dataset to count the cases, beginning with the first.

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 4 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data These are “deleted-indicator” estimates of Cronbach’s Alpha internal consistency reliability for a “temporarily reduced” composite that does not include the current indicator. Inspection of these estimates, indicator by indicator, helps identify whether an indicator should be retained in the composite: If an indicator truly belongs in the composite, then the “deleted-indicator” reliability estimate will be lower than the reliability of the full composite. If an indicator does not belong in the composite, then the “deleted-indicator” reliability estimate will be higher than the reliability of the full composite, These are “deleted-indicator” estimates of Cronbach’s Alpha internal consistency reliability for a “temporarily reduced” composite that does not include the current indicator. Inspection of these estimates, indicator by indicator, helps identify whether an indicator should be retained in the composite: If an indicator truly belongs in the composite, then the “deleted-indicator” reliability estimate will be lower than the reliability of the full composite. If an indicator does not belong in the composite, then the “deleted-indicator” reliability estimate will be higher than the reliability of the full composite, Here are estimates of Cronbach’s Alpha internal consistency reliability for the full composite formed from all six standardized indicators. Notice that both the “raw” and the “standardized” reliability estimates are identical in this case, because I chose to standardize the raw scores explicitly before entering them into the reliability estimation. If I had not standardized them explicitly in advance, then the the “raw” and “standardized” relaibility estimates would have differed by an amount that depended on how heterogeneous the sample variances of the raw indicators were. Here are estimates of Cronbach’s Alpha internal consistency reliability for the full composite formed from all six standardized indicators. Notice that both the “raw” and the “standardized” reliability estimates are identical in this case, because I chose to standardize the raw scores explicitly before entering them into the reliability estimation. If I had not standardized them explicitly in advance, then the the “raw” and “standardized” relaibility estimates would have differed by an amount that depended on how heterogeneous the sample variances of the raw indicators were.

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 5 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data Teacher #1 Teacher #2 Teacher #3 Teacher #5 PROC PRINT reveals the current organization of the data, in which a teacher’s responses to each of the six indicators are stored in a “multivariate” or “horizontal” format. A teacher’s responses to the six indicators of satisfaction are listed within a single row, each response contained in a separate variable, X1 through X6. PROC PRINT reveals the current organization of the data, in which a teacher’s responses to each of the six indicators are stored in a “multivariate” or “horizontal” format. A teacher’s responses to the six indicators of satisfaction are listed within a single row, each response contained in a separate variable, X1 through X6.

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 6 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data To estimate internal consistency reliability using multilevel modeling, we must reorganize the data from a “multivariate” into a “longitudinal” or “univariate” format, in which a teacher’s responses to each of the six indicators of satisfaction are now listed “vertically” beneath each other, in a single variable representing satisfaction, here called X. Teacher #1 Teacher #2 PROC PRINT reveals the impact on the dataset of this reorganization from a “multivariate” to a “univariate” format. Notice that there is now a single measure, X, of teacher satisfaction, containing six values for each teacher, but now distributed across six rows (rather than across six columns, as in the earlier multivariate format).

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 7 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data In this new format, we have replicated measurement over teachers. So, we can use multilevel modeling to fit an unconditional model, with X as the outcome and teacher ID as the grouping variable. This lets us estimate the L1 and L2 variance components: Between-teacher variance component captures the true variability in satisfaction across teachers. If we assume that the measurement errors implicit in the different indicators tend to cancel each other out, then we can regard the estimated between-teacher variance component as a measure of the true variability in satisfaction in each indicator. Within-teacher variance component summarizes the average flutter in a single teacher’s ratings of their satisfaction across indicators. As we assume that each indicator is a measure of the same latent construct of satisfaction, then these differences must be due to measurement error. Thus, estimating the within-teacher variance component estimates the measurement error variance of each indicator. In this new format, we have replicated measurement over teachers. So, we can use multilevel modeling to fit an unconditional model, with X as the outcome and teacher ID as the grouping variable. This lets us estimate the L1 and L2 variance components: Between-teacher variance component captures the true variability in satisfaction across teachers. If we assume that the measurement errors implicit in the different indicators tend to cancel each other out, then we can regard the estimated between-teacher variance component as a measure of the true variability in satisfaction in each indicator. Within-teacher variance component summarizes the average flutter in a single teacher’s ratings of their satisfaction across indicators. As we assume that each indicator is a measure of the same latent construct of satisfaction, then these differences must be due to measurement error. Thus, estimating the within-teacher variance component estimates the measurement error variance of each indicator.

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 8 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data There are six indicators, X 1 through X 6. Each measures the same true score, . Each has a unique error term,  1 through  6. So, the TOTAL observed score is just the sum of these: Population variance of TOTAL … So Cronbach’s  reliability can be written in terms of the L1 and L2 residual variances in an unconditional multilevel model for X :

© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 9 S052/I.3(c): Multilevel Modeling & Internal Consistency Reliability TSUCCESS data Estimated true variance in total teacher satisfaction:Estimated error variance in single indicator of teacher satisfaction: