® The Use of Confirmatory Factor Analysis to Support the Use of Subscores for a Large-Scale Student Outcomes Assessment Rochelle S. Michel, Ph.D. Evaluation.

Slides:



Advertisements
Similar presentations
Abstract Two studies examined the internal and external validity of three school climate scales on the School Climate Bullying Survey (SCBS), a self–report.
Advertisements

The Technology-Rich Outcomes-Focused Learning Environment Inventory (TROFLEI): A Cross-Cultural Validation Anita Welch, Claudette Peterson, Chris Ray,
® Introduction Low Back Pain and Physical Function Among Different Ethnicities Adelle A Safo, Sarah Holder DO, Sandra Burge PhD The University of Texas.
1]Knoll, N., Burkert, S., & Schwartzer, R. (2006). Reciprocal support provision: Personality as a moderator? European Journal of Personality, 20,
Confidential and Proprietary. Copyright © 2010 Educational Testing Service. All rights reserved. Catherine Trapani Educational Testing Service ECOLT: October.
Adding Value to the MSP Evaluations Norman Webb, Rob Meyer, and Paula White Wisconsin Center for Education Research University of Wisconsin-Madison 1025.
Chapter 4 Validity.
Developing and validating a stress appraisal measure for minority adolescents Journal of Adolescence 28 (2005) 547–557 Impact Factor: A.A. Rowley.
NAEP 2008 Trends in Academic Progress in Reading and Mathematics.
Work Session 1 Locating Baseline Data Office of Research, Evaluation and Policy Studies Ximena D. Burgin, Ed.D. November 30 th, 2010.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
© 2008 McGraw-Hill Higher Education. All rights reserved. CHAPTER 16 Classroom Assessment.
A Structural Equation Modeling Approach to Students’ Homework Assignment Web Sites Usage Emel DIKBAS TORUN Hacettepe University Eralp ALTUN Ege University.
Title I Needs Assessment and Program Evaluation
Jan Weiss, PT, DHS, CLT-LANA
Student Achievement and Predictors of Student Achievement in a State Level Agricultural Mechanics Career Development Event Edward Franklin Glen Miller.
The Learning Behaviors Scale
Development and Validation of Elder’s Satisfaction Questionnaire for patients Attending Dental Clinics By: Marjan Haghi PhD student in Gerontology University.
Measuring the Impact of Robotics and GIS/GPS on Youth STEM Attitudes Gwen Nugent, Bradley Barker, Michael Toland, Neal Grandgenett, Slava Adumchuk.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
A Reliability Generalization of the Life Satisfaction Index K. A. Wallace & J. C. Caruso University of Montana Presented at the Annual Meeting of The Gerontological.
Purpose The present study examined the psychometric properties of the SCARED in order to begin establishing an evidence base for using the SCARED in pediatric.
Ethnic Identity among Mexican American Adolescents: The Role of Maternal Cultural Values and Parenting Practices 1 Miriam M. Martinez, 1 Gustavo Carlo,
Factor validation of the Consideration of Future Consequences Scale: An Assessment and Review Tom R. EikebrokkEllen K. NyhusUniversity of Agder.
CALIFORNIA DEPARTMENT OF EDUCATION Jack O’Connell, State Superintendent of Public Instruction Results of the 2005 National Assessment of Educational Progress.
Information Systems Use Among Ohio Registered Nurses: Testing Validity and Reliability of Nursing Informatics Measurements Amany A. Abdrbo, RN, MSN, PhD.
By Cao Hao Thi - Fredric W. Swierczek
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Assessing Mental Health Among Latino Consumers of Mental Health Services Susan V. Eisen, PhD, Mariana Gerena, PhD, Gayatri Ranganathan, MS, Pradipta Seal.
Measurement Issues by Ron D. Hays UCLA Division of General Internal Medicine & Health Services Research (June 30, 2008, 4:55-5:35pm) Diabetes & Obesity.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Physical Education Teacher Education Keven A. Prusak, PhD and William F. Christensen, PhD Keven A. Prusak, PhD and William F. Christensen, PhD Brigham.
CHI SQUARE TESTS.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
METHODS Sample: The Institute for Survey Research of Temple University conducted face-to-face interviews for the 1995 National Alcohol Survey (NAS). The.
The Development and Validation of the Evaluation Involvement Scale for Use in Multi-site Evaluations Stacie A. ToalUniversity of Minnesota Why Validate.
College Comparisons. Mean Total Score by College (Possible Score Range 400 to 500) SSD = Total Scores for Colleges of Business, Education, Health and.
Applied Quantitative Analysis and Practices
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Steven W. Evans, Christine Brady, Lee Kern, Christiana Andrews and the CARS Research Team Measurement Development and Inclusion Criteria: Developing Meaningful.
Diversity Awareness Training Sanchez & Medkik Hypothesis Nature of quasi-experimental design Measures used & their validity Tests of Hypotheses Alternative.
1 Information Systems Use Among Ohio Registered Nurses: Testing Validity and Reliability of Nursing Informatics Measurements Amany A. Abdrbo, RN, MSN,
Applied Quantitative Analysis and Practices LECTURE#17 By Dr. Osman Sadiq Paracha.
How to Examine Your State's Family Outcomes Data: Asking and Answering Critical Questions Melissa Raspa (ECTA) Gary Harmon (NC) Alice Ridgway (CT) Lisa.
Dan Thompson Oklahoma State University Center for Health Science Evaluating Assessments: Utilizing ExamSoft’s item-analysis to better understand student.
Evaluating Multi-item Scales Ron D. Hays, Ph.D. UCLA Division of General Internal Medicine/Health Services Research HS237A 11/10/09, 2-4pm,
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Admission and Transfer Policy Review Task Force 1.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Terra Nova By Tammy Stegman, Robyn Ourada, Sandy Perry, & Kim Cotton.
Statistics & Evidence-Based Practice
Measuring Correlates of HIV Risk Avoidance in Sub-Saharan Preadolescents Comfort Enah, Ph.D. RN Wright State University College of Nursing and Health Ian.
Further Validation of the Personal Growth Initiative Scale – II: Gender Measurement Invariance Harmon, K. A., Shigemoto, Y., Borowa, D., Robitschek, C.,
Psychometric Properties of the Seeking of Noetic Goals Test Brandy J
Chapter 15 Confirmatory Factor Analysis
Study Limitations and Future Directions See Handout for References
Introduction Results Hypotheses Discussion Method
Carrie O’Reilly, Ph.D., M.S.N., RN Touro University Nevada
Examination of the Relationship Between Nutrition Media Literacy and Soft Drink Consumption Among Adolescents – Preliminary Findings Martin H. Evans*,
Dr. Chin and Dr. Nettelhorst Winter 2018
MIRACOSTA COLLEGE ISLOs
Validation of the Portuguese DSM-IV-MR-J
תוקף ומהימנות של ה- Dementia Quality of Life בארה"בMeasure
MIRACOSTA COLLEGE ISLOs
Meaningful Methods For Outcomes Assessment: A Case Study
Evaluating Multi-item Scales
Evaluating Multi-item Scales
UCLA Department of Medicine
Presentation transcript:

® The Use of Confirmatory Factor Analysis to Support the Use of Subscores for a Large-Scale Student Outcomes Assessment Rochelle S. Michel, Ph.D. Evaluation 2009 Annual Conference of the American Evaluation Association November 11-14, 2009

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 2 Introduction Institutions consistently request assessment information beyond the total score –Subscores –Item analysis –Diagnostic information –Comparative data

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 3 Introduction Standard A4 – Program Evaluation Standards (1994) –Calls for the use of valid information in the evaluation of programs, establishing the validity of inferences drawn from information obtained in relation to the evaluation questions (p.147)

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 4 Subscores “Easiest” information that can be provided Often used as the data by which to measure and document program effectiveness Research on issues related to subscore reporting –Haberman, 2005; Haberman, Sinharay, Puhan, 2007; High correlation between subscores and total score Additional information of subscores beyond total score

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 5 General Education Measure 108 multiple-choice items 4 skills (each measured by 27 items) –Reading –Written communication –Critical thinking –Mathematics 3 contexts (each measured by 18 items) –Humanities –Social Science –Natural Science

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 6 Research Questions How well does a four-factor model fit the data from the general education student outcomes measure? How well does a one-factor model fit the data from the general education student outcomes measure? What do the results of the CFA indicate about subscore usage for the measure under analysis?

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 7 Demographics Test takers (n=15,504) –Gender Females – 59% Males – 41% –Ethnicity White – 75% African American – 12% Hispanic – 4% Institutions (n=54)

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 8 Psychometric Properties Test reliability, 0.94 Skills –Reading,.85 –Critical Thinking,.80 –Writing,.77 –Math,.85 Contexts –Natural Sciences,.80 –Social Sciences,.77 –Humanities,.71

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 9 Correlations among skills SkillsTotalRDCTWM Total1.00 RD CT W M Note. RD=Reading, CT=Critical Thinking, W=Writing, M=Mathematics

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 10 Correlations among contexts ContextsTotalHumanitiesNatural Sciences Social Sciences Total1.00 Humanities Natural Sciences Social Sciences

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 11 Confirmatory Factor Analysis Analyses performed using LISREL 8.8 (Joreskog & Sorbom, 2006) Used the tetrachoric correlation matrix to create the covariance matrix to be analyzed Chi-square statistic –Traditional measure for evaluating overall model fit –Hypothesizes that the model fits the population data perfectly –Both one-factor and four-factor models had statistically significant chi-square values

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 12 Confirmatory Factor Analysis Summary of Fit Indices for the One-Factor and Four-Factor Models of a General Education Measure of Student Outcomes SampleRMSEAECVISRMRGFICFINNFI M M Note. M1=one-factor model; M4=four-factor model; RMSEA=root mean square error of approximation; ECVI=expected cross-validation index; SRMR=standardized root mean square residual; GFI=goodness-of-fit index; CFI=comparative fit index; NNFI=non-normed fit index.

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 13 Summary Results are consistent with previous exploratory factor analysis conducted –Conflicting results between various fit indices –Neither one-factor nor four-factor model provide an appropriate fit to the data

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 14 Implications Examination of alternative approaches to subscore reporting –Use of scale anchoring methodology/proficiency scaling Responsible use of current subscores –Total score may provide more information about test takers’ skills than the subscores –Care should be taken making decisions based solely on skill subscores

® Confidential and Proprietary. Copyright © 2009 Educational Testing Service. All rights reserved. 15 Thank you! Thank you to Michael Chajewski and Veleka Allen who performed some of the preliminary analyses on the measure. Contact: