1 Artifact Corrected Correlations between theoretical text complexity and empirical text complexity PCRC – San Diego February 7-10, 2013 A.Jackson Stenner.

Slides:



Advertisements
Similar presentations
Chapter 16 Inferential Statistics
Advertisements

Individual Centered Growth PCRC San Diego, California February, 2012 A. Jackson Stenner Chairman & CEO, MetaMetrics Research Professor University of North.
Lesson 10: Linear Regression and Correlation
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Measurement Reliability and Validity
Statistical Issues in Research Planning and Evaluation
Objectives (BPS chapter 24)
Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Experimental Design, Statistical Analysis CSCI 4800/6800 University of Georgia Spring 2007 Eileen Kraemer.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Meta-analysis & psychotherapy outcome research
Meta-Analysis and Strategy Research Dan R. Dalton Kelley School of Business Indiana University.
Today Concepts underlying inferential statistics
Chapter 7 Correlational Research Gay, Mills, and Airasian
Introduction to Regression Analysis, Chapter 13,
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Statistical Analysis & Techniques Ali Alkhafaji & Brian Grey.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Measurement in Exercise and Sport Psychology Research EPHE 348.
1 Causal Rasch Models IOMW April 11-12, 2012 Vancouver, Canada A.Jackson Stenner Donald S. Burdick Mark H. Stone.
5.1 Basic Estimation Techniques  The relationships we theoretically develop in the text can be estimated statistically using regression analysis,  Regression.
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Test item analysis: When are statistics a good thing? Andrew Martin Purdue Pesticide Programs.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
PARAMETRIC STATISTICAL INFERENCE
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Between groups designs (2) – outline 1.Block randomization 2.Natural groups designs 3.Subject loss 4.Some unsatisfactory alternatives to true experiments.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Tests and Measurements Intersession 2006.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Section 10.1 Confidence Intervals
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Schmidt & Hunter Approach to r Bare Bones. Statistical Artifacts Extraneous factors that influence observed effect Sampling error* Reliability Range restriction.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
The Theory of Sampling and Measurement. Sampling First step in implementing any research design is to create a sample. First step in implementing any.
1 How to Model and Test for the Mechanisms that make Measurement Systems Tick IMEKO Jena, Germany Wednesday, August 31, 2011 A.Jackson Stenner Chairman.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Chapter 6: Analyzing and Interpreting Quantitative Data
Measurement MANA 4328 Dr. Jeanne Michalski
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Reliability: Introduction. Reliability Session Definitions & Basic Concepts of Reliability Theoretical Approaches Empirical Assessments of Reliability.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 9 Introduction to the t Statistic
Copyright © 2009 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 47 Critiquing Assessments.
Quantitative Data Analysis and Interpretation
1. Estimation ESTIMATION.
Understanding Results
Quantitative Methods PSY302 Quiz Chapter 9 Statistical Significance
A 95% confidence interval for the mean, μ, of a population is (13, 20)
Schmidt & Hunter Approach to r
How Confident Are You?.
Presentation transcript:

1 Artifact Corrected Correlations between theoretical text complexity and empirical text complexity PCRC – San Diego February 7-10, 2013 A.Jackson Stenner

2 A Story worth remembering!  Oasis 475  Common Core  Sarah Kershaw (FSU)  Art Graesser (University of Memphis) and Danielle McNamara (Arizona)  Bob Calfee (UC Riverside

3 Theoretical versus Empirical Text Complexity for 719 Articles* Reliability = SEM = 12.8L r = r” = R 2” = RMSE” = 89.6L * Inclusion criteria: 50 encounters and 1,000 items Mean Theoretical = 884.4L (356.2) Mean Empirical = 884.4L (355.0)

4 A Brief Digression: Correlation corrected for measurement error Works for effect sizes (e.g. Cohens d) also!

5 Two ends of the stick Test for a non-nil null.P values are suggestive of a signal. These data are inconsistent with the hypothesis that there is no relationship. Have I found anything? Test the hypothesis that the population correlation between theory and observation is r = 1.0 after correction for artifacts. Is there anything left to find?

6  Hunter and Schmidt (2004) identified six more artifacts relevant to our problem

7 Artifacts 1.Error of measurement (a 1 ) in the dependent variable: Study validity will be systematically lower than true validity to the extent that empirical text complexity is measured with random error. Measurement error is ubiquitous (a 1 = √0.997 = ). 2.Error of measurement in the independent variable: Theory validity will systematically understate the true validity of the attribute measured because the theory is imperfectly represented (a 2 = 1.0*). 3.Range variation (a 3 ) in the independent variable: Study validity will be systematically lower than true validity to the extent that the range of theoretical text complexities in the study is smaller than the population range.

8 Artifacts cont’d. 4. Range variation (a 4 ) in the dependent variable: Study validity will be systematically lower than true validity to the extent that the range of empirical text complexities is smaller than the population range. Correction for double range restriction (a 3 + a 4 =.9853). 5. Deviation from perfect construct validity (a 6 ) in the dependent variable: Task type used to measure empirical text complexity may have some specificity not shared by alternative task types, thus, resulting in slightly different text orderings depending upon which task type is used (a 5 = 1.0*). 6. Deviation from perfect construct validity (a 5 ) in the independent variable: Construct mis-specification due to less than perfect operationalization of the constructs in the specification equation (a 6 =√.9833=.9916).

9 Artifacts cont’d. 7. Linear Bias (a 7 ) is quite small for large sample studies. Bias = (2N-2)/(2N-1) =.9993 Transcription and reporting error (a 8 = 1.0*).

10 A = a 1 a 2 (a 3 + a 4 ) a 5 a 6 a 7 a 8 = (.9985) (1) (.9853) (.9916) (1) (.9993) (1) =.9748 Corrected = Observed = =.9926 CorrelationCorrelation.9748 A 95% confidence interval for the observed correlation (se =.00475).96287≤P≤ % confidence interval for the corrected correlation.988≤P≤.998

11 Conclusion The fact that correlations between theoretical estimates and empirical estimates are influenced by a dozen or more artifactual sources of variation poses threats to inference whenever raw correlations are interpreted. These problems are particularly troublesome when the population correlation is in fact r = 1.0. Because of attenuation due to artifacts the raw (uncorrected) correlation, say r =.90, invites speculation about what moderator variables may have been omitted in the study. And so begins a time consuming, potentially expensive, search for moderators that don’t exist. When researchers fail to produce moderator variables that account for the unexplained variance, which they must, then it is concluded that better theories or better operationalizations of key constructs are needed. When these repair strategies also fail the research is dead-ended and assigned to the dustbin of science. We conjecture that social and human science literatures are littered with correlational studies that conform to the above depressing scenario: “Failure to correct for these artifacts results in massive underestimation of the mean correlation. Failure to control for variation in these artifacts results in a large overstatement of the variance of population correlations and, thus, in a potential false assertion of moderator variables where there are none” (Hunter and Schmidt, 2004, 132.

12 A. Jackson Stenner Chairman & CEO, MetaMetrics University of North Carolina, Chapel Hill Contact Info: