Regression to the Mean.

Slides:



Advertisements
Similar presentations
Experimental Design and the struggle to control threats to validity.
Advertisements

Experimental and Quasi-Experimental Research
Regression Towards the Mean David A. Kenny University of Connecticut
Increasing your confidence that you really found what you think you found. Reliability and Validity.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
1 Research Design Procedures. 2 Research Design Forming your action plan Deciding on the Who and When Defining all concepts and terms.
Group Discussion Describe the fundamental flaw that prevents a nonequivalent group design from being a true experiment? (That is, why can’t these designs.
Estimating the Effects of Treatment on Outcomes with Confidence Sebastian Galiani Washington University in St. Louis.
Internal Threats to Validity
QUANTITATIVE DATA ANALYSIS
Matching level of measurement to statistical procedures
Statistical Analysis of the Nonequivalent Groups Design.
Edpsy 511 Homework 1: Due 2/6.
Correlations and T-tests
1 EXPLORING PSYCHOLOGY (7th Edition) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2008.
Chapters 10 and 11: Using Regression to Predict Math 1680.
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
Hypothesis Testing – Introduction
Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.
Psychology’s Statistics Statistical Methods. Statistics  The overall purpose of statistics is to make to organize and make data more meaningful.  Ex.
Statistical Inference Two Statistical Tasks 1. Description 2. Inference.
Completing the ANOVA From the Summary Statistics.
Statistical Reasoning Chapter 1, Lecture 3 “To be an educated person today is to be able to apply simple statistical principles to everyday reasoning.
Descriptive Observational Survey Cross-Sectional (Polls)Longitudinal Panel Trend Cohort Casual Comparative Types of Research EDF 6481 Edwin Benitez Jr.
Analysis of Covariance adjusting for potential confounds.
Hotness Activity. Descriptives! Yay! Inferentials Basic info about sample “Simple” statistics.
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Multiple-Group Threats to Internal Validity. The Central Issue l When you move from single to multiple group research the big concern is whether the groups.
Probability. probability The chance or likelihood that an event will occur. - It is always a number between zero and one. - It is stated as a fraction,
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Chapter 6 Research Validity. Research Validity: Truthfulness of inferences made from a research study.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
Correlation. Up Until Now T Tests, Anova: Categories Predicting a Continuous Dependent Variable Correlation: Very different way of thinking about variables.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
Organizing and Analyzing Data. Types of statistical analysis DESCRIPTIVE STATISTICS: Organizes data measures of central tendency mean, median, mode measures.
Inferential Statistics. Population Curve Mean Mean Group of 30.
Multiple Regression Scott Hudson January 24, 2011.
Statistics. “Oh, people can come up with statistics to prove anything. 14% of people know that” Homer Simpson.
AP Review Exploring Data. Describing a Distribution Discuss center, shape, and spread in context. Center: Mean or Median Shape: Roughly Symmetrical, Right.
Stats Methods at IC Lecture 3: Regression.
GS/PPAL Section N Research Methods and Information Systems
Causality, Null Hypothesis Testing, and Bivariate Analysis
Business Decision Making
URBDP 591 A Lecture 10: Causality
Basic Estimation Techniques
The Non-Experimental and Quasi- Experimental Strategies:
Experimental Research Designs
Change Score Analysis versus ANCOVA in Pretest/Posttest Designs:
Chapter 8 Experimental Design The nature of an experimental design
Internal Validity and Confounding Variables
Hypothesis Testing – Introduction
Making Causal Inferences and Ruling out Rival Explanations
Introduction to Design
Setting significance levels at the correct level
Analysis based on normal distributions
Introduction Second report for TEGoVA ‘Assessing the Accuracy of Individual Property Values Estimated by Automated Valuation Models’ Objective.
Applied Statistical Analysis
Hypothesis Testing.
Correcting for Regression to the Mean in Behavior and Ecology
Sampling Distributions
Double Negatives and Absolute Value
Regression To The Mean 林 建 甫 C.F. Jeff Lin, MD. PhD.
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.
What’s the plan? First, we are going to look at the correlation between two variables: studying for calculus and the final percentage grade a student gets.
Presentation transcript:

Regression to the Mean

The Simple Explanation... When you select a group from the extreme end of a distribution...

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean the group will do better on a subsequent measure.

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean the group will do better on a subsequent measure. Where it would have been with no regression

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean the group will do better on a subsequent measure Where its mean is

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean the group will do better on a subsequent measure. The group mean on the first measure appears to “regress toward the mean” of the second measure. Overall mean

The Simple Explanation... When you select a group from the extreme end of a distribution... Selected group’s mean Overall mean the group will do better on a subsequent measure. The group mean on the first measure appears to “regress toward the mean” of the second measure. Overall mean Regression to the mean

If the first measure is a pretest and you select the low scorers... Example I: Pretest If the first measure is a pretest and you select the low scorers...

Example I: Pretest Posttest If the first measure is a pretest and you select the low scorers... ...and the second measure is a posttest Posttest

Example I: Pretest Posttest if the first measure is a pretest and you select the low scorers... ...and the second measure is a posttest, regression to the mean will make it appear as though the group gained from pre to post. Posttest Pseudo-effect

If the first measure is a pretest and you select the high scorers... Example II: Pretest If the first measure is a pretest and you select the high scorers...

Example II: Pretest Posttest if the first measure is a pretest and you select the high scorers... ...and the second measure is a posttest, Posttest

Example I: Pretest Posttest If the first measure is a pretest and you select the high scorers... ...and the second measure is a posttest, regression to the mean will make it appear as though the group lost from pre to post. Posttest Pseudo-effect

Some Facts This is purely a statistical phenomenon. This is a group phenomenon. Some individuals will move opposite to this group trend.

Why Does It Happen? For low scorers, you have taken the lowest x%. What are the chances they will be the lowest x% on the second measure? For high scorers, you have taken the highest x%. What are the chances they will be the highest x% on the second measure?

Why Does It Happen? Regression artifacts occur whenever you sample asymmetrically from a distribution. Regression artifacts occur with any two variables (not just pre and posttest) and even backwards in time!

The absolute amount of regression to the mean depends on two factors: What Does It Depend On? The absolute amount of regression to the mean depends on two factors: The degree of asymmetry (i.e., how far from the overall mean of the first measure the selected group's mean is) The correlation between the two measures

The percent of regression to the mean is A Simple Formula The percent of regression to the mean is

The percent of regression to the mean is: A Simple Formula The percent of regression to the mean is: Prm = 100(1 - r)

A Simple Formula The percent of regression to the mean is Prm = 100(1 - r) Where r is the correlation between the two measures.

A Simple Formula The percent of regression to the mean is: Prm = 100(1 - r) Where r is the correlation between the two measures. The formula tells the %, but the actual amount depends on how far the group mean is from the overall mean on the selection variable.

For Example: Prm = 100(1 - r) If r = 1, there is no (i.e., 0%) regression to the mean. If r = 0, there is 100% regression to the mean. If r = .2, there is 80% regression to the mean. If r = .5, there is 50% regression to the mean.

Assume a standardized test with a mean of 50. Example Assume a standardized test with a mean of 50. Pretest 50

Example Pretest Assume a standardized test with a mean of 50 You give your program to the lowest scorers and their mean is 30. 30 50

Example Pretest Posttest Assume a standardized test with a mean of 50. You give your program to the lowest scorers and their mean is 30. Assume that the correlation of pre-post is .5. 30 50 Posttest

Example Pretest Posttest Assume a standardized test with a mean of 50. You give your program to the lowest scorers and their mean is 30. Assume that the correlation of pre-post is .5. 30 50 The formula is… Posttest

Example Pretest Posttest Assume a standardized test with a mean of 50. You give your program to the lowest scorers and their mean is 30. Assume that the correlation of pre-post is .5. 30 50 The formula is Prm = 100(1 - r) = 100(1-.5) = 50% Posttest 50%

Example Pretest Posttest Assume a standardized test with a mean of 50. You give your program to the lowest scorers and their mean is 30. Assume that the correlation of pre-post is .5. 30 50 The formula is Prm = 100(1 - r) = 100(1-.5) = 50% Therefore the mean will regress up 50% (from 30 to 50), leaving a final mean of 40 and a 10 point pseudo-gain. 40 Posttest Pseudo-effect