Stat 1301 More on Regression. Outline of Lecture 1. Regression Effect and Regression Fallacy 2. Regression Line as Least Squares Line 3. Extrapolation.

Slides:



Advertisements
Similar presentations
Statistics lecture 4 Relationships Between Measurement Variables.
Advertisements

Statistical Analysis Regression & Correlation Psyc 250 Winter, 2013.
Stat 13 lecture 23 correlation and regression A cartoon from handout The taller the father, the taller the son Tall father’s son is taller than short farther’s.
Normal distribution. An example from class HEIGHTS OF MOTHERS CLASS LIMITS(in.)FREQUENCY
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Math 3680 Lecture #19 Correlation and Regression.
Correlations and scatterplots -- Optical illusion ? -- Finding the marginal distributions in the scatterplots (shoe size vs. hours of TV) Regressions --
MA-250 Probability and Statistics
Measures of Central Tendency
Optical illusion ? Correlation ( r or R or  ) -- One-number summary of the strength of a relationship -- How to recognize -- How to compute Regressions.
Linear regression (continued) In a test-retest situation, almost always the bottom group on the 1 st test will on average show some improvement on the.
5-1 bivar. Unit 5 Correlation and Regression: Examining and Modeling Relationships Between Variables Chapters Outline: Two variables Scatter Diagrams.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
LINEAR REGRESSIONS: Cricket example About lines Line as a model:
Linear Regression MARE 250 Dr. Jason Turner.
Crash Course in Correlation and Regression MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central.
LINEAR REGRESSIONS: About lines Line as a model: Understanding the slope Predicted values Residuals How to pick a line? Least squares criterion “Point.
Forecasting Outside the Range of the Explanatory Variable: Chapter
Chapters 10 and 11: Using Regression to Predict Math 1680.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a quiz at the end of today’s lecture.
And the Rule THE NORMAL DISTRIBUTION. SKEWED DISTRIBUTIONS & OUTLIERS.
Review Multiple Choice Chapters 1-10
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
STAT 1301 Chapter 8 Scatter Plots, Correlation. For Regression Unit You Should Know n How to plot points n Equation of a line Y = mX + b m = slope b =
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
3-1 Stats Unit 3 Summary Statistics (Descriptive Statistics) FPP Chapter 4 For one variable - - Center of distribution "central value", "typical value"
Hotness Activity. Descriptives! Yay! Inferentials Basic info about sample “Simple” statistics.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Competition to get into a Post- Secondary institution has greatly increased and the change in the school curriculum has, and will continue to, affect.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
Introduction to Regression (Dr. Monticino). Assignment Sheet Math 1680  Read Chapter 9 and 10  Assignment # 7 (Due March 2)  Chapter 9 Exercise Set.
9.2 Linear Regression Key Concepts: –Residuals –Least Squares Criterion –Regression Line –Using a Regression Equation to Make Predictions.
3.2 Least-Squares Regression Objectives SWBAT: INTERPRET the slope and y intercept of a least-squares regression line. USE the least-squares regression.
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
SPSS Homework Practice The Neuroticism Measure = S = 6.24 n = 54 How many people likely have a neuroticism score between 29 and 34?
Chapter 8 (3-4), 9 More about Correlation. Today’s Lecture l SD Line l Calculating r l correlation vs causation.
The Line of Best Fit CHAPTER 2 LESSON 3  Observed Values- Data collected from sources such as experiments or surveys  Predicted (Expected) Values-
Chapter 4 Measures of Spread. RMS RMS size of a list: (S) (S) square values in list (M) (M) sum squared values and divide by total # of values in list.
SPSS Homework Practice The Neuroticism Measure = S = 6.24 n = 54 How many people likely have a neuroticism score between 29 and 34?
The simple linear regression model and parameter estimation
Statistical analysis.
CHAPTER 3 Describing Relationships
Simple Linear Regression
CHAPTER 12 More About Regression
Statistics 200 Lecture #5 Tuesday, September 6, 2016
Statistical analysis.
Regression Chapter 6 I Introduction to Regression
3.2 (part 1)
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2017 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Which of the following would be necessary to establish a cause-and- effect relationship between two variables? Strong association between the variables.
Regression Fallacy.
Warm Up 1) Find the mean & median for this data set: 7, 6, 5, 8, 10, 20, 4, 3 2) Which measure of center, the mean or median should be used to describe.
Do Now What is the predicted English score when the math score is 10?
Modeling a Linear Relationship
Practice The Neuroticism Measure = S = 6.24 n = 54
Multiple Linear Regression
Lecturer’s desk Projection Booth Screen Screen Harvill 150 renumbered
What is regression?.
Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)
Lesson 2.2 Linear Regression.
Modeling a Linear Relationship
Standard Deviation Mean - the average of a set of data
Standard Deviation.
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Stat 1301 More on Regression

Outline of Lecture 1. Regression Effect and Regression Fallacy 2. Regression Line as Least Squares Line 3. Extrapolation 4. Multiple Regression

1. Regression Effect and Regression Fallacy

Hypothetical Grades for the First 2 Tests in a Class of STAT 1301 Hypothetical Grades for the First 2 Tests in a Class of STAT 1301 AVG x = 75SD x = 10 (Test 1) AVG y = 75SD y = 10 (Test 2) r = 0.7 Test - Retest Situation

Predict the score on Test 2 for a student whose Test 1 score was... Predict the score on Test 2 for a student whose Test 1 score was... (a) 95 (a) 95 (b) 60 Regression Line: Y =.7X ^

l Test-retest situation: - Bottom group on Test 1 does better on Test 2 - Top group on Test 1 falls back on Test 2 The Regression Fallacy l attributing the regression effect to something besides natural spread around the line. The Regression Effect

Regression Effect - Explanation Students scoring 95 on Test 1 3 categories (a) Students who will average 95 for the course (a) Students who will average 95 for the course (b) Great students having a bad day (c) “Pretty good” students having a good day - There are more students in category (c) than in (b) - Thus, we expect the “average” performance for those who scored 95 on Test 1 to drop

Regression Effect - Examples 4-yr-olds with IQ’s of 120 typically have adult IQ’s around yr-olds with IQ’s of 70 typically have adult IQ’s around 85. Of major league baseball teams with winning records, typically 2/3 win fewer games the next year.

Note: n The regression effect does not explain a change in averages n If r > 0:  if X is above AVGx, then the predicted Y must be above AVGy  if X is below AVGx, then the predicted Y must be below AVGy  if X is below AVGx, then the predicted Y must be below AVGy

2. Regression Line as Least Squares Line

What line is “closest” to the points ?

The regression line has smallest RMS size of deviations from points to the line.

The regression line is also called the least squares line.

3. Extrapolation l Predicting beyond the range of predictor variables

3. Extrapolation l Predicting beyond the range of predictor variables 6 NOT a good idea

4. Multiple Regression Using more than one independent variable to predict dependent variable. Using more than one independent variable to predict dependent variable.Example: PredictY = son’s height UsingX 1 = father’s height X 2 = mother’s height

4. Multiple Regression Using more than one independent variable to predict dependent variable. Using more than one independent variable to predict dependent variable.Example: PredictY = son’s height UsingX 1 = father’s height X 2 = mother’s height Equation:Y = m 1 X 1 + m 2 X 2 + b