Lecture 18 Simple Linear Regression (Chapters 18.1- 18.5)

Slides:



Advertisements
Similar presentations
MATH 2400 Chapter 5 Notes. Regression Line Uses data to create a linear equation in the form y = ax + b where “a” is the slope of the line (unit rate.
Advertisements

Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Analysis of Variance The contents in this chapter are from Chapter 15 and Chapter 16 of the textbook. One-Way Analysis of Variance Multiple Comparisons.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Lecture 15 – Tues., Oct. 28 Review example of one-way layout Simple Linear Regression: –Simple Linear Regression Model, 7.2 –Least Squares Regression Estimation,
Class 6: Tuesday, Sep. 28 Section 2.4. Checking the assumptions of the simple linear regression model: –Residual plots –Normal quantile plots Outliers.
Statistics 350 Lecture 1. Today Course outline Stuff Section
Lecture 19 Transformations, Predictions after Transformations Other diagnostic tools: Residual plot for nonconstant variance, histogram to check normality.
Stat Notes 4 Chapter 3.5 Chapter 3.7.
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Correlation and Regression Analysis
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 Chapter 10, Part 2 Linear Regression. 2 Last Time: A scatterplot gives a picture of the relationship between two quantitative variables. One variable.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Multivariate Analysis. One-way ANOVA Tests the difference in the means of 2 or more nominal groups Tests the difference in the means of 2 or more nominal.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Section 4.2: Least-Squares Regression Goal: Fit a straight line to a set of points as a way to describe the relationship between the X and Y variables.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Inference for Regression
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
The Question of Causation 4.2:Establishing Causation AP Statistics.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Cautions About Correlation and Regression
Daniela Stan Raicu School of CTI, DePaul University
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
CHAPTER 29: Multiple Regression*
EQ: How well does the line fit the data?
Unit 3 – Linear regression
CHAPTER 3 Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Least-Squares Regression
The Question of Causation
Daniela Stan Raicu School of CTI, DePaul University
Least-Squares Regression
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
CHAPTER 3 Describing Relationships
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
CHAPTER 3 Describing Relationships
Chapter 14 Multiple Regression
CHAPTER 3 Describing Relationships
Presentation transcript:

Lecture 18 Simple Linear Regression (Chapters )

Interaction plots in ANOVA It is a good idea to always look at the interaction plots when doing a two-way ANOVA, regardless of whether or not the test for interactions is significant. Interaction plots display the basic results of study. If there really are no interactions, then the interaction plots will consist of parallel lines.

Regression Analysis The goal: Estimate E(Y|X) = conditional mean of Y given X based on a sample. Simple Linear Regression: Assumes E(Y|X) is a straight line in X.

Uses of Regression Analysis Descriptive. Describe the association between y and x in the population observed. Passive prediction. Predict y based on x where you do not plan to manipulate x, e.g., predict today’s stock price based on yesterday’s stock price. Control. Predict what y will be if you change x, e.g., predict what your earnings will be if you obtain different levels of education.

Lurking Variables A lurking variable is a variable that has an important effect on the relationship among the variables in a study but is not included among the variables studied. Examples: –Y=Salaries of Presbyterian Ministers over time, X=Price of rum in Havana over time, Lurking Variable = Inflation rate over time. –Y=Pellagra rate in village, X=Amount of flies in village, Lurking Variable = Amount of corn in diet.

Pitfalls in Regression Analysis (1) Descriptive: If using simple linear regression, need to make sure E(Y|X) is actually approximately a straight line. (2) Passive Prediction: Need to beware of pitfall for (1) plus extrapolation and lurking variables Control: Need to beware of pitfalls for (1) and (2) plus extra caution about lurking variables. Requires a cause-and-effect relationship. Best found through a controlled experiment.

Example of Pitfall A researcher measures the number of television sets per person X and the average life expectancy Y for the world’s nations. The regression line has a positive slope – nations with many TV sets have higher life expectancies. Could we lengthen the lives of people in Rwanda by shipping them TV sets?

Residual Plots Against Time Many lurking variables change systematically over time. Useful method for detecting lurking variables: Plot residuals against time order of observation is available. If a systematic pattern is found, an understanding of the background of the data might allow you to guess what the lurking variables are. Another useful residual plot: Plot residuals vs. location of observations.

Residual Plot vs. Time Example Goal: Predict elementary mathematics enrollment (X) at college based on number of freshman students (Y). Linear Fit Math enrollment = Freshman students

Residual Plot vs. Time Residual plot suggests that a change took place between 1994 and 1995 that caused higher proportion of students to take math courses. In fact, one of schools in university changed its program in 1995 to require entering students to take another math course. Conclusion: The math dept. shouldn’t use data from before 1995 for predicting future enrollment.