Lecture 9 Sections 3.3 Objectives:

Slides:



Advertisements
Similar presentations
CHAPTER 3 Describing Relationships
Advertisements

Regression, Residuals, and Coefficient of Determination Section 3.2.
Correlation & Regression
Objectives (BPS chapter 5)
Chapter 3: Examining relationships between Data
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Correlation tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition, we would like to.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Correlation tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition, we would like to.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
The simple linear regression model and parameter estimation
CHAPTER 3 Describing Relationships
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Examining Relationships Least-Squares Regression & Cautions about Correlation and Regression PSBE Chapters 2.3 and 2.4 © 2011 W. H. Freeman and Company.
LSRL Least Squares Regression Line
SIMPLE LINEAR REGRESSION MODEL
(Residuals and
CHAPTER 29: Multiple Regression*
Ice Cream Sales vs Temperature
Least-Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Chapter 3: Describing Relationships
Descriptive Analysis and Presentation of Bivariate Data
Chapter 3 Describing Relationships Section 3.2
Chapter 3: Describing Relationships
Objectives (IPS Chapter 2.3)
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Correlation/regression using averages
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapters Important Concepts and Terms
Chapter 3: Describing Relationships
9/27/ A Least-Squares Regression.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Homework: PG. 204 #30, 31 pg. 212 #35,36 30.) a. Reading scores are predicted to increase by for each one-point increase in IQ. For x=90: 45.98;
Review of Chapter 3 Examining Relationships
Correlation/regression using averages
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Lecture 9 Sections 3.3 Objectives: Bivariate and Multivariate Data and Distributions Fitting a Straight Line Assessing the fit Residuals

Fitting a Line to Bivariate Data We first draw the scatter plot of two quantitative variables for a visual inspection of relationship between those variables. We then measure its strength by computing a correlation coefficient if the scatter plot shows a linear relationship. Now one may want to fit a line on the scatter plot to summarize the overall pattern. This is done using a regression analysis. Fitting a straight line to data (Regression line) We want to draw a line 1) to summarize the relationship between x and y. 2) to describe how a response variable y changes as an explanatory variable x changes. 3) to predict the value of y for a given value of x.

Least Squares Regression Line The least-squares regression line is the unique line such that the sum of the squared vertical (y) distances between the data points and the line is as small as possible. Distances between the points and line are squared so all are positive values. This is done so that distances can be properly added (Pythagoras).

Least Squares Regression Line Least squares line is a regression line that minimizes the squared distances between the observed points and a line. That is, we’ll find a and b that minimize To find the minimum of this function, take the derivative of Q wrt a and b and equate them to zero. Those two equations (called the normal equations) give Estimated y-intercept Estimated slope The equation of the least square line is often written as where “hat” above y emphasizes that is a prediction of y that results from the substitution of any particular x value into the equation.

Example A sample of Pizza restaurants located near college campus Find a least squares line. b. Predict the quarterly sales when the student populations are 18,000 and 30,000, resp.

Extrapolation Extrapolation is the use of a regression line for predictions outside the range of x values used to obtain the line. This is not recommended, as seen here. !!! !!! Height in Inches Height in Inches

LS Regression Line Regression and correlation coefficient Coefficient of determination, denoted by r2, is the proportion of variation in the observed y values that can be explained by the regression line: Note that 1) 0 ≤ r2 ≤ 1 2) the closer this percentage is to 100%, the more successful is the relationship in explaining variation in y. 3) (correlation coefficient)2 = coefficient of determination. Example. Revisit the Pizza sales example. What percent of the variation in the quarterly sales is explained by the regression line?

Residuals Predicted value (or fitted value) = Residual = observed value - predicted value =

Residual Plot A residual plot is a scatter plot of the regression residuals against the predicted value or the explanatory variable. Residual plots help us assess the fit of a regression line. If residuals are scattered randomly around 0, chances are your data fit a linear model, was normally distributed, and you didn’t have outliers.

Residual Plot Residuals are randomly scattered—good! Curved pattern—means the relationship you are looking at is not linear. A change in variability across a plot is a warning sign. You need to find out why it is, and remember that predictions made in areas of larger variability will not be as good.

Example Consider the following data on x=height (in) and y=average weight (lb) for American females aged 30-39 (taken from The World Almanac and Book of Facts). x: 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 y: 113 115 118 121 124 128 131 134 137 141 145 150 153 159 164 Draw the scatter plot and residual plots. Find the least square regression line.