Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 C Coefficient of Determination
Advertisements

Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
2nd Day: Bear Example Length (in) Weight (lb)
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Chapter 3 Association: Contingency, Correlation, and Regression
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Introduction to Linear Regression.
Chapter 4 Describing the Relation Between Two Variables
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Relationships Between Quantitative Variables
Simple Linear Regression Analysis
Math 227 Elementary Statistics Math 227 Elementary Statistics Sullivan, 4 th ed.
CHAPTER 3 Describing Relationships
Regression, Residuals, and Coefficient of Determination Section 3.2.
Linear Regression Analysis
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Chapter 6 & 7 Linear Regression & Correlation
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Section 3.2C. The regression line can be found using the calculator Put the data in L1 and L2. Press Stat – Calc - #8 (or 4) - enter To get the correlation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
CHAPTER 3 Describing Relationships
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
MATH 2311 Section 5.2 & 5.3. Correlation Coefficient.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 The Association.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-3 Linear Regression, Residuals, and Variation.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 How Can We Explore the Association between Two Categorical Variables?
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Correlation and Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Homework: pg. 180 #6, 7 6.) A. B. The scatterplot shows a negative, linear, fairly weak relationship. C. long-lived territorial species.
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
CHAPTER 3 Describing Relationships
Presentation transcript:

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome of a Variable

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 The first step of a regression analysis is to identify the response and explanatory variables.  We use y to denote the response variable.  We use x to denote the explanatory variable. Regression Line

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 4 The regression line predicts the value for the response variable y as a straight-line function of the value x of the explanatory variable. Let denote the predicted value of y. The equation for the regression line has the form In this formula, a denotes the y-intercept and b denotes the slope. Regression Line: An Equation for Predicting the Response Outcome

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 5 Regression Equation: is the predicted height and is the length of a femur (thighbone), measured in centimeters. Use the regression equation to predict the height of a person whose femur length was 50 centimeters. Example: Height Based on Human Remains

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 6 y-Intercept:  The predicted value for y when  This fact helps in plotting the line  May not have any interpretative value if no observations had x values near 0 It does not make sense for femur length to be 0 cm, so the y-intercept for the equation is not a relevant predicted height. Interpreting the y-Intercept

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 7 Slope: measures the change in the predicted variable (y) for a 1 unit increase in the explanatory variable (x). Example: A 1 cm increase in femur length results in a 2.4 cm increase in predicted height. Interpreting the Slope

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 8 Slope Values: Positive, Negative, Equal to 0 Figure 3.12 Three Regression Lines Showing Positive Association (slope > 0), Negative Association (slope < 0) and No Association (slope = 0). Question Would you expect a positive or negative slope when y = annual income and x = number of years of education?

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 9 Residuals measure the size of the prediction errors, the vertical distance between the point and the regression line.  Each observation has a residual  Calculation for each residual:  A large residual indicates an unusual observation.  The smaller the absolute value of a residual, the closer the predicted value is to the actual value, so the better is the prediction. Residuals Measure the Size of Prediction Errors

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 10 Residual sum of squares: The least squares regression line is the line that minimizes the vertical distance between the points and their predictions, i.e., it minimizes the residual sum of squares. Note: The sum of the residuals about the regression line will always be zero. The Method of Least Squares Yields the Regression Line

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 11 Slope: y-Intercept: Notice that the slope b is directly related to the correlation r, and the y-intercept depends on the slope. Regression Formulas for y-Intercept and Slope

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 12 Calculating the slope and y-intercept for the regression line Using the baseball data in Example 9 to illustrate the calculations. The regression line to predict team scoring from batting average is.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 13 Correlation:  Describes the strength of the linear association between 2 variables.  Does not change when the units of measurement change.  Does not depend upon which variable is the response and which is the explanatory. The Slope and the Correlation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 14 Slope:  Numerical value depends on the units used to measure the variables.  Does not tell us whether the association is strong or weak.  The two variables must be identified as response and explanatory variables.  The regression equation can be used to predict values of the response variable for given values of the explanatory variable. The Slope and the Correlation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 15 The typical way to interpret is as the proportion of the variation in the y-values that is accounted for by the linear relationship of y with x. When a strong linear association exists, the regression equation predictions tend to be much better than the predictions using only. We measure the proportional reduction in error and call it,. The Squared Correlation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 16 measures the proportion of the variation in the y-values that is accounted for by the linear relationship of y with x. A correlation of.9 means that  81% of the variation in the y-values can be explained by the explanatory variable, x. The Squared Correlation