Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester 2007-2008 Eng. Tamer Eshtawi First Semester 2007-2008.

Slides:



Advertisements
Similar presentations
Section 10-3 Regression.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-4 Variation and Prediction Intervals.
Correlation and Regression
Chapter 4 The Relation between Two Variables
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Introduction to Linear Regression.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
CHAPTER 3 Describing Relationships
Simple Linear Regression and Correlation
Linear Regression/Correlation
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation & Regression
Descriptive Methods in Regression and Correlation
Linear Regression.
Chapter 10 Correlation and Regression
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
Chapter 11 Simple Regression
Correlation.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 10 Correlation and Regression
Correlation & Regression
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Slide Slide 1 Warm Up Page 536; #16 and #18 For each number, answer the question in the book but also: 1)Prove whether or not there is a linear correlation.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
CHAPTER 3 Describing Relationships
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture Slides Elementary Statistics Twelfth Edition
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHS 221 Biostatistics Dr. wajed Hatamleh
SIMPLE LINEAR REGRESSION MODEL
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
AP Stats: 3.3 Least-Squares Regression Line
Chapter 10 Correlation and Regression
Lecture Notes The Relation between Two Variables Q Q
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Elementary Statistics Tenth Edition
Correlation and Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Lecture Slides Elementary Statistics Eleventh Edition
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
9/27/ A Least-Squares Regression.
Created by Erin Hodgess, Houston, Texas
CHAPTER 3 Describing Relationships
Presentation transcript:

Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester

Lecture 13 Chapter 10 (part 2) Correlation and Regression Main Reference: Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Section 10-3 Regression

Key Concept The key concept of this section is to describe the relationship between two variables by finding the graph and the equation of the straight line that best represents the relationship. The straight line is called a regression line and its equation is called the regression equation.

Regression y = mx + b y = b 0 + b 1 x b 0 b 1 The typical equation of a straight line y = mx + b is expressed in the form y = b 0 + b 1 x, where b 0 is the y -intercept and b 1 is the slope. x y The regression equation expresses a relationship between x (called the independent variable, predictor variable or explanatory variable), and y (called the dependent variable or response variable). ^

Requirements 1. The sample of paired ( x, y ) data is a random sample of quantitative data. 2. Visual examination of the scatter plot shows that the points approximate a straight-line pattern. 3. Any outliers must be removed if they are known to be errors. Consider the effects of any outliers that are not known errors.

Definitions  Regression Equation Given a collection of paired data, the regression equation  Regression Line The graph of the regression equation is called the regression line (or line of best fit, or least squares line). algebraically describes the relationship between the two variables.

Regression

Notation for Regression Equation Y - intercept of regression equation  0 b 0 Slope of regression equation  1 b 1 Equation of the regression line Population Parameter Sample Statistic

Formulas for b 0 and b 1 The regression line fits the sample points best.

Example 1 Using the simple random sample of data below, find the value of r. In the last lecture, we used these values to find that the linear correlation coefficient of r = – Use this sample to find the regression equation.

Calculating the Regression Equation - cont

Calculating the Regression Equation - cont The estimated equation of the regression line is:

Example 2 given find the regression equation.

In predicting a value of y based on some given value of x If there is not a linear correlation, the best predicted y -value is y. Predictions 2.If there is a linear correlation, the best predicted y -value is found by substituting the x -value into the regression equation.

From a previous example, we found that the regression equation is y = x, ( r = 0.926). Assuming that x = 180 sec, find the best predicted value of y Example 3 We must consider whether there is a linear correlation that justifies the use of that equation. We do have a significant linear correlation (with r = 0.926).

1. If there is no linear correlation, don’t use the regression equation to make predictions. 2. A regression equation based on old data is not necessarily valid now. 3. Don’t make predictions about a population that is different from the population from which the sample data were drawn. Guidelines for Using The Regression Equation

Definitions  Marginal Change The marginal change is the amount that a variable changes when the other variable changes by exactly one unit.  Outlier An outlier is a point lying far away from the other data points.  Influential Point An influential point strongly affects the graph of the regression line.

Residual The residual for a sample of paired ( x, y ) data, is the difference ( y - y ) between an observed sample y -value and the value of y, which is the value of y that is predicted by using the regression equation. ^ Definition residual = observed y – predicted y = y - y ^

Linear regression with small residual error Linear regression with large residual error

 Least-Squares Property A straight line has the least-squares property if the sum of the squares of the residuals is the smallest sum possible.  Residual Plot A scatter plot of the ( x, y ) values after each of the y - coordinate values have been replaced by the residual value. That is, a residual plot is a graph of the points ( x, ) Definitions

Residual Plot Analysis  If a residual plot does not reveal any pattern, the regression equation is a good representation of the association between the two variables.  If a residual plot reveals some systematic pattern, the regression equation is not a good representation of the association between the two variables.

Residual Plots

Section 10-4 Variation and Prediction Intervals

Key Concept In this section we proceed to consider a method for constructing a prediction interval, which is an interval estimate of a predicted value of y.

Unexplained, Explained, and Total Deviation

Definition Total Deviation The total deviation of ( x, y ) is the vertical distance, which is the distance between the point ( x, y ) and the horizontal line passing through the sample mean y.

Definition Explained Deviation The explained deviation is the vertical distance, which is the distance between the predicted y - value and the horizontal line passing through the sample mean y.

Definition Unexplained Deviation The unexplained deviation is the vertical distance which is the vertical distance between the point ( x, y ) and the regression line. (The distance is also called a residual).

(total deviation) = (explained deviation) + (unexplained deviation) Relationships (total variation) = (explained variation) + (unexplained variation)

Definition r2 =r2 = Explained variation. Total variation The value of r 2 is the proportion of the variation in y that is explained by the linear relationship between x and y. Coefficient of determination r 2 is the amount of the variation in y that is explained by the regression line.

Standard Error of Estimate The standard error of estimate, denoted by S e is a measure of the differences (or distances) between the observed sample y -values and the predicted values y that are obtained using the regression equation. Definition ^

Standard Error of Estimate or

find the standard error S e based on the following: Given: n = 8  y 2 = 60,204  y = 688  xy = 154,378 b 0 = b 1 = Example:

Example (a) Calculate the least squares estimates of the slope and intercept. (b) Use the equation of the fitted line to predict what pavement deflection would be observed when the surface temperature is 85F. (c) Determine the standard error Regression methods were used to analyze the data from a study investigating the relationship between roadway surface temperature ( x ) and pavement deflection ( y ). Summary quantities were

(a) Calculate the least squares estimates of the slope and intercept.

(c) Determine the standard error (b) Use the equation of the fitted line to predict what pavement deflection would be observed when the surface temperature is 85 F.