Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation

Slides:



Advertisements
Similar presentations
Test of (µ 1 – µ 2 ),  1 =  2, Populations Normal Test Statistic and df = n 1 + n 2 – 2 2– )1– 2 ( 2 1 )1– 1 ( 2 where ] 2 – 1 [–
Advertisements

Topic 12: Multiple Linear Regression
 Population multiple regression model  Data for multiple regression  Multiple linear regression model  Confidence intervals and significance tests.
Chapter 12 Simple Linear Regression
Forecasting Using the Simple Linear Regression Model and Correlation
Hypothesis Testing Steps in Hypothesis Testing:
Linear regression models
EPI 809/Spring Probability Distribution of Random Error.
Simple Linear Regression and Correlation
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Chapter 12 Simple Linear Regression
Chapter 10: Inferential for Regression 1.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Chapter 11: Inferential methods in Regression and Correlation
Reading – Linear Regression Le (Chapter 8 through 8.1.6) C &S (Chapter 5:F,G,H)
SIMPLE LINEAR REGRESSION
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Chapter Topics Types of Regression Models
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
1 732G21/732A35/732G28. Formal statement  Y i is i th response value  β 0 β 1 model parameters, regression parameters (intercept, slope)  X i is i.
Simple Linear Regression and Correlation
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Simple Linear Regression Analysis
Correlation & Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Time Series Analysis – Chapter 2 Simple Regression Essentially, all models are wrong, but some are useful. - George Box Empirical Model-Building and Response.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Simple Linear Regression ANOVA for regression (10.2)
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Chapter 12: Correlation and Linear Regression 1.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Dependent (response) Variable Independent (control) Variable Random Error XY x1x1 y1y1 x2x2 y2y2 …… xnxn ynyn Raw data: Assumption:  i ‘s are independent.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
Regression Analysis Presentation 13. Regression In Chapter 15, we looked at associations between two categorical variables. We will now focus on relationships.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Chapter 12: Correlation and Linear Regression 1.
The simple linear regression model and parameter estimation
Chapter 20 Linear and Multiple Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Prepared by Lee Revere and John Large
PENGOLAHAN DAN PENYAJIAN
Correlation and Simple Linear Regression
Review of Chapter 2 Some Basic Concepts: Sample center
Simple Linear Regression and Correlation
Linear Regression and Correlation
Linear Regression and Correlation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation

Regression and Causality

Linear Regression: Definitions X: predictor, explanatory, independent variable Y: response, dependent variable

Example: Scatterplot The following data is to determine the relationship between age and change in systolic blood pressure (BP, mm Hg) after 24 hours in response to a particular treatment. a) Draw a scatterplot of this data. Obs Age BP

Example: Scatterplot (cont) Age

Error in the Regression Line

Distribution of Y

Linear Regression: Assumptions 1.There is a linear relationship between X and Y. 2.Each (X,Y) pair is random and independent of the other pairs. 3.Variance of the residuals is constant.

Principle of Least Squares

Point Estimations

X

Example: Least Squares The following data is to determine the relationship between age and change in systolic blood pressure (BP, mm Hg) after 24 hours in response to a particular treatment. a)What is the regression line for this data? x̄ = , ȳ = , S xy = , S xx =

Example: Least Squares (cont)

Extrapolation

Example: Least Squares point estimate The following data is to determine the relationship between age and change in systolic blood pressure (BP, mm Hg) after 24 hours in response to a particular treatment. c) What is the point estimate for the change in BP for someone who is 56 years old? 51 years old? d) What is the residual at this age 51? The actual data point is (51, -10).

ANOVA Table SourcedfSSMS Model (Regression) 1SSM Errorn - 2 Totaln - 1

Meaning of σ 2

Meaning of R 2

Cautions about R 2 1.Linearity 2.Association 3.Outliers 4.Prediction

Example: Least Squares point estimate The following data is to determine the relationship between age and change in systolic blood pressure (BP, mm Hg) after 24 hours in response to a particular treatment. e) What proportion of the observed variation in y can be attributed to the simple linear regression relationship between x and y?

Example: Least Squares (cont) outlier?

Inference on the slope Regression_line.htm

Normality of Y

Example: Regression Suppose the mean daily peak load (MW) for a power plant and the maximum outdoor temperature ( o F) for a sample of 10 days is given below. a)What is the estimated regression line (besides the equation of the line, include R 2 )? S xx = , S xy = , x̄ = , ȳ = SSE = , SST = x i ( o F) y i (MW)

Example: Regression (cont) Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept temp <.0001 X

Example: Regression (cont) X

Example: Regression The cetane number is a critical property in specifying the ignition quality of a fuel used in a diesel engine. Determination of this number for a biodiesel fuel is expensive and time-consuming. Therefore a way of predicting this number is wanted. The data on the next slide is x = iodine value (g) and y = cetane number for a sample of 14 biofuels. The iodine value is the amount of iodine necessary to saturate a sample of 100g of oil. S xx = , S xy = , x̄ = , ȳ = , SSE = , SST = a)What is the estimated regression line (besides the equation of the line, include R 2 )?

Example: Scatterplot

Example (Example 12.4): CI The cetane number is a critical property in specifying the ignition quality of a fuel used in a diesel engine. Determination of this number for a biodiesel fuel is expensive and time-consuming. Therefore a way of predicting this number is wanted. The data on the next slide is x = iodine value (g) and y = cetane number for a sample of 14 biofuels. The iodine value is the amount of iodine necessary to saturate a sample of 100g of oil. S xx = , S xy = , x̄ = , ȳ = , SSE = , SST = b) What is the 95% CI for the true slope?

β 1 Hypothesis test: Summary

Example (Example 12.4): Hypothesis test The cetane number is a critical property in specifying the ignition quality of a fuel used in a diesel engine. Determination of this number for a biodiesel fuel is expensive and time-consuming. Therefore a way of predicting this number is wanted. The data on the next slide is x = iodine value (g) and y = cetane number for a sample of 14 biofuels. The iodine value is the amount of iodine necessary to saturate a sample of 100g of oil. c) Is the model useful (that is, is there a useful linear relationship between x and y)?

ANOVA Table SourcedfSSMS Model (Regression) 1SSM Errorn - 2 Totaln - 1

Copyright © Cengage Learning. All rights reserved Inferences Concerning  Y  x  and the Prediction of Future Y Values

 Hypothesis test: Summary

Example (12.4): Hypothesis test for  The cetane number is a critical property in specifying the ignition quality of a fuel used in a diesel engine. Determination of this number for a biodiesel fuel is expensive and time-consuming. Therefore a way of predicting this number is wanted. The data on the next slide is x = iodine value (g) and y = cetane number for a sample of 14 biofuels. The iodine value is the amount of iodine necessary to saturate a sample of 100g of oil. d) Is the model useful (that is, is there a useful linear relationship between x and y) using the population correlation coefficient? SSE = , SST = S yy = S xx = , S xy =

Residual Plots Good Linearity Violation

Residual Plots Good Constant variance violation

Residual Plots

Example: SLR 1 – Residual Plot

Example: SLR 1 – Normality

Residual Plots X