BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Chapter 13 Multiple Regression
Chapter 12 Simple Regression
Chapter 12 Multiple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
The Simple Regression Model
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chapter 9: Correlation and Regression
Linear Regression Example Data
SIMPLE LINEAR REGRESSION
BCOR 1020 Business Statistics
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Lecture 5 Correlation and Regression
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Correlation and Regression
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Correlation and Linear Regression
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Correlation and Regression
Bivariate Regression (Part 1) Chapter1212 Visual Displays and Correlation Analysis Bivariate Regression Regression Terminology Ordinary Least Squares Formulas.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation & Regression
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Correlation & Regression Analysis
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Correlation and Regression
Chapter 13 Simple Linear Regression
Regression and Correlation
Chapter 4 Basic Estimation Techniques
Correlation and Regression
Correlation and Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

BCOR 1020 Business Statistics Lecture 24 – April 17, 2008

Overview Chapter 12 – Linear Regression –Visual Displays and Correlation Analysis –Bivariate Regression –Regression Terminology

Chapter 12 –Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed data pair (x i, y i ) as a dot on an X/Y grid. - indicates visually the strength of the relationship between the two variables. Visual Displays:

Chapter 12 –Visual Displays Visual Displays: The price of Regular Unleaded appears to have a positively sloped linear relationship with the price of Diesel. These variables appear to be correlated.

Chapter 12 –Correlation Analysis The sample correlation coefficient (r) measures the degree of linearity in the relationship between X and Y. -1 < r < +1 r = 0 indicates no linear relationship. Correlation functions are available in Excel, MegaStat and on your calculators. Correlation Analysis: Strong negative relationship Strong positive relationship

Chapter 12 –Correlation Analysis Correlation Analysis (Computing r): This value can be calculated on your calculator or using a software package like Excel or MegaStat.

Chapter 12 –Correlation Analysis Example: Data Set for problem 12.3 (“CallWait”)… Y = Hold time (minutes) for concert tickets X = number of operators Operators (X)Wait Time (Y) There appears to be “some” negative correlation between the variables. Does this make sense? We can calculate the sample correlation coefficient… r = (overhead)

Excel/MegaStat Demo…

Chapter 12 –Correlation Analysis Strong Positive Correlation Weak Positive Correlation Weak Negative Correlation Strong Negative Correlation

Chapter 12 –Correlation Analysis No Correlation Nonlinear Relation

Chapter 12 –Correlation Analysis r is an estimate of the population correlation coefficient  (rho). To test the hypothesis H 0 :  = 0, the test statistic is: The critical value t  is obtained from Appendix D using = n – 2 degrees of freedom for any . We can bound the p-value for this test using the t table or we can find it exactly using Excel or MegaStat. Tests for Significance:

Chapter 12 –Correlation Analysis Equivalently, you can calculate the critical value for the correlation coefficient using This method gives a benchmark for the correlation coefficient. However, there is no p-value and is inflexible if you change your mind about . Tests for Significance:

Chapter 12 –Correlation Analysis Step 1: State the Hypotheses Determine whether you are using a one or two-tailed test and the level of significance (a). H 0 :  = 0 H 1 :  ≠ 0 Step 2: Calculate the Critical Value For degrees of freedom = n -2, look up the critical value t  in Appendix D, then calculate Steps in Testing if  = 0: Step 3: Make the Decision If the sample correlation coefficient r exceeds the critical value r , then reject H 0. If using the t statistic method, reject H 0 if t > t  or if the p-value < .

Chapter 12 –Correlation Analysis Example: In our earlier example on the data set “CallWait”, we calculated the sample correlation, r = , based on n = 5 data points. Calculate the Critical Value, r , to test the hypothesis H 0 :  = 0 vs. H 1 :  ≠ 0 at the 10% level of significance. Since | r | is not greater than r , we cannot reject H 0. There is not a significant correlation between these variables at the 10 % level of significance.

Clickers For our example on the data set “CallWait”, we calculated the sample correlation, r = , based on n = 5 data points. Instead of calculating the Critical Value, r , to test the hypothesis H 0 :  = 0 vs. H 1 :  ≠ 0, we could have calculated the test statistic What are the bounds for the p-value on this test statistic? (A) 0.10 < p-value < 0.20 (B) < p-value < 0.05 (C) 0.05 < p-value < 0.10 t distribution with = n-2 d.f. under H 0.

Chapter 12 –Correlation Analysis As sample size increases, the critical value of r becomes smaller. This makes it easier for smaller values of the sample correlation coefficient to be considered significant. A larger sample does not mean that the correlation is stronger nor does its significance imply importance. Role of Sample Size:

Chapter 12 –Bivariate Regression Bivariate Regression analyzes the relationship between two variables. It specifies one dependent (response) variable and one independent (predictor) variable. This hypothesized relationship may be linear, quadratic, or whatever. What is Bivariate Regression?

Chapter 12 –Bivariate Regression Some Model Forms:

Chapter 12 –Bivariate Regression The intercept and slope of a fitted regression can provide useful information. For example, consider the fitted regression model… Sales(Y) = Ads(X) –Each extra $1 million of advertising will generate $7.37 million of sales on average. –The firm would average $268 million of sales with zero advertising. –However, the intercept may not be meaningful because Ads = 0 may be outside the range of the observed data. Prediction Using Regression:

Chapter 12 –Bivariate Regression One of the main uses of regression is to make predictions. Once you have a fitted regression equation that shows the estimated relationship between X and Y, we can plug in any value of X to make a prediction for Y. Consider our example… Sales(Y) = Ads(X) If the firm spends $10 million on advertising, its expected sales would be… –Sales(Y) = (10) = $341.7 million. Prediction Using Regression:

Chapter 12 –Regression Terminology Unknown parameters that we will estimate are  0 =Intercept  1 =Slope The assumed model for a linear relationship is y i =  0 +  1 x i +  i for all observations (i = 1, 2, …, n) The error term is not observable, but is assumed normally distributed with mean of 0 and standard deviation . Models and Parameters:

Chapter 12 –Regression Terminology The fitted model used to predict the expected value of Y for a given value of X is y i = b 0 + b 1 x i Models and Parameters: The fitted coefficients are b 0 the estimated intercept b 1 the estimated slope Residual is e i = y i - y i. Residuals may be used to estimate , the standard deviation of the errors. We will discuss how b 0 and b 1 are found next lecture. ^ ^

Chapter 12 –Regression Terminology Step 1: - Highlight the data columns. - Click on the Chart Wizard and choose Scatter Plot - In the completed graph, click once on the points in the scatter plot to select the data - Right-click and choose Add Trendline - Choose Options and check Display Equation Fitting a Regression on a Scatter Plot in Excel:

Chapter 12 –Regression Terminology Example: Data Set for problem 12.3 (“CallWait”)… Y = Hold time (minutes) for concert tickets X = number of operators Operators (X)Wait Time (Y) From this output, we have the linear model: y = 458 – 18.5x b 0 = 458 b 1 = Discussion…

Clickers For our example on the data set “CallWait”, we have now calculated the regression model: Wait time (Y) = 458 – 18.5 Operators (X) If the there are 7 operators, what is the expected wait time? (A) 458 (B) (C) (D) 587.5

Chapter 12 –Regression Terminology Regression Caveats: The “fit” of the regression does not depend on the sign of its slope. The sign of the fitted slope merely tells whether X has a positive or negative association with Y. View the intercept with skepticism unless X = 0 is logically possible and was actually observed in the data set. Be wary of extrapolating the model beyond the observed range in the data. Regression does not demonstrate cause-and-effect between X and Y. A good fit shows that X and Y vary together. Both could be affected by another variable or by the way the data are defined.