Class 4 Simple Linear Regression. Regression Analysis Reality is thought to behave in a manner which may be simulated (predicted) to an acceptable degree.

Slides:



Advertisements
Similar presentations
Test of (µ 1 – µ 2 ),  1 =  2, Populations Normal Test Statistic and df = n 1 + n 2 – 2 2– )1– 2 ( 2 1 )1– 1 ( 2 where ] 2 – 1 [–
Advertisements

Chapter 12 Simple Linear Regression
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Chapter 12 Simple Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Simple Linear Regression Analysis
Introduction to Probability and Statistics Linear Regression and Correlation.
Korelasi dalam Regresi Linear Sederhana Pertemuan 03 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
BCOR 1020 Business Statistics
Introduction to Regression Analysis, Chapter 13,
Linear Regression/Correlation
Lecture 15 Basics of Regression Analysis
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
EQT 272 PROBABILITY AND STATISTICS
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
CHAPTER 15 Simple Linear Regression and Correlation
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
INTRODUCTORY LINEAR REGRESSION SIMPLE LINEAR REGRESSION - Curve fitting - Inferences about estimated parameter - Adequacy of the models - Linear.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
Linear Regression Least Squares Method: the Meaning of r 2.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Chapter 5: Regression Analysis Part 1: Simple Linear Regression.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Chapter 13 Multiple Regression
Regression Analysis Relationship with one independent variable.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Inference about the slope parameter and correlation
The simple linear regression model and parameter estimation
Statistics for Managers using Microsoft Excel 3rd Edition
Simple Linear Regression
Relationship with one independent variable
Simple Linear Regression
Quantitative Methods Simple Regression.
Simple Linear Regression
Relationship with one independent variable
Simple Linear Regression
MGS 3100 Business Analysis Regression Feb 18, 2016
St. Edward’s University
Presentation transcript:

Class 4 Simple Linear Regression

Regression Analysis Reality is thought to behave in a manner which may be simulated (predicted) to an acceptable degree of accuracy by a simplified mathematical model. Statistical models (which include regression) permit some degree of random error, because some variable of interest cannot be duplicated under seemingly identical conditions.

An Example We would like to predict test scores on an academic test. Ten such scores are shown below: A possible model of test scores: A test score, y, is obtained by taking the average test score, , and adding a random value, , to it y =  + 

Example (cont.) How might we estimate  ? How do we tell if our model is useful?

Improving the Model We would have a more useful model if we could remove (explain) some of the variability that we see in the data. Perhaps there exists other factors that cause variability in the test score. Can you think of some?

Improving the Model (cont.) Here is the data including the hours of study.

Improving the Model (cont.) We have the same problem: Select the best line that minimizes the (squared) distance of the data points to the line. This line is referred to as the least square line. Our model now looks like Our estimated or fitted line will be called

Another view of the Model This (and all linear regression) model(s) can be expressed as y = E(y) + . So in our model, E(y) =  0 +  1 x, that is, the mean test score falls on a straight line as a function of hours of study. The random error term, , is assumed to have a normal distribution with mean 0 and variance  2. Our ability to effectively use the model depends on this variation.

Analysis of Variance It turns out that the variation displayed by the variable y, referred to as the total sum of squares (SST), can be broken into two pieces: The part caused by the variable x, called the regression sum of squares (SSR), The part left over (the distance from the data points to the regression line), called the sum of squared errors or residual sum of squares (SSE). SST = SSR + SSE

Getting it done with EXCEL Select tools/data analysis/regression. r, correlation r 2 = SSR/SST s, the square root of s 2, our estimate of  2 r a 2 = 1 - (1-r 2 )[(n-1)/(n-p-1)]

Getting it done with EXCEL SSR SSE SST Actual sums of squares Sums of squares divided by degrees of freedom MSR MSE, also s 2 MSR/MSE p-value For example,

Getting it done with EXCEL Least square estimates Standard deviation of our estimate t-test for the hypothesis that the coefficient (  ) is 0 The important part: the p-value for the t-test Confidence Intervals for our estimates b0b0 b1b1

Hypothesis Testing The F-test tests to see if all of the coefficients of the independent variables are zero. For our model: The t-test tests to see if each coefficient of an independent variables is zero.

Using the Model The model has two basic purposes: (1) It can be used to provide partial confirmation of the theory that a particular factor is, indeed, influencing the response variable, y. (2) It can be used to estimate the mean, E(y), and predict an actual value of y. Under the function wizard, select forecast(new_x, known_y, known_x).

Confidence interval can be generated (page 555 for more discussion). Let Using the Model

EXCEL does not provide an automatic calculation for confidence and prediction intervals The authors have included a macro in the spreadsheet called PredInt.xls on your data disk. Simply open the file and follow the instructions! Using the Model

A Note on Correlation Many people prefer to perform a correlational analysis before they build regression models. In EXCEL this can be accomplished in two ways: Under the function wizard, use correl(array1,array2) to find the correlation between two variables. Under tools/data analysis/correlation to determine the correlation between several variables.

Correlation (cont.) What correlation does: Provides an easy measure to determine if two variables have a linear relationship. Positive correlation implies if one variable goes up, the other also tends to go up. Negative correlation implies if one variable goes up, the other tends to go down. What correlation does not do: There is no implication of cause and effect. There may exist some lurking factor that produces the behavior being witnessed.