Inference about the Slope and Intercept

Slides:



Advertisements
Similar presentations
Properties of Least Squares Regression Coefficients
Advertisements

Lesson 10: Linear Regression and Correlation
Chapter 12 Simple Linear Regression
Statistical Techniques I EXST7005 Simple Linear Regression.
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
Objectives (BPS chapter 24)
The Simple Linear Regression Model: Specification and Estimation
Chapter 10 Simple Regression.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Simple Linear Regression and Correlation
Lecture 5 Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Chapter 11 Simple Regression
Ms. Khatijahhusna Abd Rani School of Electrical System Engineering Sem II 2014/2015.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Week 21 Statistical Assumptions for SLR  Recall, the simple linear regression model is Y i = β 0 + β 1 X i + ε i where i = 1, …, n.  The assumptions.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Applied Regression Analysis BUSI 6220
Inference about the slope parameter and correlation
Chapter 13 Simple Linear Regression
The simple linear regression model and parameter estimation
GS/PPAL Section N Research Methods and Information Systems
Regression and Correlation
Decomposition of Sum of Squares
Regression Analysis: Statistical Inference
26134 Business Statistics Week 5 Tutorial
Multiple Regression Analysis: Estimation
Correlation and Simple Linear Regression
Linear Regression and Correlation Analysis
Chapter 11: Simple Linear Regression
Simple Linear Regression
Chapter 14: Correlation and Regression
Simple Linear Regression - Introduction
Correlation and Simple Linear Regression
I271B Quantitative Methods
Inference about the Slope and Intercept
Statistical Assumptions for SLR
Correlation and Simple Linear Regression
Simple Linear Regression
Simple Linear Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
Product moment correlation
SIMPLE LINEAR REGRESSION
Topic 8 Correlation and Regression Analysis
Chapter 14 Inference for Regression
Regression & Correlation (1)
Chapter Thirteen McGraw-Hill/Irwin
Inference about the Slope and Intercept
Regression Models - Introduction
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Inference about the Slope and Intercept Recall, we have established that the least square estimates and are linear combinations of the Yi’s. Further, we have showed that they are unbiased and have the following variances In order to make inference we assume that εi’s have a Normal distribution, that is εi ~ N(0, σ2). This in turn means that the Yi’s are normally distributed. Since both and are linear combination of the Yi’s they also have a Normal distribution. STA302/1001 - week 3

Inference for β1 in Normal Error Regression Model The least square estimate of β1 is , because it is a linear combination of normally distributed random variables (Yi’s) we have the following result: We estimate the variance of by S2/SXX where S2 is the MSE which has n-2 df. Claim: The distribution of is t with n-2 df. Proof: STA302/1001 - week 3

Tests and CIs for β1 The hypothesis of interest about the slope in a Normal linear regression model is H0: β1 = 0. The test statistic for this hypothesis is We compare the above test statistic to a t with n-2 df distribution to obtain the P-value…. Further, 100(1-α)% CI for β1 is: STA302/1001 - week 3

Important Comment Similar results can be obtained about the intercept in a Normal linear regression model. See the book for more details. However, in many cases the intercept does not have any practical meaning and therefore it is not necessary to make inference about it. STA302/1001 - week 3

Example We have Data on Violent and Property Crimes in 23 US Metropolitan Areas.The data contains the following three variables: violcrim = number of violent crimes propcrim = number of property crimes popn = population in 1000's We are interested in the relationship between the size of the city and the number of violent crimes…. STA302/1001 - week 3

Comments Regarding the Crime Example A regression model fit to data from all 23 cities finds a statistically significant linear relationship between numbers of violent crimes in American cities and their populations. For each increase of 1,000 in population, the number of violent crimes increases by 0.1093 on average. 48.6% of the variation in number of violent crimes can be explained by its relationship with population. Because these data are observational, i.e. collected without experimental intervention, it cannot be said that larger populations cause larger numbers of crimes, but only that such an association appears to exist. However, this linear relationship is mostly determined by New York City whose population and number of violent crimes are much larger than any other city, and thus accounts for a large fraction of the variation in the data. When New York is removed from the analysis there is no longer a statistically significant linear relationship and the linear relationship with population explains less than 9% of the variation in number of violent crimes. STA302/1001 - week 3

Bivariate Normal Distribution X and Y are jointly normally distributed if their joint density is where - ∞ < x < ∞ and - ∞ < y < ∞. Can show that the marginal distributions are: and ρ is the correlation between X and Y, i.e., STA302/1001 - week 3

Properties of Bivariate Normal Distribution It can be shown that the conditional distribution of Y given X = x is: Linear combinations of X and Y are normally distributed. A zero covariance between any X and Y implies that they are statistically independent. Note that this is not true in general for any two random variables. STA302/1001 - week 3

Sample Correlation If X and Y are random variables, and we would like a symmetric measure of the direction and strength of the linear relationship between them we can use correlation. Based on n observed pairs (xi , yi) i =1,…,n, the estimate of the population correlation ρ is the Pearson’s Product-Moment Correlation given by It is the MLE of ρ. STA302/1001 - week 3

Facts about r It measures the strength of the linear relationship between X and Y. It is distribution free. r is always a number between –1 and 1. r = 0 indicates no linear association. r = –1 or 1 indicates that the points fall perfectly on a straight line with negative slope. r = 1 or 1 indicates that the points fall perfectly on a straight line with positive slope. The strength of the linear relationship increases as r moves away from 0. STA302/1001 - week 3

STA302/1001 - week 3

Relationship between Regression and Correlation STA302/1001 - week 3