Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.

Slides:



Advertisements
Similar presentations
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Advertisements

Ch11 Curve Fitting Dr. Deshi Ye
Objectives (BPS chapter 24)
Inference for Regression 1Section 13.3, Page 284.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
Multiple regression analysis
Chapter Seventeen HYPOTHESIS TESTING
Chapter 10 Simple Regression.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
The Simple Regression Model
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Chapter 11 Multiple Regression.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Introduction to Regression Analysis, Chapter 13,
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
Regression and Correlation Methods Judy Zhong Ph.D.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Chapter 10 Hypothesis Testing
Means Tests Hypothesis Testing Assumptions Testing (Normality)
Regression Method.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Ch4 Describing Relationships Between Variables. Pressure.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Chapter 10 The t Test for Two Independent Samples
© Copyright McGraw-Hill 2004
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Model adequacy checking in the ANOVA Checking assumptions is important –Normality –Constant variance –Independence –Have we fit the right model? Later.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
CHAPTER 12 More About Regression
The simple linear regression model and parameter estimation
BINARY LOGISTIC REGRESSION
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
AP Statistics Chapter 14 Section 1.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
CHAPTER 12 More About Regression
Hypothesis Tests: One Sample
Hypothesis Testing: Hypotheses
CONCEPTS OF HYPOTHESIS TESTING
CHAPTER 29: Multiple Regression*
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Interval Estimation and Hypothesis Testing
CHAPTER 12 More About Regression
Power Section 9.7.
Intro to Confidence Intervals Introduction to Inference
CHAPTER 12 More About Regression
Making Inferences about Slopes
Inference for Regression
Presentation transcript:

Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both null and alternative. Under null, var(Y)=σ 0 2. Under alternative, β 1 >0, and var(Y)=σ 1 2.

Step 1: Choose the test statistic and specify its null distribution Use conditions of the null to find:

Bringing sample size into regression design The sample size n is hidden in the regression results. That is, let:

Step 2: Define the critical value For the univariate linear regression test:

Step 3: Define the Rejection Rule Each test is a right sided test, and so the rule is to reject when the test statistic is greater than the critical value.

Step 4: Specify the Distribution of Test Statistic under Alternative Use conditions of the null to find:

Step 5: Define a Type II Error For the univariate linear regression test:

Step 6: Find β For a univariate linear regression test:

Basic Insight Notice that all three problems have the same basic structure. That is, if you understand the solution of the one sample test, then you can derive the answer to the other problems.

Step 7: Phrase requirement on β For example, we seek to “choose n so that β=0.01.” That is, “choose n so that Pr 1 {Accept H 0 }=β=0.01.

Step 7: Phrase requirement on β For example, we seek to “choose n so that

Step 7: Phrase requirement on β Notice the parallel phrasing:

Step 7: Phrase requirement on β That is, “choose n so that (note that E 0 =0):

Step 7: Phrase requirement on β That is, choose n so that (after algebraic clearing out):

Step 8: State the conclusion The result for a left sided test has to be worked through but is similar. You must remember to keep all entries positive. This is reasonable if both α and β are constrained to be less than or equal to 0.5. The restriction is not a hardship in practice.

Univariate Linear Regression Note that the σ 0 factor is changed to σ 0 /σ X. There is a similar adjustment for the alternative standard deviation.

Example Problem Group Two hundred values of an independent variable x i are chosen so that Σ(x i -xbar) 2 is equal to 400,000. For each setting of x i, the random variable Y i =β 0 +β 1 x i +σZ i is observed. Here β 0 and β 1 are fixed but unknown parameters, σ=400, and the Z i are independent standard normal random variables.

Example Problem Group The null hypothesis to be tested is H 0 : β 1 =0, α=0.01, and the alternative is H 1 : β 1 <0. The random variable B 1 is the OLS estimate of β 1.

Example Question 1 When H 0 is true, what is the standard deviation of B 1, the OLS estimate of the slope? Var(B 1 )=σ 2 /Σ(x i -xbar) 2 =400 2 /400,000=0.4. sd(B 1 )=0.632.

Example Question 2 What is the probability of a Type II error in the test specified in the common section using B 1, the OLS estimator of the slope, as test statistic when β 1 =-4, α=0.01, σ=400, and Σ(x i -xbar) 2 is equal to 400,000?

Solution to Question 2 The critical value is (0.632)=-1.47 A Type II error occurs when B 1 > Under alternative B 1 is normal with expected value -4 and standard deviation (error) Pr{B 1 >-1.47}=Pr{Z>(-1.47-(-4))/0.632} =Pr{Z>4.00}= The answer is

Example Question 3 How many observations n are necessary so that the probability of a Type II error in the test specified in the common section when β 1 =-4, α=0.01, σ=400, and Σ(x i -xbar n ) 2 is equal to 2,000n?

Outline of Solution to Problem 3 For σ o term, use (400 2 /2000) 0.5 =8.94. Use same value for σ 1 term. Use |z 0.01 |= Use |E 1 -E 0 |=|-4-0|=4. Square root of sample size is Sample size is 109 or more.

Chapter 21: Residual Analysis If the assumptions in regression are violated: –Residuals are one way of checking model: R i = Y i - Fitted value at x i

Checking the Assumptions –Check for normality (test of normality, histogram, q-q plots) –Check variance if it is the same for all values of the independent variable (plot residuals against predicted values) –Check independence (plot residuals against sequence variable) –Check for linearity (plot dependent variable against independent variable)

Residual Plots Plot residuals against independent variable. –Plot should be flat indicating the same variance. –There should be no fanning out pattern. –Check for influential observations. Plot residuals against predicted variable. –For univariate regression this is the same as the above plot. There should be no pattern.

What to do if problem? Can look for transformations of either independent or dependent variable or both. Using computer this is easy: compute option from menu bar.

Influential Points An easier way to look for points that have a large impact on the slope is to plot the change in slope against an arbitrary case sequence number.

Example Data set in the web page aim: predict final exam score from midterm score dependent variable: final exam score independent variable: midterm score model, check assumptions, predict

Output Model: Y=  0 +  1 X +  R 2 = F statistics=60.91, Significance=0.0  1 = , t statistic=7.805, Significance=0.0  0 =238.95, t statistic=8.329, Significance=0.0

Next Class Multiple Regression! Check web site for your data file