Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.

Slides:



Advertisements
Similar presentations
Lecture 17: Tues., March 16 Inference for simple linear regression (Ch ) R2 statistic (Ch ) Association is not causation (Ch ) Next.
Advertisements

Chapter 12 Inference for Linear Regression
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Objectives 10.1 Simple linear regression
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Inference for Regression
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sampling: Final and Initial Sample Size Determination
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Linear regression models
Objectives (BPS chapter 24)
Inference for Regression 1Section 13.3, Page 284.
Chapter 10 Simple Regression.
Class 8: Tues., Oct. 5 Causation, Lurking Variables in Regression (Ch. 2.4, 2.5) Inference for Simple Linear Regression (Ch. 10.1) Where we’re headed:
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Chapter 11 Multiple Regression.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Pertemua 19 Regresi Linier
Stat Notes 4 Chapter 3.5 Chapter 3.7.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Stat 112: Notes 2 This class: Start Section 3.3. Thursday’s class: Finish Section 3.3. I will and post on the web site the first homework tonight.
Simple Linear Regression and Correlation
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
Correlation & Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Inference for regression - Simple linear regression
STA291 Statistical Methods Lecture 27. Inference for Regression.
Hypothesis Testing in Linear Regression Analysis
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Chapter 11: Estimation Estimation Defined Confidence Levels
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapter 8: Confidence Intervals based on a Single Sample
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Stat 112 Notes 14 Assessing the assumptions of the multiple regression model and remedies when assumptions are not met (Chapter 6).
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 13 Simple Linear Regression
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
AP Statistics Chapter 14 Section 1.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Statistics 350 Lecture 4.
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
CHAPTER 29: Multiple Regression*
Review of Hypothesis Testing
Regression Chapter 8.
CHAPTER 12 More About Regression
Simple Linear Regression
Inference for Regression
Presentation transcript:

Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression model. –Inferences for simple linear regression model.

Wages and Education A random sample of 100 men (ages 18-70) was surveyed about their weekly wages in 1988 and their education (part of the 1988 March U.S. Current Population Survey) (in file wagedatasubset.JMP) How much more on average do men with one extra year of education make? If a man has a high school diploma but no further education, what’s the best prediction of his earnings? Regression addresses these two questions X=Education, Y= Weekly Wage

Simple Linear Regression Model

Sample vs. Population We can view the data – -- as a sample from a population. Our goal is to learn about the relationship between X and Y in the population: –We don’t care about the particular 100 men sampled but about the population of US men ages –From Notes 1, we don’t care about the relationship between tracks counted and the density of deer for the particular sample, but the relationship among the population of all tracks; this enables to predict in the future the density of deer from the number of tracks counted.

Simple Linear Regression Model

Assumptions of the Simple Linear Regression Model

Checking the Assumptions

Residual Plot

Checking Linearity Assumption

Violation of Linearity

Checking Constant Variance

Checking Normality

Checking Assumptions It is important to check the assumptions of a regression model because the inferences depend on the assumptions approximately holding. The assumptions don’t need to hold exactly but only approximately. We will study more about checking assumptions and how to deal with violations of the assumptions in Chapters 5 and 6.

Inferences

Sampling Distribution of b 0,b 1 The sampling distribution of describes the probability distribution of the estimates over repeated samples from the simple linear regression model. Understanding the sampling distribution is the key to drawing inferences from the sample to the population.

Sampling distribution in wage data To see how the least squares estimates can differ over different samples from the population, we consider the “population” to be all 25,632 men surveyed in the March 1988 Current Population Survey in wagedata1988.JMP and the sample to be random samples of size 100 like the one in wagedatasubset.JMP.

Samples of wage data To take samples in JMP, click the Tables menu, then click Subset and then click the circle next to Random Sample Size and set the sample size. JMP will create a new data subset which is a random sample of the original data set.

Sampling distributions Only sample, not population, is usually available so we need to understand sampling distribution. Sampling distribution of – –Sampling distribution is normally distributed. –Even if normality assumption fails, sampling distributions of are still approximately normal if n>30.

Properties of and as estimators of and Unbiased Estimators: Mean of the sampling distribution is equal to the population parameter being estimated. Consistent Estimators: As the sample size n increases, the probability that the estimator will become as close as you specify to the true parameter converges to 1. Minimum Variance Estimator: The variance of the estimator is smaller than the variance of any other linear unbiased estimator of, say

Confidence Intervals Point Estimate: Confidence interval: range of plausible values for the true slope Confidence Interval: where is an estimate of the standard deviation of ( ) Typically we use a 95% CI. 95% CI is approximately 95% CIs for a parameter are usually approximately where the standard error of the point estimate is an estimate of the standard deviation of the point estimate.

Computing Confidence Interval with JMP

Summary We have described the assumptions of the simple linear regression model and how to check them. We have come up with a method of describing the uncertainty in our estimates of the slope and the intercept via confidence intervals. Note: These confidence intervals are only accurate if the assumptions of the simple linear regression model are approximately correct. Next class: Hypothesis tests.