Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Inference for Regression
Objectives (BPS chapter 24)
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Chapter 13 Multiple Regression
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Chapter 12 Multiple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Stat 217 – Day 26 Regression, cont.. Last Time – Two quantitative variables Graphical summary  Scatterplot: direction, form (linear?), strength Numerical.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Stat 512 – Lecture 17 Inference for Regression (9.5, 9.6)
Linear Regression Example Data
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 10 th Edition.
Correlation and Regression Analysis
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 7 Forecasting with Simple Regression
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Chapter 13 Simple Linear Regression
Correlation & Regression
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Correlation and Linear Regression
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Chapter 14 Simple Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Lecture 10: Correlation and Regression Model.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Chapter 10 Inference for Regression
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Simple Linear Regression.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Conceptual Foundations © 2008 Pearson Education Australia Lecture slides for this course are based on teaching materials provided/referred by: (1) Statistics.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter 13 Simple Linear Regression
Inference for Regression
Simple Linear Regression
Chapter 11 Simple Regression
Chapter 13 Simple Linear Regression
CHAPTER 26: Inference for Regression
CHAPTER 12 More About Regression
Chapter 13 Simple Linear Regression
Presentation transcript:

Stat 217 – Day 25 Regression

Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null hypothesis:  1 =  2 = … =  I (no association between the two variables)  Alternative hypothesis: at least one  differs (there is an association between the two variables)

Example (with 3 groups…) Not significant Significant

How?  Compare differences in means vs. the natural variability in the data (s)  Compare test statistic to F distribution, p-value  Output: test statistic, p-value, ratio of variability between groups to variability within groups Demo Strong evidence (p-value =.03 <.05) that the type of disability affected the ratings, on average, of these 70 students

Technical Conditions Technical conditions:  Randomness: random sampling or random assignment  Sample sizes: Normal populations  Equal standard deviations: Check ratio of sample standard deviations  Kinda need same shape and spread for a comparison of just means to be reasonable

Technical Conditions 1) Randomness Random assignment 2) Each population follows a normal distribution 3) Each population has the same standard deviation 1.794/1.482 < 2

Summary: Comparing several groups Categorical response H 0 :  1 =  2 = … =  I Ha: at least one  differs Is test statistic large?  Chi-square test Expands 2 sample z-test Quantitative response H 0 :  1 =  2 = … =  I Ha: at least one  differs Is test statistic large?  ANOVA Expands 2 sample t-test No association between variables Is an association between variables

Exam 2 comments Pet owners and CPR (a) Make sure interpret the calculated interval “55% of pet owners “ – sample or population? (b) Technical conditions Using the ones for categorical data (c) See whether.5 is inside CI (d) Interpretation of p-value: chance of data at least this extreme if null hypothesis is true (e) Why is sample size information important? Sampling variability

Exam 2 comments Anchoring (a) Make sure clear which is which (b) “TC met”, TOS applet with 2 means (c) Chicago average estimate is 51K to 1.6 million higher than Green Bay average (direction!) (d) What does it mean to say it’s significant? What is the actual conclusion to the research question

Exam 2 comments Lab 6

Exam 2 comments Multiple choice 1. B 2. C – either is possible 3. B – small p-value eliminates “random chance” as a plausible explanation 4. B – it’s only unusual if she’s guessing (7s and 11s are only unusual for fair dice) Extra Credit  More likely to get a value far from mean with smaller sample size (e.g., n =1)

Next Topic: Two quantitative variables Graphical summary Numerical summary Model to allow predictions Inference beyond sample data

Activity 26-1 (p. 532) Have a sample of 20 homes for sale in Arroyo Grande in 2007  Variable 1 = house price  Variable 2 = house size Is there a relationship between these 2 variables?  Does knowing the house size help us predict its price?

1) Graphical summary: scatterplot Price vs. size 1. Direction Positive or negative? 2. Strength How closely follow the pattern 3. Form Linear?

Describing Scatterplots Activity 26-3 (p. 536) Positive  None  Negative Strong  Weak  Strong Direction Strength Form: Linear or not

2) Numerical summary: Correlation coefficient (Act 27-1)

Temperatures vs. Month Direction: positive then negative Form: nonlinear Strength: very strong r =.257

Example 1: Price vs. Size r =.780 What do you learn from these numerical and graphical summaries?

Turn in, with partner  Activity 26-6 parts b, c, and e For Thursday  Pre-lab for Lab 9 For Monday  Activity 26-7  HW 7

2) Guess the correlation Applet

3) Model IF it is linear, what line best summarizes the relationship?  Demo Demo Moral: The “least squares regression line” minimizes the sum of the squared residuals

Interpreting the equation (p. 577)  a = intercept, b = slope  Slope = predicted change in response associated with a one-unit increase in the explanatory  Intercept = predicted value of response when explanatory variable = 0 Explanatory variable Response variable

3) Model? Price-hat =  size  Slope = each additional square foot in house size is associated with a $169 increase in predicted price (price per foot) Be a little careful here, don’t sound too “causal” I really do like the “predicted” in here  Intercept = a house of size zero (empty lot?) is predicted to cost $265,222 Be a little careful here, don’t have any houses in data set with size near 0…

Using the model Price-hat =  size  Predicted price for a 1250 square foot house?  Predicted price for a 3000 square foot house? Extrapolation: Very risky to use regression equation to predict values far outside the range of x values used to derive the line!

4) Is this relationship statistically significant? Is it possible there is no relationship between house price and size in the population of all homes for sale at that time, and we just happened to coincidently obtain this relationship in our random sample? Or is this relationship strong enough to convince us it didn’t happen just by chance but reflects a genuine relationship in the population?

p. 605 Let  represent the slope of the population regression line H 0 :  = 0; no relationship between price and size in population H a :  ≠ 0; is a relationship  positive Idea: Want to compare the observed sample slope to zero, does it differ more than we would expect by chance?

Assume  = 0 How many standard deviations away? Variation in sample slopes  Sample slopes  our slope? Standard error = SE(b) 169

Minitab The regression equation is Price = Size (sq ft) Predictor Coef SE Coef T P Constant Size (sq ft) Regression equation (add hat) b a SE(b) Two-sided t=(observed slope-hypothesized slope) standard error of slope = (b – 0)/SE(b) = ( )/31.88 = 5.29

Turn in, with partner  Price vs. pages: Interpret slope/evaluate p-value For Tuesday  Activities 26-7, 28-5  Be working on Lab 9 and HW 7 The regression equation is Price = Pages Predictor Coef SE Coef T P Constant Pages

Describing Scatterplots Activity 26-6 (p. 539) Positive, nonlinear, fairly strong Causation? Strength: How closely do the points follow the pattern? Direction Strength Form: Linear or not

For Monday Activities 26-7, 28-5 Be working on Lab 9 and HW 7