Stat 31, Section 1, Last Time 2-way tables –Testing for Independence –Chi-Square distance between data & model –Chi-Square Distribution –Gives P-values.

Slides:



Advertisements
Similar presentations
Chapter 12 Inference for Linear Regression
Advertisements

Lesson 10: Linear Regression and Correlation
Objectives 10.1 Simple linear regression
Copyright © 2010 Pearson Education, Inc. Slide
Inference for Regression
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Objectives (BPS chapter 24)
Simple Linear Regression Analysis
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Ch. 14: The Multiple Regression Model building
Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Linear Regression and Correlation
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Chapter 15 Inference for Regression
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Stat 31, Section 1, Last Time Inference for Proportions –Hypothesis Tests 2 Sample Proportions Inference –Skipped 2-way Tables –Sliced populations in 2.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Inference for Regression Section Starter The Goodwill second-hand stores did a survey of their customers in Walnut Creek and Oakland. Among.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
Chapter 12 Inference for Linear Regression. Reminder of Linear Regression First thing you should do is examine your data… First thing you should do is.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
CHAPTER 12 More About Regression
CHAPTER 3 Describing Relationships
Stat 31, Section 1, Last Time Sampling Distributions
Inference for Regression
CHAPTER 12 More About Regression
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Simple Linear Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
Simple Linear Regression
Simple Linear Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 12 More About Regression
CHAPTER 3 Describing Relationships
Linear Regression and Correlation
Presentation transcript:

Stat 31, Section 1, Last Time 2-way tables –Testing for Independence –Chi-Square distance between data & model –Chi-Square Distribution –Gives P-values (CHIDIST) Simpson’s Paradox: –Lurking variables can reverse comparisons Recall Linear Regression –Fit a line to a scatterplot

Recall Linear Regression Idea: Fit a line to data in a scatterplot Recall Class Example 14 To learn about “basic structure” To “model data” To provide “prediction of new values”

Inference for Regression Goal: develop Hypothesis Tests and Confidence Int’s For slope & intercept parameters, a & b Also study prediction

Inference for Regression Idea: do statistical inference on: –Slope a –Intercept b Model: Assume: are random, independent and

Inference for Regression Viewpoint: Data generated as: y = ax + b Y i chosen from X i Note: a and b are “parameters”

Inference for Regression Parameters and determine the underlying model (distribution) Estimate with the Least Squares Estimates: and (Using SLOPE and INTERCEPT in Excel, based on data)

Inference for Regression Distributions of and ? Under the above assumptions, the sampling distributions are: Centerpoints are right (unbiased) Spreads are more complicated

Inference for Regression Formula for SD of : Big (small) for big (small, resp.) –Accurate data  Accurate est. of slope Small for x’s more spread out –Data more spread  More accurate Small for more data –More data  More accuracy

Inference for Regression Formula for SD of : Big (small) for big (small, resp.) –Accurate data  Accur’te est. of intercept Smaller for –Centered data  More accurate intercept Smaller for more data –More data  More accuracy

Inference for Regression One more detail: Need to estimate using data For this use: Similar to earlier sd estimate, Except variation is about fit line is similar to from before

Inference for Regression Now for Probability Distributions, Since are estimating by Use TDIST and TINV With degrees of freedom =

Inference for Regression Convenient Packaged Analysis in Excel: Tools  Data Analysis  Regression Illustrate application using: Class Example 27, Old Text Problem 8.6 (now 10.12)

Inference for Regression Class Example 27, Old Text Problem 8.6 (now 10.12) Utility companies estimate energy used by their customers. Natural gas consumption depends on temperature. A study recorded average daily gas consumption y (100s of cubic feet) for each month. The explanatory variable x is the average number of heating degree days for that month. Data for October through June are:

Inference for Regression Data for October through June are: MonthX = Deg. DaysY = Gas Cons’n Oct Nov Dec Jan Feb Mar Apr May Jun01.1

Inference for Regression Class Example 27, Old Text Problem 8.6 (now 10.12) Excel Analysis: Good News: Lots of things done automatically Bad News: Different language, so need careful interpretation

Inference for Regression Excel Glossary: ExcelStat 31 R2R2 r 2 = Prop’n of Sum of Squares Explained by Line interceptIntercept b X VariableSlope a CoefficientEstimates &.

Inference for Regression Excel Glossary: ExcelStat 31 Standard Errors Estimates of &. (recall from Sampling Dist’ns) T – Stat.(Est. – mean) / SE, i.e. put on scale of T – distribution P-valueFor 2-sided test of:

Inference for Regression Excel Glossary: ExcelStat 31 Lower 95% Upper 95% Ends of 95% Confidence Interval for a and b (since chose 0.95 for Confidence level) Predicted.Points on line at, i.e..

Inference for Regression Excel Glossary: ExcelStat 31 Residual for. Recall: gave useful information about quality of fit Standard Residuals: on standardized scale

Inference for Regression Some useful variations: Class Example 28, Old Text Problems (now – 10.15) Excel Analysis:

Inference for Regression Class Example 28, (now – 10.15) Old 10.8: Engineers made measurements of the Leaning Tower of Pisa over the years 1975 – “Lean” is the difference between a points position if the tower were straight, and its actual position, in tenths of a meter, in excess of 2.9 meters. The data are:

Inference for Regression Class Example 28, (now – 10.15) Old 10.8: The data are: YearLean

Inference for Regression Class Example 28, (now – 10.15) Old 10.8: (a)Plot the data, does the trend in lean over time appear to be linear? (b)What is the equation of the least squares fit line? (c)Give a 95% confidence interval for the average rate of change of the lean.

Inference for Regression HW: 10.3 b,c 10.5

And Now for Something Completely Different Etymology of: “And now for something completely different” Anybody heard of this before?

And Now for Something Completely Different What is “etymology”? Google responses to: define: etymology The history of words; the study of the history of words. csmp.ucop.edu/crlp/resources/glossary.html csmp.ucop.edu/crlp/resources/glossary.html The history of a word shown by tracing its development from another language.

And Now for Something Completely Different What is “etymology”? Etymology is derived from the Greek word e/)tymon(etymon) meaning "a sense" and logo/j(logos) meaning "word." Etymology is the study of the original meaning and development of a word tracing its meaning back as far as possible.

And Now for Something Completely Different Google response to: define: and now for something completely different And Now For Something Completely Different is a film spinoff from the television comedy series Monty Python's Flying Circus. The title originated as a catchphrase in the TV show. Many Python fans feel that it excellently describes the nonsensical, non sequitur feel of the program. en.wikipedia.org/wiki/And_Now_For_Somethi ng_Completely_Different en.wikipedia.org/wiki/And_Now_For_Somethi ng_Completely_Different

And Now for Something Completely Different Google Search for: “And now for something completely different” Gives more than 100 results…. A perhaps interesting one:

And Now for Something Completely Different Google Search for: “Stat 31 and now for something completely different” Gives: [PPT] Slide 1File Format: Microsoft Powerpoint 97 - View as HTML... But what is missing? And now for something completely different… Review Ideas on State Lotteries,. from our study of Expected Value... UNCstat /Stat ppt - Similar pagesSlide 1View as HTMLSimilar pages

Prediction in Regression Idea: Given data Can find the Least Squares Fit Line, and do inference for the parameters. Given a new X value, say, what will the new Y value be?

Prediction in Regression Dealing with variation in prediction: Under the model: A sensible guess about, based on the given, is: (point on the fit line above )

Prediction in Regression What about variation about this guess? Natural Approach: present an interval (as done with Confidence Intervals) Careful: Two Notions of this: 1.Confidence Interval for mean of 2.Prediction Interval for value of

Prediction in Regression 1.Confidence Interval for mean of : Use: where: and where

Prediction in Regression Interpretation of: Smaller for closer to But never 0 Smaller for more spread out Larger for larger

Prediction in Regression 2.Prediction Interval for value of Use: where: And again

Prediction in Regression Interpretation of: Similar remarks to above … Additional “1 + ” accounts for added variation in compared to

Prediction in Regression Revisit Class Example 28, (now – 10.15) Old 10.8: Engineers made measurements of the Leaning Tower of Pisa over the years 1975 – “Lean” is the difference between a points position if the tower were straight, and its actual position, in tenths of a meter, in excess of 2.9 meters. The data are listed above…

Prediction in Regression Class Example 28, (now – 10.15) Old 10.9: (a)Plot the data, Does the trend in lean over time appear to be linear? (b)What is the equation of the least squares fit line? (c)Give a 95% confidence interval for the average rate of change of the lean.

Prediction in Regression HW: and add part: (f) Calculate a 95% Confidence Interval for the mean oxygen uptake of individuals having heart rate 96, and heart rate 115.

Additional Issues in Regression Robustness Outliers via Java Applet HW on outliers