Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful.

Slides:



Advertisements
Similar presentations
Objectives 10.1 Simple linear regression
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Copyright © 2010 Pearson Education, Inc. Slide
Stat 31, Section 1, Last Time Hypothesis Testing –Careful about 1-sided vs. 2-sided Connection: CIs - Hypo Tests 3 Traps of Hypo Testing –Statistically.
Objectives (BPS chapter 24)
Inference for Regression 1Section 13.3, Page 284.
Section 4.2 Fitting Curves and Surfaces by Least Squares.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Basics of Regression continued
Stat 217 – Day 26 Regression, cont.. Last Time – Two quantitative variables Graphical summary  Scatterplot: direction, form (linear?), strength Numerical.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Need to know in order to do the normal dist problems How to calculate Z How to read a probability from the table, knowing Z **** how to convert table values.
Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s.
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals.
Inference for regression - Simple linear regression
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
Stat 31, Section 1, Last Time T distribution –For unknown, replace with –Compute with TDIST & TINV (different!) Paired Samples –Similar to above, work.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
Inference for Regression
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Stat 31, Section 1, Last Time 2-way tables –Testing for Independence –Chi-Square distance between data & model –Chi-Square Distribution –Gives P-values.
Ch4 Describing Relationships Between Variables. Pressure.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Chapter 10 Correlation and Regression
Stor 155, Section 2, Last Time Review…. Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday,
Stor 155, Section 2, Last Time Prediction in Regression –Given new point X 0, predict Y 0 –Confidence interval for mean –Prediction Interval for value.
Lesson Inference for Regression. Knowledge Objectives Identify the conditions necessary to do inference for regression. Explain what is meant by.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/6/12 Simple Linear Regression SECTIONS 9.1, 9.3 Inference for slope (9.1)
Statistics – OR 155, Section 2 J. S. Marron, Professor Department of Statistics and Operations Research.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
Administrative Matters Midterm II Results Take max of two midterm scores:
Math 4030 Final Exam Review. Probability (Continuous) Definition of pdf (axioms, finding k) Cdf and probability (integration) Mean and variance (short-cut.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
CHAPTER 12 More About Regression
Regression and Correlation
Final Examination Thursday, April 30, 4:00 – 7:00
CHAPTER 3 Describing Relationships
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Stat 31, Section 1, Last Time Sampling Distributions
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Chapter 12 Review Inference for Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
Inference for Regression
Algebra Review The equation of a straight line y = mx + b
Linear Regression and Correlation
Presentation transcript:

Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful answers (CIs, Hypo Tests, Graphics,…) –But had to “translate language”

Reading In Textbook Approximate Reading for Today’s Material: Pages Next Time: All review

Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday, May 7, 10:00 - 5:00 & by appointment (earlier) Bring with you, to exam: Single (8.5" x 11") sheet of formulas Front & Back OK

Prediction in Regression Idea: Given data Can find the Least Squares Fit Line, and do inference for the parameters. Given a new X value, say, what will the new Y value be?

Prediction in Regression Dealing with variation in prediction: Under the model: A sensible guess about, based on the given, is: (point on the fit line above )

Prediction in Regression What about variation about this guess? Natural Approach: present an interval (as done with Confidence Intervals) Careful: Two Notions of this: 1.Confidence Interval for mean of 2.Prediction Interval for value of

Prediction in Regression 1.Confidence Interval for mean of : Use: where: and where

Prediction in Regression Interpretation of: Smaller for closer to But never 0 Smaller for more spread out Larger for larger

Prediction in Regression 2.Prediction Interval for value of Use: where: And again

Prediction in Regression Interpretation of: Similar remarks to above … Additional “1 + ” accounts for added variation in compared to

Prediction in Regression Revisit Class Example 33, Textbook Problem : Engineers made measurements of the Leaning Tower of Pisa over the years 1975 – “Lean” is the difference between a points position if the tower were straight, and its actual position, in tenths of a meter, in excess of 2.9 meters. The data are listed above…

Prediction in Regression ??? Next time: spruce up these examples a lot ???

Prediction in Regression Class Example 33, Textbook Problems – 10.25: (a)Plot the data, Does the trend in lean over time appear to be linear? (b)What is the equation of the least squares fit line? (c)Give a 95% confidence interval for the average rate of change of the lean.

Prediction in Regression HW: 10.7 b, c, d 10.8 ((c) [11610, 12660], [9554, 14720])

Prediction in Regression Revisit Class Example 33, Textbook – 10.25: Engineers made measurements of the Leaning Tower of Pisa over the years 1975 – “Lean” is the difference between a points position if the tower were straight, and its actual position, in tenths of a meter, in excess of 2.9 meters. The data are listed above…

Prediction in Regression Class Example 33, Problem 10.24: (a)In 1918 the lean was (the coded value is 71). Using the least squares equation for the years 1975 to 1987, calculate a predicted value for the lean in 1918 (b)Although the least squares line gives an excellent fit for 1975 – 1987, this did not extend back to Why?

Prediction in Regression Class Example 33, Problem 10.25: (a)How would you code the explanatory variable for the year 2002? (b)The engineers working on the tower were most interested in how much it would lean if no corrective action were taken. Use the least squares equation line to predict the lean in

Prediction in Regression Class Example 33, Problem 10.25: (c)To give a margin of error for the lean in 2005, would you use a confidence interval for the mean, or a prediction interval? Explain your choice.

Prediction in Regression Class Example 33, Problem (d)Give the values of the 95% confidence interval for the mean, and the 95% prediction interval. How do they compare? Recall generic formula (same for both):

Prediction in Regression Class Example 33, Problem Difference was in form for SE: CI for mean: PI for value:

Outliers in Regression Caution about regression: Outliers can have a major impact Single point can throw slope way off And intercept too Can watch for this, using plot And residual plot show this, too

Nonlinear Regression When lines don’t fit data: How do we know? What can we do? There is a lot… But beyond scope of this course Some indication…

Nonlinear Regression Class Example 34: World Population Main lessons: Data can be non-linear Identify with plot Residuals even more powerful at this Look for systematic structure

Nonlinear Regression Class Example 34: World Population When data are non-linear: There is non-linear regression But not covered here Can use lin. reg’n on transformed data Log transform often useful

Next time: Additional Issues in Regression Robustness Outliers via Java Applet HW on outliers

And Now for Something Completely Different Etymology of: “And now for something completely different” Anybody heard of this before? (really 2 questions…)

And Now for Something Completely Different What is “etymology”? Google responses to: define: etymology The history of words; the study of the history of words. csmp.ucop.edu/crlp/resources/glossary.html csmp.ucop.edu/crlp/resources/glossary.html The history of a word shown by tracing its development from another language.

And Now for Something Completely Different What is “etymology”? Etymology is derived from the Greek word e/)tymon(etymon) meaning "a sense" and logo/j(logos) meaning "word." Etymology is the study of the original meaning and development of a word tracing its meaning back as far as possible.

And Now for Something Completely Different Google response to: define: and now for something completely different And Now For Something Completely Different is a film spinoff from the television comedy series Monty Python's Flying Circus. The title originated as a catchphrase in the TV show. Many Python fans feel that it excellently describes the nonsensical, non sequitur feel of the program. en.wikipedia.org/wiki/And_Now_For_Somethi ng_Completely_Different en.wikipedia.org/wiki/And_Now_For_Somethi ng_Completely_Different

And Now for Something Completely Different Google Search for: “And now for something completely different” Gives more than 100 results…. A perhaps interesting one:

And Now for Something Completely Different Google Search for: “Stor 155 and now for something completely different” Gives: [PPT] Slide 1Slide 1 File Format: Microsoft Powerpoint - View as HTML or.unc.edu/webspace/postscript/marron/Teaching/st or /... And Now for Something Completely Different. P: Dead bugs on windshield.... stat- or.unc.edu/webspace/postscript/marron/Teaching/st or /Stor ppt - Similar pagesView as HTMLSimilar pages

Review Slippery Issues Major Confusion: Population Quantities Vs. Sample Quantities

Review Slippery Issues Population Mathematical Notation: (fixed & unknown) Sample Mathematical Notation : (summaries of data, have numbers)

Hypothesis Testing – Z scores E.g. Fast Food Menus: Test Using P-value = P{what saw or m.c.| H 0 & H A bd’ry} (guides where to put $21k & $20k)

Hypothesis Testing – Z scores P-value = P{what saw or or m.c.| H 0 & H A bd’ry}

Hypothesis Testing – Z scores P-value This is the Z-score Computation: Class E.g. 24, Part 6 Distribution: N(0,1)

Hypothesis Testing – Z scores P-value So instead of reporting tail probability, Report this cutoff instead, as “SDs away from mean $20,000”

Review for Final An Important Mode of Thinking: Ideas vs. “Cookbook”

How do you view your sheet of formulas? A set of recipes? “Look through list” to solve problems? Getting harder to find now? Problems: Too many decisions to make Hard to sort out while looking through…

Ideas vs. “Cookbook” Too many decisions to make, e.g.: Binomial vs. Normal 1-sided vs. 2-sided Hypo. Tests TDIST (INV) vs. NORMDIST (INV) CI vs. Sample Size calculation 1 sample vs. 2 sample Which is H 0, H A ? And what direction? What is “m.c.”? What is “Bdry”?

Ideas vs. “Cookbook” Suggested Approach: Use concepts to guide choice This is what I try to teach And is what I am testing for… How to learn? Go through old HW (random order) When stumped, look through notes (look for main Ideas, not “the right formula”)

A useful concept Perhaps not well taught? |a – b| - “Number of spaces between a and b on the numberline” e.g. Midterm II, problem 3c (x = 0, 1, 2, 3) {|X – 1| > 1} = {“number of spaces between X and 1 is more than 1”} = {X = 3}

A useful concept e.g. Midterm II, problem 3c (x = 0, 1, 2, 3) {|X – 1| > 1} = {“number of spaces between X and 1 is more than 1”} = {X = 3} Because:

Response to a Request You said at the end of today's class that you would be willing to take class time to "reteach" concepts that might still be unknown to us. Well, in my case, it seems that probability and probability distribution is a hard concept for me to grasp. On the first midterm, I missed … and on the second midterm, I missed … I seem to be able to grasp the other concepts involving binomial distribution, normal distribution, t- distribution, etc fairly well, but probability is really killing me on the exams. If you could reteach these or brush up on them I would greatly appreciate it.

A Flash from the Past Two HW “Traps” 1.Working together: Great, if the relationship is equal But don’t be the “yes, I get it” person… 2.The HW “Consortium”: You do HW 1, and I’ll do HW 2… Easy with electronic HW Trap: HW is about learning You don’t learn on your off weeks…