SE-280 Dr. Mark L. Hornick 1 Prediction Intervals.

Slides:



Advertisements
Similar presentations
Chapter 9: Simple Regression Continued
Advertisements

This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Chapter 12 Simple Regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Chapter 8 Estimation: Single Population
Lecture 23 Multiple Regression (Sections )
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
Inferences About Process Quality
Chapter 7 Estimating Population Values
Statistics for Managers Using Microsoft® Excel 7th Edition
Chapter 12 Section 1 Inference for Linear Regression.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistics for Managers Using Microsoft® Excel 7th Edition
Lecture 5 Correlation and Regression
Standard Error of the Mean
Inference for regression - Simple linear regression
Linear Regression Inference
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
SE-280 Dr. Mark L. Hornick Numerical Integration.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
CORRELATION & REGRESSION
Correlation and Regression
SE-280 Dr. Mark L. Hornick 1 Statistics Review Linear Regression & Correlation.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Disciplined Software Engineering Lecture #4 Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department.
Disciplined Software Engineering Lecture #6 Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department.
Inferential Statistics 2 Maarten Buis January 11, 2006.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Copyright © 1994 Carnegie Mellon University Disciplined Software Engineering - Lecture 1 1 Disciplined Software Engineering Lecture #4 Software Engineering.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
SE-280 Dr. Mark L. Hornick Multiple Regression (Cycle 4)
Diagnostics – Part II Using statistical tests to check to see if the assumptions we made about the model are realistic.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Chapter 11: Estimation of Population Means. We’ll examine two types of estimates: point estimates and interval estimates.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Sampling Distributions and Estimation
Estimating with PROBE II
Confidence Interval Estimation
Correlation and Regression
Correlation and Regression
Confidence and Prediction Intervals
Presentation transcript:

SE-280 Dr. Mark L. Hornick 1 Prediction Intervals

SE-280 Dr. Mark L. Hornick 2 Review: PROBE’s regression calculations give us estimates (projections) for size (A+M LOC)…

SE-280 Dr. Mark L. Hornick 3 …and time

SE-280 Dr. Mark L. Hornick 4 But just how good are these estimates??? Off by 5%, 10%, 50%, 100%, 500%? Does it matter? Do you want to bet: Your weekends? Your reputation? Your JOB?

SE-280 Dr. Mark L. Hornick 5 Which of the following regression projections would you trust more?

SE-280 Dr. Mark L. Hornick 6 Example A 10 data points Correlation = Estimated Object LOC Actual Total LOC

SE-280 Dr. Mark L. Hornick 7 Example B 25 data points Correlation = Estimated Object LOC Actual Total LOC

A Prediction Interval calculation computes the bounds on the likely error of an estimate Range UPI = estimated A+M LOC + Range LPI = estimated A+M LOC - Range UPI LPI Projection (Estimate) Strictly speaking, the UPI/LPI "lines" are parabolas, and Range varies. Range

SE-280 Dr. Mark L. Hornick 9 If you had this kind of information about your estimates, how would you use it? Suppose your time projection said that a project would take 8 weeks. But, your prediction interval has a range of 3 weeks How should you make your plan? What should you tell management?

SE-280 Dr. Mark L. Hornick 10 If you had this kind of information about your estimates, how would you use it? Suppose your time projection said that a project would take 8 weeks. But, your prediction interval has a range of 3 weeks How should you make your plan? What should you tell management? 3 3

SE-280 Dr. Mark L. Hornick 11 If you had this kind of information about your estimates, how would you use it? Suppose your time projection said that a project would take 8 weeks How should you make your plan? What should you tell management? What if the range was 6 weeks? 6 6

SE-280 Dr. Mark L. Hornick 12 The prediction interval is based on the t distribution. 70% limits (area) Regression- projected value Range Lower prediction interval limit (LPI) Upper prediction interval limit (UPI)

SE-280 Dr. Mark L. Hornick 13 Prediction Interval Usage Range within which data is likely to fall Assuming variation is this estimate is similar to that in prior estimates PSP uses 70% and 90% limits Computes range in which actual value will likely fall 70% of the time 90% of the time Helps to assess planning quality

To get the prediction interval, we must calculate the range: Text, page 128; may have error in formula (n instead of d), depending on textbook revision. Note: this is for one independent variable.

For multiple regression, the range calculation is just extended a little.

The  ("sigma") value is computed in the following way. x i,j = previous independent variable values y i = previous dependent variable (estimate) values n= number of previous estimates m= number of independent variables d= n-(m+1) [degrees of freedom]  j = regression coefficients calculated from previous data Same for one independent variable. Alternate form:

The range formula requires us to find the integration limit that yields the correct integral value. For a 70% interval, we want p = 0.70 Question: what integration limit “t” gives this value? -tt Two-sided integral value = p We have to search (try t values) in order to find out. For a 90% interval, we want p = 0.90

When thinking about searching for the desired integral value, it may be helpful to plot the integral of the t-distribution function. Hint: create a "function object" that calculates the two-sided p integral, given a specified t value.

SE-280 Dr. Mark L. Hornick 19 The calculation needed is the reverse of that used in the significance calculation, since we are seeking "t" instead of "p". t p For a specified "p" (integral) value, we want to find the corresponding "t" (integration limit). How should we do the search?

In the significance calculation, we calculated "p" for a given "t"; now we are seeking "t" that will give us a desired "p". t p For a specified "p" (2-sided integral) value, we want to find the corresponding "t" (integration limit). How should we do the search?

The textbook's suggests a state-machine approach that requires the function to be monotonic (pg. 246). The sign of the error (desired versus actual "p") tells you whether to increase or decrease the trial "t" value to get closer to the desired answer. p t The increase/decrease step size is halved when changing search direction.

An alternative search method brackets the answer and bisects the interval.

How does the interval bisection method work? t p error (+) error (-)