Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.

Slides:



Advertisements
Similar presentations
Regression and correlation methods
Advertisements

Lesson 10: Linear Regression and Correlation
Forecasting Using the Simple Linear Regression Model and Correlation
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Chapter 4 The Relation between Two Variables
Simple Linear Regression and Correlation
Objectives (BPS chapter 24)
Scatter Diagrams and Linear Correlation
Simple Linear Regression
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Statistics for Business and Economics
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Simple Linear Regression Analysis
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Pertemua 19 Regresi Linier
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Correlation and Regression Analysis
Chapter 7 Forecasting with Simple Regression
Simple Linear Regression Analysis
Relationships Among Variables
Lecture 5 Correlation and Regression
Correlation & Regression
Correlation and Linear Regression
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Simple Linear Regression Models
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Chapter 6 & 7 Linear Regression & Correlation
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Introduction to Linear Regression
Correlation & Regression
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Chapter 12: Correlation and Linear Regression 1.
Stats Methods at IC Lecture 3: Regression.
The simple linear regression model and parameter estimation
Regression Analysis AGEC 784.
Topic 10 - Linear Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression Analysis

Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000 to repair, on average.

You’ve got the Data… Now What? Unscheduled Maintenance Actions (UMAs)

What do you want to know? l How many UMAs will there be next month? l What is the average number of UMAs ?

Sample Mean

Sample Standard Deviation

UMA Sample Statistics

UMAs Next Month 95% Confidence Interval

Average UMAs 95% Confidence Interval

Model: Cost of UMAs for one squadron If the cost per UMA = $1000, the Expected cost for one squadron = $60,000

Model: Total Cost of UMAs Expected Cost for all squadrons = 60 * $1000 * 36 = $2,160,000

Model: Total Cost of UMAs Expected Cost for all squadrons = 60 * $1000 * 36 = $2,160,000 How confident are we about this estimate?

~ 95% mean (=60) standard error =12/  36 = 2

~ 95% ~56 ~58 60 ~62 ~64 (1 standard unit = 2)

95% Confidence Interval on our estimate of UMAs and costs l (2) = [56, 64] l low cost: 56 * $1000 * 36 = $2,016,000 l high cost: 64 * $1000 * 36 = $2,304,000

What do you want to know? l How many UMAs will there be next month? l What is the average number of UMAs ? l Is there a relationship between UMAs and and some other variable that may be used to predict UMAs? l What is that relationship?

Relationships l What might be related to UMAs? n Pilot Experience ? n Flight hours ? n Sorties flown ? n Mean time to failure (for specific parts) ? n Number of landings / takeoffs ?

Regression: l To estimate the expected or mean value of UMAs for next month: n look for a linear relationship between UMAs and a “predictive” variable n If a linear relationship exists, use regression analysis

Regression analysis: describes and evaluates relationships between one variable (dependent or explained variable), and one or more other variables (called the independent or explanatory variables).

What is a good estimating variable for UMAs? l quantifiable l predictable l logical relationship with dependent variable l must be a linear relationship: Y = a + bX

Sorties

Pilot Experience

Sample Statistics

Describing the Relationship l Is there a relationship? n Do the two variables (UMAs and sorties or experience) move together? n Do they move in the same direction or in opposite directions? l How strong is the relationship? n How closely do they move together?

Positive Relationship

Strong Positive Relationship

Negative Relationship

Strong Negative Relationship

No Relationship

Relationship?

Correlation Coefficient l Statistical measure of how closely two variables are moving together in a coordinated fashion n Measures strength and direction l Value ranges from -1.0 to +1.0 n +1.0 indicates “perfect” positive linear relation n -1.0 indicates “perfect” negative linear relation n 0 indicates no relation between the two variables

Correlation Coefficient

Sorties vs. UMAs r =.9788

Experience vs. UMAs r =.1896

Correlation Matrix

A Word of Caution... l Correlation does NOT imply causation n It simply measures the coordinated movement of two variables l Variation in two variables may be due to a third common variable l The observed relationship may be due to chance alone

What is the Relationship? l In order to use the correlation information to help describe the relationship between two variables we need a model l The simplest one is a linear model:

Fitting a Line to the Data

One Possibility Sum of errors = 0

Another Possibility Sum of errors = 0

Which is Better? l Both have sum of errors = 0 l Compare sum of absolute errors:

Fitting a Line to the Data

One Possibility Sum of absolute errors = 6

Another Possibility Sum of absolute errors = 6

Which is Better? l Sum of the absolute errors are equal l Compare sum of errors squared:

X Y The Correct Relationship: Y = a + bX + U systematic random

X Y The correct relationship: Y = a + bX + U systematic random

Least-Squares Method l Penalizes large absolute errors l Y- intercept: l Slope:

Assumptions l Linear relationship: l Errors are random and normally distributed with mean = 0 and variance = n Supported by Central Limit Theorem

Least Squares Regression for Sorties and UMAs

Regression Calculations

Sorties vs. UMAs

Regression Calculations: Confidence in the predictions

Confidence Interval for Estimate

95% Confidence Interval for the model (b) X Y

Testing Model Parameters l How well does the model explain the variation in the dependent variable? l Does the independent variable really seem to matter? l Is the intercept constant statistically significant?

Variation

Coefficient of Determination l Values between 0 and 1 l R 2 = 1 when all data on line (r=1) l R 2 = 0 when no correlation (r=0)

Regression Calculations: How well does the model explain the variation?

Does the Independent Variable Matter? l If sorties do not help predict UMAs we expect b = 0 l If b is not 0, is it statistically significant?

Regression Calculations: Does the Independent Variable Matter?

95% Confidence Interval for the slope (a) Mean of Y Mean of XX Y

Confidence Interval for Slope

Is the Intercept Statistically Significant?

Confidence Interval for Y-intercept

Basic Steps of Regression Analysis l Formulate the model l Plot scatter diagram for visual inspection l Compute correlation coefficient l Fit the regression line l Test the model

Factors affecting estimation accuracy l Sample size (larger is better) l Range of X values (wider is better) l Standard deviation of U (smaller is better)

Uses and Limitations of Regression Analysis l Identifying relationships n Not necessarily cause n May be due to chance only l Forecasting future outcomes n Only valid over the range of the data n Past may not be good predictor of future

Common pitfalls in regression l Failure to draw scatter diagrams l Omitting important variables from the model l The “two point” phenomenon l Unfounded claims of model sophistication l Insufficient attention to interval estimates and predictions l Predicting too far outside of known range

Lines can be deceiving... R 2 =.6662

Nonlinear Relationship

Best fit?

Misleading data

Summary l Regression Analysis is a useful tool n Helps quantify relationships l But be careful n Does not imply cause and effect n Don’t go outside range of data n Check linearity assumptions n Use common sense!

Non-linear relationship between output and cost