Naval Center for Cost Analysis (NCCA) Jake Mender and Ann Hawpe

Slides:



Advertisements
Similar presentations
Request Dispatching for Cheap Energy Prices in Cloud Data Centers
Advertisements

SpringerLink Training Kit
Luminosity measurements at Hadron Colliders
From Word Embeddings To Document Distances
Choosing a Dental Plan Student Name
Virtual Environments and Computer Graphics
Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI
THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –
D. Phát triển thương hiệu
NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN
Điều trị chống huyết khối trong tai biến mạch máu não
BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.
Nasal Cannula X particulate mask
Evolving Architecture for Beyond the Standard Model
HF NOISE FILTERS PERFORMANCE
Electronics for Pedestrians – Passive Components –
Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel
L-Systems and Affine Transformations
CMSC423: Bioinformatic Algorithms, Databases and Tools
Some aspect concerning the LMDZ dynamical core and its use
Bayesian Confidence Limits and Intervals
实习总结 (Internship Summary)
Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,
Front End Electronics for SOI Monolithic Pixel Sensor
Face Recognition Monday, February 1, 2016.
Solving Rubik's Cube By: Etai Nativ.
CS284 Paper Presentation Arpad Kovacs
انتقال حرارت 2 خانم خسرویار.
Summer Student Program First results
Theoretical Results on Neutrinos
HERMESでのHard Exclusive生成過程による 核子内クォーク全角運動量についての研究
Wavelet Coherence & Cross-Wavelet Transform
yaSpMV: Yet Another SpMV Framework on GPUs
Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.
MOCLA02 Design of a Compact L-­band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Fuel cell development program for electric vehicle
Overview of TST-2 Experiment
Optomechanics with atoms
داده کاوی سئوالات نمونه
Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium  
ლექცია 4 - ფული და ინფლაცია
10. predavanje Novac i financijski sustav
Wissenschaftliche Aussprache zur Dissertation
FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,
Particle acceleration during the gamma-ray flares of the Crab Nebular
Interpretations of the Derivative Gottfried Wilhelm Leibniz
Advisor: Chiuyuan Chen Student: Shao-Chun Lin
Widow Rockfish Assessment
SiW-ECAL Beam Test 2015 Kick-Off meeting
On Robust Neighbor Discovery in Mobile Wireless Networks
Chapter 6 并发:死锁和饥饿 Operating Systems: Internals and Design Principles
You NEED your book!!! Frequency Distribution
Y V =0 a V =V0 x b b V =0 z
Fairness-oriented Scheduling Support for Multicore Systems
Climate-Energy-Policy Interaction
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Ch48 Statistics by Chtan FYHSKulai
The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.
Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs
Online Learning: An Introduction
Factor Based Index of Systemic Stress (FISS)
What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.
THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*
Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.
The Toroidal Sporadic Source: Understanding Temporal Variations
FW 3.4: More Circle Practice
ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف
Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM
Limits on Anomalous WWγ and WWZ Couplings from DØ
Presentation transcript:

Naval Center for Cost Analysis (NCCA) Jake Mender and Ann Hawpe Modeling with Gumby: Pros and Cons of the Weibull Curve Jake Mender and Ann Hawpe 18 MAY 2017 1

Overview Motivation Parametric Phasing Curve Overview Overfitting Rayleigh Function Details Rayleigh vs. Weibull Problem with Rayleigh Solutions Take Aways Future Study

Motivation NCCA was asked to evaluate the use of the Weibull curve, instead of the commonly used Rayleigh curve, as a tool for early R&D project estimating Our review highlighted some potential pitfalls when using the Weibull for predictive purposes with small data sets Results are relevant to other modeling problems in the cost estimating domain

Parametric Phasing Curves Functions that give cost, hours, or effort as a function of time elapsed Allows for a forecast of effort and schedule to-go by fitting the chosen function to actuals to-date (by period) Useful when estimating cost or schedule independent of Program Manager projections

Weibull Distribution 𝑓(𝑡)=d*( 𝜅 𝜆 𝑡−𝜇 𝜆 (𝜅−1) 𝑒 − 𝑡−𝜇 𝜆 𝜅 ) x ≥μ; 𝜆, 𝜅 >0 (𝜅): shape (μ): location (usually set to 0) (𝜆): scale (d): cost scalar Three parameters result in a function that is extremely flexible (like Gumby) Pro: Can fit wide range of projects Con: 30-45 data points required to avoid overfitting Most popular solution is to use the Rayleigh function, a degenerate of the Weibull

Flexibility of the Weibull Curve 𝜆 = 1, k = 0.5 𝜆 = 1, k = 1.0 𝜆 = 1, k = 1.5 𝜆 = 1, k = 2.0 𝜆 = 1, k = 5.0 At k = 2; Weibull is equal to Rayleigh

Overfitting When a model is too complex for the data set An overfitted model will poorly predict performance outside training data set Rule of thumb: 10-15 data points per parameter Defense R&D projects rarely have 30-45 data points available Is a linear model (2 parameters) accurate with 3 or 4 data points? Weibull model (with 3 parameters) requires a lot of data (30-45)

Slopes and Y-intercepts change while R2 is similar Overfitting: Part 2 Slopes and Y-intercepts change while R2 is similar Estimates change greatly 10 points 20 points Applying rule: linear model requires 20-30 points before parameter estimates will stabilize around the population mean

Similar Slopes and Y-intercepts Overfitting: Part 3 Similar Slopes and Y-intercepts Estimates stable 20 points 30 points Weibull model (with 4 parameters) requires a lot of data (40-60)

Projections very different depending on amount of data used Overfitting: Part 4 Projections very different depending on amount of data used Predictions stabilize once sufficient data is available

Overfitting: Solutions Three ways to deal with this: Use Weibull degenerates (like the Rayleigh) with fewer parameters Reduce the amount of required data Estimate parameters from other features of the project (AKA, get more data) “Create” more data by considering other information Use a different method Find another method that isn’t as data hungry

Major Simplification = Mode assumed to fall at the 39th percentile The Rayleigh Curve Developed from Manpower Utilization model developed by P.V. Norden in the 1960s Pattern is approximated by Rayleigh Function If true, allows for total effort and duration to be estimated from the trajectory of early effort For our purposes this is conveyed by ACWP as reported in EVM CPRs Major Simplification = Mode assumed to fall at the 39th percentile Rayleigh is a degenerate (or restricted version) of the Weibull

Rayleigh Model Cost Parameter Scale Parameter Rayleigh shape or “profile” is assumed – parameter fits simply scale function to data Scale Parameter By assuming a fixed shape parameter, we are scaling the cost and duration of the fixed Rayleigh profile to find the combination of the two that best fits our data

Rayleigh vs. Weibull Demo to be done in Excel

The Problem Restricted models, like the Rayleigh, require a major assumption that projects have the Rayleigh profile Evidence is mixed on this assumption Rayleigh has predictive power because of the assumed profile Weibull lacks predictive power because it can take on a variety of forms An effective heuristic to determine which Weibull fits the problem at hand is needed

NCCA hasn’t done this yet, but other people have tried… Possible Solution Rayleigh method assumes a profile Instead of assuming a profile, why not use two-stage model to predict profile, and then predict cost and duration? Won’t work off existing CPR data Could work if additional independent information about the project allowed for classification NCCA hasn’t done this yet, but other people have tried…

Air Force Institute of Technology (AFIT) Weibull Model Attempted to address the overfitting problem via a two-stage model Step 1: Develop regression using program characteristics to predict best Shape and Scale parameters Step 2: Fit a Weibull model using estimated parameters and optimizing for budget parameter See: “Forecasting Research and Development Program Budgets Using the Weibull Model” Captain Thomas W. Brown USAF, March 2002, Air Force Institute of Technology

AFIT Model: Results Predict Scale and Shape as a function of service, system type, total cost, and duration Results: success predicting scale (and to a lesser extent Shape) – Forecasted Weibull models correlated well to project expenditures, but the associated duration and cost estimates were not evaluated for accuracy

NCCA Evaluation Criteria In DoD use, phasing curves are used to make specific predictions regarding three things: Profile: the period by period expenditures Duration: The total amount of time required Total Cost: The total amount required to complete the project Any proposed Phasing Model should be evaluated in all 3 dimensions before use AFIT model appears promising, but was not fully evaluated for this use case

Summary Parametric phasing models can be an effective tool for independent evaluation of R&D projects Estimators must be careful when using complex models early in a project Complex models require larger amounts of data Rayleigh model is well supported by a large body of research, but has known limitations The more complex Weibull shows promise, but requires further research regarding the overfitting problem Analysts must think critically regarding their use case and ensure the model selected is appropriate for the problem at hand

Future Study Evaluate AFIT model in the context of the early R&D forecasting use case Explore alternate predictors for shape, scale, and location parameters (beyond what was considered in the AFIT study)

Questions?