1 The Output Analyzer Separate application, also accessible via Tools menu in Arena Reads binary files saved by Arena Various kinds of output-data display,

Slides:



Advertisements
Similar presentations
Materials for Lecture 11 Chapters 3 and 6 Chapter 16 Section 4.0 and 5.0 Lecture 11 Pseudo Random LHC.xls Lecture 11 Validation Tests.xls Next 4 slides.
Advertisements

Outline input analysis input analyzer of ARENA parameter estimation
Statistics review of basic probability and statistics.
Eastern Mediterranean University Department of Industrial Engineering IENG461 Modeling and Simulation Systems Computer Lab 2 nd session ARENA (Input Analysis)
Simulation with ArenaAppendix C – A Refresher on Probability and StatisticsSlide 1 of 33 A Refresher on Probability and Statistics Appendix C.
Hydrologic Statistics
Simulation with ArenaChapter 4 – Modeling Basic Operations and InputsSlide 1 of 58 Schedules Vary Capacity (number of units) of a resource over time In.
Chapter 8 Random-Variate Generation
Chapter 8 Random-Variate Generation Banks, Carson, Nelson & Nicol Discrete-Event System Simulation.
Random variable Distribution. 200 trials where I flipped the coin 50 times and counted heads no_of_heads in a trial.
Probability Distributions and Stochastic Budgeting AEC 851 – Agribusiness Operations Management Spring, 2006.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Random-Variate Generation. Need for Random-Variates We, usually, model uncertainty and unpredictability with statistical distributions Thereby, in order.
A Review of Probability and Statistics
Simulation Modeling and Analysis
CS 351/ IT 351 Modelling and Simulation Technologies Random Variates Dr. Jim Holten.
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
- 1 - Summary of P-box Probability bound analysis (PBA) PBA can be implemented by nested Monte Carlo simulation. –Generate CDF for different instances.
Inferences About Process Quality
Properties of Random Numbers
Lecture 6 Data Collection and Parameter Estimation.
Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.
Hydrologic Statistics
SIMULATION MODELING AND ANALYSIS WITH ARENA
Chapter 5 Modeling & Analyzing Inputs
Graduate Program in Engineering and Technology Management
 1  Outline  stages and topics in simulation  generation of random variates.
On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.
Chapter 4 – Modeling Basic Operations and Inputs  Structural modeling: what we’ve done so far ◦ Logical aspects – entities, resources, paths, etc. 
1 CS 475/575 Slide Set 6 M. Overstreet Spring 2005.
Random Variables & Probability Distributions Outcomes of experiments are, in part, random E.g. Let X 7 be the gender of the 7 th randomly selected student.
Modeling and Simulation CS 313
Modeling and Simulation Input Modeling and Goodness-of-fit tests
Traffic Modeling.
ESD.70J Engineering Economy Module - Session 21 ESD.70J Engineering Economy Module Fall 2005 Session One Alex Fadeev - Link for this PPT:
2 Input models provide the driving force for a simulation model. The quality of the output is no better than the quality of inputs. We will discuss the.
1 Statistical Distribution Fitting Dr. Jason Merrick.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Week 10 Nov 3-7 Two Mini-Lectures QMM 510 Fall 2014.
IE 429, Parisay, January 2010 What you need to know from Probability and Statistics: Experiment outcome: constant, random variable Random variable: discrete,
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Chapter 9 Input Modeling Banks, Carson, Nelson & Nicol Discrete-Event System Simulation.
ETM 607 – Input Modeling General Idea of Input Modeling Data Collection Identifying Distributions Parameter estimation Goodness of Fit tests Selecting.
ESD.70J Engineering Economy Module - Session 21 ESD.70J Engineering Economy Fall 2006 Session Two Alex Fadeev - Link for this PPT:
Goodness-of-Fit Chi-Square Test: 1- Select intervals, k=number of intervals 2- Count number of observations in each interval O i 3- Guess the fitted distribution.
ESD.70J Engineering Economy Module - Session 21 ESD.70J Engineering Economy Fall 2009 Session Two Michel-Alexandre Cardin – Prof. Richard.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Learning Simio Chapter 10 Analyzing Input Data
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
Testing Random-Number Generators Andy Wang CIS Computer Systems Performance Analysis.
Chapter 6 Chapter 16 Sections , 4.0, Lecture 16 GRKS.XLSX Lecture 16 Low Prob Extremes.XLSX Lecture 16 Uncertain Emp Dist.XLSX Lecture 16 Combined.
Probability and Distributions. Deterministic vs. Random Processes In deterministic processes, the outcome can be predicted exactly in advance Eg. Force.
ESD.70J Engineering Economy Module - Session 21 ESD.70J Engineering Economy Fall 2010 Session Two Xin Zhang – Prof. Richard de Neufville.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
2 Input models provide the driving force for a simulation model. The quality of the output is no better than the quality of inputs. In this chapter, we.
Choosing A Distribution Risk Analysis for Water Resources Planning and Management Institute for Water Resources May 2008.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Selecting Input Probability Distributions. 2 Introduction Part of modeling—what input probability distributions to use as input to simulation for: –Interarrival.
Modeling and Simulation CS 313
BAE 5333 Applied Water Resources Statistics
Modeling and Simulation CS 313
Materials for Lecture 18 Chapters 3 and 6
Modeling Basic Operations and Inputs
Using the Input Analyzer
Discrete Event Simulation - 4
Modelling Input Data Chapter5.
Chapter 8 Random-Variate Generation
Chapter 4 – Modeling Basic Operations and Inputs
Presentation transcript:

1 The Output Analyzer Separate application, also accessible via Tools menu in Arena Reads binary files saved by Arena Various kinds of output-data display, analysis –For now: just data-display functions Source: Systems Modeling Co.

2 The Output Analyzer (cont’d.) Plot time-persistent data –Graph/Plot or –Can overlay several curves (Sensible? Units?) –Options for plot Title, axis Labels, crop axes Moving-average plots : “smooth” over time –Moving-average window Value –Exponential smoothing, Forecasting Barcharts : like Plot, cosmetically different Histograms of data –Beware: autocorrelation Source: Systems Modeling Co.

3 Deterministic vs. Random Inputs Deterministic: nonrandom, fixed values –Number of units of a resource –Entity transfer time (?) –Interarrival, processing times (?) Random (a.k.a. stochastic): model as a distribution, “draw” or “generate” values from to drive simulation –Transfer, Interarrival, Processing times –What distribution? What distributional parameters? –Causes simulation output to be random, too Don’t just assume randomness away — validity Source: Systems Modeling Co.

4 Collecting Data Generally hard, expensive, frustrating, boring –System might not exist –Data available on the wrong things — might have to change model according to what’s available –Incomplete, “dirty” data –Too much data (!) Sensitivity of outputs to uncertainty in inputs Match model detail to quality of data Cost — should be budgeted in project Capture variability in data — model validity Garbage In, Garbage Out (GIGO) Source: Systems Modeling Co.

5 Using Data: Alternatives and Issues Use data “directly” in simulation –Read actual observed values to drive the model inputs (interarrivals, service times, part types, …) –All values will be “legal” and realistic –But can never go outside your observed data –May not have enough data for long or many runs –Computationally slow (reading disk files) Or, fit probability distribution to data –“Draw” or “generate” synthetic observations from this distribution to drive the model inputs –We’ve done it this way so far –Can go beyond observed data (good and bad) –May not get a good “fit” to data — validity? Source: Systems Modeling Co.

6 Fitting Distributions via the Arena Input Analyzer Assume: –Have sample data: Independent and Identically Distributed (IID) list of observed values from the actual physical system –Want to select or fit a probability distribution for use in generating inputs for the simulation model Arena Input Analyzer –Separate application, also accessible via Tools menu in Arena –Fits distributions, gives valid Arena expression for generation to paste directly into simulation model Source: Systems Modeling Co.

7 Fitting Distributions via the Arena Input Analyzer (cont’d.) Fitting = deciding on distribution form (exponential, gamma, empirical, etc.) and estimating its parameters –Several different methods (Maximum likelihood, moment matching, least squares, …) –Assess goodness of fit via hypothesis tests H 0 : fitted distribution adequately represents the data Get p value for test (small = poor fit) Fitted “theoretical” vs. empirical distribution Continuous vs. discrete data, distribution “Best” fit from among several distributions Source: Systems Modeling Co.

8 Data Files for the Input Analyzer Create the data file (editor, word processor, spreadsheet,...) –Must be plain ASCII text (save as text or export) –Data values separated by white space (blanks, tabs, linefeeds) –Otherwise free format Open data file from within Input Analyzer –File/New menu or –File/Data File/Use Existing … menu or –Get histogram, basic summary of data –To see data file: Window/Input Data menu Can generate “fake” data file to play around –File/Data File/Generate New … menu Source: Systems Modeling Co.

9 The Fit Menu Fits distributions, does goodness-of-fit tests Fit a specific distribution form –Plots density over histogram for visual “test” –Gives exact expression to Copy and Paste (Ctrl+C, Ctrl+V) over into simulation model –May include “offset” depending on distribution –Gives results of goodness-of-fit tests Chi square, Kolmogorov-Smirnov tests Most important part: p-value, always between 0 and 1: Probability of getting a data set that’s more inconsistent with the fitted distribution than the data set you actually have, if the the fitted distribution is truly “the truth” “Small” p (< 0.05 or so): poor fit (try again or give up) Source: Systems Modeling Co.

10 The Fit Menu (cont’d.) Fit all Arena’s (theoretical) distributions at once –Fit/Fit All menu or –Returns the minimum square-error distribution Square error = sum of squared discrepancies between histogram frequencies and fitted- distribution frequencies Can depend on histogram intervals chosen: different intervals can lead to different “best” distribution –Could still be a poor fit, though (check p value) –To see all distributions, ranked: Window/Fit All Summary or Source: Systems Modeling Co.

11 Each distribution for random variable has: Definition Parameters Density or Mass function f(x) = P(X =x) Cumulative function f(x) = P(X =< x) Mean Variance IE 429, Jan 99

12 Goodness-of-fit Test, Chi-Square Test i = histogram cell i = 1, …, N n = number of observations O i = real number of observations in cell i P i = theoretical number of observations in cell i = F(a i ) - F(a i-1 ) a i = upper bound of cell i a i-1 = lower bound of cell i F(x) = cumulative density function = cdf If Then cdf is a good fit k = number of parameters of distribution IE 429, Jan 99

13 Some Issues in Fitting Input Distributions Not an exact science — no “right” answer Consider theoretical vs. empirical Consider range of distribution –Infinite both ways (e.g., normal) –Positive (e.g., exponential, gamma) –Bounded (e.g., beta, uniform) Consider ease of parameter manipulation to affect means, variances Simulation model sensitivity analysis Outliers, multimodal data –Maybe split data set (see textbook for details) Source: Systems Modeling Co.

14 No Data? Happens more often than you’d like No good solution; some (bad) options: –Interview “experts” Min, Max: Uniform Avg., % error or absolute error: Uniform Min, Mode, Max: Triangular –Mode can be different from Mean — allows asymmetry –Interarrivals — independent, stationary Exponential— still need some value for mean –Number of “random” events in an interval: Poisson –Sum of independent “pieces”: normal –Product of independent “pieces”: lognormal Source: Systems Modeling Co.

15 Multivariate and Correlated Input Data Usually we assume that all generated random observations across a simulation are independent (though from possibly different distributions) Sometimes this isn’t true: –A “difficult” part requires long processing in both the Prep and Sealer operations –This is positive correlation Ignoring such relations can invalidate model See textbook for ideas, references Source: Systems Modeling Co.