The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School OR49 Sept 07 A 3 year, EPSRC funded project in collaboration with.

Slides:



Advertisements
Similar presentations
Automating the Analysis of Simulation Output Data Katy Hoad Stewart Robinson, Ruth Davies, Mark Elder Funded by EPSRC and SIMUL8.
Advertisements

Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing.
Research Methods in MIS: Sampling Design
Automating estimation of warm-up length Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School WSC08 The AutoSimOA Project A 3 year, EPSRC funded.
Chapter 12 Simple Regression
The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School WSC 07 A 3 year, EPSRC funded project in collaboration with SIMUL8.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Automating estimation of warm-up length Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School Simulation Workshop - April 2008 The AutoSimOA.
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Automated Analysis of Simulation Output Data and the AutoSimOA Project
Automating the Analysis of Simulation Output Data Katy Hoad, Stewart Robinson, Ruth Davies SSIG Meeting, 24th October 2007
Simulation.
Evaluating Hypotheses
Classification of Discrete Event Simulation Models and Output Data: Creating a Sufficient Model Set. Katy Hoad Stewart Robinson,
Automating the Analysis of Simulation Output Data Stewart Robinson, Katy Hoad, Ruth Davies OR48, September 2006.
Automating the Analysis of Simulation Output Data Katy Hoad Stewart Robinson, Ruth Davies, Mark Elder Funded by EPSRC and SIMUL8.
Chapter 8 Estimation: Single Population
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 7 Sampling.
1.  Why understanding probability is important?  What is normal curve  How to compute and interpret z scores. 2.
Inferences About Process Quality
Automating the Analysis of Simulation Output Data Stewart Robinson, Katy Hoad, Ruth Davies ORGS Meeting, 4th October 2007
1 Simulation Modeling and Analysis Output Analysis.
191 Drawing Statistical Inference from Simulation Runs......the "fun" stuff!
Automating The Selection of a Simulation Warm-up Period Stewart Robinson, Katy Hoad, Ruth Davies Warwick Business School Cardiff University - October 2008.
Determining the Size of
AutoSimOA : A Framework for Automated Analysis of Simulation Output Stewart Robinson Katy Hoad, Ruth Davies Funded by.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Introduction to Statistical Inferences
Determining Sample Size
Verification & Validation
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
PARAMETRIC STATISTICAL INFERENCE
MS 305 Recitation 11 Output Analysis I
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Statistical Inference Statistical Inference is the process of making judgments about a population based on properties of the sample Statistical Inference.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 8: Confidence Intervals based on a Single Sample
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 15: Sample size and Power Marshall University Genomics.
Stats Methods at IC Lecture 3: Regression.
Slides by JOHN LOUCKS St. Edward’s University.
ESTIMATION.
Confidence Interval Estimation
Sampling Distributions and Estimation
CPSC 531: System Modeling and Simulation
Statistical Methods Carey Williamson Department of Computer Science
Discrete Event Simulation - 4
Confidence Interval Estimation
Carey Williamson Department of Computer Science University of Calgary
Objectives 6.1 Estimating with confidence Statistical confidence
Objectives 6.1 Estimating with confidence Statistical confidence
Presentation transcript:

The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School OR49 Sept 07 A 3 year, EPSRC funded project in collaboration with SIMUL8 Corporation.

OUTLINE Introduction Methods Algorithm Test Methodology Test Results Extended Algorithm & Results Discussion Summary

Objective To provide an easy to use method, that can be incorporated into existing simulation software, that enables practitioners to obtain results of a specified accuracy from their discrete event simulation model. (Only looking at analysis of a single scenario)

Introduction Initial Setup:  Any warm-up problems already dealt with.  Run length (m) decided upon.  Modeller decided to use multiple replications to obtain better estimate of mean performance. Multiple replications performed by changing the random number streams used by the model and re-running the simulation. Output data from model Response measure of interest = summary statistic from rep1 = summary statistic from repN N replications

QUESTION IS… How many replications are needed? Limiting factors: computing time and expense. If performing N replications achieves a sufficient estimate of mean performance: > N replications: Unnecessary use of computer time and money. < N replications: Inaccurate results → incorrect decisions.

4 main methods found in the literature for choosing N: 1. Rule of Thumb Run at least 3 to 5 replications. Advantage: Very simple. Disadvantage: Does not use characteristics of model output. No measured precision level.

2. Simple Graphical Method Plot Cumulative mean -v- number of replications Visually select point where cumulative mean line becomes “flat”. Use this as N. Advantages: Simple Uses output of interest in decision. Disadvantages: Subjective No measured precision level.

3. Confidence Interval (with Specified Precision) Method User decides size of error they can tolerate. Run increasing numbers of replications, Construct Confidence Intervals around sequential cumulative mean of output variable until desired precision achieved. Advantages: Relies upon statistical inference to determine number of replications required. Allows the user to tailor accuracy of output results to their particular requirement or purpose for that model and result. Disadvantage: Many simulation users do not have the skills to apply such an approach.

4. Prediction Formula Method User decides size of error they can tolerate. Run a few replications, estimate variance & mean Use formula to predict N. Check desired precision achieved – if not amend N and repeat Advantages: Simple. Uses data from model. Provides specified precision. Disadvantage: Can be very inaccurate especially for small number of replications. If variance estimate low underestimate N If variance estimate high overestimate N

Chose to automate: Confidence Interval (with Specified Precision) Method

The replication algorithm interacts with the simulation model sequentially.

is the student t value for n-1 df and a significance of 1-α, s n is the estimate of the standard deviation, calculated using results X i (i = 1 to n) of the n current replications. Where n is the current number of replications carried out, We define the precision, d n, as the ½ width of the Confidence Interval expressed as a percentage of the cumulative mean: is the cumulative mean, ALGORITHM DEFINITIONS

Stopping Criteria Simplest method: Stop when d n 1st found to be ≤ desired precision, d required, and recommend that number of replications, Nsol, to the user. Problem: Data series could prematurely converge, by chance, to incorrect estimate of the mean, with precision d required, then diverge again. ‘Look-ahead’ procedure: When d n 1st found to be ≤ d required, algorithm performs set number of extra replications, to check that precision remains ≤ d required.

‘Look-ahead’ procedure kLimit = ‘look ahead’ value. Actual number of replications checked ahead is a function of this user defined value: Function relates ‘look ahead’ period length with current value of n.

Nsol Nsol + f(kLimit) f(kLimit) Precision ≤ 5% 95% confidence limits Cumulative mean, Replication Algorithm

Precision≤ 5% Precision> 5% Precision ≤ 5% f(kLimit) Nsol Nsol + f(kLimit) Nsol

24 artificial data sets created: Left skewed, symmetric, right skewed; Varying values of relative standard deviation (stdev/mean). Advantage: true mean and variance known. Artificial data set: 100 sequences of 2000 data values. 8 real models selected. Different lengths of ‘look ahead’ period looked at: kLimit values = 0 (i.e. no ‘look ahead’ period), 5, 10, 25. d required value kept constant at 5%. TESTING METHODOLOGY

5 performance measures 1.Coverage of the true mean 2.Bias 3.Absolute Bias 4.Average Nsol value 5.Comparison of 4. with Theoretical Nsol value For real models: ‘true’ mean and st.dev values - estimated from whole sets of output data (3000 to data points).

Results Nsol values for individual algorithm runs are very variable. Average Nsol values for 100 runs per model close to the theoretical values of Nsol. Normality assumption appears robust. Using a ‘look ahead’ period improves performance of the algorithm.

Mean bias significantly different to zero Failed in coverage of true mean Mean est. Nsol significantly different to theoretical Nsol (>3) No ‘look- ahead’ period Proportion of Artificial models 4/242/249/18 Proportion of Real models 1/8 3/5 kLimit = 5 Proportion of Artificial models 1/2401/18 Proportion of Real models 000

% decrease in absolute mean bias kLimit = 0 to kLimit = 5 kLimit = 5 to kLimit = 10 kLimit = 10 to kLimit = 25 Artificial Models 8.76%0.07%0.26% Real Models 10.45%0.14%0.33% Impact of different look ahead periods on performance of algorithm

Number of times the Nsol value changes (out of 100 runs of the algorithm per model) because of the lengthening of the ‘look ahead’ period. Model ID kLimit = 0 to kLimit = 5 kLimit = 5 to kLimit = 10 kLimit = 10 to kLimit = 25 R1000 R3200 R52401 R82441 A53013 A62663 A15100 A A A243700

Model IDkLimitNsolTheoretical Nsol (approx) Mean estimate significantly different to the true mean? A220464Yes 554No A904112Yes 5120No A Yes 5718No A210837Yes 538No R70310Yes 58No R4036Yes 57No R80345Yes 546No Eg.s of changes in Nsol & improvement in estimate of true mean

Model ID kLimitNsolTheoretical Nsol (approx) Mean estimate significantly different to the true mean? A904112Yes 5120No A Yes 5718No R70310Yes 58No R4036Yes 57No R80345Yes 546No Examples of changes in Nsol & improvement in estimate of true mean

DISCUSSION kLimit default value set to 5. Initial number of replications set to 3. Multiple response variables - Algorithm run with each response - use maximum estimated value for Nsol. Different scenarios - advisable to repeat algorithm every few scenarios to check that precision has not degraded significantly. Inclusion into simulation package: Full explanations of algorithm and results.

SUMMARY Selection and automation of Confidence Interval (with Specified Precision) Method for estimating the number of replications to be run in a simulation. Algorithm created with ‘look ahead’ period - efficient and performs well on wide selection of artificial and real model output. ‘Black box’ - fully automated and does not require user intervention.

ACKNOWLEDGMENTS This work is part of the Automating Simulation Output Analysis (AutoSimOA) project ( that is funded by the UK Engineering and Physical Sciences Research Council (EP/D033640/1). The work is being carried out in collaboration with SIMUL8 Corporation, who are also providing sponsorship for the project. Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School OR49 Sept 07