Selecting Input Probability Distributions. 2 Introduction Part of modeling—what input probability distributions to use as input to simulation for: –Interarrival.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Exponential Distribution. = mean interval between consequent events = rate = mean number of counts in the unit interval > 0 X = distance between events.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 5 Statistical Models in Simulation
Selecting Input Probability Distribution. Introduction need to specify probability distributions of random inputs –processing times at a specific machine.
Continuous Probability Distributions.  Experiments can lead to continuous responses i.e. values that do not have to be whole numbers. For example: height.
Use of moment generating functions. Definition Let X denote a random variable with probability density function f(x) if continuous (probability mass function.
Probability Densities
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Chapter 6 Continuous Random Variables and Probability Distributions
Chapter 6 The Normal Distribution and Other Continuous Distributions
Probability and Statistics Review
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Continuous Random Variables and Probability Distributions
Chapter 5 Continuous Random Variables and Probability Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 4 Continuous Random Variables and Probability Distributions
Describing Data: Numerical
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 3 – Descriptive Statistics
Chapter 4 – Modeling Basic Operations and Inputs  Structural modeling: what we’ve done so far ◦ Logical aspects – entities, resources, paths, etc. 
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Topic 4 - Continuous distributions
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Moment Generating Functions
Traffic Modeling.
Random Sampling, Point Estimation and Maximum Likelihood.
Theory of Probability Statistics for Business and Economics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
Continuous Distributions The Uniform distribution from a to b.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
Ch5. Probability Densities II Dr. Deshi Ye
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
קורס סימולציה ד " ר אמנון גונן 1 ההתפלגויות ב ARENA Summary of Arena’s Probability Distributions Distribution Parameter Values Beta BETA Beta, Alpha Continuous.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
© 2002 Prentice-Hall, Inc.Chap 5-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 5 The Normal Distribution and Sampling Distributions.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
© 1999 Prentice-Hall, Inc. Chap Statistics for Managers Using Microsoft Excel Chapter 6 The Normal Distribution And Other Continuous Distributions.
Chap 5-1 Discrete and Continuous Probability Distributions.
Yandell – Econ 216 Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Continuous Distributions
Modeling and Simulation CS 313
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
MECH 373 Instrumentation and Measurements
Normal Distribution and Parameter Estimation
Modeling and Simulation CS 313
Chapter 7: Sampling Distributions
Chapter 6 Introduction to Continuous Probability Distributions
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 6 Continuous Probability Distributions
Continuous Distributions
The Normal Distribution
Presentation transcript:

Selecting Input Probability Distributions

2 Introduction Part of modeling—what input probability distributions to use as input to simulation for: –Interarrival times –Service/machining times –Demand/batch sizes –Machine up/down times Inappropriate input distribution(s) can lead to incorrect output, bad decisions Given observed data on input quantities, we can use them in different ways

3 Data Usage UseProsCons Trace-driven Use actual data values to drive simulation Valid vis à vis real world Direct Not generalizable Empirical distribution Use data values to define a “ connect-the-dots ” distribution (several specific ways) Fairly valid Simple Fairly direct May limit range of generated variates (depending on form) Fitted “ standard ” distribution Use data to fit a classical distribution (exponential, uniform, Poisson, etc.) Generalizable — fills in “ holes ” in data May not be valid May be difficult

4 Parameterization of Distributions - 1 There are alternative ways to parameterize most distributions Typically, parameters can be classified as one of: –Location parameter γ (also called shift parameter): specifies an abscissa (x axis) location point of a distribution ’ s range of values, often some kind of midpoint of the distribution Example: μ for normal distribution As γ changes, distribution just shifts left or right without changing its spread or shape If X has location parameter 0, then X + γ has location parameter γ

5 Parameterization of Distributions - 2 –Scale parameter β: determines scale, or units of measurement, or spread, of a distribution Example: σ for normal distribution, β for exponential distribution As β changes, the distribution is compressed or expanded without changing its shape If X has scale parameter 1, then βX has scale parameter β

6 Parameterization of Distributions - 3 –Shape parameter α: determines, separately from location and scale, the basic form or shape of a distribution Examples: normal and exponential distribution do not have shape parameter; α for Gamma and Weibull distributions May have more than one shape parameter (Beta distribution has two shape parameters) Change in shape parameter(s) alters distribution ’ s shape more fundamentally than changes in scale or location parameters

7 Continuous and Discrete Distributions Compendium of 13 continuous and 6 discrete distributions given in the textbook with details on –Possible applications –Density and distribution functions (where applicable) –Parameter definitions and ranges –Range of possible values –Mean, variance, mode –Maximum-likelihood estimator formula or method –General comments, including relationships to other distributions –Plots of densities

8 Summary Measures from Moments Mean and variance –Coefficient of Variation is a measure of variability relative to the mean: CV(X)=  X /  X. Higher moments also give useful information –Skewness coefficient gives information about the shape. –Kurtosis coefficient gives information about the tail weight (likelihood of extreme-value).

Example Find: Mean Variance Coefficient of variation Median Skewness coefficient

10 Exponential Expo(β)

11 Exponential Expo(β) Expo(1) density function

Exponential: Properties Coefficient of Variation is a measure of variability relative to the mean: CV(X)=  X /  X. Its Coefficient of Variation is 1 (unless it is shifted). The density function is monotonically decreasing (at an exponential rate). Times of events: most likely to be small but can be large with small probabilities. Skewness  = 2, Kurtosis (tail weight)  =9. 12

13 Poisson(λ) Bimodal: Two modes

14 Poisson(λ)

Poisson: Properties Counts the number of events of over time. If arrivals occur according to a Poisson process with rate, times between arrivals are exponential with mean 1/  Its Coefficient of Variation is 1/Sqrt( ). Events (i.e.) are generated by a large potential population where each customer chooses to arrive at a given small interval with a very small probability. Number of outbreaks of war over time, number of goals scored in World Cup games. 15

Normal Distribution: Properties Supported by Central Limit Theorem: the random variable is a sum of several small random variables (i.e. total consumer demand). It is symmetrical (skewness = 0, mean=median). Kurtosis=3. It’s usually not appropriate for modeling times between events (can take negative values). 16

Gamma Distribution: Properties Shape parameter:  >0, scale parameter  >0 A special case: sum of exponential random variables (  =1, corresponds to exponential (  ). In general, skewness is positive. The CV is less than one if shape parameter  > 1. Scale = 1, shape=2Scale = 1, shape=20

Weibull Distribution: Properties Shape parameter:  >0, scale parameter  >0 Very versatile 18 Scale = 1, shape=1.5Scale = 1, shape=10

Lognormal Distribution: Properties Y=ln(X) is Normal( ,  ). Models product of several independent random factors ( X=X 1 X 2 …X n ). Very versatile: like gamma and Weibull but can have a spike near zero. 19 Scale = 1, shape=0.5Scale = 2, shape=0.1

20 Empirical Distributions There may be no standard distribution that fits the data adequately: use observed data themselves to specify directly an empirical distribution There are many different ways to specify empirical distributions, resulting in different distributions with different properties.

21 Continuous Empirical Distributions If original individual data points are available (i.e., data are not grouped) –Sort data X 1, X 2,..., X n into increasing order: X (i) is ith smallest –Define F(X (i) ) = (i – 1)/(n – 1), approximately (for large n ) the proportion of the data less than X (i), and interpolate linearly between observed data points:

22 Continuous Empirical Distributions Rises most steeply over regions where observations are dense, as desired. Sample: 3,5,6,7,9,12 F(3)=0, F(5)=1/5, F(6)=2/5, F(7)=3/5, F(9)=4/5, F(12)=1,

23 Potential disadvantages: –Generated data will be within range of observed data –Expected value of this distribution is not the sample mean There are other ways to define continuous empirical distributions, including putting an exponential tail on the right to make the range infinite on the right If only grouped data are available –Don ’ t know individual data values, but counts of observations in adjacent intervals –Define empirical distribution function G(x) with properties similar to F(x) above for individual data points Continuous Empirical Distributions

24 Discrete Empirical Distributions If original individual data points are available (i.e., data are not grouped) –For each possible value x, define p(x) = proportion of the data values that are equal to x If only grouped data are available –Define a probability mass function such that the sum of the p(x) ’ s for the x ’ s in an interval is equal to the proportion of the data in that interval –Allocation of p(x) ’ s for x ’ s in an interval is arbitrary