Copyright © 2010 Lumina Decision Systems, Inc. Common Parametric Distributions Gentle Introduction to Modeling Uncertainty Series #6 Lonnie Chrisman, Ph.D.

Slides:



Advertisements
Similar presentations
Chapter 6 Continuous Random Variables and Probability Distributions
Advertisements

Exponential Distribution. = mean interval between consequent events = rate = mean number of counts in the unit interval > 0 X = distance between events.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
SE503 Advanced Project Management Dr. Ahmed Sameh, Ph.D. Professor, CS & IS Project Uncertainty Management.
Copyright © 2010 Lumina Decision Systems, Inc. Measures of Risk and Utility Analytica Users Group Gentle Intro to Modeling Uncertainty Webinar Series Session.
Lecture (7) Random Variables and Distribution Functions.
Copyright © 2010 Lumina Decision Systems, Inc. Risk Analysis for Portfolios Analytica Users Group Modeling Uncertainty Webinar Series, #5 3 June 2010 Lonnie.
Sampling Distributions (§ )
Lecture 10 – Introduction to Probability Topics Events, sample space, random variables Examples Probability distribution function Conditional probabilities.
Copyright © 2010 Lumina Decision Systems, Inc. Common Parametric Distributions Gentle Introduction to Modeling Uncertainty Series #6 Lonnie Chrisman, Ph.D.
Chapter 4 Discrete Random Variables and Probability Distributions
Test 2 Stock Option Pricing
Probability Densities
Simulation Modeling and Analysis
Chapter 6 Continuous Random Variables and Probability Distributions
Introduction Experiment  measurement Random component  the measurement might differ in day-to-day replicates because of small variations.
A random variable that has the following pmf is said to be a binomial random variable with parameters n, p The Binomial random variable.
CHAPTER 6 Statistical Analysis of Experimental Data
Continuous Random Variables and Probability Distributions
Chapter 5 Continuous Random Variables and Probability Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Great Theoretical Ideas in Computer Science.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Chapter 4 Continuous Random Variables and Probability Distributions
Lecture 10 – Introduction to Probability Topics Events, sample space, random variables Examples Probability distribution function Conditional probabilities.
Copyright © 2010 Lumina Decision Systems, Inc. Statistical Hypothesis Testing (8 th Session in “Gentle Introduction to Modeling Uncertainty”) Lonnie Chrisman,
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
QA in Finance/ Ch 3 Probability in Finance Probability.
Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.
Copyright © 2010 Lumina Decision Systems, Inc. Modeling Uncertainty: Probability Distributions Lonnie Chrisman, Ph.D. Lumina Decision Systems Analytica.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Chapter 4 – Modeling Basic Operations and Inputs  Structural modeling: what we’ve done so far ◦ Logical aspects – entities, resources, paths, etc. 
Topic 4 - Continuous distributions
Chapter 5 Statistical Models in Simulation
Random Variables & Probability Distributions Outcomes of experiments are, in part, random E.g. Let X 7 be the gender of the 7 th randomly selected student.
Ch4: 4.3The Normal distribution 4.4The Exponential Distribution.
Modeling and Simulation CS 313
Copyright © 2010 Lumina Decision Systems, Inc. Monte Carlo Simulation Analytica User Group Modeling Uncertainty Series #3 13 May 2010 Lonnie Chrisman,
Theory of Probability Statistics for Business and Economics.
1 Topic 4 - Continuous distributions Basics of continuous distributions Uniform distribution Normal distribution Gamma distribution.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Continuous Random Variables.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
STA347 - week 31 Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5’s in the 6 rolls. Let X = number of.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont’d) Instructor: Prof. Johnny Luo
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
Learning Simio Chapter 10 Analyzing Input Data
EMIS 7300 SYSTEMS ANALYSIS METHODS Spring 2006 Dr. John Lipp Copyright © Dr. John Lipp.
Random Variables Example:
SADC Course in Statistics The Poisson distribution.
Chapter 31Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2012 John Wiley & Sons, Inc.
Introduction A probability distribution is obtained when probability values are assigned to all possible numerical values of a random variable. It may.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
1 Math 10 Part 4 Slides Continuous Random Variables and the Central Limit Theorem © Maurice Geraghty, 2015.
Introduction to Probability Distributions
Chapter 4 Continuous Random Variables and Probability Distributions
Engineering Probability and Statistics - SE-205 -Chap 4
Chapter 7: Sampling Distributions
Multinomial Distribution
CPSC 531: System Modeling and Simulation
Some Discrete Probability Distributions
Sampling Distributions (§ )
Common Families of Probability Distributions
Geometric Poisson Negative Binomial Gamma
Presentation transcript:

Copyright © 2010 Lumina Decision Systems, Inc. Common Parametric Distributions Gentle Introduction to Modeling Uncertainty Series #6 Lonnie Chrisman, Ph.D. Lumina Decision Systems Analytica Users Group Webinar 10 June 2010

Copyright © 2010 Lumina Decision Systems, Inc. Course Syllabus Over the coming weeks: What is uncertainty? Probability. Probability Distributions Monte Carlo Sampling Measures of Risk and Utility Risk analysis for portfolios Common parametric distributions Assessment of Uncertainty Hypothesis testing

Copyright © 2010 Lumina Decision Systems, Inc. Today’s Topics Continuous vs. discrete. Non-parametric distributions. A handful of the most common distributions. The cases where each is useful. How to encode each in Analytica. Lots of model building exercises…

Copyright © 2010 Lumina Decision Systems, Inc. Outline (Order of exercises) “Pre-test” questions Discrete non-parametric: Monte Hall game Continuous non-parametric: Data resampling Event counts: Durations between events Uncertain percentages Bounded Bell shapes

Copyright © 2010 Lumina Decision Systems, Inc. Distribution Types Discrete Continuous

Copyright © 2010 Lumina Decision Systems, Inc. Custom (Non-parametric) Discrete ChanceDist(P,A,I) Parameters: P = Array of probabilities. Sum(P,I)=1 A = Array of possible outcomes I = Index shared by P and A Note: When A is the index, you can use: ChanceDist(P,A)

Copyright © 2010 Lumina Decision Systems, Inc. ChanceDist Exercise An event occurs on one of the 7 days of the week. Each weekday  8% Each day of weekend  20% Create a chance variable named Day_of_event with this distribution.

Copyright © 2010 Lumina Decision Systems, Inc. ChanceDist Exercise 2: Monte Hall Game You are a contestant on a game show. A prize is hidden behind 1 of three curtains. You select curtain 1. “Before opening your curtain,” says the host, “let me reveal one of the unselected curtains that does not contain the prize… Curtain 2 is empty! Would you now like to change curtains?” Task: Build an Analytica model, computing the probability of winning the prize if you do or do not change curtains.

Copyright © 2010 Lumina Decision Systems, Inc. Monte Hall Steps 1.Chance: Start with the uncertain real location of the prize. 2.Model how the host decides which curtain to show you. He will never reveal the prize or your selected curtain. Otherwise he picks randomly. 3.Decision: Change or not? 4.Objective: Probability that your final selection is the one with the prize.

Copyright © 2010 Lumina Decision Systems, Inc. Custom (non-parametric) Continuous Distributions CumDist(p,x,i) Parameters: p : Probabilities that value <= x x : Ascending set of values i : index shared CumDist(p,x,x) or just CumDist(p,x)

Copyright © 2010 Lumina Decision Systems, Inc. CumDist Exercise A geologist estimates the capacity of a recently discovered oil deposit. He expresses is assessments as follows: 100% that 100K < capacity < 1B barrels 90% that 5M < capacity < 500M barrels 75% that 50M < capacity < 100M barrels Median estimate: 75M barrels Use CumDist to encode these estimates as a distribution for capacity.

Copyright © 2010 Lumina Decision Systems, Inc. Homework challenge: Using CumDist to Resample You have 143 measured values of a quantities. Define an uncertain variable with the same implied distribution (even though your sample size doesn’t match). Here is your synthetic data: Index Data_i := Variable Data := ArcCos(Random( over:data_i)) Steps (the parameters to CumDist): Sort Data in ascending order: Sort(Data,Data_i) Compute p – equal probability steps along Data_I, starting at 0 and ending at 1.

Copyright © 2010 Lumina Decision Systems, Inc. The Most Commonly used Parametric Distributions Discrete: Bernoulli Poisson Binomial Uniform integer Continuous: Normal LogNormal Uniform Triangular Exponential Gamma Beta

Copyright © 2010 Lumina Decision Systems, Inc. Why chose one distribution over another? Discrete or continuous? Bounded quantity or infinite tails? Bounded both sides One-sided tail Two tailed Continuous Uniform Triangular Beta LogNormal Gamma Exponential Normal StudentT Logistic Discrete Binomial Uniform int Poisson

Copyright © 2010 Lumina Decision Systems, Inc. Why chose one distribution over another? Discrete or continuous? Bounded quantity or infinite tails? Convenience Some distributions are more “natural” for certain types of quantities. Ease of assessment. Analytical properties for mathematicians – not model builders. Correctness Other than broad properties, the sensitivity of computed results to specific choice of distributions for assessments is usually extremely low. x

Copyright © 2010 Lumina Decision Systems, Inc. Distributions for Integer-valued Counts #1 Poisson(mean) Count of events per unit time. # Earthquakes >6.0 in a given year # Vehicles that pass in a given hour # Alarms in a given month # Pelicans rescued from oil spill today When the occurrence of each event is independent of the time of occurrence of other events, the # of occurrences in any given window is Poisson distributed.

Copyright © 2010 Lumina Decision Systems, Inc. Distributions for Integer-valued Counts #2 Binomial(n,p) Number of times an event occurs in n repeated independent trials, each having probability p. # oil well blowouts in the next 100 deep-water wells drilled. # people that visit a store in its first month out of the 10,000 residents of the town. # of positive test results in 50 samples tested.

Copyright © 2010 Lumina Decision Systems, Inc. Exercise with event counts In a certain region, malaria infections occur at an average rate of 500 infections per year. 10% of infections are fatal. Build an Analytica model to compute the distribution for the number of people expected to die from a malaria infection in a given year.

Copyright © 2010 Lumina Decision Systems, Inc. Duration between events Exponential(rate) When events occur independently at a given rate, this gives the time between successive events. Note: rate = 1 / meanArrivalTime Gamma(a,1/rate) Time for a independent events to occur, each having a mean arrival time of 1/rate.

Copyright © 2010 Lumina Decision Systems, Inc. Arrival times exercise Cars arrive at a stoplight at a rate of 5 per minute. There is room for 10 cars before nearby freeway traffic is blocked. Graph the CDF for the amount of time until cars begin to block freeway traffic when the light is red. If the light stays red for 90 seconds, what fraction of red light-change cycles will result in blocked traffic?

Copyright © 2010 Lumina Decision Systems, Inc. Uncertain Percentages Beta(a,b) Useful for modeling uncertainty about a probability or percentage. Beta(a,b) expresses uncertainty on a [0,1] bounded quantity. Suppose you’ve seen s true instances out of n observations, with no further information. You’d estimate the true proportion as p=s/n. The uncertainty in this estimate can be modeled as: Beta(s+1,n-s+1) Exercise: Of 100 sampled voters, 55 supported Candidate A. Model the uncertainty on the true proportion.

Copyright © 2010 Lumina Decision Systems, Inc. Bounded Distributions Triangular(min,mode,max) Often very convenient & natural for expressing estimates when only the range and a best guess are available. Pert(min,mode,max) Same idea as Triangular. To use, include “Distribution Variations.ana” Uniform(min,max) All values between are equally likely. Uniform(min,max,integer:true) All integer values are equally likely.

Copyright © 2010 Lumina Decision Systems, Inc. Bounded comparisons Using: Min = 10 Mode = 15 Max = 40 Compare distributions (on same PDF & CDF plot): Triangular Pert Uniform

Copyright © 2010 Lumina Decision Systems, Inc. Central Limit Theorem Suppose y = x 1 ·x 2 ·x 3 ·.. ·x N Each x i ~ P(·), where P(·) is any distribution. (each x i is independent) As N → ∞, y’s distribution approaches a LogNormal(..) distribution. Example: Visualize the change in a stock price as the product of zillions of independent disturbances, each disturbance changing it by some percentage. Because of this, “bell curve” shaped distributions are both common and natural.