4. Review of Basic Probability and Statistics

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Advertisements

Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
9.1 confidence interval for the population mean when the population standard deviation is known
Sampling: Final and Initial Sample Size Determination
Statistics review of basic probability and statistics.
Simulation with ArenaAppendix C – A Refresher on Probability and StatisticsSlide 1 of 33 A Refresher on Probability and Statistics Appendix C.
Use of moment generating functions. Definition Let X denote a random variable with probability density function f(x) if continuous (probability mass function.
Chapter 7 Introduction to Sampling Distributions
Review of Basic Probability and Statistics
Introduction to stochastic process
Sampling Distributions
Continuous Random Variables and Probability Distributions
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Chapter 7 Sampling and Sampling Distributions
Chapter 6 Continuous Random Variables and Probability Distributions
#9 SIMULATION OUTPUT ANALYSIS Systems Fall 2000 Instructor: Peter M. Hahn
Visual Recognition Tutorial1 Random variables, distributions, and probability density functions Discrete Random Variables Continuous Random Variables.
Today Today: More on the Normal Distribution (section 6.1), begin Chapter 8 (8.1 and 8.2) Assignment: 5-R11, 5-R16, 6-3, 6-5, 8-2, 8-8 Recommended Questions:
Continuous Random Variables and Probability Distributions
Today Today: Chapter 8 Assignment: 5-R11, 5-R16, 6-3, 6-5, 8-2, 8-8 Recommended Questions: 6-1, 6-2, 6-4, , 8-3, 8-5, 8-7 Reading: –Sections 8.1,
Chapter 5 Continuous Random Variables and Probability Distributions
1 Simulation Modeling and Analysis Output Analysis.
Review of Probability and Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Random Variable and Probability Distribution
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
Chapter 6 Sampling and Sampling Distributions
Simulation Output Analysis
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
Jointly Distributed Random Variables
Chapter 12 Review of Calculus and Probability
DATA ANALYSIS Module Code: CA660 Lecture Block 3.
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
1 G Lect 3b G Lecture 3b Why are means and variances so useful? Recap of random variables and expectations with examples Further consideration.
CPSC 531: Probability Review1 CPSC 531:Probability & Statistics: Review II Instructor: Anirban Mahanti Office: ICT 745
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
0 K. Salah 2. Review of Probability and Statistics Refs: Law & Kelton, Chapter 4.
Continuous Distributions The Uniform distribution from a to b.
Chapter 4 DeGroot & Schervish. Variance Although the mean of a distribution is a useful summary, it does not convey very much information about the distribution.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Sampling and estimation Petter Mostad
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
1 Probability and Statistical Inference (9th Edition) Chapter 4 Bivariate Distributions November 4, 2015.
Continuous Random Variables and Probability Distributions
1 EE571 PART 3 Random Processes Huseyin Bilgekul Eeng571 Probability and astochastic Processes Department of Electrical and Electronic Engineering Eastern.
© 2002 Prentice-Hall, Inc.Chap 5-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 5 The Normal Distribution and Sampling Distributions.
CSE 474 Simulation Modeling | MUSHFIQUR ROUF CSE474:
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
1 Part Three: Chapters 7-9 Performance Modeling and Estimation.
ENM SIMULATION PROBABILITY AND STATISTICS REVIEW 1.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Chapter 6 Sampling and Sampling Distributions
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Variance reduction techniques Mat Simulation
Sampling and Sampling Distributions
Chapter 6 Confidence Intervals.
STAT 311 REVIEW (Quick & Dirty)
Consolidation & Review
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
Chapter 6 Confidence Intervals.
LESSON 18: CONFIDENCE INTERVAL ESTIMATION
Chapter 6 Confidence Intervals.
Presentation transcript:

4. Review of Basic Probability and Statistics Outline: 4.1. Random Variables and Their Properties 4.2. Simulation Output Data and Stochastic Processes 4.3. Estimation of Means and Variances 4.4. Confidence Interval for the Mean Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

4.1. Random Variables and Their Properties A random variable X is said to be discrete if it can take on at most a countable number of values, say, x1, x2, ... . The probability that X is equal to xi is given by and Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

where p(x) is the probability mass function where p(x) is the probability mass function. The distribution function F(x) is for all - < x < . Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Example 4. 1: Consider the demand-size random variable of Section 1 Example 4.1: Consider the demand-size random variable of Section 1.5 of Law and Kelton that takes on the values 1, 2, 3, 4, with probabilities 1/6, 1/3, 1/3, 1/6. The probability mass function and the distribution function are given in Figures 4.1 and 4.2. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.1. p(x) for the demand-size random variable. 0.35 0.30 0.25 0.20 0.15 0.10 0.05 x 0.00 1 2 3 4 Figure 4.1. p(x) for the demand-size random variable. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.2. F(x) for the demand-size random variable. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 x 1 2 3 4 5 Figure 4.2. F(x) for the demand-size random variable. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

(where “” means “contained in”). A random variable X is said to be continuous if there exists a nonnegative function f(x), the probability density function, such that for any set of real numbers B, (where “” means “contained in”). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

If x is a number and ∆x > 0, then which is the left shaded area in Figure 4.3. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.3. Interpretation of the probability density function f(x). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

The distribution function F(x) for a continuous random variable X is Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Example 4.2: The probability density function and distribution function for an exponential random variable with mean β are defined as follows (see Figures 4.4 and 4.5): and Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.4. f(x) for an exponential random variable with mean β.. Figure 4.4. f(x) for an exponential random variable with mean β.. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.5. F(x) for an exponential random variable with mean β. 1 x Figure 4.5. F(x) for an exponential random variable with mean β. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

The random variables X and Y are independent if knowing the value that one takes on tells us nothing about the distribution of the other. The mean or expected value of the random variable X, denoted by μ or E(X), is given by Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

The mean is one measure of the central tendency of a random variable. Properties: 1. E(cX) = cE(X) 2. E(X + Y) = E(X) + E(Y) regardless of whether X and Y are independent Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

The variance of the random variable X, denoted by σ2 or Var(X), is given by σ2 = E[(X - μ)2] = E(X2) - μ2 The variance is a measure of the dispersion of a random variable about its mean (see Figure 4.6). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

σ2 large σ2 small X X X X µ µ Figure 4.6. Density functions for continuous random variables with large and small variances. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Properties: 1. Var(cX) = c2Var(X) 2 Properties: 1. Var(cX) = c2Var(X) 2. Var(X + Y) = Var(X) + Var(Y) if X, Y are independent The square root of the variance is called the standard deviation and is denoted by σ. It can be given the most definitive interpretation when X has a normal distribution (see Figure 4.7). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.7. Density function for a N(, 2) distribution. Area = 0.68 Figure 4.7. Density function for a N(, 2) distribution. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Cov(X, Y) = E{[X - E(X)][Y - E(Y)]} = E(XY) - E(X)E(Y) The covariance between the random variables X and Y, denoted by Cov(X, Y), is defined by Cov(X, Y) = E{[X - E(X)][Y - E(Y)]} = E(XY) - E(X)E(Y) The covariance is a measure of the dependence between X and Y. Note that Cov(X, X) = Var(X). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

> 0 positively correlated < 0 negatively correlated Definitions: Cov(X, Y) X and Y are = 0 uncorrelated > 0 positively correlated < 0 negatively correlated Independent random variables are also uncorrelated. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Note that, in general, we have Var(X - Y) = Var(X) + Var(Y) - 2Cov(X, Y) If X and Y are independent, then Var(X - Y) = Var(X) + Var(Y) The correlation between the random variables X and Y, denoted by Cor(X, Y), is defined by Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

It can be shown that -1  Cor(X, Y)  1 Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

4.2. Simulation Output Data and Stochastic Processes A stochastic process is a collection of "similar" random variables ordered over time all defined relative to the same experiment. If the collection is X1, X2, ... , then we have a discrete-time stochastic process. If the collection is {X(t), t  0}, then we have a continuous-time stochastic process. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Di +1 = max{Di + Pi - Ai +1, 0} for i = 1, 2, ... Example 4.3: Consider the single-server queueing system of Chapter 1 with independent interarrival times A1, A2, ... and independent processing times P1, P2, ... . Relative to the experiment of generating the Ai's and Pi's, one can define the discrete-time stochastic process of delays in queue D1, D2, ... as follows: D 1 = 0 Di +1 = max{Di + Pi - Ai +1, 0} for i = 1, 2, ... Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Other examples of stochastic processes: Thus, the simulation maps the input random variables into the output process of interest. Other examples of stochastic processes: • N1, N2, ... , where Ni = number of parts produced in the ith hour for a manufacturing system • T1, T2, ... , where Ti = time in system of the ith part for a manufacturing system Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

• {Q(t), t  0}, where Q(t) = number of customers in queue at time t • C1, C2, ... , where Ci = total cost in the ith month for an inventory system • E1, E2, ... , where Ei = end-to-end delay of ith message to reach its destination in a communications network • {R(t), t  0}, where R(t) = number of red tanks in a battle at time t Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Example 4. 4: Consider the delay-in-queue process D1, D2, Example 4.4: Consider the delay-in-queue process D1, D2, ... for the M/M/1 queue with utilization factor ρ. Then the correlation function ρj between Di and Di+j is given in Figure 4.8. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

ρj  ρ = 0.9 1.0 0.9 0.8 0.7 0.6 0.5 0.4 ρ = 0.5 0.3 0.2 0.1 j 1 2 3 4 5 6 7 8 9 10 Figure 4.8. Correlation function ρj of the process D1, D2, ... for the M/M/1 queue. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

4.3. Estimation of Means and Variances Let X1, X2, ..., Xn be independent, identically distributed (IID) random variables with population mean and variance μ and σ2, respectively. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Population Sample estimate parameter (1) (3) (5) Note that is an unbiased estimator of μ, i.e., E[ ] = E(X) = μ. (2) Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

may be close to μ while on another may differ from μ by a large amount The difficulty with using as an estimator of μ without any additional information is that we have no way of assessing how close is to μ. Because is a random variable with variance Var[ ], on one experiment may be close to μ while on another may differ from μ by a large amount (see Figure 4.9). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Figure 4.9. Two observations of the random variable . Density function for X X μ First observation of Second observation of Figure 4.9. Two observations of the random variable . Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

The usual way to assess the precision of as an estimator of μ is to construct a confidence interval for μ, which we discuss in the next section. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

4.4. Confidence Interval for the Mean Let X1, X2, ..., Xn be IID random variables with mean μ. Then an (approximate) 100(1 - α) percent (0 < α < 1) confidence interval for μ is where tn - 1, 1 - α/2 is the upper 1 - α/2 critical point for a t distribution with n - 1 df (see Figure 4.10). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

t distribution with n - 1 df zzz xxxx Standard normal distribution Figure 4.10. Standard normal distribution and t distribution with n - 1 df. Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Interpretation of a confidence interval: If one constructs a very large number of independent 100(1 - α) percent confidence intervals each based on n observations, where n is sufficiently large, then the proportion of these confidence intervals that contain μ should be 1 - α (regardless of the distribution of X). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Alternatively, if X is N(,2), then the coverage probability will be 1-  regardless of the value of n. If X is not N(,2), then there will be a degradation in coverage for “small” n. The greater the skewness of the distribution of X, the greater the degradation (see pp. 256-257). Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Important characteristics: • Confidence level (e.g., 90 percent) • Half-length (see also p. 511) Problem 4.1: If we want to decrease the half-length by a factor of approximately 2 and n is “large” (e.g. 50), then to what value does n need to be increased? Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics

Chapter 4 in Law and Kelton Recommended reading Chapter 4 in Law and Kelton Simulation Modeling and Analysis – Chapter 4 – Review of Basic Probability and Statistics