Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing.

Similar presentations


Presentation on theme: "1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing."— Presentation transcript:

1 1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing u Confidence intervals u Batch means u Analysis of Variance (ANOVA)

2 2 Motivation H Simulations rely on pRNG to produce one or more “sample paths” in the stochastic evaluation of a system H Results represent probabilistic answers to the initial perf eval questions of interest H Simulation results must be interpreted accordingly, using the appropriate statistical approaches and methodology

3 3 Hypothesis Testing H A technique used to determine whether or not to believe a certain statement (to what degree) H Statement is usually regarding a statistic, and some postulated property of the statistic H Formulate the “null hypothesis” H 0 H Alternative hypothesis H 1 H Decide on statistic to use, and significance level H Collect sample data and calculate test statistic H Decide whether to accept null hypothesis or not

4 4 Chi-Squared Test H A technique used to determine if sample data follows a certain known distribution H Used for discrete distributions H Requires large samples (at least 30) H Compute D = Σ ------------------ H Check value against Chi-Squared quantiles i=1 k expected i (observed i – expected i ) 2

5 5 Kolmogorov-Smirnov Test H A technique used to determine if sample data follows a certain known distribution H Used for continuous distributions H Any number of samples is okay (small/large) H Uses CDF (known distn vs empirical distn) H Compute max vertical deviation from CDF H Check value(s) against K-S quantiles K + = √n max ( F obs (x) – F exp (x) )K - = √n max ( F exp (x) – F obs (x) )

6 6 Simulation Run Length H Choosing the right duration for a simulation is a bit of an art (inexact step) H A bit like Goldilocks + the “three bears” H Too short: results may not be “typical” H Too long: excessive CPU time required H Just right: good results, reasonable time H Usual approach: guessing; bigger is better

7 7 Simulation Warmup H One reason why simulation run-length matters is that simulation results might exhibit some temporal bias –Example: the first few customers arrive to an empty system, and are never lost H Need to determine “steady-state”, and discard (biased) transient results from either before (warmup) or after (cooldown)

8 8 Simulation Replications H One way to establish statistical confidence in simulation results is to repeat an experiment multiple times H Multiple replications, with exact same config parameters, but different seeds H Assumes independent results + normality H Can compute the “mean of means” and the “variance of the global mean”

9 9 Statistical Inference H Methods to estimate the characteristics of an entire population based on data collected from a (random) sample (subset) H Many different statistics are possible H Desirable properties: –Consistent: convergence toward true value as the sample size is increased –Unbiased: sample is representative of population H Usually works best if samples are independent

10 10 Random Sampling H Different samples typically produce different estimates, since they themselves represent a random variable with some inherent sampling distribution (known/not) H Statistics can be used to get point estimates (e.g., mean, variance) or interval estimates (e.g., confidence interval) H True values: μ (mean), σ (std deviation)

11 11 Sample Mean and Variance H Sample mean: H Sample variance: H Sample standard deviation: s = √s 2 x = 1/n Σ x i i=1 n s 2 = 1/(n-1) Σ (x i – x) 2 i=1 n

12 12 Chebyshev’s Inequality H Expresses a general result about the “goodness” of a sample mean x as an estimate of the true mean μ (for any distn) H Want to be within error ε of true mean μ H Pr[ x - ε < μ < x + ε] ≥ 1 – Var(x) / ε 2 H The lower the variance, the better H The tighter ε is, the harder it is to be sure!

13 13 Central Limit Theorem H The Central Limit Theorem states that the distribution of Z approaches the standard normal distribution as n approaches ∞ H N(0,1) has mean 0, variance 1 H Recall that Normal distribution is symmetric about the mean H About 67% of obs within 1 std dev H About 95% of obs within 2 std dev

14 14 Confidence Intervals H There is inherent error when estimating the true mean μ with the sample mean x H How many samples n are needed so that the error is tolerable? (i.e., within some specified threshold value ε) H Pr[|x – μ| < ε] ≥ k (confidence level) H Depends on variance of sampled process H Depends on size of interval ε

15 15 F-tests and t-tests H A statistical technique to assess the level of significance associated with a result H Computes a “p value” for a result H Loosely stated, this reflects the likelihood (or not) of the observed result occurring, relative to the initial hypothesis made H F-tests: relies on the F distribution H t-tests: relies on the student-t distribution

16 16 Batch Means Analysis H A lengthy simulation run can be split into N batches, each of which is (assumed to be) independent of the other batches H Can compute mean for each batch i H Can compute mean of means H Can compute variance of means H Can provide confidence intervals

17 17 Analysis of Variance (ANOVA) H Often the results from a simulation or an experiment will depend on more than one factor (e.g., job size, service class, load) H ANOVA is a technique to determine which factor has the most impact H Focuses on variability (variance) of results H Attributes portion of variability to each of the factors involved, or their interaction

18 18 Summary H Simulations use pRNG to produce probabilistic answers to the performance evaluation questions of interest H It is important to interpret simulation results appropriately, using the correct statistical approaches and methodology H Basic techniques include confidence intervals, significance tests, and ANOVA


Download ppt "1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing."

Similar presentations


Ads by Google