1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sampling Distributions (§ )
Chapter Seventeen HYPOTHESIS TESTING
Output Analysis and Experimentation for Systems Simulation.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
Chapter 2 Simple Comparative Experiments
Chapter 7 Estimation: Single Population
Experimental Evaluation
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Today Concepts underlying inferential statistics
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Chapter Nine: Evaluating Results from Samples Review of concepts of testing a null hypothesis. Test statistic and its null distribution Type I and Type.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Confidence Intervals and Hypothesis Testing - II
Hypothesis Testing. Distribution of Estimator To see the impact of the sample on estimates, try different samples Plot histogram of answers –Is it “normal”
Statistical inference: confidence intervals and hypothesis testing.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Simulation Output Analysis
Topic 5 Statistical inference: point and interval estimate
Population: a data set representing the entire entity of interest - What is a population? Sample: a data set representing a portion of a population Population.
Review: Two Main Uses of Statistics 1)Descriptive : To describe or summarize a collection of data points The data set in hand = all the data points of.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Academic Research Academic Research Dr Kishor Bhanushali M
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
Sampling and estimation Petter Mostad
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
© Copyright McGraw-Hill 2004
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
1 Estimation of Population Mean Dr. T. T. Kachwala.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
Making inferences from collected data involve two possible tasks:
CPSC 531: System Modeling and Simulation
Statistical Methods Carey Williamson Department of Computer Science
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Discrete Event Simulation - 4
Carey Williamson Department of Computer Science University of Calgary
Sampling Distributions (§ )
Presentation transcript:

1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing u Confidence intervals u Batch means u Analysis of Variance (ANOVA)

2 Motivation H Simulations rely on pRNG to produce one or more “sample paths” in the stochastic evaluation of a system H Results represent probabilistic answers to the initial perf eval questions of interest H Simulation results must be interpreted accordingly, using the appropriate statistical approaches and methodology

3 Hypothesis Testing H A technique used to determine whether or not to believe a certain statement (to what degree) H Statement is usually regarding a statistic, and some postulated property of the statistic H Formulate the “null hypothesis” H 0 H Alternative hypothesis H 1 H Decide on statistic to use, and significance level H Collect sample data and calculate test statistic H Decide whether to accept null hypothesis or not

4 Chi-Squared Test H A technique used to determine if sample data follows a certain known distribution H Used for discrete distributions H Requires large samples (at least 30) H Compute D = Σ H Check value against Chi-Squared quantiles i=1 k expected i (observed i – expected i ) 2

5 Kolmogorov-Smirnov Test H A technique used to determine if sample data follows a certain known distribution H Used for continuous distributions H Any number of samples is okay (small/large) H Uses CDF (known distn vs empirical distn) H Compute max vertical deviation from CDF H Check value(s) against K-S quantiles K + = √n max ( F obs (x) – F exp (x) )K - = √n max ( F exp (x) – F obs (x) )

6 Simulation Run Length H Choosing the right duration for a simulation is a bit of an art (inexact step) H A bit like Goldilocks + the “three bears” H Too short: results may not be “typical” H Too long: excessive CPU time required H Just right: good results, reasonable time H Usual approach: guessing; bigger is better

7 Simulation Warmup H One reason why simulation run-length matters is that simulation results might exhibit some temporal bias –Example: the first few customers arrive to an empty system, and are never lost H Need to determine “steady-state”, and discard (biased) transient results from either before (warmup) or after (cooldown)

8 Simulation Replications H One way to establish statistical confidence in simulation results is to repeat an experiment multiple times H Multiple replications, with exact same config parameters, but different seeds H Assumes independent results + normality H Can compute the “mean of means” and the “variance of the global mean”

9 Statistical Inference H Methods to estimate the characteristics of an entire population based on data collected from a (random) sample (subset) H Many different statistics are possible H Desirable properties: –Consistent: convergence toward true value as the sample size is increased –Unbiased: sample is representative of population H Usually works best if samples are independent

10 Random Sampling H Different samples typically produce different estimates, since they themselves represent a random variable with some inherent sampling distribution (known/not) H Statistics can be used to get point estimates (e.g., mean, variance) or interval estimates (e.g., confidence interval) H True values: μ (mean), σ (std deviation)

11 Sample Mean and Variance H Sample mean: H Sample variance: H Sample standard deviation: s = √s 2 x = 1/n Σ x i i=1 n s 2 = 1/(n-1) Σ (x i – x) 2 i=1 n

12 Chebyshev’s Inequality H Expresses a general result about the “goodness” of a sample mean x as an estimate of the true mean μ (for any distn) H Want to be within error ε of true mean μ H Pr[ x - ε < μ < x + ε] ≥ 1 – Var(x) / ε 2 H The lower the variance, the better H The tighter ε is, the harder it is to be sure!

13 Central Limit Theorem H The Central Limit Theorem states that the distribution of Z approaches the standard normal distribution as n approaches ∞ H N(0,1) has mean 0, variance 1 H Recall that Normal distribution is symmetric about the mean H About 67% of obs within 1 std dev H About 95% of obs within 2 std dev

14 Confidence Intervals H There is inherent error when estimating the true mean μ with the sample mean x H How many samples n are needed so that the error is tolerable? (i.e., within some specified threshold value ε) H Pr[|x – μ| < ε] ≥ k (confidence level) H Depends on variance of sampled process H Depends on size of interval ε

15 F-tests and t-tests H A statistical technique to assess the level of significance associated with a result H Computes a “p value” for a result H Loosely stated, this reflects the likelihood (or not) of the observed result occurring, relative to the initial hypothesis made H F-tests: relies on the F distribution H t-tests: relies on the student-t distribution

16 Batch Means Analysis H A lengthy simulation run can be split into N batches, each of which is (assumed to be) independent of the other batches H Can compute mean for each batch i H Can compute mean of means H Can compute variance of means H Can provide confidence intervals

17 Analysis of Variance (ANOVA) H Often the results from a simulation or an experiment will depend on more than one factor (e.g., job size, service class, load) H ANOVA is a technique to determine which factor has the most impact H Focuses on variability (variance) of results H Attributes portion of variability to each of the factors involved, or their interaction

18 Summary H Simulations use pRNG to produce probabilistic answers to the performance evaluation questions of interest H It is important to interpret simulation results appropriately, using the correct statistical approaches and methodology H Basic techniques include confidence intervals, significance tests, and ANOVA