Comparing Systems Using Sample Data Andy Wang CIS 5930-03 Computer Systems Performance Analysis.

Slides:



Advertisements
Similar presentations
Objectives 10.1 Simple linear regression
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Statistics and Quantitative Analysis U4320
STATISTICAL INFERENCE PART V
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Chapter 10 Simple Regression.
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Statistics: Data Analysis and Presentation Fr Clinic II.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Comparing Systems Using Sample Data
The Simple Regression Model
Topic 2: Statistical Concepts and Market Returns
Experimental Evaluation
Inferences About Process Quality
Statistical Comparison of Two Learning Algorithms Presented by: Payam Refaeilzadeh.
5-3 Inference on the Means of Two Populations, Variances Unknown
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
More About Significance Tests
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
STATISTICAL INFERENCE PART VII
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Mid-Term Review Final Review Statistical for Business (1)(2)
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
CPE 619 Comparing Systems Using Sample Data Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of.
© Copyright McGraw-Hill 2004
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Data Analysis Overview Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Workload parameters System Config.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Lecture 3 Page 1 CS 239, Spring 2007 Variability in Data CS 239 Experimental Methodologies for System Software Peter Reiher April 10, 2007.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
© 1998, Geoff Kuenning Comparison Methodology Meaning of a sample Confidence intervals Making decisions and comparing alternatives Special considerations.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Comparing Systems Using Sample Data
Towson University - J. Jung
Chapter 9 Hypothesis Testing.
Chapter 23 Comparing Means.
STATISTICS Topic 1 IB Biology Miss Werba.
Hypothesis Testing: The Difference Between Two Population Means
Variance and Hypothesis Tests
Presentation transcript:

Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis

2 Comparison Methodology Meaning of a sample Confidence intervals Making decisions and comparing alternatives Special considerations in confidence intervals Sample sizes

3 What is a Sample? How tall is a human? –Could measure every person in the world –Or could measure everyone in this room Population has parameters –Real and meaningful Sample has statistics –Drawn from population –Inherently erroneous

4 Sample Statistics How tall is a human? –People in Lov 103 have a mean height –People in Lov 301 have a different mean Sample mean is itself a random variable –Has own distribution

5 Estimating Population from Samples How tall is a human? –Measure everybody in this room –Calculate sample mean –Assume population mean  equals What is the error in our estimate?

6 Estimating Error Sample mean is a random variable  Mean has some distribution  Multiple sample means have “mean of means” Knowing distribution of means, we can estimate error

7 Estimating the Value of a Random Variable How tall is Fred? Suppose average human height is 170 cm  Fred is 170 cm tall –Yeah, right Safer to assume a range

8 Confidence Intervals How tall is Fred? –Suppose 90% of humans are between 155 and 190 cm  Fred is between 155 and 190 cm We are 90% confident that Fred is between 155 and 190 cm

9 Confidence Interval of Sample Mean Knowing where 90% of sample means fall, we can state a 90% confidence interval Key is Central Limit Theorem: –Sample means are normally distributed –Only if independent –Mean of sample means is population mean  –Standard deviation of sample means (standard error) is

10 Estimating Confidence Intervals Two formulas for confidence intervals –Over 30 samples from any distribution: z- distribution –Small sample from normally distributed population: t-distribution Common error: using t-distribution for non-normal population –Central Limit Theorem often saves us

11 The z Distribution Interval on either side of mean: Significance level  is small for large confidence levels Tables of z are tricky: be careful!

12 Example of z Distribution 35 samples: 10, 16, 47, 48, 74, 30, 81, 42, 57, 67, 7, 13, 56, 44, 54, 17, 60, 32, 45, 28, 33, 60, 36, 59, 73, 46, 10, 40, 35, 65, 34, 25, 18, 48, 63

13 Example of z Distribution Sample mean = Standard deviation s = n = 35 Confidence interval: 90% α = 1 – 90% = 0.1, p = 1 – α/2 = 0.95 z [p] = z [0.95] = 1.645

14 Graph of z Distribution Example 90% C.I.

15 The t Distribution Formula is almost the same: Usable only for normally distributed populations! But works with small samples

16 Example of t Distribution 10 height samples: 148, 166, 170, 191, 187, 114, 168, 180, 177, 204

17 Example of t Distribution Sample mean = Standard deviation s = 25.1, n = 10. Confidence interval: 90% α = 1 – 90% = 0.1, p = 1 – α/2 = 0.95 t [p, n - 1] = t [0.95, 9] = % interval is (144.7, 196.3)

18 Graph of t Distribution Example 90% C.I. 99% C.I.

19 Getting More Confidence Asking for a higher confidence level widens the confidence interval –Counterintuitive? How tall is Fred? –90% sure he’s between 155 and 190 cm –We want to be 99% sure we’re right –So we need more room: 99% sure he’s between 145 and 200 cm

Confidence Intervals vs. Standard Deviations 20

Confidence Intervals vs. Standard Deviations 21

22 Making Decisions Why do we use confidence intervals? –Summarizes error in sample mean –Gives way to decide if measurement is meaningful –Allows comparisons in face of error But remember: at 90% confidence, 10% of sample C.I.s do not include population mean

23 Testing for Zero Mean Is population mean significantly  0? If confidence interval includes 0, answer is no Can test for any value (mean of sums is sum of means) Our height samples are consistent with average height of 170 cm –Also consistent with 160 and 180!

24 Comparing Alternatives Often need to find better system –Choose fastest computer to buy –Prove our algorithm runs faster Different methods for paired/unpaired observations –Paired if ith test on each system was same –Unpaired otherwise

25 Comparing Paired Observations For each test calculate performance difference Calculate confidence interval for differences If interval includes zero, systems aren’t different –If not, sign indicates which is better

26 Example: Comparing Paired Observations Do home baseball teams outscore visitors? Sample from : –H –V –H-V Mean 1.4, 90% interval (-0.75, 3.6) –Can’t tell from this data –70% interval is (0.10, 2.76)

Comparing Unpaired Observations CIs do not overlap  A > BCis overlap and mean of one is in the CI of the other  A ~= B Cis overlap but mean of one is not in the CI of the other  t-test 27 Mean A B A B A B A B

28 The t-test (1) 1. Compute sample means and 2. Compute sample standard deviations s a and s b 3. Compute mean difference = 4. Compute standard deviation of difference:

29 The t-test (2) 5. Compute effective degrees of freedom: Note when n a = n b, v = 2n a when n b  ∞, v  n a - 1

30 The t-test (2) 6. Compute the confidence interval 7. If interval includes zero, no difference

31 Comparing Proportions If k of n trials give a certain result (category), then confidence interval is: If interval includes 0.5, can’t say which outcome is statistically meaningful Must have k  10 to get valid results

32 Special Considerations Selecting a confidence level Hypothesis testing One-sided confidence intervals

33 Selecting a Confidence Level Depends on cost of being wrong 90%, 95% are common values for scientific papers Generally, use highest value that lets you make a firm statement –But it’s better to be consistent throughout a given paper

34 Hypothesis Testing The null hypothesis (H 0 ) is common in statistics –Confusing due to double negative –Gives less information than confidence interval –Often harder to compute Should understand that rejecting null hypothesis implies result is meaningful

35 One-Sided Confidence Intervals Two-sided intervals test for mean being outside a certain range (see “error bands” in previous graphs) One-sided tests useful if only interested in one limit Use z 1-   or t 1-  ;n instead of z 1-  /2  or t 1-  /2;n in formulas

36 Sample Sizes Bigger sample sizes give narrower intervals –Smaller values of t, as n increases – in formulas But sample collection is often expensive –What is minimum we can get away with?

37 Choosing a Sample Size To get a given percentage error ±r %: Here, z represents either z or t as appropriate For a proportion p = k/n:

38 Example of Choosing Sample Size Five runs of a compilation took 22.5, 19.8, 21.1, 26.7, 20.2 seconds How many runs to get ±5% confidence interval at 90% confidence level? = 22.1, s = 2.8, t 0.95;4 = Note that first 5 runs can’t be reused! –Think about lottery drawings

White Slide