CPE 619 Comparing Systems Using Sample Data Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of.

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

Chapter 6 Sampling and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Statistics and Quantitative Analysis U4320
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Chapter 7 Sampling and Sampling Distributions
Comparing Systems Using Sample Data
Ka-fu Wong © 2003 Chap 9- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
k r Factorial Designs with Replications r replications of 2 k Experiments –2 k r observations. –Allows estimation of experimental errors Model:
CONFIDENCE INTERVALS What is the Purpose of a Confidence Interval?
7-2 Estimating a Population Proportion
Experimental Evaluation
IENG 486 Statistical Quality & Process Control
Inferences About Process Quality
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
5-3 Inference on the Means of Two Populations, Variances Unknown
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
AM Recitation 2/10/11.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Estimation of Statistical Parameters
Topic 5 Statistical inference: point and interval estimate
Chapter 7 Statistical Inference: Confidence Intervals
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
Determination of Sample Size: A Review of Statistical Theory
Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.
© Copyright McGraw-Hill 2000
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Mystery 1Mystery 2Mystery 3.
© Copyright McGraw-Hill 2004
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
1 Mean Analysis. 2 Introduction l If we use sample mean (the mean of the sample) to approximate the population mean (the mean of the population), errors.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 6 Sampling and Sampling Distributions
Chapter 9 Introduction to the t Statistic
Confidence Intervals Cont.
Comparing Systems Using Sample Data
More on Inference.
ESTIMATION.
Chapter 4. Inference about Process Quality
Sampling Distributions and Estimation
Chapter 9 Hypothesis Testing.
More on Inference.
CONCEPTS OF ESTIMATION
Econ 3790: Business and Economics Statistics
Presentation transcript:

CPE 619 Comparing Systems Using Sample Data Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in Huntsville

2 Part III: Probability Theory and Statistics How to report the performance as a single number? Is specifying the mean the correct way? How to report the variability of measured quantities? What are the alternatives to variance and when are they appropriate? How to interpret the variability? How much confidence can you put on data with a large variability? How many measurements are required to get a desired level of statistical confidence? How to summarize the results of several different workloads on a single computer system? How to compare two or more computer systems using several different workloads? Is comparing the mean sufficient? What model best describes the relationship between two variables? Also, how good is the model?

3 Overview Sample Versus Population Confidence Interval for The Mean Approximate Visual Test One Sided Confidence Intervals Confidence Intervals for Proportions Sample Size for Determining Mean and proportions

4 Sample Old French word `essample'  `sample' and `example' One example  theory One sample  Definite statement

5 Sample Versus Population Generate several million random numbers with mean  and standard deviation  Draw a sample of n observations: {x 1, x 2, …, x n } Sample mean (x)  population mean (  ) Parameters: population characteristics Unknown, Use Greek letters (  Statistics: Sample estimates Random, Use English letters (x, s)

6  c1c1 c2c2 Confidence Interval for The Mean k samples  k Sample means  Can't get a single estimate of   Use bounds c 1 and c 2 : Probability{c 1    c 2 } = 1-  (  is very small) Confidence interval: [(c 1, c 2 )] Significance level:  Confidence level: 100(1-  ) Confidence coefficient: 1- 

7 Determining Confidence Interval Use 5-percentile and 95-percentile of the sample means to get 90% Confidence interval  Need many samples (n > 30) Central limit theorem: Sample mean of independent and identically distributed observations: Where  = population mean,  = population standard deviation Standard Error: Standard deviation of the sample mean 100(1-  )% confidence interval for  : z 1-  /2 = (1-  /2)-quantile of N(0,1)  -z 1-  /2 z 1-  /2

8 Example 13.1 x = 3.90, s = 0.95 and n = 32 A 90% confidence interval for the mean = We can state with 90% confidence that the population mean is between 3.62 and The chance of error in this statement is 10%.

9 Confidence Interval: Meaning If we take 100 samples and construct confidence interval for each sample, the interval would include the population mean in 90 cases.  c1c1 c2c2 Total yes > 100(1-  ) Total no  100 

10 Confidence Interval for Small Samples 100(1-  ) % confidence interval for n < 30 Note: can be constructed only if observations come from a normally distributed population t [1-  /2; n-1] = (1-  /2)-quantile of a t-variate with n-1 degrees of freedom Listed in Table A.4 in the Appendix

11 Example 13.2 Sample -0.04, -0.19, 0.14, -0.09, -0.14, 0.19, 0.04, and Mean = 0, Sample standard deviation = For 90% interval: t [0.95;7] = Confidence interval for the mean

12 Testing For A Zero Mean

13 Example 13.3 Difference in processor times: {1.5, 2.6, -1.8, 1.3, -0.5, 1.7, 2.4} Question: Can we say with 99% confidence that one is superior to the other? Sample size = n = 7 Mean = 7.20/7 = 1.03 Sample variance = ( *7.20/7)/6 = 2.57 Sample standard deviation = = 1.60 t [0.995; 6] = % confidence interval = (-1.21, 3.27)

14 Example 13.3 (cont’d) Opposite signs  we cannot say with 99% confidence that the mean difference is significantly different from zero Answer: They are same Answer: The difference is zero

15 Example 13.4 Difference in processor times {1.5, 2.6, -1.8, 1.3, -0.5, 1.7, 2.4}. Question: Is the difference 1? 99% Confidence interval = (-1.21, 3.27) The confidence interval includes 1 => Yes: The difference is 1 with 99% of confidence

16 Paired vs. Unpaired Comparisons Paired: one-to-one correspondence between the i th test of system A and the i th test on system B Example: Performance on i th workload Straightforward analysis: the two samples are treated as one sample of n pairs Use confidence interval of the difference Unpaired: No correspondence Example: n people on System A, n on System B  Need more sophisticated method t-test procedure

17 Example 13.5; Paired Observations Performance: {(5.4, 19.1), (16.6, 3.5), (0.6, 3.4), (1.4, 2.5), (0.6, 3.6), (7.3, 1.7)}. Is one system better? Differences: {-13.7, 13.1, -2.8, -1.1, -3.0, 5.6}. Answer: No. They are not different (the confidence interval includes zero)

18 Unpaired Observations 1. Compute the sample means 2. Compute the sample standard deviations

19 Unpaired Observations (cont ’ d) 3. Compute the mean difference 4. Compute the standard deviation of the mean difference 5. Compute the effective number of degrees of freedom 6. Compute the confidence interval for the mean difference 7. If the confidence interval includes zero, the difference is not significant

20 Example 13.6 Times on System A: {5.36, 16.57, 0.62, 1.41, 0.64, 7.26} Times on system B: {19.12, 3.52, 3.38, 2.50, 3.60, 1.74} Question: Are the two systems significantly different? For system A: For System B:

21 Example 13.6 (cont ’ d) The confidence interval includes zero  the two systems are not different

22 Approximate Visual Test

23 Example 13.7 Times on System A: {5.36, 16.57, 0.62, 1.41, 0.64, 7.26} Times on system B: {19.12, 3.52, 3.38, 2.50, 3.60, 1.74} t [0.95, 5] = The 90% confidence interval for the mean of A = 5.31  (2.015) = (0.24, 10.38) The 90% confidence interval for the mean of B = 5.64  (2.015) = (0.18, 11.10) Confidence intervals overlap and the mean of one falls in the confidence interval for the other  Two systems are not different at this level of confidence

24 What Confidence Level To Use? Need not always be 90% or 95% or 99% Based on the loss that you would sustain if the parameter is outside the range and the gain you would have if the parameter is inside the range Low loss  Low confidence level is fine E.g., lottery of 5 Million, one dollar ticket cost, with probability of winning (one in 10 million) 90% confidence  buy 9 million tickets (and pay $9M) 0.01% confidence level is fine 50% confidence level may or may not be too low 99% confidence level may or may not be too high

25 Hypothesis Testing vs. Confidence Intervals Confidence interval provides more information Hypothesis test = yes-no decision Confidence interval also provides possible range Narrow confidence interval  high degree of precision Wide confidence interval  Low precision Example (-100,100)  No difference (-1,1)  No difference Confidence intervals tell us not only what to say but also how loudly to say it CI is easier to explain to decision makers CI is more useful E.g., parameter range (100, 200) vs. Probability of (parameter = 110) = 3%

26 One Sided Confidence Intervals Two side intervals: 90% Confidence  P(Difference > upper limit) = 5%  P(Difference < Lower limit) = 5% One sided Question: Is the mean greater than 0?  One side confidence interval One sided lower confidence interval for  : Note t at 1-  (not 1-  /2) One sided upper confidence interval for  : For large samples: Use z values instead of t values

27 Example 13.8 Time between crashes Is System A more susceptible to failures than System B? Assume unpaired observations Mean difference Standard deviation of the difference

28 Example 13.8 (cont’d) Effective number of degrees of freedom n > 30  Use z rather than t One sided test  Use z 0.90 =1.28 for 90% confidence 90% Confidence interval (-17.37, * 19.35)=(-17.37, 7.402) CI includes zero  System A is not more susceptible to crashes than system B

29 Confidence Intervals for Proportions Proportion = probabilities of various categories E.g., P(error)=0.01, P(No error)=0.99 n 1 of n observations are of type 1  Assumes Normal approximation of Binomial distribution  Valid only if np  10. Need to use binomial tables if np < 10 Can't use t-values

30 CI for Proportions (cont ’ d) 100(1-  )% one sided confidence interval for the proportion: * * Provided np  10.

31 Example out of 1000 pages printed on a laser printer are illegible np  10 90% confidence interval = 0.01  (1.645)(0.003) = (0.005, 0.015) 95% confidence interval = 0.01  (1.960)(0.003) = (0.004, 0.016)

32 Example 13.9 (cont ’ d) At 90% confidence: 0.5% to 1.5% of the pages are illegible Chances of error = 10% At 95% Confidence: 0.4% to 1.6% of the pages are illegible Chances of error = 5%

33 Example Repetitions on two systems: System A superior in 26 repetitions Question: With 99% confidence, is System A superior? p = 26/40 = 0.65 Standard deviation = 99% confidence interval = 0.65  (2.576)(0.075) = (0.46, 0.84) CI includes 0.5  we cannot say with 99% confidence that system A is superior 90% confidence interval = 0.65  (1.645)(0.075) = (0.53, 0.77) CI does not include 0.5  Can say with 90% confidence that system A is superior.

34 Sample Size for Determining Mean Larger sample  Narrower confidence interval resulting in higher confidence Question: How many observations n to get an accuracy of § r% and a confidence level of 100(1-  )%? r% accuracy implies that confidence interval should be

35 Example Sample mean of the response time = 20 seconds Sample standard deviation = 5 Question: How many repetitions are needed to get the response time accurate within 1 second at 95% confidence? Required accuracy = 1 in 20 = 5% Here, = 20, s= 5, z= 1.960, and r=5, n = A total of 97 observations are needed.

36 Sample Size for Determining Proportions To get a half-width (accuracy of) r:

37 Example Preliminary measurement : illegible print rate of 1 in 10,000 Question: How many pages must be observed to get an accuracy of 1 per million at 95% confidence? Answer: A total of million pages must be observed.

38 Example Algorithm A loses 0.5% of packets and algorithm B loses 0.6% Question: How many packets do we need to observe to state with 95% confidence that algorithm A is better than the algorithm B? Answer:

39 Example (cont ’ d) For non-overlapping intervals: n =  We need to observe 85,000 packets

40 Summary All statistics based on a sample are random and should be specified with a confidence interval If the confidence interval includes zero, the hypothesis that the population mean is zero cannot be rejected Paired observations  Test the difference for zero mean Unpaired observations  More sophisticated t-test Confidence intervals apply to proportions too

41 To Do Read chapter 13