Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

“Students” t-test.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.5 Estimation of a Population Variance This section presents methods.
1 Difference Between the Means of Two Populations.
PSY 307 – Statistics for the Behavioral Sciences
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Sampling and Sampling Distributions
7-3 Estimating a Population Mean
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Confidence Intervals (Chapter 8) Confidence Intervals for numerical data: –Standard deviation known –Standard deviation unknown Confidence Intervals for.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimating a Population Mean: σ Known 7-3, pg 355.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
© Copyright McGraw-Hill 2000
Chapter Twelve The Two-Sample t-Test. Copyright © Houghton Mifflin Company. All rights reserved.Chapter is the mean of the first sample is the.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Estimates and Sample Sizes Chapter 6 M A R I O F. T R I O L A Copyright © 1998,
Estimating a Population Mean:  Known
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-4 Estimating a Population Mean:  Not Known.
Chapter 10 The t Test for Two Independent Samples
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
1 ES Chapter 18 & 20: Inferences Involving One Population Student’s t, df = 5 Student’s t, df = 15 Student’s t, df = 25.
Lesoon Statistics for Management Confidence Interval Estimation.
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4: Estimation of a population mean   is not known  This section.
+ Z-Interval for µ So, the formula for a Confidence Interval for a population mean is To be honest, σ is never known. So, this formula isn’t used very.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 9 Introduction to the t Statistic
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 8: Estimating with Confidence
Lecture Slides Elementary Statistics Twelfth Edition
Chapter Eight Estimation.
Chapter 8: Estimating with Confidence
Chapter 9: Inferences Involving One Population
Math 4030 – 10a Tests for Population Mean(s)
Chapter 8: Inference for Proportions
Chapter 8: Estimating with Confidence
Elementary Statistics
CONCEPTS OF ESTIMATION
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Chapter 8: Estimating with Confidence
Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
2/5/ Estimating a Population Mean.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Estimating a Population Variance
Presentation transcript:

CS 321: Human-Computer Interaction Design Lesson Sixteen Sample Size Iterative Procedure Problem Discovery Large Samples Chapters 6-7

Determining Sample Size could be solved for n, the needed sample size. When conducting a usability test, how large should you make the sample size? Essentially, if you can estimate the critical difference from the test (i.e., d = the smallest difference between the obtained and true value that you need to detect), the sample’s standard deviation (which might be estimated from previous similar experiments), and the critical t-value (for the desired level of statistical confidence), then the formula for t: 𝒕= 𝒅 ( 𝒔 𝒏 ) could be solved for n, the needed sample size. Unlike the z-value, however, which uses a normal distribution, estimating the t-value complicates matters by also being dependent on the degrees of freedom (for a one-sample t-test, df = n - 1). To overcome this problem, an iterative procedure is suggested… CS 321 Lesson Sixteen Sample Size Page 207

Determining Sample Size: Iterative Procedure d = critical difference between obtained and true value 𝒕= 𝒅 ( 𝒔 𝒏 ) s = sample standard deviation estimate (from previous experiments) Use the z-score with the desired level of confidence (from a unit normal table) as an initial estimate of the t-value. Solve the above equation for n. Use a t-distribution table to find the t-score for that value of n (with df = n-1). Recalculate n by using this new t-value in the equation above. Revise the t-score from the t-distribution table. Continue this iteration until two consecutive cycles yield the same n value. CS 321 Lesson Sixteen Sample Size Page 208

Solving the basic t formula for n yields: Sample Size Example Assume that you have been using a 100-point item as a post-task measure of ease-of-use in past usability tests. One of the tasks that you routinely conduct is software installation. For the most recent usability study of the current version of the software package, the variability of this measurement on the 100-point scale is 25 (i.e., s = 5). You’re planning your first usability study with a new version of the software, and you want to get an estimate of this measure with 90% confidence and to be within 2.5 points of the true value. Let’s calculate how many participants you need to run in the study. Solving the basic t formula for n yields: n = 𝒕 𝟐 𝒔 𝟐 𝒅 𝟐 The question indicates that s = 5 and d = 2.5, so an appropriate t-value needs to be determined. CS 321 Lesson Sixteen Sample Size Page 209

Sample Size Example (Continued) For two-sided testing with a 90% confidence interval (i.e., 5% in each tail), a unit normal table indicates that a z-value of 1.645 would make a good first estimate for the t-value. Using the above formula, this yields an n-value of 10.8241, which rounds up to 11. Switching to a t-distribution table, n = 11 (i.e., df = 10) gives us a t-value of 1.812 for a 2-tailed 90% confidence interval, which produces an n-value of 13.133376 in the formula, rounding up to 14. Using n = 14 (df = 13) yields a t-value of 1.771, yielding an n-value of 12.545764, rounding up to 13. Using n = 13 (df = 12) yields a t-value of 1.782, yielding an n-value of 12.702096, again rounding up to 13. Therefore, the final sample estimate size for this study is 13 participants. CS 321 Lesson Sixteen Sample Size Page 210

Problem Discovery Sample Size Recall the binomial probability function, which determines the probability that a problem with probability p will occur k times during a study with n subjects: 𝒏! 𝒌! 𝒏−𝒌 ! 𝒑 𝒌 (𝟏−𝒑) 𝒏−𝒌 By setting k to zero, we get the probability that the problem won’t occur at all with n subjects: 𝒏! 𝟎! 𝒏−𝟎 ! 𝒑 𝟎 (𝟏−𝒑) 𝒏−𝟎 = (𝟏−𝒑) 𝒏 So, for n test subjects, the probability that the problem will occur at least once is: 𝒑 𝒙≥𝟏 =𝟏− (𝟏−𝒑) 𝒏 Solving this for the sample size, we get: 𝒏= ln (𝟏−𝒑 𝒙≥𝟏 ) ln (𝟏−𝒑) CS 321 Lesson Sixteen Sample Size Page 211

Problem Discovery Sample Size (Continued) So, for example, if we want to be 85% certain of observing a problem that affects 31% of all users, then our usability study will need a sample size of: 𝒏= ln (𝟏−𝒑 𝒙≥𝟏 ) ln (𝟏−𝒑) = ln (𝟏−𝟎.𝟖𝟓) ln (𝟏−𝟎.𝟑𝟏) = ln (𝟎.𝟏𝟓) ln (𝟎.𝟔𝟗) ≈ −𝟏.𝟖𝟗𝟕 −𝟎.𝟑𝟕𝟏 ≈𝟓 If we want to be 85% certain of observing a problem that affects 10% of all users: 𝒏= ln (𝟏−𝒑 𝒙≥𝟏 ) ln (𝟏−𝒑) = ln (𝟏−𝟎.𝟖𝟓) ln (𝟏−𝟎.𝟏) = ln (𝟎.𝟏𝟓) ln (𝟎.𝟗) ≈ −𝟏.𝟖𝟗𝟕 −𝟎.𝟏𝟎𝟓 ≈𝟏𝟖 And to be 85% certain of observing a problem that affects only 1% of all users: 𝒏= ln (𝟏−𝒑 𝒙≥𝟏 ) ln (𝟏−𝒑) = ln (𝟏−𝟎.𝟖𝟓) ln (𝟏−𝟎.𝟏) = ln (𝟎.𝟏𝟓) ln (𝟎.𝟗𝟗) ≈ −𝟏.𝟖𝟗𝟕 −𝟎.𝟎𝟏 ≈𝟏𝟖𝟗 CS 321 Lesson Sixteen Sample Size Page 212

Weak Arguments For Large Samples “If the population is large, then the sample needs to be large.” The variance in statistical sampling is determined by the sample size, not the size of the overall population. The evaluation of a design element’s quality is independent of how many people are going to use it. “The more features in the interface, the larger the sample size.” When the interface is loaded with features, more tests are needed, not more users in each test. Test subjects will be overwhelmed if asked to evaluate too many features. “The interface is being designed to accommodate many target audiences.” This only requires larger sample sizes if the different target audiences will use the interface in very different ways (e.g., buyers vs. sellers, teachers vs. students, doctors vs. patients). CS 321 Lesson Sixteen Sample Size Page 213