Summary of Test Sizing Sample size calculations will use different equations and/or techniques, depending on the application. Test sizing is a rational.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

“Students” t-test.
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Estimation in Sampling
1 1 Slide Chapter 9 Hypothesis Tests Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Statistical Issues in Research Planning and Evaluation
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Sampling distributions. Example Take random sample of students. Ask “how many courses did you study for this past weekend?” Calculate a statistic, say,
1 Difference Between the Means of Two Populations.
What z-scores represent
Sample size computations Petter Mostad
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Part III: Inference Topic 6 Sampling and Sampling Distributions
1 1 Slide © 2006 Thomson/South-Western Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses.
Inferences About Process Quality
Inferential Statistics
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part A Hypothesis Tests Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part B Hypothesis Tests Population Proportion Population Proportion Hypothesis Testing and Decision Making.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistics for Data Miners: Part I (continued) S.T. Balke.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Normal Distributions Z Transformations Central Limit Theorem Standard Normal Distribution Z Distribution Table Confidence Intervals Levels of Significance.
Chapter 10 Quality Control.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Green Belt – SIX SIGMA OPERATIONAL Central Limit Theorem.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Chapter 9 Hypothesis Testing
Slides by JOHN LOUCKS St. Edward’s University.
Confidence Intervals Topics: Essentials Inferential Statistics
Chapter 9 Hypothesis Testing.
Two-Sample Hypothesis Testing
INF397C Introduction to Research in Information Studies Spring, Day 12
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Tests of Significance The reasoning of significance tests
Chapter 21 More About Tests.
Estimation & Hypothesis Testing for Two Population Parameters
Basic Practice of Statistics - 5th Edition
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Process capability in JMP 12 –A new look
Statistical Process Control
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
What is the point of these sports?
Step M2 – Variable Process Capability
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter 9 Hypothesis Testing.
St. Edward’s University
Confidence Intervals Topics: Essentials Inferential Statistics
Slides by JOHN LOUCKS St. Edward’s University.
Inference on the Mean of a Population -Variance Known
DESIGN OF EXPERIMENT (DOE)
Presentation transcript:

Summary of Test Sizing Sample size calculations will use different equations and/or techniques, depending on the application. Test sizing is a rational process Test sizing is hard But it’s important But it’s hard

The Wise Approach to Sample Size The situation drives the analytical approach to sample size - One size does NOT fit all Analysts should have tools for your situation Binomial data? (nomograph from Test 101 or AFOTEC program) T-distributed data (see Montgomery’s* operating characteristic curves, p40) ANOVA? (see Montgomery’s operating characteristic curves, p188, 647-654) Etc. Just to give a sense of the variety of situations, each with their different tools. I’ll probably pass out hard copies of a t distribution operating characteristic curve set, just to help walk them through an example – or at least as a visual aid. *Douglas Montgomery. Design and Analysis of Experiments, 5th ed. 1997

The Wise Approach (Continued) Understand basic stats. In particular, understand that test size depends on four quantities: Alpha (Producers Risk) Beta (Consumers Risk) Sigma (inherent variability in MOE – standard deviation) Delta (Operationally significant amount of improvement in MOE over spec. The Militarily Significant Difference.) Get Alpha and Beta from decision-makers Get Sigma and Delta from subject matter experts (or try to calculate Sigma from test data, if you have data) Now SEEK PROFESSIONAL HELP This is the rational approach. It amounts to asking your decisionmakers how much risk they can tolerate, determining in advance how much variability there will be, and finding out how much of a difference is operationally significant. Your professionals are those who have a lot of experience and education in statistics. No, your typical engineer, scientist, or even mathematician may not be up on the approaches to test sizing and may have to hit the books or go get educated. No, this isn’t taught at AFIT or NPS. Yes, you still need that expertise, even though it’s hard to find.

Sample Size Calculation - JPATS Example Sample Size controls magnitude of Producer’s (a) & Consumer’s (b) Risks We will calculate sample size (n), based on a given “militarily significant difference” with desired values of a and b, for a given system. Militarily Significant Difference – The difference (in the measured value) that is operationally significant to the warfighter. The test objective is to determine whether JPATS speed meets the specification

JPATS Aircraft Example Sample Size Calculation Calculate sample size needed to detect a 20 KTAS militarily significant difference (D) Normally Distributed Data Assumed s = 15 KTAS Let a (producer’s risk) = .05 Let b (consumer’s risk) = .10 KTAS = Knots True Airspeed KTAS is Knots True air speed

JPATS Sample Size Calculations m0 - mT = D = militarily significant difference The 20 KTAS militarily significant difference means that a 20 KTAS improvement is significant to the warfighter. m0 - mT (s / n. 5 ) = Za + Zb This example is for a one tailed case, that is, we are only interested in meeting or exceeding a value. The technique is similar for two tailed cases, except that the a and b values used are divided by 2, and there may be different thresholds for upper and lower limits. Showing the formulas for these cases is beyond the scope of the course. In this example, the variance is small compared to the mean values and the difference between the objective and the threshold values. This yields a small sample size and a high confidence levels. The usual case, would have a higher variance compared to the mean value, which would cause the required sample size to increase greatly. The equation on the left of the slide looks just like the equation we used for the central limit theorem as applied to the normal distribution, and for the t distribution. The denominator is the standard deviation of the sample mean, and the numerator is the physical distance between the values. The right hand side of the equation is simply the number of standard deviations the two physical measurements are apart.

Determining Za and Zb Za and Zb are the statistical distance (number of standard deviations from the mean) a = .05 b= .10 Assuming normal distribution, Za and Zb can be obtained from the normal distribution chart on the next slide. Za = Z.05 = 1.65 Zb = Z.10 = 1.28

Cumulative Normal Distribution Values of Zp Given P

JPATS Sample Size Calculations 2 = ( ) 15(1.65 +1.28) (20) m0 - mT (s / n. 5 ) n = 4.83 ~ 5 = Za + Zb This example is for a one tailed case, that is, we are only interested in meeting or exceeding a value. The technique is similar for two tailed cases, except that the a and b values used are divided by 2, and there may be different thresholds for upper and lower limits. Showing the formulas for these cases is beyond the scope of the course. In this example, the variance is small compared to the mean values and the difference between the objective and the threshold values. This yields a small sample size and a high confidence levels. The usual case, would have a higher variance compared to the mean value, which would cause the required sample size to increase greatly. The equation on the left of the slide looks just like the equation we used for the central limit theorem as applied to the normal distribution, and for the t distribution. The denominator is the standard deviation of the sample mean, and the numerator is the physical distance between the values. The right hand side of the equation is simply the number of standard deviations the two physical measurements are apart. A sample size of 5 will give the required protection against Type I and Type II errors. Z.05 = 1.65 Z.10 = 1.28

Distribution of Sample Averages 20 KTAS 5% Producers Risk 10% Consumers Risk The true average is an unknown number A sample size of 5 will allow the sample average to be within 20 KTAS of the true average, with a = .05 and b = .10 The math we used is -only- valid for normally distributed data.