Statistical Techniques I EXST7005 Sample Size Calculation.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Tests of Hypotheses Based on a Single Sample
“Students” t-test.
Statistical Techniques I
Hypothesis testing Another judgment method of sampling data.
Statistical Issues in Research Planning and Evaluation
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
AP Statistics – Chapter 9 Test Review
1 Difference Between the Means of Two Populations.
Statistics Are Fun! Analysis of Variance
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
BCOR 1020 Business Statistics
Overview of Lecture Parametric Analysis is used for
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Inference about a Mean Part II
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Sample Size Determination In the Context of Hypothesis Testing
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
BCOR 1020 Business Statistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
The one sample t-test November 14, From Z to t… In a Z test, you compare your sample to a known population, with a known mean and standard deviation.
Chapter 9: Introduction to the t statistic
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Hypothesis testing.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Chapter 10 Hypothesis Testing
Intermediate Statistical Analysis Professor K. Leppel.
Statistical Techniques I
Statistical Analysis Statistical Analysis
Statistical Techniques I EXST7005 Conceptual Intro to ANOVA.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistics and Quantitative Analysis U4320
Chapter 10 Hypothesis Testing
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Approximate letter grade assignments ~ D C B 85 & up A.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
© Copyright McGraw-Hill 2004
Statistical Techniques
Example x y We wish to check for a non zero correlation.
T tests comparing two means t tests comparing two means.
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Statistical Inferences for Variance Objectives: Learn to compare variance of a sample with variance of a population Learn to compare variance of a sample.
Inferential Statistics Psych 231: Research Methods in Psychology.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Testing for a difference
Chapter 9 Hypothesis Testing.
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
Presentation transcript:

Statistical Techniques I EXST7005 Sample Size Calculation

The sample size formula n The Z-test and t-test use a similar formula.

The sample size formula (continued) n Lets suppose we know everything in the formula except n. Do we really? Maybe not, but we can get some pretty good estimates. Call the numerator (  Y -  0) a difference,  d. It is some mean difference we want to be able to detect, so  d =  Y -  0. The value  2 is a variance, the variance of the data that we will be sampling. We need this variance, or an estimate, S2.

The sample size formula (continued) n So we alter the formula to read.

The sample size formula (continued) What other values do we know? Do we know Z? No, but we know what Z we need to obtain significance. If we are doing a 2-tailed test, and we set  = 0.05, then Z will be n Any calculated value larger will be "more significant", any value smaller will not be significant. n So, if we want to detect significance at the 5% level, we can state that...

The sample size formula (continued) n We will get a significant difference if

The sample size formula (continued) n We square both sides and solve for n. Then we will also get a significant difference if

The sample size formula (continued) Then, if we have an idea of values for  d,  2, and Z, we can solve the formula for n.  If we are going to use a Z distribution we should have a known value of the variance (  2). If the variance is calculated from the sample, use the t distribution. n This would give us the sample size needed to obtain "significance", in accordance with whatever Z value is chosen.

Generic Example n Try an example where   d = 2   2 = 5 è Z = 1.96 n So what value of n would detect this difference with this variance and produce a value of Z equal to 1.96 (or greater)?  n  (Z2  2)/  d2 = (1.962 * 52)/22 = (25)/4 =  since n  24.01, round up to 25.

Generic Example (continued) Answer, n  25 would produce significant results. n Guaranteed? è Wouldn't this always produce significant results? Theoretically, within the limits of statistical probability of error, yes. But only IF THE DIFFERENCE WAS REALLY 2.  If the null hypothesis (no difference,  Y-  0=0) was really true and we took larger samples, then we would get a better estimate of 0, and may never show significance.

Considering Type II Error The formula we have seen contains only Z  /2 or t  /2, depending on whether we have  2 or S2. However, a fuller version can contain consideration of the probability of Type II error (  ). Remember that to work with  we need to know the mean of the real distribution. However, in calculating sample size we have a difference,  d =  Y -  0. So we can include consideration of type II error.

Considering Type II Error (continued)  error consideration would be done by adding another Z or t for the error rate. Notice that below I switch to t distributions and use S2.

Other examples n We have done a number of tests, some yielding significant results and others not. è If a test that yields significant results (showing a significant difference between the observed and hypothesized values), then we don't need to examine sample size because the sample was big enough. è However, some utility may be made of this information if we FAIL to reject the null hypothesis.

An example with t values and  error included n Recall the Rhesus monkey experiment. We hypothesized no effect of a drug, and with a sample size of 10 were unable to reject the null hypothesis. è However, we did observe a difference of +0.8 change in blood pressure after administering the drug. è What if this change was real? What if we made a Type II error? How large a sample would we need to test for a difference of 0.8 if we also wanted 90% power?

An example with t values and  error included (continued) So we want to know how large a sample we would need to get significance at the  =0.05 level if power was In this case  =0.10. To do this calculation we need a two tailed  and a one tailed  (we know that the change is +0.8). n We will estimate the variance from the sample so we will use the t distribution. However, since we don't know the sample size we don't know the d.f.!

An example with t values and  error included (continued) n So we will approximate to start with. n Given the information,   = 0.05 so t will be approximately 2   = 0.10 so t will be roughly 1.3   d =  Y-  0 = 0.8 from our previous results, and è S2 = from our previous results.

An example with t values and  error included (continued) n We do the calculations. And now we have an estimate of n and the degrees of freedom. n = 155 and d.f.=154. We can refine our values for t  /2 and t   for d.f. = 154, t  /2 = 1.97 approx.  for d.f. = 154, t  = approx.

An example with t values and  error included (continued) n So we redo the calculations with improved estimates. n A little improvement. If we saw much change in the estimate of n, we could recalculate as often as necessary. Usually 3 or 4 recalculations is enough.

Summary n We developed a formula for calculating sample size. n This formula can be adapted for either t or Z distributions.

Summary (continued) n We learned that n We need input values of  ,  ,  S2 (or  2)  and we need to know what difference we want to detect (  d).

Summary (continued) n We saw that for the t-test, the first calculation was only approximate since we didn't know the degrees of freedom and could not get the appropriate value of t. n However, after the initial calculation the estimate could be improved by iteratively recalculating the estimate of the value of n until it was stable.