Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Significance Tests About
AP Statistics – Chapter 9 Test Review
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 9_part I ( and 9.7) Tests of Significance.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 9 Hypothesis Testing.
Inference about Population Parameters: Hypothesis Testing
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical hypothesis testing – Inferential statistics I.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Experimental Statistics - week 2
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
BPS - 3rd Ed. Chapter 141 Tests of Significance: The Basics.
Comparing Two Population Means
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
STA Lecture 251 STA 291 Lecture 25 Testing the hypothesis about Population Mean Inference about a Population Mean, or compare two population means.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
AP Statistics Section 11.1 B More on Significance Tests.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
WS 2007/08Prof. Dr. J. Schütze, FB GW KI 1 Hypothesis testing Statistical Tests Sometimes you have to make a decision about a characteristic of a population.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
+ Testing a Claim Significance Tests: The Basics.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Basic Practice of Statistics - 3rd Edition Introduction to Inference
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Hypothesis Testing Hypothesis Testing Topic 11

Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question like: “Is the mean glucose level of infants small-for-gestational age (SGA) the same as those for normal infants?” i.e.  SGA =  normal ?

Hypothesis Testing “Is the mean systolic blood pressure of people taking calcium supplement the same as those in the general population?” i.e.  calcium =  gen ? “Is the mean systolic pulse rate of people taking a stress test the same as those in the control group?” i.e.  stress =  control ?

Null and alternative hypothesis Null Hypothesis H 0 is usually a statement of no effect or no difference. The aim of testing is to assess the strength of evidence presented in the data against the null hypothesis. Alternative Hypothesis H 1 is the hypothesis that we hope or suspect to be true instead of H 0 in order to demonstrate the presence of some physical phenomenon or the usefulness of a treatment (e.g., new drug better than old)

Asymmetric treatment of H 0 and H 1 H 0 is usually the more conservative hypothesis (no change, no effect, etc) which is assumed to be true until sufficient evidence is obtained to warrant its rejection H 1 is usually the scientifically more interesting hypothesis that we hope or suspect is true (e.g., new drug better than existing drug, smokers more at risk) The burden of proof is placed on H 1. Substantial supporting evidence is required before it will be accepted The consequence of wrongly rejecting H 0 (Type I error) is considered more severe than that of wrongly accepting H 0 (Type II error)

Type I and Type II errors Type I error Correct action Type II error No difference (H 0 is correct) Difference exists (H 0 is incorrect) True situation Conclusion from Hypothesis test Do not reject H 0 Reject H 0

An analogy

Another analogy

The 4 steps to hypothesis testing 1. Formulate H o and H Select an appropriate test statistic which is usually some kind of standardized difference between the estimated and hypothesized value or between the expected and observed. 3. Use the null distribution (the sampling distribution of the test statistic under H o ) to calculate the probability due to chance alone of getting a difference larger than or equal to that actually observed in the data. This probability is called the p-value. 4. Draw the appropriate conclusion in the context of the medical problem Note: A small p-value means it is difficult to attribute the observed difference to chance alone and this can be taken as evidence against the null hypothesis of no difference.

How small is small?

Some people prefer to report the p-value rather than using terms like “accept/reject” the hypothesis at level The acceptance/rejection rule is too rigid and exaggerate the difference between, say, p-value = and More informative to report the p-value. Suppose we are told that H o is rejected at level 0.05, we wouldn’t know whether H o will still be rejected at level The p-value gives us the whole story, e.g., if p-value = 0.012, then we know the hypothesis will be rejected at level 0.05, 0.02 but not 0.01.

Birthweight example revisited Suppose a random sample of 50 Malay male livebirths gave a sample mean birthweight of 3.55kg and a sample standard deviation of 0.92 kg. Question of interest: What is the likelihood that the mean birth-weight from the sampled population (ie all Malay male livebirths) is the same as the mean birth-weight of all male livebirths in the general population which is known to be 3.27kg, after taking into consideration sampling error?

2. Define and compute test statistic The 4 steps to hypothesis testing 1. Formulate hypotheses Null Hypothesis H o :  =  0 = 3.27 kg Alternative Hypothesis H 1 :  3.27 kg Difference between observed sample mean and hypothesized value = = 0.28 kg Test statistic = T = standardized difference =.28/.13 = 2.15 ^

Distribution of the standardized difference under the null hypothesis ^

Two-sided t test 3. Calculate p-value T = standardized difference Observed T = 2.15 More extreme difference means |T|>2.15. Now H 1 is 2-sided, so p-value = Pr( |T| >= 2.15 | H 0 ) = Pr( | t 49 | >= 2.15) = ^

4. Interpretation of results There is only a 3.6% chance of observing a standardized difference of magnitude 2.15 or more if the random sample of 50 Malay males had come from a population with the same mean birthweight as that of the general population. This difference is statistically significant as it cannot be reasonably attributed to chance. Thus there is strong evidence that the mean birthweight for male Malay livebirth is different from the general population.

An equivalent way to test H 0 :  =  0 at level 0.05 is to see if  0 is contained in a 95% C.I. for . A 95% C.I. for the mean Malay male birthweight was calculated earlier as (3.29kg, 3.81kg). Since the general population mean 3.27kg is not in this C.I., we reach the same conclusion of rejecting H 0 :  =3.27 at level The 95% C.I. (3.29, 3.81) tells us more. It tells us that we can reject  =3.27 but not  =3.3 at level In fact, the 95% C.I. consists of all those  0 such that H 0 :  =  0 cannot be rejected at level 0.05 by the 2-sided t-test. Relationship between C.I. and hypothesis testing Recommendation: Report both, p-value and C. I.

One-sided vs Two-sided Alternatives One-sided alternative is appropriate when we can anticipate a priori the direction of the difference or when we are primarily interested in detecting difference in one direction only, e.g., mean glucose level lower for SGA infants, new drug better than old. If H 1 is 1-sided, the test should also be 1-sided, i.e., reject H o only for large difference in the same direction as indicated by the alternative. For example, if H 1 :  >  0, then p-value=Pr(T>observed t | H o )=(1/2) p-value for 2-sided test Two-sided alternatives are conventionally used because, most of the time, we are not sure of the direction of the difference.

Example of a one-sided test

What to do with non-normal data? If sample size is large, the t-test/C.I. is still approximately valid by virtue of the Central Limit Theorem Transform the data so that the transformed data is closer to normally distributed Use nonparametric or distribution- free procedures

How large is large? n >= 15 should suffice if the underlying distribution is not too skewed and there are no outliers A smaller sample size may suffice if the data are closed to normally distributed n >= 40 should suffice even for fairly skewed populations