Hypothesis Testing.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Chapter 7 Hypothesis Testing
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Inference Sampling distributions Hypothesis testing.
Statistical Issues in Research Planning and Evaluation
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
8-2 Basics of Hypothesis Testing
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Richard M. Jacobs, OSA, Ph.D.
Inferential Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Inferential Statistics
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Lecture Slides Elementary Statistics Twelfth Edition
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 8-1 Review and Preview.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Overview Basics of Hypothesis Testing
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Slide Slide 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim about a Proportion 8-4 Testing a Claim About.
Chapter 8 Introduction to Hypothesis Testing
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Hypothesis Testing PowerPoint Prepared by Alfred.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
CHAPTER 17: Tests of Significance: The Basics
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
1 Section 8.2 Basics of Hypothesis Testing Objective For a population parameter (p, µ, σ) we wish to test whether a predicted value is close to the actual.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Chapter 13 Understanding research results: statistical inference.
Created by Erin Hodgess, Houston, Texas Section 7-1 & 7-2 Overview and Basics of Hypothesis Testing.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Hypothesis Tests Hypothesis Tests Large Sample 1- Proportion z-test.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Slide Slide 1 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Lecture Slides Elementary Statistics Twelfth Edition
FINAL EXAMINATION STUDY MATERIAL III
Unit 5: Hypothesis Testing
Review and Preview and Basics of Hypothesis Testing
P-value Approach for Test Conclusion
Presentation transcript:

Hypothesis Testing

A decision-making process for evaluating claims about a population Based on information obtained from samples Hypotheses are made a priori before the experiment was conducted Usually tests a prediction that some kind of effect exists in the population A method for scientists to test the scientific questions they generate

Usually conjecture about a population parameter A numerical value that characterizes some aspect of a population Can never be completely sure that a hypothesis is true; instead, we work with probabilities Generally, we will calculate the probability that results we have obtained in an experiment occurred by chance We are studying whether the results we see are a random occurrence or due to some mechanism

Testing a Statistical Hypothesis Steps: Define the population to study State the hypothesis that will be investigated Give a statistical level Select a sample from the population Collect the data Calculate statistical test based on sample data Draw a conclusion Testing a Statistical Hypothesis

Two types of hypothesis Null Hypothesis Usually written as H0 (H-knot) A statement that there is no difference between a population parameter and a specific value (or no difference between two parameters) Alternative Hypothesis (Research Hypothesis) Usually written as HA A statement that there is a difference between a population parameter and a specific value (or there is a difference between two parameters) Two types of hypothesis

One and Two Tailed Tests A hypothesis test that is directional is called a one-tailed test Examples: Older than age 21 H0: μ = 21 versus HA: μ > 25 Treatment A will do better than treatment B H0: μA = μB versus HA: μA > μB One and Two Tailed Tests

One and Two Tailed Tests A hypothesis test that is non-directional is called a two-tailed test Examples: The population mean age is 21 H0: μ = 21 versus HA: μ ≠ 25 Treatment A has the same effect as treatment B H0: μA = μB versus HA: μA ≠ μB One and Two Tailed Tests

One and Two Tailed Tests Notice the statement of equality – a statement of no effect – is written as the null hypothesis Preview: If we reject the null hypothesis, this is the same thing as saying have detected an effect One and Two Tailed Tests

Significance Level of a Statistical Test Significance level is the probability we use as our criterion for an unlikely outcome, supposing that H0 is true Denoted by α (alpha) Must be decided a priori, this means that α must be determined before conducting a statistical test Choosing α after the data as been analyzed lacks objectivity Significance Level of a Statistical Test

Significance Level of a Statistical Test α is our criterion for rejecting the null hypothesis Reflects how careful the researcher wishes to be The smaller the α that is specified, the stronger the evidence needed to reject H0 Scientific and medical literature commonly test hypotheses with α levels of 0.05 or 0.01 Significance Level of a Statistical Test

Calculating a Statistical Test Remember, statistical inference uses sample statistics to make inferences about population parameters We use sample data to calculate a test statistic A test statistic is a univariate statistic (a number) Sample data is used to calculate the test statistic Test statistic is used in hypothesis testing to make decisions To use the test statistic, we have to determine its sampling distribution under the null hypothesis (assuming the null hypothesis is true) Calculating a Statistical Test

Calculating a Statistical Test Once we know the distribution of the test statistic, we use this to calculate its p-value Each time we calculate a test statistic, we will couple it with a corresponding p-value In the literature, statistical test results are often presented in the form of (test statistic, p-value) Examples: (t=-1.98, p=0.04) (t=2.5, p<0.013) (χ(2)=12.32, p<0.001) (F=6.17, p=0.0001) Calculating a Statistical Test

Calculating a Statistical Test Examples: (t=-1.98, p=0.04) (t (28)=2.5, p<0.013) (χ(2)=12.32, p<0.001) (F=6.17, p=0.0001) In these examples, the letter denotes the sampling distribution of the test statistic For example: t denotes a t-distribution t(28) denotes a t-distribution with 28 degrees of freedom χ(2) denotes a Chi-squared distribution with 2 degrees of freedom F denotes a F-distribution Calculating a Statistical Test

Calculating a Statistical Test The p-value is calculated using the test statistic and its sampling distribution To calculate the p-value, we calculate the area under the curve for specific values of the test statistic and its sampling distribution We will focus on how to use a p-value to make decisions about statistical significance Calculating a Statistical Test

p - values On the slides that follow, we: Define a p-value Describe its properties Clarify what it does and does not tell us Following this, we will then go on to Step 7 of Hypothesis Testing In Step 7, we will talk about Significance Testing Significance Testing describes how the p-value is used to make decision p - values

p-values Let’s not confuse The definition of a p-value The process of using a p-value in making a decision p-values

A p-value is the probability of obtaining the same sample statistic (mean value) or a more extreme value if the null hypothesis is true The p-value is the most commonly reported result of a significance test It enables us to judge the extent of the evidence against H0 Definition of p-value

Properties of a p-value Ranges from 0 to 1 Summarizes the evidence in the data about H0 A large p-value (e.g., 0.58) indicates the observed data would not be unusual if H0 were true A small p-value (e.g., 0.0003) indicates the observed data would be very doubtful if H0 were true Properties of a p-value

P (A | B) = “the probability of A given B” No Is P(A|B) = P(B|A) ? Example; A = dramatic overdose of medication B = death P(A|B): Given a person dies, what is the probability of death due to overdose? (this probability will likely be very low) P(B|A): Given a person overdoses, what is the probability of this person dying? (this probability will likely be very high) Is P(A|B) = P(B|A) here? p-value

Properties of p-values A p-value does not give information about trend, direction, strength of an association, size of an effect, or magnitude of a difference p=0.04 is not less significant than p=0.0000000001 Statistical significance is influenced by the sample size With a large enough sample size anything can be statistically significant When reporting p-values, footnotes such as *p<.05, **p<0.01 suggest a trend and are misleading Properties of p-values

Properties of p-values A smaller p-value does not indicate a more important result Magnitude of the p-value is not a guide to clinical significance p-values do not take Into account the size of the effect A small effect in a large study can have the same p-value as a large effect In a small one Conclusions should not be based only on p-values p=0.06 is not evidence of 'marginal significance' or proof of a 'trend towards significance'; these types of conclusions are untrue Properties of p-values

Properties of p-values p-value does give the probability the result observed is due to chance The p-value does give the chance of obtaining the effect we have observed, assuming the null hypothesis is true (in other words, assuming there is no real effect) Whenever a p-value is reported, it is recommended to also report a measure of effect size Confidence Interval Correlation coefficient Regression parameter Etc. Properties of p-values

We have not discussed how the p-value is used in making a decision Significance Testing describes the process for using the p-value to make a decision Next, we talk about Step 7; drawing conclusions and making inferences We need to discuss the decision we are making, and possible errors in making that decision p-values

Errors of Inference In hypothesis testing, we make a decision Only two possible outcomes We decide to either Reject H0 Fail to reject H0 Errors in inference describes each of these two possible incorrect decisions Errors of Inference

Hypothesis Testing Language "Accepting the null hypothesis" is NOT the same conclusion as "Failing to reject the null hypothesis" Hypothesis Testing Language

Hypothesis Testing Language If we do not reject the null hypothesis We do not conclude it is true We can only recognize the null hypothesis is a possibility Showing the null hypothesis is true is not the same thing as failing to reject it Failing to find an effect is not the same thing as showing there is no effect Hypothesis Testing Language

Errors of Inference These two errors are given specific names: Type I Error We draw a conclusion that H0 is false when it is in fact true Occurs when we conclude there is an effect in our population, when in fact there is not Type II Error We draw a conclusion that H0 is true when it is in fact false The probability of a type II error is denoted by the Greek letter β ('beta’) In other words, P(Type II Error)= β Errors of Inference

We draw a conclusion that H0 is false when it is in fact true Occurs when we conclude there is an effect in our population, when in fact there is not Example 1: Incorrectly conclude an HIV prevention intervention is effective (Reject H0, conclude HA is true) in preventing HIV infections through behavioral changes when, in fact, it is not (H0 Is actually true) Example 2: Incorrectly conclude a new chemo agent is effective (Reject H0, conclude HA is true) in reducing tumor size when, in fact, it is not (H0 Is actually true) Type I Error

We draw a conclusion that H0 is true when it is In fact false The probability of a type II error Is denoted by the Greek letter β ('beta') Example 1: Incorrectly conclude an HIV prevention intervention is not effective (Fall to reject H0, conclude HA is false) in preventing HIV infections through behavioral changes when, in fact, it is (HA is actually true) Example 2: Incorrectly conclude a new chemo agent is not effective (Fall to reject H0, conclude HA is false) in reducing tumor size when, in fact, It is (HA is actually true) Type II Error

Type I and Type II Errors Decision Null Hypothesis True False Fail to Reject H0 Correct Type II Error (False Negative) Reject H0 Type I Error (False Positive) False Positive: We reject H0 and conclude there is an effect (Positive; Alternative Hypothesis) when, in fact, there is not an effect (False Conclusion) False Negative: We fail to reject H0 and conclude there is not an effect (Negative; Null Hypothesis) when, in fact, there is an effect (False Conclusion) Type I and Type II Errors

This describes the process of using the p-value to make a decision We compare the calculated p-value from Step 6 to the predetermined significance level (α) from Step 3 If p < α, we reject H0 and conclude the result is statistically significant If p > α, we fail to reject H0, and conclude we have failed to find statistical significance Significance Testing

Conclusion: Significance Essential to understand the difference between statistical and clinical significance Statistical Significance - the likelihood that the difference could have occurred by chance alone Clinical Significance - the smallest clinically beneficial and harmful values of the effect; in other words, the smallest values that matter to the patient Conclusion: Significance

A research article gives a p-value of. 001 in the analysis section A research article gives a p-value of .001 in the analysis section. Which definition of a p-value is the most accurate? the probability that the observed outcome will occur again. the probability of observing an outcome as extreme or more extreme than the one observed if the null hypothesis is true. the value that an observed outcome must reach in order to be considered significant under the null hypothesis. the probability that the null hypothesis is true. Question

Reference: Dr. Matt Hayat Statistics Lecture – Rutgers University 2013 Plichta SB, Kelvin E. (2012) Munro’s Statistical Methods for Healthcare Research, 6th Edition