Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing An inferential procedure that uses sample data to evaluate the credibility of a hypothesis.

Slides:



Advertisements
Similar presentations
The t Test for Two Independent Samples
Advertisements

1/2/2014 (c) 2001, Ron S. Kenett, Ph.D.1 Parametric Statistical Inference Instructor: Ron S. Kenett Course Website:
Introduction to Hypothesis Testing
Lecture 8: Hypothesis Testing
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Introductory Mathematics & Statistics for Business
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
STATISTICS HYPOTHESES TEST (I)
STATISTICS HYPOTHESES TEST (II) One-sample tests on the mean and variance Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
UNITED NATIONS Shipment Details Report – January 2006.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Chapter 7 Sampling and Sampling Distributions
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
You will need Your text Your calculator
Department of Engineering Management, Information and Systems
On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach
The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote.
Introduction to Hypothesis Testing
HYPOTHESIS TESTING.
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
Detection Chia-Hsin Cheng. Wireless Access Tech. Lab. CCU Wireless Access Tech. Lab. 2 Outlines Detection Theory Simple Binary Hypothesis Tests Bayes.
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
Chi-Square and Analysis of Variance (ANOVA)
Hypothesis Tests: Two Independent Samples
Chapter 4 Inference About Process Quality
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
Student’s t test This test was invented by a statistician working for the brewer Guinness. He was called WS Gosset ( ), but preferred to keep.
Module 17: Two-Sample t-tests, with equal variances for the two populations This module describes one of the most utilized statistical tests, the.
McGraw-Hill, Bluman, 7th ed., Chapter 9
Please enter data on page 477 in your calculator.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
Putting Statistics to Work
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Analyzing Genes and Genomes
Hypothesis Testing Variance known?. Sampling Distribution n Over-the-counter stock selling prices calculate average price of all stocks listed [  ]calculate.
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Chapter Thirteen The One-Way Analysis of Variance.
Ch 14 實習(2).
Chapter 8 Estimation Understandable Statistics Ninth Edition
Intracellular Compartments and Transport
PSSA Preparation.
Experimental Design and Analysis of Variance
Essential Cell Biology
Testing Hypotheses About Proportions
Part 13: Statistical Tests – Part /37 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of.
Multiple Regression and Model Building
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
Chapter 8 Introduction to Hypothesis Testing
Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing An inferential procedure that uses sample data to evaluate the credibility of a hypothesis.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Introduction to Hypothesis Test – Part 2
Introduction to Hypothesis Testing
Presentation transcript:

Chapter 8: Introduction to Hypothesis Testing

2 Hypothesis Testing An inferential procedure that uses sample data to evaluate the credibility of a hypothesis about a population.

3 Before and after treatment comparisons µ = 18 µ = ?  = 4 Known population before treatment Unknown population after treatment TreatmentTreatment

4 Hypothesis Testing Step 1: State hypothesis Step 2: Set criteria for decision Step 3: Collect sample data Step 4: Evaluate null hypothesis Step 5: Conclusion

5 Infant handling problem We know that the average GRE score for college seniors is µ = 500 and  = 100. A researcher is interested in effect of the Kaplan course on GRE scores. A random sample of 100 college seniors is selected and take the Kaplan GRE Training course. Afterwards, each is given the GRE exam. Average scores after training are 525 for the sample of 100 students.

6 Before and after treatment comparisons µ = 500 µ = ?  =100 Known population before treatment Unknown population after treatment TreatmentTreatment

7 Hypothesis testing example problem µ = 500 g  = 100 g Step 1: Ho:Ho: µ = 500 GRE after Kaplan Course GRE after Kaplan Course ( There is no effect of the Kaplan training course on average GRE scores ) H1:H1: µ  500 (There is an effect…)  = 0.05 Step 2: Set criteria z Critical Region z > 1.96 z < or Step 3:n = 100X = 525 Step 4: Reject H o because Z obt of 2.5 is in the critical region. Step 5: Conclusion. The Kaplan training course significantly increased GRE scores on average, z = 2.5, p <.05.

8 Evaluating Hypotheses Reject H o Middle 95% Very Unlikely X µ z

What is the effect of alcohol on fetal development? 9

10 Before and after treatment comparisons µ = 18 µ = ?  = 4 Known population before treatment Unknown population after treatment TreatmentTreatment

11 Hypothesis testing example problem µ = 18 g  = 4 g Step 1: Ho:Ho:µ = 18 g Weight of rats of alcoholic mothers (There is no effect of alcohol on the average birth weight of new born rat pups) H1:H1: µ  18 g (There is an effect…)  = 0.01 Step 2: Set criteria z Critical Region z > 2.58 z < or Step 3:n = 25 ratsX = 15.5 g Step 4: Reject H o because Z obt of is in the critical region. Step 5: Conclusion: Alcohol significantly reduced mean birth weight of newborn rat pups, z = 3.125, p <.01.

12 Evaluating Hypotheses Reject H o Middle 99% Very Unlikely X µ z

13 The distribution of sample means (all possible experimental outcomes) if the null hypothesis is true Reject H o Extreme values (probability < alpha) if H o is true Possible, but “very unlikely”, outcomes

14 Normal curve with different alpha levels z  =.05  =.01  =.001

15 Reject H o Null Hypothesis Rejection areas on distribution curve p >  p < 

16 Hypothesis testing example problem What if the sample mean does not indicate a large enough change to reject the null? Step 1: Ho:Ho: µ = 500 GRE after Kaplan Course GRE after Kaplan Course ( There is no effect of the Kaplan training course on average GRE scores ) H1:H1: µ  500 (There is an effect…)  = 0.05 Step 2: Set criteria z Critical Region z > 1.96 z < or Step 3:n = 100X = 515 Step 4: Retain H o because Z obt of 1.5 is not in the critical region. Step 5: Conclusion. There was no effect of the Kaplan training on GRE scores on average, z = 1.5, p >.05.

17 Hypothesis testing example problem What if the sample mean is not in the critical region? Step 1: Ho:Ho:µ = 18 g Weight of rats of alcoholic mothers (There is no effect of alcohol on the average birth weight of new born rat pups) H1:H1: µ  18 g (There is an effect…)  = 0.01 Step 2: Set criteria z Critical Region z > 2.58 z < or Step 3:n = 25 ratsX = 17g Step 4: Retain H o because Z obt of is not in the critical region. Step 5: Conclusion: There was no effect of alcohol on the average birth weight of newborn rat pups, z = 1.25, p >.01.

18 Hypothesis testing procedure 1.State hypothesis and set alpha level 2.Locate critical region e.g.z > | 1.96 | z > 1.96 or z < Obtain sample data and compute test statistic 4. Make a decision about the H o 5. State the conclusion

19 Error Type / Correct Decision table Type I Error Type II Error Decision Correct Reject H o Retain H o Experimenter’s Decision Actual Situation No Effect, H o True Effect Exists, H o False

20 Jury’s Verdict vs. Actual Situation Type I Error Type II Error Verdict Correct Guilty Innocent Jury’s Verdict Actual Situation Did Not Commit Crime Committed Crime

21 Fixed coin (cheating) vs. OK coin (fair) Type I Error Type II Error Correct Decision Coin Fixed (Cheating) Coin O.K. (Fair) Your Decision Actual Situation Coin O.K. (Fair) Coin Fixed (Cheating)

22 Assumptions for Hypothesis Tests with z-scores: 1.Random sampling 2.Value of  unchanged by treatment 3.Sampling distribution normal 4.Independent observations

One-Tailed Hypothesis Tests 23

24 Reading Improvement Problem A researcher wants to assess the “miraculous” claims of improvement made in a TV ad about a phonetic reading instruction program or package. We know that scores on a standardized reading test for 9-year olds form a normal distribution with µ = 45 and  = 10. A random sample of n = 25 8-year olds is given the reading program package for a year. At age 9, the sample is given the standardized reading test.

25 Normal curve with rejection area on the positive side Reject H o Data indicates that H o is wrong 45 µ z X

26 Two-tailed vs. One-tailed Tests 1.In general, use two-tailed test 2.Journals generally require two-tailed tests 3.Testing H o not H 1 4.Downside of one-tailed tests: what if you get a large effect in the unpredicted direction? Must retain the H o

27 Error and Power Type I error =  –Probability of a false alarm Type II error =  –Probability of missing an effect when H o is really false Power = 1 -  –Probability of correctly detecting effect when H o is really false

28 Factors Influencing Power (1 -  ) 1.Sample size 2.One-tailed versus two-tailed test 3.Criterion (  level) 4.Size of treatment effect 5.Design of study

29 Distributions demonstrating power at the.05 and.01 levels  Treatment Distribution Null Distribution  =.05 Reject H o X  Treatment Distribution Null Distribution  =.01 Reject H o X 1 -  Reject H o Reject H o 1 -  µ = 180 µ = 200 (a) (b)

30 Distributions demonstrating power with different size n’s  Treatment Distribution Null Distribution n = 25 Reject H o X 1 -  Reject H o µ = 180µ = 200 (a)  Treatment Distribution Null Distribution n = 100 Reject H o X 1 -  (b) Reject H o

31 Large distribution demonstrating power  Treatment Distribution Null Distribution Reject H o X 1 -  Reject H o µ 180 µ 200

32 Large distribution demonstrating power (with closer means)  Treatment Distribution Null Distribution Reject H o X 1 -  Reject H o µ 190 µ 200

33 Frequency distributions Frequency Frequency X X

34 Are birth weights for babies of mothers who smoked during pregnancy significantly different? µ = 2.9 kg  = 2.9 kg Random Sample:n = , 2.0, 2.2, 2.8, 3.2, 2.2, 2.5, 2.4, 2.4, 2.1, 2.3, 2.6, 2.0, 2.3

35 The distribution of sample means if the null hypothesis is true (all the possible outcomes) Sample means close to H o : high-probability values if H o is true µ from H o Extreme, low-probability values if H o is true Extreme, low-probability values if H o is true

36 Distribution of sample means based on percentages Middle 95%: high-probability values if H o is true µ from H o Reject H o z = z = 1.96 Critical Region: Extreme 5%