Hypothesis Testing Quantitative Methods in HPELS 440:210.

Slides:



Advertisements
Similar presentations
Statistical Issues in Research Planning and Evaluation
Advertisements

Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Review: What influences confidence intervals?
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Hypothesis testing & Inferential Statistics
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
8-2 Basics of Hypothesis Testing
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
BCOR 1020 Business Statistics
PSY 307 – Statistics for the Behavioral Sciences
Statistics for Managers Using Microsoft® Excel 5th Edition
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Getting Started with Hypothesis Testing The Single Sample.
Inferential Statistics
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Lecture Slides Elementary Statistics Twelfth Edition
Overview Definition Hypothesis
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
Introduction to Biostatistics and Bioinformatics
Chapter 8 Introduction to Hypothesis Testing
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing An inferential procedure that uses sample data to evaluate the credibility of a hypothesis.
1 Statistical Inference. 2 The larger the sample size (n) the more confident you can be that your sample mean is a good representation of the population.
Chapter 8 Introduction to Hypothesis Testing
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
HYPOTHESIS TESTING. Statistical Methods Estimation Hypothesis Testing Inferential Statistics Descriptive Statistics Statistical Methods.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Related Samples T-Test Quantitative Methods in HPELS HPELS 6210.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Created by Erin Hodgess, Houston, Texas Section 7-1 & 7-2 Overview and Basics of Hypothesis Testing.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Chapter 9 Introduction to the t Statistic
Lecture Slides Elementary Statistics Twelfth Edition
Hypothesis Testing Is It Significant?.
Review: What influences confidence intervals?
Quantitative Methods in HPELS HPELS 6210
Presentation transcript:

Hypothesis Testing Quantitative Methods in HPELS 440:210

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Effect Size and Power Instat Example

Introduction  Hypothesis Testing Recall:  Inferential Statistics: Calculation of sample statistic to make predictions about population parameter  Two potential problems with samples: Sampling error Variation between samples  Infinite # of samples  predictable pattern  sampling distribution Normal µ = µ M  M =  /√n

Introduction  Hypothesis Testing Common statistical procedure Allows for comparison of means General process: 1. State hypotheses 2. Set criteria for decision making 3. Collect data  calculate statistic 4. Make decision

Introduction  Hypothesis Testing Remainder of presentation will use following concepts to perform a hypothesis test:  Z-score  Probability  Sampling distribution

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Effect Size and Power Instat Example

General Process of HT Step 1: State hypotheses Step 2: Set criteria for decision making Step 3: Collect data and calculate statistic Step 4: Make decision

Step 1: State Hypotheses Two types of hypotheses: 1. Null Hypothesis (H 0 ): 2. Alternative Hypothesis (H 1 ):  Directional  Non-directional Only one can be true Example 8.1, p 223

Assume the following about 2-year olds: µ = 26  = 4  M =  /√n = 1 n = 16 Researchers want to know if extra handling/stimulation of babies will result in increased body weight once the baby reaches 2 years of age

Null Hypothesis: H 0 : Sample mean = 26 Alternative Hypothesis: H 1 : Sample mean ≠ 26

Assume that this distribution is the “TRUE” representation of the population Recall: If an INFINITE number of samples are taken, the SAMPLING DISTRIBUTION will be NORMAL with µ = µ M and will be identical to the population distribution Reality: Only ONE sample will be chose What is the probability of choosing a sample with a mean (M) that is 1, 2, or 3 SD above or below the mean (µ M )? µMµM

µMµM µMµM µMµM p(M > µ M + 1  ) = 15.87%p(M > µ M + 2  ) = 2.28% p(M > µ M + 3  ) = 0.13%

It is much more PROBABLE that our sample mean (M) will fall closer to the mean of the means (µ M ) as well as the population mean (µ) µMµM Inferential statistics is based on the assumption that our sample is PROBABLY representative of the population

Our sample could be here,or here, but we assume that it is here! µMµM

H 0 : Sample mean = 26 If true (no effect): 1.) It is PROBABLE that the sample mean (M) will fall in the middle 2.) It is IMPROBABLE that the sample mean (M) will fall in the extreme edges H 1 : Sample mean ≠ 26 If true (effect): 1.) It is PROBABLE that the sample mean (M) will fall in the extreme edges 2.) It is IMPROBABLE that the sample mean (M) will fall in the middle

Assume that M = 30 lbs (n = 16) µ = 26M = 30 Accept or reject? H 0 : Sample mean = 26 What criteria do you use to make the decision?

Step 2: Set Criteria for Decision A sampling distribution can be divided into two sections:  Middle: Sample means likely to be obtained if H 0 is accepted  Ends: Sample means not likely to be obtained if H 0 is rejected Alpha (  ) is the criteria that defines the boundaries of each section

Step 2: Set Criteria for Decision Alpha:  AKA  level of significance Ask this question:  What degree of certainty do I need to reject the H 0 ? 90% certain:  = % certain:  = % certain:  = 0.01

Step 2: Set Criteria for Decision As level of certainty increases:   decreases  Middle section gets larger  Critical regions (edges) get smaller Bottom line: A larger test statistic is needed to reject the H 0

Step 2: Set Criteria for Decision Directional vs. non- directional alternative hypotheses Directional:  H1: M > or < X Non-directional:  H1: M ≠ X Which is more difficult to reject H 0 ?

Step 2: Set Criteria for Decision Z-scores represent boundaries that divide sampling distribution Non-directional:   = 0.10 defined by Z = 1.64   = 0.05 defined by Z = 1.96   = 0.01 defined by Z = 2.57 Directional:   = 0.10 defined by Z = 1.28   = 0.05 defined by Z = 1.64   = 0.01 defined by Z = 2.33

Critical Z-Scores  Non-Directional Hypotheses 90% 95% 99% Z=1.64 Z=1.96 Z=2.58 Z=1.64 Z=1.96 Z=2.58

Critical Z-Scores  Directional Hypotheses Z=1.28 Z=1.64 Z= % 95% 99%

Step 2: Set Criteria for Decision Where should you set alpha?  Exploratory research  0.10  Most common  0.05  0.01 or lower?

Step 3: Collect Data/Calculate Statistic Z = M - µ /  M where:  M = sample mean  µ = value from the null hypothesis H 0 : sample = X   M =  /√n Note: Population  must be known otherwise the Z-score is an inappropriate statistic!!!!!

Example 8.1  Continued Step 3: Collect Data/Calculate Statistic

Assume the following about 2-year olds: µ = 26  = 4  M =  /√n = 1 n = 16 Researchers want to know if extra handling/stimulation of babies will result in increased body weight once the baby reaches 2 years of age M = 30 Z = M - µ /  M Z = 30 – 26 / 1 Z = 4 / 1 = 4.0

Process: 1. Draw a sketch with critical Z-score  Assume non-directional  Alpha = Plot Z-score statistic on sketch 3. Make decision Step 4: Make Decision

µ = 26 M = 30 Z = 4.0 Step 1: Draw sketch Critical Z-score Z = 1.96 Critical Z-score Z = 1.96 Step 3: Make Decision: Z = 4.0  falls inside the critical region If H 0 is false, it is PROBABLE that the Z-score will fall in the critical region ACCEPT OR REJECT? Step 2: Plot Z-score

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Effect Size and Power Instat Example

Errors in Hypothesis Testing Recall  Problems with samples:  Sampling error  Variability of samples Inferential statistics use sample statistics to predict population parameters There is ALWAYS chance for error

Errors in Hypothesis Testing There is potential for two kinds of error: 1. Type I error 2. Type II error

Type I Error Rejection of a true H 0 Recall  alpha = certainty of rejecting H 0  Example: Alpha = % certain of correctly rejecting the H 0 Therefore 5% certain of incorrectly rejecting the H 0 Alpha maybe thought of as the “probability of making a Type I error Consequences:  False report  Waste of time/resources

Type II Error Acceptance of a false H 0 Consequences:  Not as serious as Type I error  Researcher may repeat experiment if type II error is suspected

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Effect Size and Power Instat Example

One vs. Two-Tailed Tests One-Tailed (Directional) Tests:  Specify an increase or decrease in the alternative hypothesis  Advantage: More powerful  Disadvantage: Prior knowledge required

One vs. Two-Tailed Tests Two-Tailed (Non-Directional) Tests:  Do not specify an increase or decrease in the alternative hypothesis  Advantage: No prior knowledge required  Disadvantage: Less powerful

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Effect Size and Power Instat Example

Statistical Software  p-value The p-value is the probability of a type I error Recall alpha (  ) Recall  Step 4: Make a Decision

If the p-value >   accept the H 0  Probability of type I error is too high  Researcher is not “comfortable” stating that differences are real and not due to chance If the p-value <   reject the H 0  Probability of type I error is low enough  Researcher is “comfortable” stating that differences are real and not due to chance

Statistical vs. Practical Significance Distinction: 1. Statistical significance: There is an acceptably low chance of a type I error 2. Practical significance: The actual difference between the means are not trivial in their practical applications

Practically Significant? Knowledge and experience Examine effect size  The magnitude of the effect Examples of measures of effect size:  Eta-squared (  2 )  Cohen’s d R2R2 Interpretation of effect size:  0.0 – 0.2 = small effect  0.21 – 0.8 = moderate effect  > 0.8 = large effect Examine power of test

Statistical Power Statistical power: The probability that you will correctly reject a false H 0 Power = 1 –  where   = probability of type II error Example: Statistical power = 0.80 therefore:  80% chance of correctly rejecting a false H 0  20% of accepting a false H 0 (type II error)

Researcher Conclusion Accept H 0 Reject H 0 Reality About Test No real difference exists Correct Conclusion Type I error Real difference exists Type II error Correct Conclusion

Statistical Power What influences power? 1. Sample size: As n increases, power increases - Under researcher’s control 2. Alpha: As  increases,  decreases therefore power increases - Under researcher’s control (to an extent) 3. Effect size: As ES increases, power increases - Not under researcher’s control

Statistical Power How much power is desirable? General rule: Set  as 4*  Example:   = 0.05, therfore  = 4*0.05 = 0.20  Power = 1 –  = 1 – 0.20 = 0.80

Statistical Power What if you don’t have enough power?  More subjects What if you can’t recruit more subjects and you want to prevent not having enough power?  Estimate optimal sample size a priori  See statistician with following information: Alpha Desired power Knowledge about effect size  what constitutes a small, moderate or large effect size relative to your dependent variable

Statistical Power Examples: 1. Novice athlete improves vertical jump height by 2 inches after 8 weeks of training 2. Elite athlete improves vertical jump height by 2 inches after 8 weeks of training

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Instat Example

Instat Type data from sample into a column.  Label column appropriately. Choose “Manage” Choose “Column Properties” Choose “Name” Choose “Statistics”  Choose “Simple Models” Choose “Normal, One Sample” Layout Menu:  Choose “Single Data Column”

Instat Data Column Menu:  Choose variable of interest. Parameter Menu  Choose “Mean, Known Variance (z-interval)”  Enter known SD or variance value. Confidence Level:  90% = alpha 0.10  95% = alpha 0.05

Instat Check “Significance Test” box:  Check “Two-Sided” if using non-directional hypothesis.  Enter value from null hypothesis. What population value are you basing your sample comparison? Click OK. Interpret the p-value!!!

Agenda Introduction Hypothesis Testing  General Process Errors in Hypothesis Testing One vs. Two Tailed Tests Instat Example

Example (p 246) Researchers want to investigate the effect of prenatal alcohol on birth weight in rats  Independent variable?  Dependent variable? Assume:  µ = 18 g   = 4  n = 16   M =  /√n = 4/4 = 1  M = 15 g

Step 1: State hypotheses (directional or non-directional) H 0 : µ alcohol = 18 g H 1 : µ alcohol ≠ 18 g Step 2: Set criteria for decision making Alpha (  ) = 0.05 Step 3: Sample data and calculate statistic Z = M - µ /  M Z = 15 – 18 / 1 = -3.0

Step 4: Make decision Does Z-score fall inside or outside of the critical region? Accept or reject? Statistical Software: p-value = 0.02  Accept or reject? p-value = 0.15  Accept or reject?

Homework Problems: 3, 5, 6, 7, 8, 11, 21