PSY 1950 Null Hypothesis Significance Testing September 29, 2008.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Inference Sampling distributions Hypothesis testing.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Null Hypothesis Significance Testing What the heck have we been doing this whole time?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
8-2 Basics of Hypothesis Testing
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
BCOR 1020 Business Statistics
Null Hypothesis Signficance Testing
Probability Population:
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Statistical hypothesis testing – Inferential statistics I.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
7 Elementary Statistics Hypothesis Testing. Introduction to Hypothesis Testing Section 7.1.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Chapter 8 Introduction to Hypothesis Testing
P Values - part 4 The P value and ‘rules’ Robin Beaumont 10/03/2012 With much help from Professor Geoff Cumming.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
No criminal on the run The concept of test of significance FETP India.
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Unit 8 Section 8-1 & : Steps in Hypothesis Testing- Traditional Method  Hypothesis Testing – a decision making process for evaluating a claim.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
© Copyright McGraw-Hill 2004
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Introduction to Hypothesis Testing
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Psych 230 Psychological Measurement and Statistics Pedro Wolf October 21, 2009.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Learning Objectives Describe the hypothesis testing process Distinguish the types of hypotheses Explain hypothesis testing errors Solve hypothesis testing.
Logic of Hypothesis Testing
Statistical inference
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Testing Hypotheses I Lesson 9.
Presentation transcript:

PSY 1950 Null Hypothesis Significance Testing September 29, 2008

vs

Finite Population Correction Factor SEM and central limit theorem calculations are based on sampling with replacement from idealized, infinite populations Real-life research involves sampling without replacement from actual, finite populations When n/N<.05, this doesn’t matter When n/N>.05, use a correction factor:

Controversy of NHST “backbone of psychological research” –Gerrig & Zimbardo (2002, p. 42) “a potent but sterile intellectual rake who leaves in his merry path a long train of ravished maidens but no viable scientific offspring” –Meehl (1967, p. 265) “…surely the most bone-headedly misguided procedure ever institutionalized in the rote training of science students” –Rozeboom (1997, p. 335)

NHST example (z-test) 1.State the null and alternative hypotheses H 0 : µ infant = 26 lbs H 1 : µ infant  26 lbs 2.Set the criteria for a decision  =.05 |z| ≥ Collect data and compute sample statistics M infant = 30 lbs with n = 16 and  = 4 z = (M - µ)/  M = ( )/1 = 4 4.Make a decision Reject H 0

NHST Errors   Type III error?

Power The probability of correctly rejecting a false null hypothesis = 1 - 

 =.05 “It is convenient to take this point as a limit in judging whether a deviation is to be considered significant or not. Deviations exceeding twice the standard deviation are thus formally regard as significant” –Fisher (1925, p. 47) Historical roots prior to Fisher’s definition Corresponds to subjective demarcation of chance from non-chance events "... surely, God loves the.06 nearly as much as the.05” –(Rosnow & Rosenthal, 1989)

NHST Rationale Why try to reject null hypothesis? –Philosophical: Popperian falsifiability accept H 1 : the projector occasionally malfunctions reject H 0 : the projector always works –Practical: defining the sampling distribution H 1 : the projector failure rate = ? H 0 : the projector failure rate = 0%

History of NHST Fisher’s (1925) NHST: 1.Set up null hypothesis (not necessarily) 2.Report exact significance 3.Only do this when you know little else Neyman & Pearson (1950) 1.Set up two competing hypothesis, H 1 and H 2, and make a priori decisions about  and  2.If data falls into rejection region of H1, accept H2; otherwise accept H1. Acceptance  belief. 3.Only do this when you have a disjunction of hypotheses (either H 1 or H 2 is true) Current NHST (according to some): 1.Set up null hypothesis as nil hypothesis 2.Reject null at p<.05 and accept your hypothesis 3.Always do this

Criticisms of NHST Affirming the consequent –If P then Q. Q. Therefore P. The straw person argument –Tukey (1991): “It is foolish to ask ‘Are the effects of A and B different?’ They are always different—for some decimal place”(p. 100) –“Statistical significance does not necessarily imply practical significance!” The replication fallacy –If you conduct an experiment that results in p =.05 (two-tailed), what is the chance that a replication of that experiment will produce a statistically significant (p<.05) effect? 50% (see Cumming, 2008, Appendix B) “Confusion of the inverse” –“absence of proof is not proof of absence” –“presence of proof is not proof of presence”

Affirming the Consequent NHST commits logical fallacy –NHST: If the null hypothesis is correct, then these data are highly unlikely These data have occurred Therefore, the null hypothesis is highly unlikely –Analog: If a person is an American, then he is probably not a member of Congress This person is a member of Congress Therefore, he is probably not an American Response: Science progresses through testing its predictions –Logic may be flawed, but success is hard to deny

The Straw Person Argument Often null hypothesis = nil hypothesis –The nil hypothesis is always (or almost always) false The “crud factor” in correlational research (Meehl, 1990) The “princess and the pea” effect in experimental research –If the null hypothesis is always false, how does rejecting it increase knowledge? Response: effect size matters, statistical significance is not practical significance, test interactions

Replication Fallacy p-values don’t say much about replicability, yet most everyone thinks they do –Replication is NOT 1 -  (Tversky & Kahneman, 1971) Response: p-values inform replicability, just less than one might think –All else equal, the lower the p-value, the higher the replicability

“Confusion of the Inverse” Criticism: NHST calculates the probability of obtaining the data given a hypothesis, p(D|H 0 ), not the probability of obtaining a hypothesis given the data, p(H 0 |D) –A p-value of.05 does NOT necessarily indicate that the null hypothesis is unlikely to be true Response: logically faulty but productive inferences is better than nothing –p(D|H 0 ) approximates p(H 0 |D) under typical experimental settings where p(H 0 ) is low, i.e., p(H 1 ) > p(H 0 ) –p(H 0 |D) varies monotonically with p(D|H 0 ) p(H 0 |D) –When p(H 0 ) =.35, p(H 0 |D) =.35 –p(D|H 0 ) and p(H 0 |D) are correlated (r =.38)

NHST gives p(D|H 0 ) not p(H 0 |D)

Reconciliation “Inductive inference cannot be logically justified, but they can be defended pragmatically” (Krueger, 2001) Use NHST mindfully –“There is no God-given rule about when and how to make up your mind in general.” Hays (1973, p. 353) Don’t rely exclusively on p-values

Alternatives to p-values Effect size –Meta-analysis Confidence intervals p rep