Interpreting Null Hypothesis Significance Tests (NHSTs) Joe Stevens, Ph.D. Educational Psychology 120 Simpson Hall 277-4203,

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

THE ART (AND SCIENCE?) OF INTERPRETING TABLES. BACKGROUND READINGS Pollock, Essentials, chs. 3-4.
Statistical vs. Practical Significance
Using and Reporting Measures of Effect Size Roger E. Kirk Department of Psychology & Neuroscience Baylor University.
Null Hypothesis Significance Testing What the heck have we been doing this whole time?
One-Way BG ANOVA Andrew Ainsworth Psy 420 Obtained from Way%20BG%20ANOVA.ppt.
Modeling in Undergraduate Statistics Michael M Granaas Psychology University of South Dakota.
LECTURE 8 HYPOTHESIS TESTING AND STATISTICAL SIGNIFICANCE MSc (Addictions) Addictions Department.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
Statistics. Overview 1. Confidence interval for the mean 2. Comparing means of 2 sampled populations (or treatments): t-test 3. Determining the strength.
Understanding Research Results. Effect Size Effect Size – strength of relationship & magnitude of effect Effect size r = √ (t2/(t2+df))
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
American Psychological Association’s Task Force on Statistical Inference TFSI 1999.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
STATISTICS & HYPOTHESIS TESTING PRESENTATION Tiesha Amison, Lorraine Lorentz, Vicky Mintz, Diana Morris and Trish Zimmerer BSHS/382 November 24, 2014 Angela.
Chapter 9 Hypothesis Testing.
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Simple Linear Regression Analysis
Rosnow, Beginning Behavioral Research, 5/e. Copyright 2005 by Prentice Hall Ch. 12: Statistical Significance, Effect Size, and Power Analysis.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Statistics for the Behavioral Sciences Second Edition Chapter 11: The Independent-Samples t Test iClicker Questions Copyright © 2012 by Worth Publishers.
AP STATISTICS LESSON COMPARING TWO PROPORTIONS.
Examining Relationships in Quantitative Research
What’s in the Bag? Lecture 4 Section Wed, Sep 7, 2005.
On teaching statistical inference: What do p values (not) mean? Bruce Blaine, PhD, PStat® Department of Mathematical and Computing Sciences St. John Fisher.
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
United States Military Academy Duty, Honor, Country 1 Assessment, Evaluation, & Statistics: A Brief Overview WISTPC 2014 Florida International University.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Essential Statistics Chapter 141 Thinking about Inference.
AP STATISTICS LESSON INFERENCE FOR A POPULATION PROPORTION.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Section A Confidence Interval for the Difference of Two Proportions Objectives: 1.To find the mean and standard error of the sampling distribution.
Module 3: Research in Psychology Learning Objectives What is the scientific method? How do psychologist use theory and research to answer questions of.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
Scientific Method Probability and Significance Probability Q: What does ‘probability’ mean? A: The likelihood that something will happen Probability.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
1 CHAPTER 4 CHAPTER 4 WHAT IS A CONFIDENCE INTERVAL? WHAT IS A CONFIDENCE INTERVAL? confidence interval A confidence interval estimates a population parameter.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Innovative Teaching Article (slides with auxiliary information: © 2014) James W. Grice Oklahoma State University Department of Psychology.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
AP Statistics Section Statistical significance is valued because it points to an effect that is __________ to occur simply by chance. Carrying out.
AP STATISTICS LESSON 11 – 1 (DAY 2) The t Confidence Intervals and Tests.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Chapter 9 Introduction to the t Statistic
Chapter 8  INFERENTIAL STATISTICS: CORRELATION
Correlation and Linear Regression
Lecture 9-I Data Analysis: Bivariate Analysis and Hypothesis Testing
Statistical power Learning outcomes
Chapter 6  PROBABILITY AND HYPOTHESIS TESTING
Hypothesis Testing.
Methods of Observation
Research Methods With Statistics 8-10% of AP Exam
Calculating Sample Size: Cohen’s Tables and G. Power
Effect size measures for single-case designs: General considerations
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Chapter 6 Making Sense of Statistical Significance: Decision Errors, Effect Size and Statistical Power Part 1: Sept. 18, 2014.
Richard S. Balkin, Ph.D., LPC
Lecture 4 Section Wed, Jan 19, 2005
CHAPTER 10 Comparing Two Populations or Groups
Correlation A measure of the strength of the linear association between two numerical variables.
Primer of statistics in dental research: Part I
Comparing Two Proportions
Presentation transcript:

Interpreting Null Hypothesis Significance Tests (NHSTs) Joe Stevens, Ph.D. Educational Psychology 120 Simpson Hall , Presentation available at: © Stevens, 2002

Which study is “stronger”? Study A: t (398) = 2.30, p =.022 Study B: t (88) = 2.30, p =.024 Examples inspired by Rosenthal & Gaito (1963).

Answer: Study B ω 2 for Study A =.0106 ω 2 for Study B =.0455

Which study is “stronger”? Study P: F(2, 697) = 5.30, p = Study Q: F(2, 97) = 5.30, p =.00654

Answer: Study Q  2 for Study P =.0150  2 for Study Q =.0985

Which is “significant”/important? Study T: F = 63.62, p < Study U: F = 5.40, p =.0486

 2 for Study T =.01, N = 6,300  2 for Study U =.40, N = 10 Study T is a very small effect, Study U is a much larger effect Significance”/importance cannot be determined statistically

Significance Test Results = Effect Size X Size of Study t = X

Significance Test Results = Effect Size X Size of Study t = X

Significance Test Results = Effect Size X Size of Study t = d X

Significance Test Results = Effect Size X Size of Study F = Xdf

Significance Test Results = Effect Size X Size of Study F = X

Interpreting Statistical Results NHST results (t, F, p, etc.) answer the question: Is there a relationship between variables (yes/no)? Effect size (d, g) and strength of association measures (r 2, ω 2, η 2 ) answer the question: How strong is the relationship?

NHST results should always be accompanied by estimation of effect size; report both Estimation of statistical power is also advisable, especially when NHST results are not statistically significant Use of Confidence Intervals can also aid proper interpretation of NHST results

Bibliography Cohen, J. (1994). The earth is round (p <.05). American Psychologist, 49, 997– Harlow, L. L. Mulaik, S. A., & Steiger, J. H. (1997). What if there were no significance tests? Hillsdale, NJ: Erlbaum. Rosenthal & Gaito (1963). The interpretation of levels of significance by psychological researchers. Journal of Psychology, 55, Rosenthal, R. & Rosnow, R. L. (1991). Essentials of behavioral research (2 nd Ed.). New York: McGraw-Hill, Inc. Thompson, B. (1996). AERA editorial policies regarding statistical significance testing: Three suggested reforms. Educational Researcher, 25 (2), 26– 30. Wilkinson, L. & Task Force on Statistical Inference (1999). Statistical Methods in Psychology Journals: Guidelines and Explanations, American Psychologist, 54 (8), 594–604. [available at: