The Search for Significance: A Practical Guide to

Slides:



Advertisements
Similar presentations
Multiple Regression W&W, Chapter 13, 15(3-4). Introduction Multiple regression is an extension of bivariate regression to take into account more than.
Advertisements

Vocabulary Lesson Unit Title Here. first vocabulary word.
Type Title Here for Tic-Tac-Toe Type names of students in group here.
Dante’s Inferno By: Dante AlighieriDante Alighieri.
AgendaWriting Prompt  Please pick up a lit book  Writing prompt-Have lit. texts out to be checked.  Finish Dante Introduction if we didn’t Friday 
PSY 307 – Statistics for the Behavioral Sciences
D ANTE ’ S I NFERNO The Circles of Hell. A CTIVATOR : P ERSONAL C ONNECTION INSTRUCTIONS: You have received nine post-it notes that you should number.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Statistics for the Social Sciences Psychology 340 Spring 2005 Analysis of Variance (ANOVA)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics made simple Modified from Dr. Tammy Frank’s presentation, NOVA.
The Divine Comedy Dante Alighieri. The Divine Comedy Written between 1308 and 1321 Central epic poem of Italian literature Divided into three parts Inferno.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
4.2 One Sided Tests -Before we construct a rule for rejecting H 0, we need to pick an ALTERNATE HYPOTHESIS -an example of a ONE SIDED ALTERNATIVE would.
Slide 23-1 Copyright © 2004 Pearson Education, Inc.
P-Hacking: A Practical Guide
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
The Scientific Method Physics.
Two Sample Problems  Compare the responses of two treatments or compare the characteristics of 2 populations  Separate samples from each population.
How do you know what you know?. How do you know what you know? 1)Maybe you can measure something directly. 2)You can interpret what you have measured.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
1 Chapters 6-8. UNIT 2 VOCABULARY – Chap 6 2 ( 2) THE NOTATION “P” REPRESENTS THE TRUE PROBABILITY OF AN EVENT HAPPENING, ACCORDING TO AN IDEAL DISTRIBUTION.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
CONSORT: Consolidated Standards of Reporting Trials Evidence-based, minimum set of recommendations for reporting clinical trials Rennie (JAMA) urged the.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Dante Alighieri By Derick and Tanner. -Born in florence italy -Exact date of birth is unknown but believed to be around Not much is known about.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
How to read a scientific paper
Statistics (cont.) Psych 231: Research Methods in Psychology.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Chi Squared Test. Why Chi Squared? To test to see if, when we collect data, is the variation we see due to chance or due to something else?
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
Linear Models One-Way ANOVA. 2 A researcher is interested in the effect of irrigation on fruit production by raspberry plants. The researcher has determined.
Multiplying Common Fractions Multiplying next Using the Corn Bread © Math As A Second Language All Rights Reserved.
Happy New Year 2012 The top ten New Year’s Resolutions.
This tutorial will talk you through a very basic workbench queueing simulation. The queueing system modelled is of customers entering an infinite capacity.
Example 1 Writing Powers Write the product as a power and describe it in words. a. 44= to the second power, or 4 squared 9 to the third power,
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Essential Questions What is biology? What are possible benefits of studying biology? What are the characteristics of living things? Introduction to Biology.
2.16 A researcher with a sample of 50 individuals with similar education but differing amounts of training hypothesizes that hourly earnings, EARNINGS,
THE SCIENTIFIC METHOD: It’s the method you use to study a question scientifically.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Scientific Method. OVERVIEW What is the Scientific Method? It’s a way to solve/explain a problem or natural phenomenon, while removing human bias and.
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
Inferential Statistics Psych 231: Research Methods in Psychology.
April Center for Open Fostering openness, integrity, and reproducibility of scientific research.
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
Simine Vazire UC DAVIS SPSP GETTING PAPERS ACCEPTED IN SOCIAL/PERSONALITY JOURNALS POST REPLICABILITY CRISIS.
Dante Alighieri ( ).
Why do so many researchers misreport p-values?
David Preregistration David
Physics 114: Lecture 13 Probability Tests & Linear Fitting
Methods of Science Chapter 1 Section 3.
Preregistration on the Open Science Framework
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
Study Pre-Registration
The Resistible Rise of Questionable Research Practices
The Divine Comedy: Dante’s Inferno
The Inferno by Dante Alighieri.
Unit 15 Power Analysis and Statistical Validity
Section 3: Methods of Science
The Divine Comedy: Dante’s Inferno
Student #7 starts with Locker 7 and changes every seventh door
School of Psychology, Cardiff University
Methods of Science Chapter 1 Section 3.
Ms. Lindsey’s Kindergarten Class 11/1/16
Presentation transcript:

The Search for Significance: A Practical Guide to

The Nine Circles of Dante’s Inferno First Circle: Limbo Second Circle: Lust Third Circle: Gluttony Fourth Circle: Greed Fifth Circle: Anger Sixth Circle: Heresy Seventh Circle: Violence Eighth Circle: Fraud Ninth Circle: Treachery

The Nine Circles of Scientific Hell First Circle: Limbo Second Circle: Overselling Third Circle: Post-Hoc Storytelling Fourth Circle: P-Value Fishing Fifth Circle: Creative Use of Outliers Sixth Circle: Plagiarism Seventh Circle: Non-Publication of Data Eighth Circle: Partial Publication of Data Ninth Circle: Inventing Data

P-Fishing Fourth Circle: P-Value Fishing “Those who tried every statistical test in the book until they got a p value less than 0.05 find themselves here, an enormous lake of murky water. Sinners sit on boats and must fish for their food. Fortunately, they have a huge selection of different fishing rods and nets. Unfortunately, only one in 20 fish are edible, so they are constantly hungry.”

P-Fishing Also known as… ▫P-Hacking ▫Questionable Research Practices (QRPs)) ▫Torturing the data ▫Outcome reporting bias ▫Undisclosed flexibility ▫Researcher Degrees of Freedom ▫…and more.

Related to… Publication bias P-hacking is about using multiple methods or attempts to find a significant result. Publication bias is the tendency to only publish significant results. Both go hand in hand in practice. But each could, in theory, occur without the other.

P-Hacking Works! Collect some data

P-Hacking Works! Collect some data Try many statistical tests on the same data

P-Hacking Works! Collect some data Try many statistical tests on the same data Or try many variants of the same data (e.g. removing ‘outliers’.)

P-Hacking Works! Collect some data Try many statistical tests on the same data Or try many variants of the same data (e.g. removing ‘outliers’.) Or try looking at different variables within the dataset

P-Hacking Works! Collect some data Try many statistical tests on the same data Or try many variants of the same data (e.g. removing ‘outliers’.) Or try looking at different variables within the dataset Report the analyses that give the most favourable results (usually the lowest p-values).

“P-Hack the numbers, HARK the text” Hypothesizing After the Results Are Known

“P-Hack the numbers, HARK the text” Hypothesizing After the Results Are Known Allows any significant result to become an interesting, hypothesis-confirming finding

“P-Hack the numbers, HARK the text” Hypothesizing After the Results Are Known Allows any significant result to become an interesting, hypothesis-confirming finding HARKing is not to be confused with revising or rejecting hypotheses in the light of new data – which is essential (!)

“P-Hack the numbers, HARK the text” Hypothesizing After the Results Are Known Allows any significant result to become an interesting, hypothesis-confirming finding HARKing is not to be confused with revising or rejecting hypotheses in the light of new data – which is essential (!) Rather, HARKing means that hypotheses are never tested. The hypotheses are always “one step ahead” of the data.

And now a demonstration…

fMRI Simulator

Why P-Hacking Is So Effective There are many choices (‘researcher degrees of freedom’) in data analysis. For example, in a simple task-based fMRI data analysis, Joshua Carp found 7000 combinations of parameters (very conservative). Carp, J. (2012).On the plurality of (methodological) worlds: estimating the analytic flexibility of fMRI experiments Frontiers in Neuroscience Carp, J. (2012).On the plurality of (methodological) worlds: estimating the analytic flexibility of fMRI experiments Frontiers in Neuroscience

How To Spot It The p-curve… Simonsohn, U. Nelson, L. D. Simmons, J. P. (2013). P-curve: a key to the file-drawer. Journal of Exp. Psychol General Simonsohn, U. Nelson, L. D. Simmons, J. P. (2013). P-curve: a key to the file-drawer. Journal of Exp. Psychol General Try it now! Try it now!

Although it’s complicated “Publication bias and underpowered studies might be a bigger problem for science than inflated Type 1 error rates…”

The Root of the Problem

The Root of the Problem (and Fixes) Smulders YM (2013). A two-step manuscript submission process can reduce publication bias. Journal of Clinical Epidemiology Smulders YM (2013). A two-step manuscript submission process can reduce publication bias. Journal of Clinical Epidemiology

The Root of the Problem (and Fixes) Smulders YM (2013). A two-step manuscript submission process can reduce publication bias. Journal of Clinical Epidemiology Smulders YM (2013). A two-step manuscript submission process can reduce publication bias. Journal of Clinical Epidemiology Chambers CD (2013). Registered Reports: a new publishing initiative at Cortex Cortex Chambers CD (2013). Registered Reports: a new publishing initiative at Cortex Cortex

Happy