Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dorothy V. M. Bishop University of

Similar presentations


Presentation on theme: "Dorothy V. M. Bishop University of"— Presentation transcript:

1 Dorothy V. M. Bishop University of Oxford @deevybee
Hacking a way through the garden of forking paths: A cause of poor reproducibility Dorothy V. M. Bishop University of Oxford @deevybee

2 This not new Failure to distinguish between hypothesis-testing and hypothesis-generating (exploratory) research -> misuse of statistical tests ‘If the processing of empirically obtained material has in any way an “exploratory character”, i.e. if the attempt to let the material speak leads to ad hoc decisions in terms of processing, as described above, then this precludes the exact interpretability of possible outcomes of statistical tests’. De Groot, 1956 de Groot, A. D. (2014). The meaning of “significance” for different types of research [translated and annotated by EJ Wagenmakers et al]. Acta Psychologica, 148,

3 Writing the Empirical Journal Article
Daryl J. Bem The Compleat Academic: A Practical Guide for the Beginning Social Scientist, 2nd Edition. Washington, DC: American Psychological Association, 2004. Which Article Should You Write? There are two possible articles you can write: (a) the article you planned to write when you designed your study or (b) the article that makes the most sense now that you have seen the results. They are rarely the same, and the correct answer is (b). re Data Analysis: Examine them from every angle. Analyze the sexes separately. Make up new composite indexes. If a datum suggests a new hypothesis, try to find additional evidence for it elsewhere in the data. If you see dim traces of interesting patterns, try to reorganize the data to bring them into bolder relief. If there are participants you don’t like, or trials, observers, or interviewers who gave you anomalous results, drop them (temporarily). Go on a fishing expedition for something— anything —interesting. “This book provides invaluable guidance that will help new academics plan, play, and ultimately win the academic career game.”

4 Large population database used to explore link between ADHD and handedness
1 contrast Probability of a ‘significant’ p-value < .05 = .05

5 Large population database used to explore link between ADHD and handedness
Focus just on Young subgroup: 2 contrasts at this level Probability of a ‘significant’ p-value < .05 = .10

6 Large population database used to explore link between ADHD and handedness
Focus just on Young on measure of hand skill: 4 contrasts at this level Probability of a ‘significant’ p-value < .05 = .19

7 Large population database used to explore link between ADHD and handedness
Focus just on Young, Females on measure of hand skill: 8 contrasts at this level Probability of a ‘significant’ p-value < .05 = .34

8 Large population database used to explore link between ADHD and handedness
Focus just on Young, Urban, Females on measure of hand skill: 16 contrasts at this level Probability of a ‘significant’ p-value < .05 = .56

9 Problem exacerbated because
Can now easily gather huge multivariate datasets Can easily do complex statistical analyses Problems with exploratory analyses that use methods that presuppose hypothesis-testing approach

10 Huge bias for type I error

11 Solutions a. Using simulated datasets to give insight into statistical methods

12 Correlation matrix: 8 random normal deviates

13 Correlation matrix: 8 random normal deviates

14 Correlation matrix: 8 random normal deviates

15 Correlation matrix: 8 random normal deviates

16 We must overcome our natural bias to over-interpret observed patterns

17 Another example: Multiway ANOVA Illustrated with field of ERP/EEG
Flexibility in analysis in terms of: Electrodes Time intervals Frequency ranges Measurement of peaks etc, etc Often see analyses with 4- or 5-way ANOVA (group x side x site x condition x interval) Standard stats packages correct p-values for N levels WITHIN a factor, but not for overall N factors and interactions . Cramer AOJ, et al Hidden multiplicity in exploratory multiway ANOVA: Prevalence and remedies. Psychonomic Bulletin & Review 23:

18

19 Solutions b. Distinguish exploration from hypothesis-testing analyses
Subdivide data into exploration and replication sets. Or replicate in another dataset

20 Solutions c. Preregistration of analyses

21 Solutions for journals
More emphasis on methodological rigour Publish well-powered null findings/replications

22 Solutions for institutions
Problems caused by employers Reward research reproducibility over impact factor in evaluation Consider ‘bang for your buck’ rather than amount of grant income Reward those who adopt open science practices Solutions for institutions Marcia McNutt Science 2014 • VOL 346 ISSUE 6214 "This is Dr Bagshaw, discoverer of the infinitely expanding research grant“ ©Cartoonstock

23 Solutions for funders Problems caused by funders Promote data-sharing; require that all data are reported Fund well-designed replications of key findings Encourage pre-registration More scrutiny of methods – can’t assume competence

24

25 Resources


Download ppt "Dorothy V. M. Bishop University of"

Similar presentations


Ads by Google