Rick Hoyle Duke Dept. of Psychology & Neuroscience

Slides:



Advertisements
Similar presentations
Overview of Inferential Statistics
Advertisements

Reliability and Validity
1 G Lect 2a G Lecture 2a Thinking about variability Samples and variability Null hypothesis testing.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Non-Experimental designs: Developmental designs & Small-N designs
Non-Experimental designs: Developmental designs & Small-N designs
Meta-analysis & psychotherapy outcome research
Non-Experimental designs
Measurement, Quantification and Analysis Some Basic Principles.
The Characteristics of an Experimental Hypothesis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Statistical Computing
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
 Internal Validity  Construct Validity  External Validity * In the context of a research study, i.e., not measurement validity.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Reliability, Validity, and Bias. Reliability Reliability Reliability is the extent to which an experiment, test, or any measuring procedure yields the.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
Chapter 16 Social Statistics. Chapter Outline The Origins of the Elaboration Model The Elaboration Paradigm Elaboration and Ex Post Facto Hypothesizing.
Replication in Prevention Science Valentine, et al.
URBDP 591 A Lecture 16: Research Validity and Replication Objectives Guidelines for Writing Final Paper Statistical Conclusion Validity Montecarlo Simulation/Randomization.
April Center for Open Fostering openness, integrity, and reproducibility of scientific research.
The Scientific Method. Scientifically Solving a Problem Observe Define a Problem Review the Literature Observe some More Develop a Theoretical Framework.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Prior beliefs Prior belief is knowledge that one has about a parameter of interest before any events have been observed – For example, you may have an.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Ch. 2: Planning a Study.
More on Inference.
Measurement, Quantification and Analysis
Confidence Intervals.
Selecting the Best Measure for Your Study
Hallmarks of scientific research
Unit 5: Hypothesis Testing
Inference and Tests of Hypotheses
Question 1: What is the baseline of high power?
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Analysis by: Kevin Gallagher (PPG Industries) Sept 26, 2017
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
Meta-Analytic Thinking
Communicating Methods, Results, and Intentions in Empirical Research
Summarising and presenting data - Univariate analysis continued
Hypothesis Tests for a Population Mean in Practice
More on Inference.
CHAPTER 26: Inference for Regression
Fundamentals of Statistics
Experiments and Quasi-Experiments
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Devil physics The baddest class on campus IB Physics
Discussions and Conclusions
Statistical Inference about Regression
Reliability, Validity, and Bias
Experiments and Quasi-Experiments
Research in Psychology
Hypothesis Testing.
Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations Greenland et al (2016)
Lecture: statistical significance.
Project collaborators’ names
Sampling and Power Slides by Jishnu Das.
Psych 231: Research Methods in Psychology
CHAPTER 10 Comparing Two Populations or Groups
Thinking critically with psychological science
Inferential Statistics
Positive analysis in public finance
CS639: Data Management for Data Science
Non-Experimental designs
Managerial Decision Making and Evaluating Research
Chapter 4 Summary.
Alternative Scenarios and Related Techniques
Meta-analysis, systematic reviews and research syntheses
What to look at in fire engineering analysis
The end of statistical significance
Presentation transcript:

Rick Hoyle Duke Dept. of Psychology & Neuroscience rhoyle@duke.edu Is Observation Oriented Modeling a Way Forward for the Behavioral Sciences? Rick Hoyle Duke Dept. of Psychology & Neuroscience rhoyle@duke.edu

Crisis of confidence in psychological science 2011 – Stapel suspended for fabricating data used in research publications leading to 58 retractions 2011 – Bem’s paper reported findings from 9 experiments seeming to show pre-cognition 2012 – John et al. publish findings from survey or researchers showing that more than half routinely engage in questionable research practices 2011 – Bakker & Wicherts reported widespread statistical reporting errors, some of which, if corrected, would change the inference

Are the prevailing research methods capable of yielding an authentic science of psychology? [I]s significance testing the appropriate tool for evaluating data? Is the randomized controlled experiment sufficient for determining causality? Are parametric statistics appropriate for the attributes studied by psychologists? Grice (2011, p. 2)

Problems to be solved concerns about pervasive Type 1 errors uncertainty about reproducibility of effects concerns about validity of measurement dearth of translatable research outcomes that could inform policy and practice It’s time for something new!

OOM is an appealing alternative to the status quo more than just an attempt to fix what is broken shifts the focus from population focused group-level patterns and comparisons and variable based data modeling to observation-focused methods What might a specific person think or feel or do under certain circumstances? “. . . observation oriented modeling shifts the focus of analysis away from computed aggregates such as means and variances onto the observations themselves.” (Grice, 2011, p. 45)

How do we get there from here? things to stop doing NHST treating measures as if they were continuous (or even interval scaled) testing hypotheses using summary statistics inferring from sample to population

How do we get there from here? things to start doing using techniques that do not make untenable assumptions about samples and measures addressing generalizability and reproducibility concerns through replication and theory-development rather than sampling and p-values aligning hypotheses and methods to focus directly on what individuals (rather than populations) do designing studies that involve intensive repeated assessments

The worst-case scenario is clearly dire: It is plausible that inattention to nonergodicity and a lack of group-to-individual generalizability threaten the veracity of countless studies, conclusions, and best- practice recommendations. . . . even in the best-case scenario, we should not think of a correlation in group data as an estimate that generalizes to any given individual in the population. Stated bluntly, this implies that the temptation to use aggregate estimates to draw inferences at the basic unit of social and psychological organization—the person—is far less accurate or valid than it may appear in the literature. Fisher et al. (2018)

Practical matters How receptive are journals likely to be to the OOM approach? How would findings from observation-oriented research be synthesized? Is there a place for “big data” in the behavioral sciences given the concerns that OOM addresses? Is it time to abandon classical statistics in behavioral science research?