Communicating Methods, Results, and Intentions in Empirical Research

Slides:



Advertisements
Similar presentations
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Advertisements

Statement of the Problem Goal Establishes Setting of the Problem hypothesis Additional information to comprehend fully the meaning of the problem scopedefinitionsassumptions.
Introduction, Acquiring Knowledge, and the Scientific Method
Chapter One: The Science of Psychology
Inferential Statistics
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Chapter One: The Science of Psychology. Ways to Acquire Knowledge Tenacity Tenacity Refers to the continued presentation of a particular bit of information.
T tests comparing two means t tests comparing two means.
CHAPTER 18: Inference about a Population Mean
The Scientific Method in Psychology.  Descriptive Studies: naturalistic observations; case studies. Individuals observed in their environment.  Correlational.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
The Scientific Method: Terminology Operational definitions are used to clarify precisely what is meant by each variable Participants or subjects are the.
Paired Samples and Blocks
Chapter Eight: Quantitative Methods
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
T tests comparing two means t tests comparing two means.
How Psychologists Do Research Chapter 2. How Psychologists Do Research What makes psychological research scientific? Research Methods Descriptive studies.
Copyright © 2009 Pearson Education, Inc. Chapter 25 Paired Samples and Blocks.
Chapter 9 Introduction to the t Statistic
Introduction to Marketing Research
Logic of Hypothesis Testing
Writing a sound proposal
CHAPTER 9 Testing a Claim
Confidence Intervals.
Selecting the Best Measure for Your Study
Paired Samples and Blocks
Chapter 23 Comparing Means.
Chapter 25: Paired t-Test
CHAPTER 10 Comparing Two Populations or Groups
Unit 5: Hypothesis Testing
Paired Samples and Blocks
Warm Up Check your understanding P. 586 (You have 5 minutes to complete) I WILL be collecting these.
Chapter 2 Simple Comparative Experiments
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
CHAPTER 10 Comparing Two Populations or Groups
© 2011 The McGraw-Hill Companies, Inc.
Null Hypothesis Testing
Chapter Eight: Quantitative Methods
Stat 217 – Day 28 Review Stat 217.
Paired Samples and Blocks
CHAPTER 9 Testing a Claim
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Discussions and Conclusions
SCIENCE AND ENGINEERING PRACTICES
Paired Samples and Blocks
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
Paired Samples and Blocks
Psych 231: Research Methods in Psychology
Statistical Data Analysis
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Paired Samples and Blocks
CHAPTER 9 Testing a Claim
CS639: Data Management for Data Science
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 18: Inference about a Population Mean
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Communicating Methods, Results, and Intentions in Empirical Research Jessica Hullman Assistant Professor University of Washington steveh.co/panel

Research as a conversation Developing good communication habits is critical to success. WITH EACH OTHER WITH OURSELVES --If we think about pipeilne – from initial research to define question, to communicating our results, to years after we published a finding … --Develop good communication habits - Communicating methods, results, intentions steveh.co/panel

COMMUNICATING METHODS steveh.co/panel

What analysis details are critical to provide? Explain your experiment design using standard terms before getting into details Why did you choose this statistical test? If other reasonable alternatives exist, do they produce congruent results? Submit, and expect to receive, supplemental material Reviewers should be able to take the study themselves Videos are useful for empirical papers! steveh.co/panel

Should we require sample size rationale? How did you decide the sample size? Could my study detect an effect of a given size if one did exist? Pilot, estimate variance, simulate with different Ns Concisely describing an experiment can be difficult. steveh.co/panel

COMMUNICATING RESULTS steveh.co/panel

How to deal with surprises in analysis? If you didn’t have a hypothesis, be wary of drawing conclusions (and avoid testing everything) Don’t attribute causality to only part of a treatment Use priors, if only informally A test with a low error rate can have a high rate of false positives conditional on a positive finding Robustness checks Confirm randomization, analyze with and without outliers, assumptions Doing all possible pairwise comparisons is rarely necessary If you didn’t set out to explain / identify the mechanism, beware of posthoc speculation on what mechanism produced your results Don’t just trust the result because you chose a low alpha – expectations still matter, even if you don’t want to go all the way with Bayesian analysis SAFER TO AVOID OVERFITTING THAN TO DO IT, BC EVEN IF YOU ARE CLEAR ON WHATS SPECULATION, OTHERS MAY NOT BE Particularly about causation -- Inferring causation generally requires hypothesis testing ( Conversely, you might find that there’s an effect where you wouldn’t have expected one. Particularly when using frequentist methods In the context of probability mathematics, textbooks carefully explain that p(A|B) != p(B|A), and how a test with a low error rate can have a high rate of errors conditional on a positive finding, if the underlying rate of positives is low, but the textbooks typically confine this problem to the probability chapters and don’t explain its relevance to accept/reject decisions in statistical hypothesis testing. steveh.co/panel

NHST alternatives: Are 95% CIs the answer? Communicate effect size + uncertainty, but beware of misinterpretation. Evidence that even researchers using CIs regularly struggle with their interpretation (Belia et al. 2005, Hoekstra et al. 2014) Confidence intervals … State confidence in the algorithm Indicate no relationship between sample variance and bias Have a direct relationship with conventional hypothesis testing (⍺=0.05) when 95% CIs are used. Pierre’s talk : CIs are as unintuitive and misunderstand as NHST 95 % confidence is a confidence that in the long-run 95% of the CIs will include the population mean. steveh.co/panel

How can I make uncertainty in results concrete? Present and walk reader through realizations Posterior predictive checks Use inferred model to simulate input data, compare to observed Plot observed + simulated Plot observed - simulated We need more research on visualizing experiment results! Hullman et al. 2015 Kay et al. 2016 steveh.co/panel Gelman 2004

steveh.co/panel COMMUNICATING INTENTIONS Develop good communication habits - Communicating methods, results, intentions Empirical research is a conversation. Groups of researchers across disciplines and subfields draw on one another’s results (e.g., InfoVis and perceptual psychology), researchers interpret and publish their findings for discussion and iteration by others in their field, and each individual researcher must communicate honestly with him or herself to avoid to biasing empirical work toward desired, but not necessarily supported, conclusions. When communication around empirical research is not clear at any of these levels, evaluations of that work, and the subsequent research that builds on those findings, can be subject to false and/or superficial interpretations that threaten the validity of knowledge. I will describe communication challenges spanning different levels of the "research conversation" around empirical work in InfoVis. For example, researchers may overlook the uncertainty or ambiguity in the findings of prior work that they use to motivate their own hypotheses; reviewers may not thoroughly understand a reported experimental design, leading them to accept claims based on overinterpreted evidence; or researchers may fail to acknowledge the degree to which analysis procedures are constructed ad-hoc in an effort to confirm their hypothesis. I will offer ideas for discussion around how revising practices at different points in the pipeline of empirical research—from formulation of hypotheses, to consulting of the literature, to experimental design, to analysis and presentation of results, to review by the community—can help enforce clear communication. steveh.co/panel

Am I really susceptible to my own biases? Garden of forking paths / Researcher degrees- of-freedom: Outlier removal Which DV matters Defining comparisons Whether to transform variables Which test When to stop data collection Compare control and all Compare control and the best two … steveh.co/panel

Should we pre-specify (pre-register) all studies? Specify all decisions about data collection and analysis before doing the study (publicly) Easy version: Implement in your lab by having students write out analysis plans. You Your lab Pre-registration steveh.co/panel

Thanks! @jessicahullman faculty.washington.edu/jhullman steveh.co/panel