The role of journals and publishers in reproducible research Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group.

Slides:



Advertisements
Similar presentations
OVERVIEW OF FACULTY OF 1000’S SERVICES
Advertisements

Building Support for a Discipline-Based Data Repository Ryan Scherle 1, Sarah Carrier 2, Jane Greenberg 2, Hilmar Lapp 1, Abbey Thompson 2, Todd Vision.
SCIENTIFIC DATA Presentation to the California Digital Library, 20 th June 2014 Ruth Wilson – Head of Publishing Services Andrew Hufton – Managing Editor.
Data archiving in evolutionary biology Michael Whitlock.
The journal as index and incentive for data publication Myles Axton Editor, Nature Genetics Cambridge Oct 23 rd 2011.
Doug Altman Centre for Statistics in Medicine, Oxford, UK
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
The Thomson Reuters CITATION CONNECTION Digital Library st March – 3 rd April 2014, Jasná David Horký Country Manager – Central and Eastern Europe.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
BioMed Central’s open data initiatives Alliance for Permanent Access conference 7 th November 2012 Iain Hrynaszkiewicz Publisher (Open Science), BioMed.
An Open Access publisher’s perspective on data publishing Matthew Cockerill Managing Director, BioMed Central Dryad-UK meeting HEFCE, London, 28 April.
Open access innovations in clinical research reporting October 22 nd 2012 Iain Hrynaszkiewicz Publisher (Open Science), BioMed Central
Order of speakers Iain Hrynaszkiewicz Nature Publishing Group & Palgrave Macmillan Michele Acuto University College London Timothy M Shaw University of.
THE NEED AND DRIVE FOR HIGH QUALITY DATA PUBLICATION Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave.
Data Publishing Workflows: Strategies and Standards
FROM DATA REPOSITORIES TO DATA JOURNALS – WHERE, WHEN AND HOW TO SUBMIT Andrew L. Hufton Managing Editor, Scientific Data Nature Publishing Group
An introduction to BioMed Central and Open Access publishing Matthew Cockerill Managing Director, BioMed Central.
Thomas Lemberger Chief Editor, Molecular Systems Biology Deputy Head, Scientific Publications, EMBO Publishing actionable data.
American Medical Association Journals include: JAMA (journal of the American Medical Association.), Archives of surgery, Archives of ophthalmology and.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Towards an Integrated Transparent Journal Publishing Workflow
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
Promoting data dissemination and reproducibility. Christopher I. Hunter, Scott C. Edmunds, Peter Li, Xiao Si Zhe, Robert L Davidson, Laurie Goodman. Submit.
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
TODAY’S SCIENTIFIC ARTICLE – HOW REPRODUCIBLE IS IT? MICHAEL MARKIE Associate Publisher,
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
Introduction to GigaScience journal & database Chris I Hunter & Rob L Davidson ISI CODATA International Training Workshop on Big Data 11 th March 2015.
PLoS Enlivening Scientific Culture Dr Chris Surridge Managing Editor, PLoS ONE Public Library of Science.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
A paradigm shift in biodiversity publishing: mobilization, mark up, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
8 October 2009Microbial Research Commons1 Toward a biomedical research commons: A view from NLM-NIH Jerry Sheehan Assistant Director for Policy Development.
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
The role of journals in research data sharing EPFL 2014 Damian Pattinson, PhD
BMJ and Data Sharing Claire Bower, Digital Communications
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Professor Phillipa Hay Centre for Health Research, School of Medicine.
GigaScience ( is an online, open-access journal that includes, as part of its publishing activities, the database GigaDB.
DOE Data Management Plan Requirements
The BioCADDIE / FORCE11 Data Citation Pilot © 2015 FORCE11.orgFORCE11.org Tim Clark, Ph.D. Harvard Medical School & Massachusetts General Hospital Maryann.
Dryad UK discussion meeting Mark Patterson, Director of Publishing April 27, 2010 Committed to making the world’s scientific and medical literature.
What is the role of journals and publishers in driving research standards? Véronique Kiermer, PhD Director, Author & Reviewer Services Nature Publishing.
Beyond the PDF: New modes of dissemination Experiments from PLOS Theo Bloom, Editorial Director for Biology, PLOS Amsterdam, March 2013.
Publication Ethics Webinar: Jan 2016 (Ethical) framework for author-driven publishing Dr Michaela Torkar Editorial Director, F1000Research
OPEN SCIENCE PUBLISHING: BEYOND OPEN ACCESS MAX PLANCK OPEN ACCESS AMBASSADORS CONFERENCE, 4 December 2014 Michaela Torkar Editorial Director, F1000 Research.
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
Webinar on increasing openness and reproducibility April Clyburne-Sherin Reproducible Research Evangelist
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
Sara Bowman Center for Open Science | Promoting, Supporting, and Incentivizing Openness in Scientific Research.
Efforts to Improve Transparency at the JBC Roger J. Colbran Associate Editor Credit: Amanda Fosang, Associate Editor.
Publish your data. The Data Journal concept Data must be well described before others can use it and benefit from it. Scientists who share data in a reusable.
NRF Open Access Statement
F1000: Open for science Hollydawn Murray
Center for Open Science: Practical Steps for Increasing Openness
Transparency increases the credibility and relevance of research
Publishing software and data
Policy and publishing developments for sharing data and code
Data publishing from the viewpoint of a biodiversity publisher
Role of peer review in journal evaluation
What, why and best practices in open research
BMC Research Notes A peer-reviewed forum for micro publications across all scientific disciplines; launched 2008 Editor: Dirk Krueger, PhD Focused on brief.
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

The role of journals and publishers in reproducible research Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave CASIM Reproducible Research Workshop, 27 th November 2015

Why do publishers care? More reliable evidence and papers Supporting journal and society goals Supporting research community expectations and expectations of funding agencies Content innovation More visible and widely reused publications CASIM workshop Nov 20152

PloS Medicine 2005 doi: /journal.pmed Nature 2015 doi: /525426a

Irreproducibility: underlying issues Misconduct Publication bias and refutations – where? Experimental design Statistics Lab supervision and training Reporting and sharing information Gels, microscopy images Statistical reporting Methods description Data deposition 4

Transparency vs. Reproducibility Both require significant effort but transparency more pragmatic/achievable Promoting transparency and reuse helps reproducibility Access to materials to reduce bias and support reproducible research: Methods Protocols Code Data Pre-registration CASIM workshop Nov Miguel et al. (2014). Promoting transparency in social science research. Science (New York, N.Y.), 343(6166), 30–1. doi: /science

Reproducibility: roles for publishers Content Policies Incentives Licenses Access Reliability Innovation CASIM workshop Nov Image credit: DS Pugh [CC-BY-SA-2.0 ( via Wikimedia Commons. Further reading: Hrynaszkiewicz I, Li P, Edmunds SC: Open science and the role of publishers in reproducible research. In: Implementing Reproducible Research. Edited by Stodden V, Leisch F, Peng RD. Chapman & Hall/CRC; 2014

Reproducibility: Content - details Glasziou et al. (2008) BMJ – inadequate methods descriptions for medical interventions BMJ 2008;336:1472 Length restrictions removed on Methods (Nature) No length restrictions in open access journals Reporting guidelines e.g. MIAME but implementation/enforcement is patchy Format of content also important when literature used a resource for research e.g. structured XML versions of articles in PubMed Central CASIM workshop Nov 20157

Reporting checklist of statistical and methodological details Reproducibility checklist also currently being trialled at various BMC journals, including BMC Biology, BMC Neuroscience, Genome Biology, and GigaScience.

Example (a) Western blot of cell lysates of control and Rac1-siRNA-treated MTLn3 cells, blotted for Rac1 and β-actin. A representative image is shown from 3 blots. (b) MTLn3 cells transfected with control or Rac1 siRNA and plated on Alexa-405-conjugated gelatin overnight. Arrows point to invadopodia and sites of degradation. Scale bars, 10 μm. Representative image sets are shown from 50 image sets each for the control and Rac1 siRNA. (c) Quantification of mean degradation area per cell from b, including Rac1 inhibitor NSC23766 treatment at 100 μM. n = 60 fields for each condition, pooled from 5 independent experiments; error bars are s.e.m. Student’s t-test was used. **P = ,^ ^P = Uncropped images of blots are shown in Supplementary Fig. 9. CASIM workshop Nov statement of replication definition of n definition of statistic tests Nature Cell Biology 16, 571–583 (2014) doi: /ncb2972 raw source data

Reproducibility: Content - format Format of content also important when literature used a resource for research e.g. structured XML versions of articles in PubMed Central Building a “GenBank for the published literature” (Roberts, Varmus et al Science, 2001) Growing amount of open access articles (e.g. >60% of articles at NPG in 2015) CASIM workshop Nov

Reproducibility: Content - types 11CASIM workshop Nov 2015

Get Credit for Sharing Your Data Publications will be indexed and citeable. Open-access Creative Commons licenses (CC-BY/CC-BY-NC) for the main Data Descriptor. Each publication supported by CCO metadata. Focused on Data Reuse All the information others need to reuse the data; no interpretative analysis, or hypothesis testing Peer-reviewed Rigorous peer-review focused on technical data quality and reuse value Promoting Community Data Repositories Not a new data repository; data stored in community data repositories 13

Sequence variants (EVA) Associated Nature Genetics article Data at European Variation Archive

Gene expression Associated Nature Article Data at figshare & NCBI GEO Integrated figshare data viewer

Neuroscience Code in GitHub New Dataset Data in OpenfMRI Source code in GitHub Big Data 16

Policies: on data Willingness to share stated (Annals Internal Medicine) Data sharing implied by submission (BioMed Central*) Data sharing implied as a condition of publication (Nature*) Mandated data sharing with statement in paper (PLOS, BMJ - for clinical trials) Mandated data sharing with statement and link to data (non- medical journals e.g. ecology, animal genomics) Mandated open data as a condition of submission (Scientific Data, GigaScience, F1000Research) *Minimum requirement – some disciplines/journals may mandate 17 STRONGER 1. Vines, T. H. et al. Mandated data archiving greatly improves access to research data. FASEB J. fj.12–218164– (2013). doi: /fj CASIM workshop Nov 2015

Finding the right repository Lists more than 80 repositories, across the biological, physical and social sciences Advise authors on the best place to store their data List made available under CC-BY in figshare 18

Policies: on code CASIM workshop Nov

Policies: it’s in the implementation CASIM workshop Nov Meta-analysis fails when <40% data available Systematic Reviews 2014, 3:97 doi: / Poor availability of psychological datasets (64/249 available) American Psychologist, Vol 61(7), Oct 2006, doi: / X Data received from 1/10 PLOS Medicine and PLOS Clinical Trials authors PLoS ONE 4(9): e7078. doi: /journal.pone % of 394 researchers contacted sent their data Collabra (1) doi: /collabra.13

Reproducibility: Incentives Enabling data and code citation Data articles and journals Recognising reproducibility – collaborating with challenges, awards CASIM workshop Nov

Data citation CASIM workshop Nov Scientific Data (2014) ​ doi: /sdata

Reproducibility: Licenses Data: depends on public repositories. Some repositories e.g. figshare and Dryad both use the CC0 waiver. Metadata: released under the CC0 waiver to maximize reuse and aid data miners Articles: Creative Commons licenses 23

Licensing for maximum reuse Further reading: BMC Research Notes (2012) doi: /

Reproducibility: Access Discoverability and links to other digital products of research More useful links between papers CASIM workshop Nov BMC “Threaded Publications” Nature ENCODE explorer

Reproducibility: Reliability/quality Peer review at Scientific Data focuses on: Completeness (can others reproduce?) Consistency (were community standards followed?) Integrity (are data in the best repository?) Experimental rigour and technical quality (were the methods sound?) Does not focus on: Perceived impact/importance Size/complexity of data 26

Reproducibility: Innovation Collaboration between publishers and software/tools for science Connect doing with communicating Data and article submission integration (figshare, Dryad) Various publisher-repository partnerships 27

Reproducibility: Innovation 28

Thank you for listening Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave