Research Data Management From A Publisher’s Perspective

Slides:



Advertisements
Similar presentations
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Advertisements

Supporting Engagement in Open Access: a Publishers Perspective
Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
THE NEED AND DRIVE FOR HIGH QUALITY DATA PUBLICATION Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave.
Release 4 of the COUNTER Code of Practice for e- Resources and new usage- based measures of impact Peter Shepherd COUNTER May 2014.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Managing journals: challenges and opportunities How to get started (with OJS) Jackie Proven.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Panel Discussion Part I Methodology Ideas from adult MR brain segmentation are used in neonatal MR brain segmentation. However, additional challenges.
1 Why should “WE” CARE about data?. International initiatives OECD principles and guidelines for access to research data from public funding 2007 “Access.
Online Editorial Management On-line Management of Scholarly Journals Mahmoud Saghaei.
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
BMJ and Data Sharing Claire Bower, Digital Communications
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Responsible Data Use: Copyright and Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Linking Embargoed Datasets: A Plan for Improving How Research Data Can Be Shared, Linked and Tracked Arlington, VA, November 19, 2015 Anita de Waard VP.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Data Citation Implementation Pilot Workshop
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
Writing a Data Management Plan with the DMPTool Kathleen Fear January 15, 2015.
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
RCUK Policy on Open Access: Terms and Compliance Repositories Support Project Event London, May 2013 Mari Williams BBSRC.
Beyond the Repository: Research Systems, REF & New Opportunities William J Nixon Digital Library Development Manager.
| 1 Data Citation Implementation Evolving the Elsevier infrastructure Mike Taylor January 2016.
Scholarly Link Exchange WG Wrap up – Wouter Haak
NRF Open Access Statement
Jeff Moon Data Librarian &
Open Exeter Project Team
BIO1130 Lab 2 Scientific literature
Open Access and Research Data Management: An Overview for LLOs
Open Access infrastructure and Open Data
The role and contributions of the Information Specialist to the core business/functions of the University of Pretoria Suzy Nyakale: Faculty Library.
Research Data Management From A Publisher’s Perspective
ACS 2016 Moving research forward with persistent identifiers
GFBio – Education module
Publishing software and data
Institutional role in supporting open access, open science, open data
General Finnish DMP Guidance
Jay Bhatt Drexel University Libraries
CNI Spring 2010 Membership Meeting
Getting Started with Data Management
Data Management: Documentation & Metadata
Digital Curation Centre
Open Access to your Research Papers and Data
OpenML Workshop Eindhoven TU/e,
Research Data Management
BIO1130 Lab 2 Scientific literature
Comparing your papers to the rest of the world
USER MANUAL - WORLDSCINET
Measuring Your Research Impact
Jisc Research Data Shared Service (RDSS)
Research data lifecycle²
Track Your Research Impact
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
USER MANUAL - WORLDSCINET
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

Research Data Management From A Publisher’s Perspective Presentation for RDMI Meeting, Industry Panel September 14, 2017 Anita de Waard, a.dewaard@elsevier.com VP Research Data Management, RDS Elsevier

Outline: How has your work in data management enabled research and discovery? What key areas of success has your organization achieved in delivering research data management solutions? What are the greatest challenges you are facing in developing solutions that meet the needs of research data management?

10 Properties of Highly Effective Research Data 9. Re-usable (allow tools to run on it) 8. Reproducible Use 7. Trusted (e.g. reviewed) 6. Comprehensible (description / method is available) 10. Integrate upstream and downstream – make metadata to serve use. 5. Citable 4. Discoverable (data is indexed or data is linked from article) Share 3. Accessible 2. Preserved (long-term & format-independent) Save 1. Stored (existing in some form)

10 Properties of Highly Effective Research Data Repeat. Replicate. Reproduce.  Reuse. Review.  Data Journals: Research Elements 9. Re-usable (allow tools to run on it) 8. Reproducible Use Research Data Guidelines for Journal 7. Trusted (e.g. reviewed) 6. Comprehensible (description / method is available) 10. Integrate upstream and downstream – make metadata to serve use. Mendeley Data Repository 5. Citable 4. Discoverable (data is indexed or data is linked from article) Share DataSearch 3. Accessible 2. Preserved (long-term & format-independent) Hivebench Lab Notebook Save 1. Stored (existing in some form)

HIvebench: Store protocols in an Electronic Lab Notebook. Edit, export, share Keep collection of protocols online https://www.hivebench.com/

Hivebench: Run experiments from this Lab Notebook. Base on saved Protocols Edit, export, share Save and Export Outputs https://www.hivebench.com/

Mendeley Data: Export results to a trusted data repository. Describe how exoeriment can be reproduced Create DOI for Citation Link back to protocols Store up to 5 GB of data in many formats Keep track of versions of dataset https://data.mendeley.com/

DataSearch: Search over collection of repositories Chemistry data are retrievable from NIST, but only by going to their page in a browser and using their search tools. What about accessible within other applications, or accessible in assistive devices for those with vision impairment? What guarantee do we have the data will remain accessible in case of government funding problems? https://datasearch.elsevier.com

Data With Journals: Research Data Guidelines For Journals: https://www.elsevier.com/authors/author-services/research-data/data-guidelines Data With Journals: Research Data Guidelines For Journals: Option A: Research Data deposit and citation You are encouraged to: Deposit your research data in a relevant data repository Cite this dataset in your article Option B: Research Data deposit, citation and linking (or Research Data Availability Statement) You are encouraged to: Deposit your research data in a relevant data repository Cite and link to this dataset in your article If this is not possible, make a statement explaining why research data cannot be shared Option C: Research Data deposit, citation and linking (or Research Data Availability Statement) You are required to: Deposit your research data in a relevant data repository Cite and link to this dataset in your article If this is not possible, make a statement explaining why research data cannot be shared Option D: Research Data deposit, citation and linking You are required to: Deposit your research data in a relevant data repository Cite and link to this dataset in your article Option E: Research Data deposit, citation and linking (or Research Data Availability Statement); You are required to: Deposit your research data in a relevant data repository Cite and link to this dataset in your article. If this is not possible, make a statement explaining why research data cannot be shared Peer reviewers are asked to review the data prior to publication

Data Journals: E.g. MethodsX Link to protocols Data Journals: E.g. MethodsX Journal focuses on Method reporiduction Link to Data Fully OA https://www.journals.elsevier.com/methodsx

Reproducibility: Reproducibility Papers To increase the practice of reproducibility in computational science, we have two main goals: 1. Usability: development of tools that make it easier and significantly less time-consuming for authors to do reproducible research, and for reviewers to execute computational artifacts (and modify them) corresponding to published results. 2. Incentives: a new publication model that recognizes the efforts of making experiments reproducible (for authors) and verifying published scientific results (for reviewers). we have implemented a new publication model for the Reproducibility Section of Information Systems Journal. In this section, authors submit a reproducibility paper that explains in detail the computational assets from a previous published manuscript in Information Systems. Submission is by invitation only. Using Mendeley Data authors also submit their code, data, and optionally a ReproZip package or a Docker container to make the review process easier. Reviewers not only review the reproducibility paper, but also validate the results and claims published in the original manuscript. Once the paper is accepted, reviewers also become co-authors and are encouraged to add a section in the paper that states the extent to which the software is portable, is robust to changes, and is likely to be usable as a subcomponent or as a basis for comparison by future researchers. The review is not blinded, so authors and reviewers are encouraged to engage in a discussion about the validity of the experimental results as many times as necessary.

Currently In Development: Mendeley Data Management Platform: Integration with Existing Standards/Systems at Institution

Underway: “Basket of Metrics” & Elsevier Tracking Solutions Goal: Metric: How to measure More data is saved: 1 Stored, i.e. safely available in long-term repository) Nr of datasets stored in long-term storage MD, Pure; Plum Indexes Figshare, Dryad, MD and working on Dataverse. 2. Published, i.e. long-term preserved, accessible via web, have a GUID, citeable, with proper metadata Nr of datasets published, in some form Scholix, ScienceDirect/Scopus 3. Linked, to articles or other datasets Nr of datasets linked to articles Scholix, Scopus 4. Validated, by a reviewer/curated Nr of datasets in curated databases/peer reviewed in data articles Science Direct, DataSearch (for curated Dbses) More data is seen and used: 5. Discovered: found by users Nr of datasets viewed in databases/websites/search engines Datasearch, metrics from other search engines/repositories 6. Identified: Resolved through a GUID Broker DOI is resolved DataCite has DOI resolution: made available? 7. Mentioned: Social media and news Social media and news mentions Plum and Newsflo 8. Cited: Formal citations of data Nr of datasets cited in articles Scopus 9. Downloaded: Distinct downloads Downloaded from repositories Downloads from MD, access data from Figshare/Dryad 10. Reused: Dataset is used for new research Mention of usage in article or other dataset SD, access to other data repositories

We need baselines! Example: University of Manchester Data sharing = 19% (well above the average of 5.5%) 886 random articles checked 570 articles without any supplementary/associated data (64%); +151 articles with supplementary docs (but not data) 2 data journal articles (0.2%) 86 articles with associated data in repositories (9.7%) 81 articles linked to associated data in a repository (9.1%) 5 articles with no link to a repository (0.6%) 79 articles with supplementary data (8.9%) 9. Re-usable 8. Reproducible 0.2% 7. Trusted 6. Comprehensible 9.1% 5. Citable 4. Discoverable 0.6% 3. Accessible 8.9% 2. Preserved 1. Stored Random Selection Articles 886 Links found manually 81 Links found through Scholix 5 Total links 86 (9.8%) Courtesy Sean Husen and Helena Cousijn (Elsevier)

Open Data Report Reveals Some Challenges: Data sharing survey (with 1167 respondents): Although 69% of respondents found that sharing data was very important in their field And 73% wanted to have access to other people’s data, Only 37% believe there was credit in doing so, And only 25% felt they had adequate training to properly share their data with others. The main barriers for sharing data were: privacy concerns, ethical issues, intellectual property rights issues. Furthermore: Mandates from publishers or funding agencies were largely not seen as a driving force => Gap between desire and practice concerning data sharing. https://data.mendeley.com/datasets/bwrnfb4bvh/1

Further Challenge: Who Do We Talk to At An institution?

Further Challenge: How do you ‘Play Well With Others’ when there are so many others (e.g. 47 tools on NDS Labs Workbench) and they are mostly ‘academic’ (i.e. OS, constantly renewed, etc etc)?

How has your work in data management enabled research and discovery? Summary: How has your work in data management enabled research and discovery? Providing a suite of tools and standards that encourage open, integrated RDM solutions. What key areas of success has your organization achieved in delivering research data management solutions? Tools are used (ergo: useful); Developing institutional solutions and data metrics with partners. What are the greatest challenges you are facing in developing solutions that meet the needs of research data management? No great urgency for researchers, inadequate knowledge of possibilities; Distributed responsibility/decision-making processes for RDM; Plethora of tools to integrate with; Difficult to see what the market is (OS, completely? Academic/government?) > How can publisher play a role? Feel free to email me with any questions! a.dewaard@elsevier.com