Managing data and being open Sarah Jones Digital Curation Centre, Glasgow Data Management Plans: principles and.

Slides:



Advertisements
Similar presentations
Good practice in Research Data Management Module 5: Deposit and long-term preservation.
Advertisements

Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.
Funded by: Research Data Management University of East London, 1 st May 2013 Sarah Jones Digital Curation Centre Twitter: sjDCC.
Data Management Planning and DMPonline Angus Whyte DCC, University of Edinburgh Slides by Sarah Jones University of Aberdeen, 7 Oct 2014.
Managing your research data: University support for researchers Sally Rumsey The Bodleian Libraries University of Oxford Mary Harssch
The benefits & practice of openness Sarah Jones Digital Curation Centre, University of Glasgow Boo(s)tcamp Open.
Archiving Data. Essential stuff to know Why deposit? Digital repositories ADS Guidelines Deposit evaluation & requirements Deposit checklist & template.
Open Exeter Project Team
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
August 14, 2015 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
Good practice in Research Data Management Module 6: Tools, training and support.
Data Management Planning and DMPonline Sarah Jones DCC, University of Glasgow VADS4R, UCA Epsom, 22 nd July 2014.
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
+ Sarah Jones Digital Curation Centre Supporting researchers with Data Management Plans.
Data Management Planning and DMPonline
Libra: Thesis and Dissertation Submission. What is Libra? UVA’s institutional repository, providing online archiving and access for the scholarly output.
Managing Your Research Data Catherine Pink (UKOLN) Jez Cope (DTC)
Managing your research data Stephen Grace and David McElroy Managing your research data workshop, 02 February 2014.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
Open Access and Data Curation Team
Jonathan Rans Digital Curation Centre
Data Management Planning Data Sharing. What is data sharing? “… the practice of making data used for scholarly research available to others.” [Wikipedia]
The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones
Software Sustainability Institute Dealing with software: the research data issues 26 August.
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
Because good research needs good data The DCC lifecycle model, Exeter Uni, 19 May 2012 Funded by: The Digital Curation Lifecycle Model Joy Davidson and.
Getting to grips with Research Data Management 3 rd June 2015 Isabel Chadwick, Research Data Management Librarian
Where are the rewards? Building a culture of data citation workshop Edith Cowan University, Perth March
Research Data Management for Research Support staff 30 th June 2015 Isabel Chadwick, Research Data Management Librarian
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
Because good research needs good data The DCC lifecycle model, Exeter Uni, May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson.
June 3, 2016 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
SHARE (SHared Access Research Ecosystem) Tyler Walters Co-Chair, SHARE Steering Group (a joint committee of the ARL, the AAU, and the APLU) Eric Celeste.
Choosing Between Data Sharing Repositories for Engineering Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Introduction to Data Management Plans Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
Supporting DMPs: lessons from the UK and elsewhere Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop,
Options for customising DMPonline Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
Sharing Research Data with: OC Data Portal: ocdp.lib.uci.edu UC Irvine Dash: dash.lib.uci.edu Dan Tsang, Data Librarian Julia Gelfand, Applied Sciences.
Issues in RDM This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative Commons Attribution 4.0 International License.
Because good research needs good data Funded by: C4D Workshop, 26 th July 2013 The Shifting Research Data Management Policy Landscape Joy Davidson and.
Funded by: Data Management Planning Sarah Jones Digital Curation Centre Twitter: sjDCC.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Introduction to Research Data Management Joy Davidson and Sarah Jones Digital Curation Centre
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
Introduction to RDM Sarah Jones & Joy Davidson Digital Curation Centre
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
Writing a successful data management plan Kathleen Fear October 17, 2013.
Research Data Management in the Humanities: an Introduction to the Basics Open Exeter Project Team.
Funders’ data policies and costs Sarah Jones DCC, University of Glasgow Twitter: sjDCC Funded by:
Writing a data management plan (DMP) Stephen Grace and David McElroy Writing a DMP workshop, UEL 5 March 2015.
Because good research needs good data The DCC lifecycle model, Exeter Uni, May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson.
Funders’ data policies and costs Sarah Jones DCC, University of Glasgow Twitter: sjDCC Funded by:
Jeff Moon Data Librarian &
Open Exeter Project Team
Open Access and Research Data Management: An Overview for LLOs
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Designing a better future: Active, actionable DMPs
EPSRC research data expectations and research software management
GFBio – Education module
Publishing software and data
Institutional role in supporting open access, open science, open data
Research data requirements in Horizon 2020
What, why and best practices in open research
Research Data Management
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

Managing data and being open Sarah Jones Digital Curation Centre, Glasgow Data Management Plans: principles and practice workshop, 19 November 2015, Bologna

“the active management and appraisal of data over the lifecycle of scholarly and scientific interest” Data management is part of good research practice What is research data management? PlanCreateDocumentUsePublishShare

What does data management involve? Data Management Planning Creating data Analysing data Documenting data Accessing / (re)using data Storage and backup Selecting what to keep Sharing data Data licensing and citation Preserving data … CC-BY-NC-SA

Reasons to manage and share data Benefits for researchers To make research easier! Stop yourself drowning in irrelevant stuff Make sure you can understand and reuse your data again later Advance your career – data is growing in significance Research integrity To avoid accusations of fraud or bad science Evidence findings and enable validation of research methods Meet codes of practice on research conduct Many research funders worldwide now require Data Management and Sharing Plans Potential to share data So others can reuse and build on your data To gain credit – several studies have shown higher citation rates when data are shared For greater visibility, impact and new research collaborations Promote innovation and allow research in your field to advance faster

Managing and sharing data: a best practice guide

MANTRA online training for RDM

DCC ‘How to’ guides How to appraise and select research data How to cite datasets and link to publications How to develop a data management plan How to license research data How to track the impact of research data with metrics How to write a lay summary

WHAT DOES IT MEAN TO BE OPEN? Image by biblioteekje CC-BY-NC-SA

What is open science? “science carried out and communicated in a manner which allows others to contribute, collaborate and add to the research effort, with all kinds of data, results and protocols made freely available at different stages of the research process.” Research Information Network, Open Science case studies open-science-case-studies

More than open access publishing CC-BY Andreas Neuhold

Open data  make your stuff available on the Web (whatever format) under an open licence  make it available as structured data (e.g. Excel instead of a scan of a table)  use non-proprietary formats (e.g. CSV instead of Excel)  use URIs to denote things, so that people can point at your stuff  link your data to other data to provide context Tim Berners-Lee’s proposal for five star open data - “Open data and content can be freely used, modified and shared by anyone for any purpose”

Open methods Documenting and sharing workflows and methods Sharing code and tools to allow others to reproduce work Using web based tools to facilitate collaboration and interaction from the outside world Open netbook science – “when there is a URL to a laboratory notebook that is freely available and indexed on common search engines.”

Reliance on specialist research software Slide from Neil Chue-Hong, Software Sustainability Institute Do you use research software? What would happen to your research without software Survey of researchers from 15 UK Russell Group universities conducted by SSI between August - October DOI: /zenodo % Develop their own software 71% Have no formal software training

Openness at every stage DesignExperimentAnalysisPublicationRelease Open science image CC BY-SA 3.0 by Greg Emmerich Change the typical lifecycle Publish earlier and release more Papers + Data + Methods + Code… Support reproducibility

Degrees of openness Open Restricted Closed Content that can be freely used, modified and shared by anyone for any purpose Limits on who can use the data, how or for what purpose -Charges for use -Data sharing agreements -Restrictive licences -Peer-to-peer exchange -… Five star open data  Unable to share Under embargo

MANAGING RESEARCH DATA AND MAKING IT OPEN Things to consider Image CC-BY-NC-SA by Leo Reynolds

How to make data open? 1.Choose your dataset(s) –What can you make open? You may need to revisit this step if you encounter problems later. 2.Apply an open license –Determine what IP exists. Apply a suitable licence e.g. CC-BY 3.Make the data available –Provide the data in a suitable format. Use repositories. 4.Make it discoverable –Post on the web, register in catalogues…

Licensing research data openly This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement your licence CREATIVE COMMONS LIMITATIONS NCNon-Commercial What counts as commercial? NDNo Derivatives Severely restricts use These clauses are not open licenses Horizon 2020 Open Access guidelines point to: or

EUDAT licensing tool Answer questions to determine which licence(s) are appropriate to use

Metadata standards to use Use relevant standards for interoperability

Choosing appropriate file formats If you want your data to be re-used and sustainable in the long- term, you typically want to opt for open, non-proprietary formats. You may create your data in one format and covert it. TypeRecommendedAvoid for data sharing Tabular dataCSV, TSV, SPSS portableExcel TextPlain text, HTML, RTF PDF/A only if layout matters Word MediaContainer: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 ImagesTIFF, JPEG2000, PNGGIF, JPG Structured dataXML, RDFRDBMS Further examples:

Data repositories Does your publisher or funder suggest a repository? Are there data centres or community databases for your discipline? Does your university offer support for long-term preservation? Zenodo OpenAIRE-CERN joint effort Multidisciplinary repository Multiple data types – Publications – Long tail of research data Citable data (DOI) Links funding, publications, data & software

Citing research data: why?

How to cite data Key citation elements Author Publication date Title Location (= identifier) Funder (if applicable )

Conduct science in the open Use open lab notebooks Share protocols Blog about your work Publish assertions to get ideas out sooner (nanopublication)

Collaborate & share: MyExperiment

Plan for openness from the outset Many decisions taken early on in the project will affect whether the data can be made openly available Think about where you want to publish and include APCs in grant applications if needed Ensure consent agreements also include permission to archive and share data for reuse by others Seek permissions for more than just the primary project purpose if signing licences to reuse third-party data. Derivative data may not be able to be shared if it includes somebody else’s IP Explore the potential for openness when drafting agreements with commercial partners

Thanks for listening DCC case studies and guidance documents: Follow us on and #ukdcc