Kerry Miller Digital Curation Centre University of Edinburgh

Slides:



Advertisements
Similar presentations
April 2010 MRC Data Sharing Policy Peter Dukes Policy Lead – Data Sharing & Preservation.
Advertisements

Reconciling the sharing of research data with ethical review for research with people as participants Veerle Van den Eynden UK Data Archive Data Support.
Useful tools for ESRC Research Centres
A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation UKOLN Open ForumIWMW June 2006 Funded by: This work is licensed under the Creative Commons.
A centre of expertise in data curation and preservation London :: ARK Group Workshop: Archiving the Web :: 28 Sept 2006 Funded by: This work is licensed.
A centre of expertise in data curation and preservation National FoI Group Birmingham07 March 2007 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation SoA Annual Conference::York::August 2008 Funded by: This work is licensed under the Creative Commons.
A centre of expertise in data curation and preservation CETIS MDR SIG::28 June 2006::University of Bath Funded by: This work is licensed under the Creative.
Data Management Tools David Wallom. YOUR DATA DOES NOT BELONG TO YOU! IT BELONGS TO YOUR EMPLOYING INSTITUTION!
APA CONFERENCE, FRASCATI 6 November 2012 Data management planning at the DCC Martin Donnelly Digital Curation Centre University of Edinburgh.
MANAGING YOUR DATA WELL …………………………………………
JISC Managing Research Data 2 Programme Launch Nottingham 1-2 December,2011 Robin Rice Data Librarian University of Edinburgh.
A centre of expertise in data curation and preservation DC 101 Lite, September 10, 2010, London Funded by: This work is licensed under the Creative Commons.
… because good research needs good data DMP Online, Lincoln, 28 th Feb 2013 DMP Online Kerry Miller Digital Curation Centre University of Edinburgh
Data Management Planning Kerry Miller Digital Curation Centre University of Edinburgh DIY Research Data Management Training Kit for.
<<Date>><<SDLC Phase>>
Creating a Data Management Plan for your Research EPFL Workshop Lausaunne, 28 Oct 2014 Robin Rice, Laine Ruus EDINA and Data Library.
How to Write a Data Management Plan Gareth Cole, Data Curation Officer, Open Access Team.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
Good practice in Research Data Management Module 2: RDM Introduction.
Data Seal of Approval Overview Lightning Talk RDA Plenary 5 – San Diego March 11, 2015 Mary Vardigan University of Michigan Inter-university Consortium.
DATA LIFECYCLE & DATA MANAGEMENT PLANNING ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA.
Open Exeter Project Team
Undertaken by the ………………………………
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
DATA MANAGEMENT SUPPORT FOR RESEARCHERS …………………………………………
+ Sarah Jones Digital Curation Centre Supporting researchers with Data Management Plans.
Data Management Planning and DMPonline
U.S. Department of the Interior U.S. Geological Survey Planning for Data Management Creating data management plans for your project.
… because good research needs good data PEKin: Developing Data Management Expertise in Research, 21 October 2010 The DCC’s Data Management Planning: Encouraging.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
UVa Library Research Data Services
Data Management Planning
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Data Archiving and Networked Services Introduction to Data Management Planning.
Because good research needs good data Data Management Planning Anglia Ruskin University 1 st June 2015 Jonathan Rans Digital Curation Centre This work.
Queensland University of Technology CRICOS No J The OAK Law Project Legal Issues in Data Management: A Practical Approach.
1 The Technical Standards and Your Bid Sarah Ormes UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives.
Because good research needs good data The DCC lifecycle model, Exeter Uni, 19 May 2012 Funded by: The Digital Curation Lifecycle Model Joy Davidson and.
Data Management Planning. What is a DMP? A short plan that outlines  what data you will create and how  how you will manage it (storage, back-up, access…)
Research Data Management for Research Support staff 30 th June 2015 Isabel Chadwick, Research Data Management Librarian
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
Introduction to Data Management Plans Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
DOE Data Management Plan Requirements
11 Researcher practice in data management Margaret Henty.
Options for customising DMPonline Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
Funded by: Data Management Planning Sarah Jones Digital Curation Centre Twitter: sjDCC.
Using RMS to comply with Open Access Requirements Betsy Fuller Research Repository Librarian Information Services.
Preserving your research data for future use This work is licensed under a Creative Commons Attribution 3.0 Unported License.Creative Commons Attribution.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
Introduction to Research Data Management Joy Davidson and Sarah Jones Digital Curation Centre
Research Data Management 26 th April 2016 Federica Fina, Data Scientist, University of St Andrews Library.
DMPonline Adaption of template Sacha Zurcher & Stine Vejlebo Hansen RUb.
DATUM for Health – Healthy research needs healthy data I’ve collected my data, so what do I do with it now? Research data management Session 2 Data Curation.
Research Data Management in the Humanities: an Introduction to the Basics Open Exeter Project Team.
Writing a data management plan (DMP) Stephen Grace and David McElroy Writing a DMP workshop, UEL 5 March 2015.
Because good research needs good data The DCC lifecycle model, Exeter Uni, May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson.
Data Management Planning Sarah Jones & Joy Davidson Digital Curation Centre
Open Exeter Project Team
DMPonline Adaption of template
Open Access and Research Data Management: An Overview for LLOs
Frameworks for Sensitive Data in the Research Lifecycle
EPSRC research data expectations and research software management
Institutional role in supporting open access, open science, open data
General Finnish DMP Guidance
Open Access to your Research Papers and Data
Sabrina Iavarone Senior User Services Officer
Research Data Management
Research data management plans: How to write one
Research Data Dr Aoife Coffey, Research Data Coordinator
Presentation transcript:

Kerry Miller Digital Curation Centre University of Edinburgh Data Management Planning Kerry Miller Digital Curation Centre University of Edinburgh This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.

Overview About the Digital Curation Centre Data Management Planning Group exercise DMP Online Keywords: best practice; collaboration; community; data management; life cycle; sharing; guidance; policy; adherence. MIGHT BE WORTH SQUEEZING SOMETHING IN ABOUT CLOUD TO CONNECT WITH AUDIENCE

The Digital Curation Centre Founded in 2004 Mission to support the UK research community in the management and curation of digital resources, with a special focus on research data Three partners: Edinburgh, Glasgow and Bath Primary funder is JISC

Collaborations Generic data management guidance in conjunction with Funders, Institutions, & Disciplines to provide specific guidance Joint training programmes organized and delivered by DCC and UKDA Provided advice to US consortium Provide support to a variety of JISC MRD projects

What is Research Data Management (RDM)? An umbrella term to describe all aspects of planning, organising, documenting, storing and sharing data. It also takes into account issues such as data protection and confidentiality. It provides a framework that supports researchers and their data throughout the course of their research and beyond.

The RDM Lifecycle Expose National Repository or Slide borrowed with permission from Anthony Beitz, Monash University. Presented at OR 2012, Edinburgh Expose National Repository or Institutional Repository or Electronic Journal or Community Portal Design Data Management Planning Research Data Management Platform Collaborate Experiment Publish Analyse Collaborate Conceive Design Experiment Publish Expose Analyse

Benefits of RDM By managing your data properly you will: Meet funder / university / industry requirements Ensure data are accurate, complete, authentic and reliable – as per good research practice Ensure research integrity and replication Ensure data security & minimise the risk of loss Increase efficiency - save time & resources Ensure data is available for your own future use

RCUK Common Principles on Data Policy Key messages: Data are a public good Adherence to community standards and best practice Metadata for discoverability and access Recognise constraints on what data to release Permit embargo periods delaying data release Acknowledgement of / compliance with T&Cs Data management and sharing activities should be explicitly funded http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx

Funder Requirements AHRC, BBSRC, ESRC, MRC, NERC, and STFC all require some form of data management or sharing plan as part of a funding application. The requirements are diverse, but they all have the RCUK Common Principles as their foundation. Cancer Research UK and the Wellcome Trust are not part of RCUK but both require data sharing plans. http://www.dcc.ac.uk/resources/data-management-plans/funders-requirements

STFC Requirements STFC has a ‘Scientific Data Policy’ which applies to ALL scientific data produced as a result of STFC funding. Data management plans should exist for all data within the scope of the policy. Proposals for grant funding, for those projects which result in the production or collection of scientific data, should include a data management plan. STFC recommends that data management plans be formulated following the guidance provided by the Digital Curation Centre. http://www.dcc.ac.uk/resources/data-management-plans

The DMP Checklist The DMP Checklist (Donnelly/Jones) is the underlying intellectual framework which supports all of the DCC’s data management planning work. The next part of this talk gives a very quick overview of these sections.

The DMP Checklist Pulled together a list of all the UK funders’requirements, gathered them into thematic groups, and supplemented this list with their own expertise... (Donnelly & Jones, 2009) This became the first ‘Data Management Plan Content Checklist’ (Checklist v1.0’) – it had 51 questions/headings. Post-consultation, v2.0 had 115 questions/headings. Version 3 was produced in March 2011 and has 118 questions – much too long! Version 4 has just been completed, it is shorter and more suited to the needs of researchers and institutions

DMP Checklist Headings The checklist covers all of the following areas: Administrative Data Data Collection Documentation and Metadata Ethics and Legal Compliance Storage and Backup Selection and Preservation Data Sharing Responsibilities and Resources

1: Administrative Data Overview of functionality... DCC Checklist DCC Guidance and questions to consider Administrative Data ID A pertinent ID as determined by the funder and/or institution. Funder State research funder if relevant Grant Reference Number Enter grant reference number if applicable [POST-AWARD DMPs ONLY] Project Name If applying for funding, state the name exactly as in the grant proposal. Project Description Questions to consider: What is the nature of your research project? What research questions are you addressing? For what purpose are the data being collected or created? Guidance: Briefly summarise the type of study (or studies) to help others understand the purposes for which the data are being collected or created. PI / Researcher Name of Principal Investigator(s) or main researcher(s) on the project. PI / Researcher ID E.g ORCID http://orcid.org/ Project Data Contact Name (if different to above), telephone and email contact details Date of First Version Date the first version of the DMP was completed Date of Last Update Date the DMP was last changed Related Policies Are there any existing procedures that you will base your approach on? Does your department/group have data management guidelines? Does your institution have a data protection or security policy that you will follow? Does your institution have a Research Data Management (RDM) policy? Does your funder have a Research Data Management policy? Are there any formal standards that you will adopt? List any other relevant funder, institutional, departmental or group policies on data management, data sharing and data security. Some of the information you give in the remainder of the DMP will be determined by the content of other policies. If so, point/link to them here. Overview of functionality...

2: Data Collection DCC Checklist DCC Guidance and questions to consider Data Collection What data will you collect or create? Questions to consider: - What type, format and volume of data? - Do your chosen formats and software enable sharing and long-term access to the data? - Are there any existing data that you can reuse? Guidance: Give a brief description of the data, including any existing data or third-party sources that will be used, in each case noting its content, type and coverage. Outline and justify your choice of format and consider the implications of data format and data volumes in terms of storage, backup and access. How will the data be collected or created? Questions to Consider: - What standards or methodologies will you use? - How will you structure and name your folders and files? - How will you handle versioning? - What quality assurance processes will you adopt? Guidance: Outline how the data will be collected/created and which community data standards (if any) will be used. Consider how the data will be organised during the project, mentioning for example naming conventions, version control and folder structures. Explain how the consistency and quality of data collection will be controlled and documented. This may include processes such as calibration, repeat samples or measurements, standardised data capture or recording, data entry validation, peer review of data or representation with controlled vocabularies. Overview of functionality...

3: Documentation and Metadata DCC Checklist DCC Guidance and questions to consider Documentation and Metadata What documentation and metadata will accompany the data? Questions to consider: - What information is needed for the data to be to be read and interpreted in the future? - How will you capture / create this documentation and metadata? - What metadata standards will you use and why? Guidance: Describe the types of documentation that will accompany the data to help secondary users to understand and reuse it. This should at least include basic details that will help people to find the data, including who created or contributed to the data, its title, date of creation and under what conditions it can be accessed. Documentation may also include details on the methodology used, analytical and procedural information, definitions of variables, vocabularies, units of measurement, any assumptions made, and the format and file type of the data. Consider how you will capture this information and where it will be recorded. Wherever possible you should identify and use existing community standards. Overview of functionality...

4: Ethics and Legal Compliance DCC Checklist DCC Guidance and questions to consider Ethics and Legal Compliance How will you manage any ethical issues? Questions to consider: - Have you gained consent for data preservation and sharing? - How will you protect the identity of participants if required? e.g. via anonymisation - How will sensitive data be handled to ensure it is stored and transferred securely? Guidance: Ethical issues affect how you store data, who can see/use it and how long it is kept. Managing ethical concerns may include: anonymisation of data; referral to departmental or institutional ethics committees; and formal consent agreements. You should show that you are aware of any issues and have planned accordingly. If you are carrying out research involving human participants, you must also ensure that consent is requested to allow data to be shared and reused. How will you manage copyright and Intellectual Property Rights (IPR) issues? Questions to consider: - Who owns the data? - How will the data be licensed for reuse? - Are there any restrictions on the reuse of third-party data? - Will data sharing be postponed / restricted e.g. to publish or seek patents? Guidance: State who will own the copyright and IPR of any data that you will collect or create, along with the licence(s) for its use and reuse. For multi-partner projects, IPR ownership may be worth covering in a consortium agreement. Consider any relevant funder, institutional, departmental or group policies on copyright or IPR. Also consider permissions to reuse third-party data and any restrictions needed on data sharing. Overview of functionality...

5: Storage and Backup DCC Checklist DCC Guidance and questions to consider Storage and Backup How will the data be stored and backed up during the research? Questions to consider: - Do you have sufficient storage or will you need to include charges for additional services? - How will the data be backed up? - Who will be responsible for backup and recovery? - How will the data be recovered in the event of an incident? Guidance: State how often the data will be backed up and to which locations. How many copies are being made? Storing data on laptops, computer hard drives or external storage devices alone is very risky. The use of robust, managed storage provided by university IT teams is preferable. Similarly, it is normally better to use automatic backup services provided by IT Services than rely on manual processes. If you choose to use a third-party service, you should ensure that this does not conflict with any funder, institutional, departmental or group policies, for example in terms of the legal jurisdiction in which data are held or the protection of sensitive data. How will you manage access and security? Questions to consider: - What are the risks to data security and how will these be managed? - How will you control access to keep the data secure? - How will you ensure that collaborators can access your data securely? - If creating or collecting data in the field how will you ensure its safe transfer into your main secured systems? Guidance: If your data is confidential (e.g. personal data not already in the public domain, confidential information or trade secrets), you should outline any appropriate security measures and note any formal standards that you will comply with e.g. ISO 27001. Overview of functionality...

6: Selection and Preservation DCC Checklist DCC Guidance and questions to consider Selection and Preservation Which data should be retained, shared, and/or preserved? Questions to consider: - What data must be retained/destroyed for contractual, legal, or regulatory purposes? - How will you decide what other data to keep? - What are the foreseeable research uses for the data? - How long will the data be retained and preserved? - Guidance: Consider how the data may be reused e.g. to validate your research findings, conduct new studies, or for teaching. Decide which data to keep and for how long. This could be based on any obligations to retain certain data, the potential reuse value, what is economically viable to keep, and any additional effort required to prepare the data for data sharing and preservation. Remember to consider any additional effort required to prepare the data for sharing and preservation, such as changing file formats. What is the long-term preservation plan for the dataset? Questions to consider: - Where e.g. in which repository or archive will the data be held? - What costs if any will your selected data repository or archive charge? - Have you costed in time and effort to prepare the data for sharing / preservation? Guidance: Consider how datasets that have long-term value will be preserved and curated beyond the lifetime of the grant. Also outline the plans for preparing and documenting data for sharing and archiving. If you do not propose to use an established repository, the data management plan should demonstrate that resources and systems will be in place to enable the data to be curated effectively beyond the lifetime of the grant. Overview of functionality...

7: Data Sharing DCC Checklist DCC Guidance and questions to consider How will you share the data? Questions to consider: - How will potential users find out about your data? - With whom will you share the data, and under what conditions? - Will you share data via a repository, handle requests directly or use another mechanism? - When will you make the data available? - Will you pursue getting a persistent identifier for your data? Guidance: Consider where, how, and to whom data with acknowledged long-term value should be made available. The methods used to share data will be dependent on a number of factors such as the type, size, complexity and sensitivity of data. If possible, mention earlier examples to show a track record of effective data sharing. Consider how people might acknowledge the reuse of your data. Are any restrictions on data sharing required? Questions to consider: - What action will you take to overcome or minimise restrictions? - For how long do you need exclusive use of the data and why? - Will a data sharing agreement (or equivalent) be required? Guidance: Outline any expected difficulties in sharing data with acknowledged long-term value, along with causes and possible measures to overcome these. Restrictions may be due to confidentiality, lack of consent agreements or IPR, for example. Consider whether a non-disclosure agreement would give sufficient protection for confidential data. Overview of functionality...

8: Responsibilities and Resources DCC Checklist DCC Guidance and questions to consider Responsibilities and Resources Who will be responsible for data management? Questions to consider: - Who is responsible for implementing the DMP, and ensuring it is reviewed and revised? - Who will be responsible for each data management activity? - How will responsibilities be split across partner sites in collaborative research projects? - Will data ownership and responsibilities for RDM be part of any consortium agreement or contract agreed between partners? Guidance: Outline the roles and responsibilities for all activities e.g. data capture, metadata production, data quality, storage and backup, data archiving & data sharing. Consider who will be responsible for ensuring relevant policies will be respected. Individuals should be named where possible. What resources will you require to deliver your plan? Questions to consider: - Is additional specialist expertise (or training for existing staff) required? - Do you require hardware or software which is additional or exceptional to existing institutional provision? - Will charges be applied by data repositories? Guidance: Carefully consider any resources needed to deliver the plan, e.g. software, hardware, technical expertise, etc. Where dedicated resources are needed, these should be outlined and justified. Overview of functionality...

Group exercise (20 minutes) In groups of 4 or 5: Read the Data Management Plan; Compare responses to the STFC Guidelines; If you were a reviewer would you approve this plan? If not then what changes would you want the author to make?

DMP Online – What is it? A web-based tool that enables users to... Create, store and update multiple versions of Data Management Plans across the research lifecycle Meet a variety of specific data-related requirements (from funders, institutions, publishers, etc.) using carefully designed templates Get tailored guidance on best practice and helpful contacts, at the point of need Customise exports of DMPs in a variety of formats Share DMPs with others in order to facilitate communications within and beyond research projects * N.B. The templates have varying degrees of endorsement from funders, stakeholder communities, etc.

DMP Online V3.0 released in Spring 2012 Extensive user testing of v3.0 conducted in Winter 2013 Version 4.0 now being developed to better meet the needs of researchers and institutions

DMP Online V4.0 V4.0 will: Simplify the entire tool Ask Funder questions directly Give institutions more flexibility and control over their customisations Make guidance more obvious and easier to read Let users delve into more detailed guidance as needed

DMP Online V4.0

DMP Online V4.0

DMP Online V4.0

DMP Online V4.0

Thank you Kerry Miller Digital Curation Centre University of Edinburgh www.dcc.ac.uk/dmponline Kerry.miller@ed.ac.uk