Research data: lifecycle, plans and planning

Slides:



Advertisements
Similar presentations
Linking research & learning technologies through standards June Handle Workshop Towards a National Persistent Identifier Infrastructure Handle.
Advertisements

Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
A centre of expertise in digital information management UKOLN is supported by: Research Data & Institutions Roles & Responsibilities? Dr.
MANAGING YOUR DATA WELL …………………………………………
Introduction to Metadata: Describing datasets CQU Librarians workshop, 11 February 2015 Kathryn Unsworth - Data Librarian This work is licensed under a.
“Can you digitise this for me please?” The University of Auckland's approach to managing digitisation proposals John Garraway Digital Services & Information.
Data Lifecycles & Management Plans
How Libraries are involved in RDM CQU Librarians workshop, 11 February 2015 Kathryn Unsworth – Data Librarian This work is licensed under a Creative Commons.
When can I begin submitting works to the Digital Repository ?
Repositioning for repositories: making the move to science data management Gerry Ryder CSIRO Information Management & Technology 21 January 2009.
Research data: lifecycle, plans and planning SEQld Data Intensive 29 th January 2015 Kathryn Unsworth.
DATA LIFECYCLE & DATA MANAGEMENT PLANNING ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA.
Good practice in Research Data Management Module 1: Research data and lifecycles.
Undertaken by the ………………………………
Open Data & overview of data management in H February 2015.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
DATA MANAGEMENT SUPPORT FOR RESEARCHERS …………………………………………
Makaba Macanda Research Data Management: Unisa experience LIASA: 03/10/2012.
Demystifying data … - management planning - management plans - interviews This work is licensed under a Creative Commons Attribution 4.0 International.
Procurement Innovation for Cloud Services in Europe CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium.
SAVING AND STORING YOUR RESEARCH DATA : TIPS AND TOOLS Jane Fry Wendy Watkins Carleton University Library Data Centre and the Carleton Scholar Program.
What are research data? Kathryn Unsworth (Natasha Simons' slides) James Cook University Workshop 17 September 2015.
DIVISION OF LIBRARY SERVICES Research Data Management at CSU.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
UVa Library Research Data Services
Research data management: an introduction Basic training course for information specialists Library – RDM Support Project.
Keeping abreast: How can librarians upskill for RDM? Kathryn Unsworth RDM for JCU Librarians workshop, 17 September 2015.
Getting to grips with Research Data Management 3 rd June 2015 Isabel Chadwick, Research Data Management Librarian
A survey based analysis on training opportunities Dr. Jūratė Kuprienė Framing the digital curation curriculum International Conference Florence, Italy.
Introduction to the RDMRose module Session 1.1 Sep-2012 Learning material produced by RDMRose RDMRose:
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
Preservation Strategies: Emerging standards for preservation Ronald Weaver National Snow and Ice Data Center Version 1.0 Review Date.
Research Data Management for Research Support staff 30 th June 2015 Isabel Chadwick, Research Data Management Librarian
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
OPPORTUNITIES FOR LIBRARIES 1 Image: This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
Data in the NEES Data Repository Conditions for Current and Future Use and Re-Use Quake Summit 2012, Boston, Massachusetts July 12, 2012 Stanislav Pejša.
Research Grants and Projects Discovery Service ANDS Webinar 12th August 2015 Monica Omodei, ANDS.
Introduction to the Australian National Data Service (ANDS) CQU Librarians workshop, 11 February 2015 Kathryn Unsworth – Data Librarian 1 This work is.
Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth.
How can librarians upskill in RDM? CQU Librarians workshop, 11 February 2015 Kathryn Unsworth – Data Librarian This work is licensed under a Creative Commons.
11 Researcher practice in data management Margaret Henty.
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
U.S. Department of the Interior U.S. Geological Survey August 24-25, 2011 Data Management Best Practices: FY11 Report.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
Open Access and Institutional Repositories. Accra, June 2007 Institutional repositories in SA research institutions: the DISA experience Dr D Peters.
Open Access & Institutional Repositories, Accra June 2007 Metadata and e-preservation Dr D Peters DISA: Digital Innovation South Africa.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Institutional data curation implementation 1st African Digital Curation Conference 12 February 2008.
Issues in RDM This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative Commons Attribution 4.0 International License.
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Library and IT Services. Marc van den Berg Some UvT facts.
Digital Stewardship Lee Dotson Digital Initiatives Librarian University of Central Florida John C. Hitt Library Presentation available at
Preserving your research data for future use This work is licensed under a Creative Commons Attribution 3.0 Unported License.Creative Commons Attribution.
RoaDMaP LEEDS RESEARCH DATA MANAGEMENT PILOT Research data Management Workshop Welcome!
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Research and Innovation Support Conference Library Support for Research Dr Stella Butler, University Librarian.
DATUM for Health – Healthy research needs healthy data I’ve collected my data, so what do I do with it now? Research data management Session 2 Data Curation.
PAUL STACEY Except where otherwise noted these materials are licensed under a Creative Commons Attribution 3.0 (CC BY)CC BY Open Licensing Requirements.
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement number Irena Vipavc.
Publish your Data on the Tropical Data Hub Seeding the Commons Project Australian National Data Service e-Research Centre James Cook University This work.
Jeff Moon Data Librarian &
Accessing data – a user’s perspective
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
A [very] short introduction to Data Management
Research data lifecycle²
Presentation transcript:

Research data: lifecycle, plans and planning CQU Librarians’ workshop, 11 February 2015 Kathryn Unsworth – Data Librarian Research data: lifecycle, plans and planning Kathryn Unsworth, ANDS How and why do we manage data through the data lifecycle? This segment unpacks the difference between “data management plan” and “planning for data management”. Q&A This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

Research data management and lifecycle models… Why? There must be something in it as a lot of people have gone to the trouble of creating many different models – refer to link on this slide, which takes you to a document with various lifecycle models. Because research activity often takes place in stages it can be viewed as forming a lifecycle Similarly the activities around research data can be seen as cyclic or going through often iterative phases within the wider research lifecycle. Essentially, the Data lifecycle is not a separate or standalone set of activities, instead is an embedded part of the research lifecycle. [Data Lifecycle Models and Concepts Version 8]

“Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.” However the life of research data often extends past the discrete nature of a research project. Researchers tend to think about their research data within the scope of the research project, but the research lifecycle extends past this through to preservation and data re-use by the researcher or other researchers or might end their life via secure disposal (where required) after the required retention period. http://www.data-archive.ac.uk/create-manage/life-cycle

Some examples of research/data related lifecycle models So let’s have a look at some examples. Many of which you may well have seen before.

Re-using data follow-up research new research undertake research reviews scrutinise findings teach and learn Processing data enter data, digitise, transcribe, translate check, validate, clean data anonymise data where necessary describe data manage and store data Giving access to data distribute data share data control access establish copyright promote data Analysing data interpret data derive data produce research outputs author publications prepare data for preservation A simple version of the lifecycle outlining actions for each of the phases of the cycle. Preserving data migrate data to best format migrate data to suitable medium back-up and store data create metadata and documentation archive data http://www.data-archive.ac.uk/create-manage/life-cycle

ANDS data curation continuum Quite a high level view of the curation process – concentrating mainly on sharing (with collaborators and partners) and the dissemination/publication of research data as a research output. The data curation continuum begins in the private domain, with the creation of research data by a researcher (see Figure 1). There may be a large number of data objects which are updated frequently. At this stage, researchers typically manage their own data. Preservation and metadata may not be needed, and access to the data is limited At the other end of the continuum is the public domain. There are likely to be a smaller number of selected static data objects which have accrued more metadata, and which may be managed and preserved through institutional arrangements such as repositories. This data is more likely to be publicly accessible, possibly in association with print publications. http://ands.org.au/guides/curation.continuum.html

Research lifecycle - JISC JISC – Joint Information Systems Committee UK This lifecycle models appears to represent the research project and not beyond it – no mention of preservation or reuse I do however think Conceptualising the research idea, Seeking partners, Proposal writing and Publication are all part of the research process?? Any ideas for a new label? The larger blue circle, excepting “research process”, represents activities not directly a part of the “doing” of research, almost administrative tasks. http://webarchive.nationalarchives.gov.uk/20140702233839/http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx

The DCC Curation Lifecycle Model Most popular or well-known of the research/data related lifecycle models. Note: additional labels and arrows have been added by presenter to point out the three “action” stages: Full lifecycle, sequential and occasional actions Breaks the stages of the lifecycle into three levels of actions: full lifecycle actions, sequential actions and occasional actions. My main criticism of this and some other data lifecycle models is that they provide an impression that data curation is a separate set of or extraneous activities. Also not sure about the placement of some of the sequential actions, e.g. as soon as a researcher creates or receives data they usually have to “store” it otherwise they won’t have data to appraise and select from. With the UK Data Archive model and this the DCC curation lifecycle model we can see the areas of consideration which assist researchers in shaping a data management plan, and assist institutions and funders with wider planning around the management of research data. http://www.dcc.ac.uk/resources/curation-lifecycle-model

United States Geological Survey Science (USGS) Data Lifecycle Model Really like the interactive version on the website. Well worth a look. Good explanations of each of the stages. Criticism: too linear – research is not always this sequential. Process and analyse are greyed out because these phases are seen as being quite unique to a research project and its methodology – domain specific. Like the representation of the full lifecycle actions. http://www.usgs.gov/datamanagement/why-dm/lifecycleoverview.php

OK, so what’s the practical use for these lifecycle models?

A Lifecycle model can provide the framework that informs planning for the management of data, from which researchers are given a clear roadmap for creating their data management plans The lifecycle models provide a great framework that informs planning for the management of data at the funder, institution and researcher levels. *Note: There is a difference between planning for the management of data and Data Management Plans

Data management planning (or planning for the management of data) & Data Management Plans Is there a difference? Just a quick word on terminology.

Let’s look at this visually. Data management planning is a strategic action undertaken by funders and institutions, with heavy investment by business units iwho support research. Data management planning (or planning the management of data) informs service development and delivery. Data management plans on the other hand are undertaken at the individual project level, but play a role in informing funder and institutional DM planning initiatives. Planning for the management of data is a strategic endeavour on the behalf of funders, institutions and researchers. Whilst Data Management Plans can inform strategic directions, they are more simply the formal documentation of how data will be managed by a researcher or research team over the course of their research project (and hopefully beyond) to improve the efficiency of the research project and to provide appropriate care for the data created, collected or compiled – safekeeping for later use/reuse. Let’s look at this visually.

To further illustrate practical applications To further illustrate practical applications. Let’s take a look at my favourite model. The UCF research lifecycle shows how these models can be used in planning – here we see a really nice referral map for research support at UCF, including the grey circles “activity not yet supported” Note: This lifecycle represents this university’s current research support status, your institution’s research lifecycle map would look quite different, particularly in who is providing what service and when within the research lifecycle. Your institution may have different gaps, i.e. services not yet supported, or may even provide services not even listed in this lifecycle. When looking at planning to manage research data, it’s important to take into account both the research lifecycle and the data lifecycle; the intersections between the two are interdependent and critical to informing all stakeholders of their roles and responsibilities in context of the whole [research process]. http://library.ucf.edu/ScholarlyCommunication/ResearchLifecycleUCF.php

Taken from the ANDS website, this table provides an overview of what needs to come together within an institution in order to meet funder requirements and at the same time provide researchers with the path of least resistance to good data management practice – through the creation of a living data management plan and the planning required to manage data to provide appropriate and fit for purpose services.

Data management plans are not mandatory inclusions in grant applications for Australian-funded research YET! It is worth noting that Data Management Plans are not mandatory yet, but recent changes with the inclusion of the “Management of Data” section in ARC grant applications, suggest that further change may come. A data management plan is, as the name suggests, a formal document which outlines how data will be managed over the course of a research project, quite different to the current ARC application requirements. In brief the DM plan gives details of what sort of data the project expects to create, collect or compile, and what will be done with it. This might include: A description of the type of data that will be used and where it will come from – how it will be created, or where it will be obtained from if pre-existing datasets are being used. How the data will be stored and kept safe during the project What plans there are for preserving the data after the end of the project, and for sharing it with other researchers

Group discussion: Lifecycle models: how practical are they? Consider the models you’ve just seen. As a group let’s discuss your overall impressions and whether you think they have a practical application in your role. Which of these, if any, would you prefer to use when discussing RDM with your library colleagues and/or researchers? Why? EXERCISE: Group discussion 10 minutes Scenario discussion: Consider the models you’ve just seen. Which of these, if any, would you prefer to use when discussing RDM with your library colleagues and/or researchers, and why?

This work is licensed under a Creative Commons Attribution 3 This work is licensed under a Creative Commons Attribution 3.0 Australia License ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy (NCRIS).