Co-Chairs: Mike Hildreth (Notre Dame), Ruth Duerr (Ronin Inst.) + ?

Slides:



Advertisements
Similar presentations
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION 1 NASA Earth Science Data Systems (ESDS) Software Reuse Working Group CEOS WIGSS-22 Annapolis, MD September.
Advertisements

I2S2 - Infrastructure for Integration in Structural Sciences Information Model Development Workshop RAL 11 th February 2010
ROLE OF SUBJECT LIAISON LIBRARIANS Scholarly Communication and Publishing Issues Jennifer Laherty, Digital Publishing Librarian, IUScholarWorks: Indiana.
Sharing the load – librarians and research data support services Stephen Grace, Research Services Librarian M25 Conference, Wellcome Collection, 23 April.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Enabling E Research ANU Data Commons. What is it ? Building a repository for data sets o data can be deposited o updated o published to Research Data.
PURR: A RESEARCH DATA CURATION SERVICE MODEL USING HUBZERO Courtney Earl Matthews Digital Data Repository Specialist HUBBUB 2012 Purdue University.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Version Review Date Section:
Data Providers Dissemination – Access, cost, formats, size, metadata, service, support, findability, Policies – Copyright, fees, confidentiality, preservation,
An Environmental Scan for Data Services Trends that are shaping today’s environment for data services.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
Mike Hildreth DASPOS Update Mike Hildreth representing the DASPOS project 1.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
E ARTHCUBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Overview PI:
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Data NIH Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health Big Data Symposium, Lincoln,
SAIL 2011: Into the I of the Storm; Information Resources Undergo a Sea Change Texas A&M University at Galveston April 5, 2011 – April 8, 2011 Data Management.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
School on Grid & Cloud Computing International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics.
Exploring Creative Commons Licenses for Scholarly Metadata Records
Jennie Larkin, PhD Senior Advisor
RDA Plenary 5 Big Data (Analytics) IG Session
Overview of WGs, IGs and BoFs
Mike Hildreth representing the DASPOS Team
Mike Hildreth representing the DASPOS project
EOSC Services for Scientists
CI Updates and Planning Discussion
Auditing of Trustworthy Data Repositories – Speakers
WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘
Preparing a Trustworthy Domain Repository for ISO Certification
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
WHY? - Found initiative while case statement preparation
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
PresQT – Implementation Ideas
Working Group 4 Facilities and Technologies
Algorithmic approach to contemporary bibliography generation
The Landscape of Questioning: Data & Software Sharing
Paolo Budroni, University of Vienna
Jarek Nabrzyski Director, Center for Research Computing
Where might software fit with CoreTrustSeal
CASE Tools and Joint and Rapid Application Development
Susanna-Assunta Sansone, Rebecca Lawrence and Simon Hodson
Where might software fit with CoreTrustSeal
Persistent Identifiers Implementation in EOSDIS
Reproducible Groundwater Science Workflows for the Future: A case for Texas Groundwater Availability Models Nalbeat “Sonny” Kwon, M.S. The University of.
Active Data Management in Space 20m DG
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Wrap Up Panel PresQT Workshop University of Notre Dame May 2, 2017
Institutional role in supporting open access, open science, open data
PresQT - Preservation Quality Tool
Evolution of Open Science in Europe and the Helmholtz Association
C2CAMP (A Working Title)
Data stewardship life cycle
Supporting the Data Management Needs of your Researchers 2017 Workshop for Heads and Chairs of Earth and Space Sciences Departments 10 December, 2017.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
WP7: Training & Education
Agenda welcome and goals (Peter)
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Assessment Workshop Title of the Project (date)
Archives and Records Professionals for Research Data IG
Research Data Alliance (RDA) 9th WG/IG Collaboration Meeting: Repository Platforms for Research Data (RPRD) Interest Group 13nd June 2018 Co-Chairs:
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Briefing to ARL Membership
Jisc Research Data Shared Service (RDSS)
Bird of Feather Session
It’s all about people Data-related training experiences from EUDAT, OpenAIRE, DANS Marjan Grootveld, DANS EDISON workshop, 29 August 2017.
Presentation transcript:

Co-Chairs: Mike Hildreth (Notre Dame), Ruth Duerr (Ronin Inst.) + ? Preservation Tools, Techniques, and Policies IG: Research Meets Preservation – Tools at the Interface Co-Chairs: Mike Hildreth (Notre Dame), Ruth Duerr (Ronin Inst.) + ?

PTTP IG: Introduction Co-Chairs: Ruth Duerr: Mike Hildreth: NEW: Officially-approved IG! Co-Chairs: Ruth Duerr: Mike Hildreth: + ? (still) looking for a non-US-based co-chair Research Scholar, Ronin Institute, PI or Co-I on several NSF, NASA, and NOAA cyberinfrastructure and informatics grants, current President of AGU Earth and Space Science Informatics group Experimental Particle Physicist (CERN), PI of US-based DASPOS project, co-leader of various NSF workshops on open access to research data

Today’s Agenda Group Business, name discussion PresQT Update (from Sandra Gesing) Jerad Bales/David Tarboton: Hydroshare & CUASHI Sunje Dallmeier-Tiessen:  CERN Analysis Preservation Portal Ramona Walls: CyVerse Rachel Drysdale: ELIXIR Ruth Duerr:  Earthcube Discussion: WG ideas? Future IG focus

Question/Discussion of our Name Originally: “Preservation Tools, Techniques, and Policies” (PTTP) Since then Separate Policy group has arisen Different focus, but confusing We found that “preservation” means something different on different continents Title lacks desired emphasis on researchers What about: “ReTAiN” = “Researcher Tools for Attaining kNowledge preservation” The “p” is silent Not sure if we are allowed to change our name mid-stream

Motivation for a Research Data & Software Preservation Quality Tool (PresQT) Researchers often respond reluctantly and retroactively to funder and publisher mandates for data and software sharing. Depositing data can be quite labor intensive. Metadata enhancement, provenance reconstruction, reformatting and data documentation efforts can present significant barriers to timely and complete data sharing. Curators engaged near the end of the research life cycle often receive incomplete metadata, at-risk formats, and a paucity of data documentation. Reuse and reproducibility are jeopardized.  PresQT bridges gaps between existing digital library infrastructure, repositories and software reuse. Hesburgh Libraries

Planning Phase Finished Outlook for Implementation https://presqt.crc.nd.edu/ presqt-contact-list@nd.edu Hesburgh Libraries

Additional Information

Some Thoughts… This IG is intended to host a conversation that has been mostly lacking in the RDA discussions so far Researcher/Data-Generator Archivist/Data Scientist

Some Thoughts… This session is intended to start a conversation that extends the RDA discussions so far Researcher/Data-Generator Archivist/Data Scientist

? Tools & Techniques Some Thoughts… How to bridge the gap? This session is intended to start a conversation that extends the RDA discussions so far ? How to bridge the gap? Tools & Techniques Researcher/Data-Generator Archivist/Data Scientist

From our Charter: What data/software/artifacts/documentation (“knowledge products”) should be preserved for sharing, re-use, and reproducibility for a given research domain? What tools are available for researchers to preserve these elements in a manner that does not obstruct or hinder their research? What are the strengths and weaknesses of these tools? Are there common features that could allow tools from one domain to be re-used elsewhere? Are there tools that archives/repositories could provide that could make preservation much easier for researchers? What are the longer-term development goals of each of these tools?

Tools & Techniques Preservation Tools & Techniques are essential for data sharing, curation, reproducibility, and re-use you have to preserve the data before you share it! decidedly non-trivial, even daunting for researchers Overlap between preservation and the needs of computational portability e.g. if you can wrap up your workflow for remote execution, you can preserve it potential synergies with data sharing here

Provocative(?) Statements No generic preservation tool(s) exists workflows, data structures, use cases etc. are very discipline-specific But at some base level, all preservation is the same must capture: data generation and/or processing and filtering algorithms (sometimes called data provenance) algorithm input parameters (metadata, workflow information) repeat until result is obtained capture should be “complete” for re-use how do we find middle ground of commonality?

Provocative(?) Statements No tool will be adopted by researchers unless there is an “Economic Incentive” (enlightened self-interest) tool makes doing science easier, more efficient use of tools is mandated in order to obtain $$$ use of tools is mandated in order to publish use of tools improves the training of students re-use possibilities worth the effort outreach/training worth the effort