Data Services at CSC ©2016 OKM ATT 2014-2017 initiative www.openscience.fi Licensed under Creative Commons BY 4.0.

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

UKRDS: the policy context 26 February 2009 Paul Hubbard Head of Research Policy, HEFCE.
TTA – National Research Data Project, Finland. The objectives of the TTA infrastructure 1. Developing a Finnish sustainable information infrastructure.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
New organisational perspectives in 'library business' in the future – case study Finland Kristiina Hormia-Poutanen National Library of Finland.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Microsoft Office Sharepoint Server 2007 (MOSS) Overview Momentum Microsoft November 15, 2007.
Open Exeter Project Team
Agile Development of the Open Source Software based Online Service FINNA Aki Lassila Head of Development National Library of Finland.
Resource Sharing Development and Challenge in Academic Libraries: the Case Study of CALIS Yao XiaoXia CALIS Administrative Center , PUL , shanghai.
Good practice in Research Data Management Module 6: Tools, training and support.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
Archival information system ARHiNET Croatian national archival information system Vlatka Lemić Croatian State Archives, Croatia.
Research Data Management for Research Support staff 30 th June 2015 Isabel Chadwick, Research Data Management Librarian
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
The DEER The Distributed European Electronic Resource.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Options for customising DMPonline Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
Aalto Data Repository Keijo Heljanko and Mikko Hakala
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Store and Share Research Data b2share.eudat.eu B2SHARE How to share and store research data using EUDAT’s B2SHARE This work is licensed under.
Institutional data curation implementation 1st African Digital Curation Conference 12 February 2008.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
Research Data Management 26 th April 2016 Federica Fina, Data Scientist, University of St Andrews Library.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
1 Computer Assisted Audit Tools National Audit Office of Lithuanian accepts the challenge Meeting of representatives of Baltic, Nordic and Polish SAIs.
Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Notes accompany this presentation. Please select Notes Page view. These materials can be reproduced only with official approval from Gartner. Such approvals.
Kathleen Shearer Data management: The new frontier for libraries.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Services.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
Data management planning tool DMPTuuli © 2016 OKM ATT 2014–2017 initiative Licensed under Creative Commons BY 4.0Creative.
Research data storage service IDA © 2016 OKM ATT 2014–2017 initiative Licensed under Creative Commons BY 4.0Creative.
Open data publishing platform AVAA © 2016 OKM ATT 2014–2017 initiative Licensed under Creative Commons BY 4.0Creative.
Research data storage service IDA
Open Science and Research Services Tuuli, IDA, Etsin, AVAA, TIPA
The EUDAT Services Suite
Open Exeter Project Team
Head of Publishing, University of Jyväskylä
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
Data management planning tool DMPTuuli
EUDAT’s engagement with the Earth Sciences
© 2015 OKM ATT 2014–2017 initiative 
? What is Institutional Repository for Rutgers University
VI-SEEM Data Discovery Service
Joseph JaJa, Mike Smorul, and Sangchul Song
Research infrastructure databank
Exploitation of ISS Scientific data - sustainability
Research data finder Etsin
CFI John R Evans Leaders Fund Digital Data Management
VI-SEEM Data Repository
General Finnish DMP Guidance
Mark van de Sanden Giovanni Morelli
DATA SPHINX & EUDAT Collaboration
EOSCpilot Skills Landscape & Framework
NFFA Europe.
Publishing data and metdata From iRODS to repositories
Malte Dreyer – Matthias Razum
Brian Matthews STFC EOSCpilot Brian Matthews STFC
DATATURB Direct simulation data of turbulent flows
Jisc Research Data Shared Service (RDSS)
MMG: from proof-of-concept to production services at scale
Introduction to the CESSDA Data Management Expert Guide
WP6 – EOSC integration J-F. Perrin (ILL) 15th Jan 2019
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

Data Services at CSC ©2016 OKM ATT 2014-2017 initiative www.openscience.fi Licensed under Creative Commons BY 4.0

CSC and Customers Computing Services Research Information Management Services Funet Network Services Education Management and Student Administration Services Identity and Access Management Services Datacenter and Capacity Services (IaaS) Training Services Consultation and Tailored Solutions Ministry of Education and Culture Other ministries and state administration Higher education institutions Research institutions Companies

Data Service Portfolio Data services for open science (details later) HPC archive 20GB to 5TB default quota/user iRODS (see IDA) Cloud (ePouta, cPouta) ePouta: secure and private for organizations cPouta: general purpose IaaS (includes FGCI resources) Databases for HPC and others HPC data-analysis, off-the-shelf and tailored services EUDAT Pan-European research data infra, training and consultancy B2DROP*, B2Share*, B2Safe, B2Stage, B2Find* Coordinated by CSC

Open Science Services Development: Policy Work E.g. requirements for higher education institutions Framework architecture A target-level description of open science & research processes, services, information, data structures, actors, roles and information system services Defines the framework for national common solutions, components, data management, information system and local service design and implementation Finished in 2015, put into practice in 2016 Long term preservation model for research data Recognize the most important or unique research output Ensure linkages between publications, data and methods Make services easy to use, efficient and adaptable Enable organizations to easily adopt the services in their own operations

Data Services for Open Science Etsin research data finder IDA research data storage service AVAA open data publishing portal PAS digital preservation solution Tuuli data management planning tool Research infrastructure databank

Data lifecycle and services Data planning Data search Data analysis Data storage Data sharing Data reuse Open science & research handbook PAS

AVAA Current architecture IDA Etsin iRODS Reetta REMS Apache CKAN Anyone Browser/API Haka user Browser/AP I IDA user Anyone Browser Haka user Browser Browser Folder Command line http http https WebDAVS https https https irods Current architecture AVAA Liferay SUI My files IDA-AVAA download IDA-REMS download AVAA sites irods irods irods https IDA Davis irods oai-pmh iRODS SQL OAI-PMH Etsin Reetta REMS Apache CKAN https RESTful API

research data storage service IDA

IDA research data storage Offered since 2012 for projects in Finnish universities, universities of applied sciences and the Academy of Finland Organizational usage quotas vary according to size from 30 TB to 1260 TB Open-source iRODS technology provides secure storage procedures with data replication openscience.fi/ida

IDA research data storage Currently 130 projects 500 registered users 19 million data files 470 TB used Data owner decides on openness and use policy Metadata catalogue and open data portal data metadata User Producer Research organization’s service

plans for new IDA Needs: Everyday storage and sharing A medium-length term (~10 yrs) preservation buffer for PAS (long term) Data lifecycle support: storage, “freezing” and hand-over Centralized metadata management: Data registration in external metadata resource, linking files to datasets to storage packages Access management improvements: roles, organizations Upgrades planned for multiple layers: software (iRODS vs. OwnCloud?), storage solution (scale out) and system architecture

research data finder Etsin

Etsin research data finder National metadata catalogue for research data Adheres to the national metadata model URN PIDs assigned, also support for other IDs Currently 9000+ dataset metadata entries published etsin.avointiede.fi

Etsin research data finder Extension of CKAN data portal & Solr search engine DDI and OAI-PMH metadata harvesting from outside sources Lately: UX improvements, new datasets harvested, plans for integration with research organization catalogues

open data publishing portal AVAA

AVAA open data publishing platform For producers and users of open data since 2013 Pilot cases of research data and access tools developed together with researchers Open data from IDA Roughly 3000 users yearly, 10 million+ API requests avaa.tdata.fi

AVAA open data publishing platform Applications and interfaces for data download, analysis and visualizations Applications developed as open source: github.com/avaa-csc/

AVAA open data publishing platform Applications and interfaces for data download, analysis and visualizations Applications developed as open source

digital preservation solution PAS

Layers in data storage and discovery Managing status (is data integrity intact? is data available?) Managing location (where is the data?) Managing roles (who owns rights to the data? who is responsible for sustainability?) Managing risks (how to keep data discoverable and usable? what actions are needed?) Source: McDonald 2008

PAS digital preservation solution PAS infrastructure operational National Digital Library digital preservation (KDK-PAS) in production since 5.11.2015 Under the administration of the Ministry of Education and Culture Preserving cultural heritage ISO27001 audited service Research Data PAS Same infrastructure as KDK-PAS Preservation model published in 12/2015 At piloting phase To production in stages starting 2017

data management planning tool dmpTuuli

dmpTuuli data management planning tool What: data management planning (DMP) tool for Finnish research organizations How: a collaborative project with a user driven approach Why: DMP is an integral part of good research practise and ensures research integrity and quality Where: www.dmptuuli.fi When: Piloting with national funders in 2016

dmpTuuli Data management plan (DMP) will help you manage your data, meet funder requirements and help others use your data if shared. – DMPTuuli will help you write data management plans. DMPTuuli is provided by the Finnish Tuuli-project. The project has worked closely with researchers and research funders to produce guidance and templates that assist researchers to produce an effective data management plan (DMP) to cater for the whole lifecycle of a project, from bid-preparation stage through to completion. DMPTuuli is based on DMPonline code, developed by the UK's Digital Curation Centre.

Data management plan A living document – updatable and reviewable Create your data management plan early and review it regularly throughout the research project Describes what data will be collected and how the usage and storage of your data how to enable the reuse of your data after the project Covers issues concerning Responsibilities Data ownership and licensing Costs

Research infrastructure databank

Research infrastructure databank Unified descriptions of RIs and services Promotes openness and sharing Centralized and easily updatable For researchers, RI service providers, funders infras.openscience.fi

RI DB: Features in development PIDs for RIs Open API Updates through harvesting Linking data: publications, data, funding, projects, organizations, resources etc.

Common challenges

Common Challenges Metadata management & creation, metadata reserve Levels of abstraction in research data management: file vs. dataset Researcher vs. organization, handover Roles of funders International data

Thank you!