Virtual Research Environments The story of Scratchpads

Slides:



Advertisements
Similar presentations
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Advertisements

Facilitating biodiversity science through
Connect communicate collaborate View on eResearch 2020 study Draft report on “The Role of e-Infrastructures in the Creation of Global Virtual Research.
Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
Dimitris Koureas, Vince Smith & Simon Rycroft Natural History Museum London Linking data, services and communities using Virtual Research Environments.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Nurturing a community based sustainability model Support and outreach structures in Scratchpads Livermore L. & Koureas D. Biodiversity Informatics Group.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
Man-Sze Li IC Focus Enterprise Interoperability Research Roadmap SME aspects.
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
GEO Implementation Boards Considerations and Lessons Learned (Document 8) Max Craglia (EC) Co-chair of the Infrastructure Implementation Board (IIB) On.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number EGI vision for the EOSC Tiziana.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
On the Road towards Arts and Humanities e-Infrastructure in Germany Dr. Heike Neuroth Göttingen State & University Library Max Planck Digital Library Berlin.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
Announcing the 2014 National Digital Stewardship Agenda.
Name - Date Technology-enhanced Learning: tomorrow’s school and beyond Pat Manson Head of Unit Technology Enhanced Learning Directorate General.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Accessing the VI-SEEM infrastructure
Towards a European Open Science Cloud for research
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Disciplinary Interoperability Framework
EOSC MODEL Pasquale Pagano CNR - ISTI
Steven Newhouse EGI-InSPIRE Project Director, EGI.eu
Summit 2017 Breakout Group 2: Data Management (DM)
Carlos Morais Pires European Commission Information Society and Media
Programme Board 6th Meeting May 2017 Craig Larlee
Presentation on Copernicus Dissemination
Short to Medium Term Priority issues for EGI, EMI, anD others
Low Carbon Development for climate resilient societies
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Antonella Fresa Technical Coordinator
Open is not enough! Sustainability and Innovation in Scholarly Communication
VI-SEEM Data Repository
Who’s Who in Bioinformatics: The European Landscape
Connecting the European Grid Infrastructure to Research Communities
.Stat Suite built by the SIS-CC
EOSC & e-Science: enabling the digital transformation of Science
EOSC Governance Development Forum
EGI Webinar - Introduction -
EOSC services architecture
The Globus Toolkit™: Information Services
EOSCpilot All Hands Meeting 8 March 2018 Pisa
An ecosystem of contributions
ESciDoc Introduction M. Dreyer.
European Open Science Cloud All Hands Meeting Pisa 8-9 March 2018
Common Solutions to Common Problems
Three Uses for a Technology Roadmap
Brian Matthews STFC EOSCpilot Brian Matthews STFC
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Jisc Research Data Shared Service (RDSS)
Bird of Feather Session
EOSC-hub Contribution to the EOSC WGs
Interoperability and data for open science
OU BATTLECARD: Oracle Identity Management Training
Open Science Conference Ljubljani 22 May 2019
Presentation transcript:

Virtual Research Environments The story of Scratchpads Dimitris Koureas Natural History Museum London Coordinator, Distributed System of Scientific Collections (DiSSCo) Chair, Biodiversity Information Standards (TDWG) Co-chair, DCF IG, BDI IG & ASDOC WG @DimitrisKoureas

1 2 4 3 Facilitate the research data lifecycle Data collection & generation Facilitate the research data lifecycle Functional and operational characteristics differ significantly according to the communities of practice addressed 1 2 3 4 Data curation VREs enable communities to transfer their typical scientific practice into the digital realm Data publishing Data analysis

Successful examples of VREs across disciplines Past Projects (EC-funded) Current Projects (EC-funded) VI-SEEM MuG OpenDreamKit BlueBRIDGE VRE4EIC West-Life Despite their success for Virtual Research Environments to continue serving their communities we need to reinvent their role within the ever developing research e-infrastructures landscape

The challenges with VREs Built in isolation Developed redundant technological solutions for storage, authentication, computing Failed to develop good sustainability models (failed to attract users)

Build it and they will come? No! Confidence Commitment Longevity Agility Adaptability User monitoring Marketing Visibility Intuitive interface

650 Scratchpads Communities by 7,500 active registered users covering 125,000 taxa in 1,200,000 pages. An example from the Biodiversity domain Per month unique visitors to Scratchpads sites In total more than 5,000,000 visitors 70,000 unique visitors/month

What are Scratchpads? Hosted websites for biodiversity data Virtual research & publication platform Completely open access & open source Modular & flexible What are Scratchpads? They are hosted websites for biodiversity data, but the specifics and the kind of data are entirely up to the user. Scratchpads are designed to be a virtual research and publication platform, a place where you should be able to store, save, publish and reuse your work. They are completely open access, open source and free to use. They are modular and flexible. It is possible for other developers to work with us to write modules for specific functions or communities.

The Scratchpads concept Your data External data & services Scratchpads are fed by your data. Scratchpads help you structure your data in a way that makes them both human and machine readable. Allows you to contribute to global biodiversity databases and also aggregates all related to your data information from external resources.

The Scratchpads concept

What is the challenges today? Hard to benefit from technological advances and external services (e.g. cloud storage and computing) Locked-in certain technologies leads to substantial limitations (e.g. recruiting) The hybrid sustainability model (core funds and community in-kind support) not always working Continuity and uninterrupted service provision, while changing underlying architecture is difficult (expensive)

The future of Scratchpads Retain a thin layer of interfaces that: Enable standardised capture and share of data Collaboration and communication tools Robustly link to underlying e-infrastructures Cloud storage and cloud computing AAI services

The future of Scratchpads JSON Schema Microservice API CRUD Validation OpenAPI Docs JSON-LD The future of Scratchpads Create a database & custom schema matching internally modelled data Input data Map the custom schema Linked Open Data Schemas. Output Linked Open Data

Vertical Integration Utilise the common services of the European Open Science Cloud Currently: 7,000 Target: 15,000 Domain aggregators & repositories Scientific Outputs Authentication Storage Computing Backbone services

Horizontal Integration Interoperability across tools and services Science-Policy IPBES - EBV

The future of Scratchpads Merge with other similar community solutions Shift ownership from the institution to the corresponding Research Infrastructures

Not all research communities are ‘created equal’ For instance, researchers working on genomics, physics or astronomy have long appreciated the value of data sharing. By nurturing a culture of shared physical and computational infrastructure, open-source software and open data, they have embraced the principles of open science. The need to respect diversity and continuously developing needs Research communities are developing new practices organically and that they are best placed to explore which of these contribute to the advancement of their discipline The challenge starts with identifying how professionals work within their research communities and understanding the processes that lead to innovation becoming embedded into common practice (May and Finch 2009). Providing einfrastructures that seamlessly couple with the work practices of a particular professionrequires layers that abstract from a technical level and use the language of each profession. Such layers are typically web-based applications that address elements across the lifecycle of data and research, i.e. data mobilisation and discovery, experimentation, analyses, publication, and open collaboration. Such “Virtual Research Environments” (VREs) should act as intuitive and responsive interventions between researchers and core services.

European Investments in e-infrastructures 2007-2015 more than €740 Million Digital Agenda for Europe Over the last years investments have focused towards robust, reliable and interoperable services that generate global solutions for data sharing and preservation, high performance and cloud computing, user-authorisation and authentication

A global priority of changing the modus operandi of science Investments in development of e-infrastructures Provision of tools and services Training and incentives Digital Agenda for Europe Digital Collaborative Data-intensive research

Developing the European Open Science Cloud A set of robust, sustainable infrastructures that provide cloud services (authorisation, storage, computing) An ecosystem of common backbone services

Researchers are still reluctant to engage European Grid Infrastructure (a flagship initiative that delivers integrated computing services to European researchers) announced (Dec 2015) a user base of 35,959 (European Grid Infrastructure 2015)

Developing the Backbone infrastructure (core services) is not enough Services need to reach the client’s ‘house’

The last mile challenge for research e-infrastructures Koureas D. et al. 10.3897/rio.2.e9933 The last mile challenge for research e-infrastructures Borrowed from telecommunications and transportation Cost and complexity of ‘connecting’ end-users to the backbone (core) infrastructures is high when compared to the core infrastructure itself. VREs act more as the socio-cultural and less as technical brokers. Intermediate environments that connect communities of practice to a common e-infrastructure landscape.

Socio-cultural distance from open digital practices VRE Backbone service Bus (e.g. EOSC) VRE VRE

The development of EOSC does not reduce the importance of VREs benefits In fact, it strengthens it. As they become the needed mechanisms that enable communities of practice to engage with core services VREs benefit The EOSC reduces the operational costs of VRE up keeping and hardwires interoperability minimising fragmentation and redundancy

Developing the next generation of VREs User communities need to be able to: articulate and communicate their community-specific needs in regards to data and services, and translate these needs into clear functional requirements that will drive the development of VREs.

Developing the next generation of VREs VRE operators need to: develop VREs looking beyond the ephemeral timeframes of project-based approaches, invest early in building public-public and public-private partnerships that ensure sustainability and, robustly link VREs with existing underlying e-infrastructure, building on top of available backbone services.

Developing the next generation of VREs Funders need to: further acknowledge the pivotal role of VREs in support of user community engagement and, develop, with a particular eye to long-term sustainability, dedicated VRE funding programmes with targeted calls to discipline-specific communities.

Thank you @DimitrisKoureas d.koureas@nhm.ac.uk