Core Data Resources and FAIRification of Data

Slides:



Advertisements
Similar presentations
International Barcode Of Life Initiative
Advertisements

Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Cultural Content and Digital Heritage Bernard Smith European Commission INFSO/D2.
Beispielbild CETAF summary of state of play Anton Güntsch Freie Universität Berlin Botanic Garden & Botanical Museum Berlin-Dahlem Dept. of Research and.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Welcome to the Conference !! Juan Bicarregui Chair, APA Executive.
Facilitating biodiversity science through
OpenUp! A New Project on Opening up the European Natural History Heritage for EUROPEANA W. G. Berendsohn, A. K. Michel, A. Güntsch, W.-H. Kusber (2011)
Eye on Earth (EoE), Citizen Science and the Invasive Alien Species project Malene Bruun NRC’s for EIS June 17, 2011.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Sustainability of EDIT Informatics Activities. BoD working group on sustainability Executive Summary, 20th July 2009: “… set of themes we are sure we.
The Preparatory Phase Proposal a first draft to be discussed.
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.
Christina Flann Species 2000 October 2014 Catalogue of Life Indexing The World’s Known Species Connecting the taxonomic community and the names infrastructure.
LifeWatch E-Science and Observatory Infrastructure for Biodiversity & Ecosystem Science Olaf Bánki.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
Research libraries in a European e-science infrastructure Wouter Schallier Executive Director LIBER (Association of European Research Libraries)
CBD CoP 11 Special Event National Biodiversity Information Outlook (NBIO) Vishwas Chavan 15 October 2012 Hyderabad.
European Network for Biodiversity Information. Why ENBI ?
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Andreas Kohlbecker, Pepe Ciardelli, Niels Hoffmann, Katja Luther, Andreas Müller Botanic Garden.
Virtual Biodiversity ViBRANT Vocabularies, Standards, merging and linking Data Olaf Banki University of Amsterdam ViBRANT Virtual Biodiversity.
TDWG – Looking Backward and Forward Donald Hobern, Director, Atlas of Living Australia 20 October 2008.
GBIFS Seminar with the Science Committee and the Nodes Strategy Group Analysis of the content published by the GBIF network – Better understanding what’s.
Working Group on Invasive Alien Species First meeting, 12 th June, 2015 Brussels Colette O’Flynn, Ireland.
Dr. Patricia Mergen Biology Department Head of the Cyber-taxonomy and Biodiversity Information Unit Royal Museum For Central Africa (RMCA) Federal Scientific.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Introductory remarks Wouter Los LifeWatch Infrastructure for Biodiversity and Ecosystem Research.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa ECOINFORMATICS 2006 JRC, Ispra,
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Data mobilization and use for international policy Olaf Bánki Senior.
COST Action and European GBIF Nodes Anne-Sophie Archambeau.
Workshop on Brokering in Data Fabrics - community perspectives -
Overview of WGs, IGs and BoFs
Dimitris Koureas Lead, Research Data and Partnerships
Ahmet ULUDAG Project Manager on Invasive Alien Species
Biodiversity.europa.eu BISE EEA and ETC/BD Rania Spyropoulou.
WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘
GISELA & CHAIN Workshop Digital Cultural Heritage Network
INTAROS – Integrated Arctic Observation System
GBIF Implementation Plan Highlights
Susanna-Assunta Sansone, Rebecca Lawrence and Simon Hodson
Flanders Marine Institute (VLIZ)
Organising data to represent biodiversity
Wrap-up & discussion EOSC Governance Development Forum workshop:
Data Sharing Between SANBI and Partners
Toward FAIR Semantic Resources
Interlinking standards, repositories and policies
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
Who’s Who in Bioinformatics: The European Landscape
Bringing Organism Observations Into Bioinformatics Networks
Consortium of European Taxonomic Facilities
EOSCpilot All Hands Meeting 9 March 2018, Pisa
EOSCpilot All Hands Meeting 8 March 2018 Pisa
LifeWatch Cloud Computing Workshop
From Observational Data to Information (OD2I IG )
Three Uses for a Technology Roadmap
Brian Matthews STFC EOSCpilot Brian Matthews STFC
GBIF Strategic Plan Alberto González-Talaván
Dr. Patricia Mergen Biology Department
The ENVRIplus approach of cooperation
EOSCpilot All Hands Meeting 9 March 2018, Pisa
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Civil Society and the Heath Data Collaborative
Bird of Feather Session
7.b Marine alien species on EASIN
Research Data: Infrastructure, Re-use and Dark Knowledge
Digital Objects: The Science
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

Core Data Resources and FAIRification of Data Biodiversity Data Integration IG Core Data Resources and FAIRification of Data Presentation for Joint meeting: IG ELIXIR Bridging Force, IG Biodiversity Data Integration, WG BioSharing Registry RDA P11 Berlin 2018 Wouter Addink DiSSCo Coordination team member

Biodiversity Data Integration IG From data.. to knowledge?

Moving away from siloed data Atlas of Living Australia Moving away from siloed data L Biodiversity Heritage Library L Catalogue of Life Global Biodiversity Knowledgebase L GBIF L Barcode of Life L iDigBio L Encyclopedia of Life L Treebase

Biodiversity Data Integration IG Hot topics in the Biodiversity Data community A.H. Ariño et al: TDWG Now and Then, TDWG , Costa Rica, 7-XII-2016

Biodiversity Data Integration IG UN Sustainable Development Goals Required: High quality integrated data and services Coordinated strategy

Biodiversity Data Integration IG Linking dispersed information is imperative Example– Invasive Alien Species UN Sustainable Development Goals (Target 15.8) Economic costs of IAS for EU €20 Billion / year Kettunen et al. 2009 Urgent challenge Institutional collections Facilities & information Climate data Ecological monitoring data Genomic information Species distribution & genomics Linked Data Other Research Infrastructures Analysis / Interpretation Services Modelling / Prevention / Early detection

DATA MEASUREMENTS MODELLING Biodiversity Data Integration IG RI landscape for linking biodiversity information ENVRI plus, 2017 Species/ organisms observatories Experiments RIs providing data on external factors Integrative RIs Biodiversity standards / Reference data Taxonomic backbone System Modelling / Prevention / Early detection DATA MEASUREMENTS MODELLING Species distribution & genomics Institutional collections Alien Invasive species use case

SAP Biodiversity Data Integration IG Alignment of Projects for effective RI development - DiSSCo example Biodiversity Data Integration IG ICEDIG €3M | 2018-2020 € CoL+ €0.5M | 2017 - 2020 € €10M | 2014 - 2017 DiSSCo Design Study €10M | 2019 - 2021 SYNTHESYS+ € €2M | 2024 - 2025 DiSSCo Deploy € €53M | 2021 - 2024 DiSSCo Construct € DiSSCo Prepare €20M | 2019 - 2023 € €0.5M | 2018 - 2022 MOBILISE € SAP Strategic Alignment of Projects

114 National Facilities 21 Countries DiSSCo: A new European infrastructure 114 National Facilities 21 Countries Largest ever formal agreement between natural science collection facilities Centralised governance model already in place Supporting network of working groups DiSSCo builds on top of a mature community of institutions Strategic collaboration already underpinned by sound governance and decision-making structures

Biodiversity Data Integration IG Challenges in the Biodiversity Data domain Accelerate generation and linking of information into research data objects Ensure provenance and quality Provide reliable, unified, certified services and harmonised policies Provide services to other Research Infrastructures Connect publishing and use Improve feedback and ability to reference data

Biodiversity Data Integration IG Stakeholders in FAIRification of data Specification of core cloud services | Service Level Agreements e-Infrastructures Standardisation bodies New community data Standards Recommendations - specifications | Knowledge exchange Technical communities Research Infrastructures User requirements | Systems interoperability Data, workflow and systems integrity FAIR principles

Linking Biodiversity Data & Core data resources Catalogue of Life Plus project: Joint development of a practical, community-based approach to rapid completion of a Global Taxonomic backbone: (Re-)connects taxonomic research with specimen data Quality control and enhanced linkages Contribution of taxonomic expertise through a clearinghouse Species names DNA Barcoding Specimen identifiers Literature references iBOL – International Barcoded of Life project: The International Barcode of Life Project (iBOL) is the largest biodiversity genomics initiative ever undertaken, to create a digital identification system for life. CETAF Identifiers initiative: a joint Linked Open Data (LOD) compliant identifier system developed by the CETAF Information Science and Technology Committee (ISTC) providing mechanisms for consistently referencing individual specimens BHL – Biodiversity Heritage Library: Collaboratively makes biodiversity literature openly available to the world

Biodiversity Data Integration IG FAIRification process adopted by GO FAIR Steps: Retrieve non-FAIR data Analyse the retrieved data Define the semantic model Make data linkable Assign license Define metadata for the dataset Deploy FAIR data resource

Biodiversity Data Integration IG Some issues for FAIRification of Biodiversity Data No infrastructure yet for sensitive biodiversity data No standard ontologies Semantic Web and Linked Data technologies not widely used in community No common standard for metadata and current standards incomplete for giving attribution for the maintenance, curation, and digitization of collections. (RDA / TDWG Metadata Standards WG is working on this)

The need for taxon concept identifiers From: The use and limits of scientific names in biological informatics D. Remsen http://zookeys.pensoft.net/articles.php?id=6234

Data classes in the biodiversity data domain Occurrence Specimen Taxon Concept Interaction Taxon Name Publication Trait Collection Sequence Gene

Meta-model interpretation Relations in occurrence data Record <Class=Occurrence> BRA:UFPB:JPB:0000061643 Meta-model interpretation Observer Soares Neto, RL Place João Pessoa includedIn Place Paralba includedIn observedBy includedIn Place Brasil Occurrence BRA:UFPB:JPB:0000061643 Event <Unnamed> 20 Jan 2016 Place Campus I da UFPB (7.1375 S, 34.84586 W) fromEvent atLocality hasEvidence Specimen 61643 identifiedAs TaxonConcept <Species> hasName TaxonName Tarenaya spinosa includedIn inCollection TaxonConcept <Genus> TaxonName Tarenaya hasOwner hasName Collection JPB Institution UFPB TaxonConcept <Family> TaxonName Cleomaceae includedIn hasName hasCustodian