BIS TDWG Conference 29 October 2014, Jönköping, Sweden Publishing sample-based data using Darwin Core Archives Éamonn Ó Tuama, Markus Döring, Kyle Braak,

Slides:



Advertisements
Similar presentations
Katia Cezón GBIF Spain, Coordination Unit Real Jardín Botánico, Madrid 2014 Mentoring Project 2014 France-Portugal-Spain DATA QUALITY WORKFLOW.
Advertisements

GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
BIS TDWG Conference, New Orleans, 2011 GBIF: Issues in providing federated access to digital information related to biological specimens David Remsen Senior.
Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
Dimitris Koureas, Vince Smith & Simon Rycroft Natural History Museum London Linking data, services and communities using Virtual Research Environments.
GLOBAL BIODIVERSITY INFORMATION FACILITY Greg Riccardi Co-chair 9 November Outcomes of the GBIF LSID-GUID Task Group.
GBIF WP consultation Planning for 2014 and beyond Olaf Bánki Senior Programme Officer for Participation Global Biodiversity Information Facility (GBIF)
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
TDWG Annual Conference 2013, Florence Hannu Saarenmaa University of Eastern Finland Integrating observation and survey data for production of the Essential.
1 Implementing GEO-BON: Plans for monitoring terrestrial biodiversity at the species level Henrique Miguel Pereira Center for Environmental Biology, University.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
Gary GELLER NASA Ecological Forecasting Program Jet Propulsion Laboratory California Institute of Technology NASA Biodiversity Meeting Silver Spring, MD.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics.
THEME[ENV ]: Inter-operable integration of shared Earth Observation in the Global Context Duration: Sept. 1, 2011 – Aug. 31, 2014 Total EC.
Prepared for the 3rd SBB telecon 20 Mar 2012 Michele Walters, BI-01 task coordinator.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
© GEO Secretariat GEO Ecosystem task and GEO BON Carlos Padovani, Brazil Georgios Sarantakos, GEO Secretariat Beijing, China April 21, 2013 GEO Ecosystem.
EU BON Meeting, Joensuu, March 2015 Specifications of data sharing tools.
GLOBAL BIODIVERSITY INFORMATION FACILITY TDWG 2009, Montpelier, November 12, 2009 Dag Endresen (NordGen)Samy Gaiji (GBIF) Dag Endresen (NordGen) & Samy.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
1 GBIF and Ocean Biodiversity, OBI'07 Conference, Oct 2-4, 2007, Dartmouth, Nova Scotia GBIF and Ocean Biodiversity Building the data web with OBIS Éamonn.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY DNA Barcoding in Southern Africa Cape Town 7 April
BIS TDWG Conference, New Orleans 2011 Knowledge Organization Systems Session - Introduction Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery,
Isabel Calabuig Lotte Endsleff 1 NODES regional MEETING Europe Digitarium,
MEDIN Work Plan for By March 2011 MEDIN will be 3 years into the original 5 year development plan started in Would normally ask for continued.
BIS TDWG Conference, New Orleans, 2011 GBIF: the challenges of intra- and inter-operability at large scales David Remsen Senior Programme Officer Global.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat The GBIF Data.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
TDWG – Looking Backward and Forward Donald Hobern, Director, Atlas of Living Australia 20 October 2008.
BIS TDWG Conference, New Orleans, 2011 GBIF and Genomic Data Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery, Access (IDA) Global Biodiversity.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 GBIF Training Materials and Future Plans Alberto GONZÁLEZ-TALAVÁN.
GBIF – collaborating to promote data access for research and policy Tim Hirsch Deputy Director Global Biodiversity Information Facility (GBIF) Biodiversity.
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
Emmelina monodactyla (Linnaeus, 1758), Hellerup, Denmark, 4 May 2013 ANTANANARIVO, MADAGASCAR, OCTOBER 2015 Update and Strategic Plan Donald Hobern,
GB22 TRAINING EVENT FOR NODES – 5 OCTOBER 2015 Session 06: Introduction to sample- data publishing Larissa Smirnova.
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Data mobilization and use for international policy Olaf Bánki Senior.
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
Inspiring and Engaging the Public Towards a Shared Understanding and Sense of Ownership of Freshwater Ecosystems A. Mauroner a, I.J. Harrison ab, & M.
Sample-based data publication; reflections on semantics and logic 1(1) Hanna - GBIF Finland Lepidoptera collection of Hannu SaarenmaaPublicNo (but DwC.
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
Introduction to GBIF and the BID programme
EMODnet Biology Work Package 2
Harmonizing Measurements for Marine Biodiversity Observation Networks
Prospecting Secondary raw materials in the Urban mine and Mining wastes INTRODUCTION The ProSUM Project aims to provide an inventory of secondary raw.
Togo: a BID national project in Africa
GBIF Implementation Plan Highlights
Flanders Marine Institute (VLIZ)
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
GLOBAL BIODIVERSITY INFORMATION FACILITY
Citizen Science’s contribution to GEO BON
Consortium of European Taxonomic Facilities
OBIS Data flows Dave Watts 8 March 2017 Data Centre, O&A.
GLOBAL BIODIVERSITY INFORMATION FACILITY (GBIF)
Eirini Politi EuroLag March 2018 Athens, Greece
SCALING UP CAPACITY ENHANCEMENT: BID, BIFA AND SUPPLEMENTARY FUNDING
1B Publishing Primary Biodiversity Data
GBIF Today and Tomorrow
4. IMPLEMENTATION PROCESS
4. IMPLEMENTATION PROCESS
Presentation transcript:

BIS TDWG Conference 29 October 2014, Jönköping, Sweden Publishing sample-based data using Darwin Core Archives Éamonn Ó Tuama, Markus Döring, Kyle Braak, Tim Robertson, Olaf Bánki Global Biodiversity Information Facility (GBIF)

Why do this? Long perceived need by GBIF to enable publishing of abundance (sample) data; Requirement with the EU Project EU BON ( Meeting the needs of the GEO Biodiversity Observation Network (GEO BON ).

Sample-based data Output of monitoring programmes; Quantitative, calibrated; Using standard protocols; Repeatable, comparable. Detect changes and trends in populations

Constraints Be available for testing in 2015 Build on existing widely used standards: Darwin Core Work within the existing tools ecosystem: IPT … while acknowledging the promise of ontologies (BCO, OBOE …)

Caveat Aim: demonstrate one way data can be exposed to maximize discoverability and reuse. Not in scope: establishing how data should be captured or modelled.

A use case Enabling the flow of sample based data in support of GEO BON Essential Biodiversity Variables (EBVs).

Essential Biodiversity Variables intermediate layer between raw data and indicators GEO BON has identified six EBV classes a measurement required for study, reporting and management of biodiversity change

EBV Class: Species populations

Building on the Darwin Core vocabulary

taxonRank higherClassification taxonConceptID collectionCode geodeticDatum specificEpithet coordinatePosition collectionCode: The name, acronym, coden, or initialism identifying the collection or data set from which the record was derived. Examples: "Mammals", "Hildebrandt", "eBird". Darwin Core – a glossary of terms

7 essential terms for encoding sample data 1.eventID 2.projectID (new) 3.samplingProtocol 4.sampleSize (new) 5.sampleSizeUnit (new) 6.quantity (new) 7.quantityType (new)

New terms required eventID: an identifier for the set of information associated with an Event; may be a global unique identifier or an identifier specific to the data set. projectID: an identifier for a project with which the data is associated; use to link related data sets, e.g., a monitoring series; may be a global unique identifier or an identifier specific to the series.

New terms required sampleSize: a numeric value for the time duration, length, area or volume involved in the sampling. sampleSizeUnit: the unit of measurement used for sampling, e.g., minute, hour, day, metre, metre^2, metre^3. 2hour 3m2 17km 1litre

Unit of measurement vocabulary

Used in IPT as controlled list for sampleSizeUnit Unit of measurement vocabulary

New terms required quantity: the number or enumeration value of the entity or category being quantified in the sample. As such it is paired with quantityType. quantityType: the entity being referred to by quantity, e.g., individuals, a percentage (e.g., species, biomass, biovolume), a scale type 14Individuals rBraunBlanquetScale 0.4%Species 31%Biomass

Publishing sample data using the IPT

Event Core An event core is the logical way of organising a sampling event; Related environmental measurements can be included in an extension; Vegetation plot data (coverages) can be included separately from “occurrences”.

Darwin Core Archive components Event core Occurrence ext Measurement-or-fact ext Relevé ext … meta.xml EML.xml … + DwC Archive

Event Core (Event, Location, Geological Context) eventID, projectID (n), samplingProtocol, sampleSize (n), sampleSizeUnit (n) Occurrence Extension (Occurrence, Taxon, Identification) eventID, quantity (n), quantityType (n) (n) = proposed new term Placing the terms in a Darwin Core Archive For term definitions, see

eventIDprojectIDsampling Protocol sample Size sample SizeUnit event Datelocationdecimal Latitude decimal Longitude … C_1428RM065AQEM1.25m2m Kinzig O3 Rothenbergen … C_1538RM065AQEM1.25m2m Kinzig W1 Bulau … eventIDscientificNamequantityquantityType… C_1428 Baetis rhodani 14individuals… C_1428 Ephemera danica 15individuals… C_1428 Gyraulus albus 2individuals… C_1538 Serratella ignita 318individuals… A sampling event uses a particular samplingProtocol with sampleSize and sampleSizeUnit, etc. and can record one or more taxa, each of which has a measurement (quantity and quantityType associated with it. Event core Occurrence extension

Adapting the IPT Now with Event Core

This project has received funding from the European Union’s Seventh Programme for research, technological development and demonstration under grant agreement No Acknowledgement EU BON and GEO BON partners, TDWG mailing list contributors and GBIF sample data workshop participants informed this work and are gratefully acknowledged.

Thank you GBIF Secretariat Universitetsparken 15 DK-2100 Copenhagen Ø Denmark Phone: Fax: