OBIS Data flows Dave Watts 8 March 2017 Data Centre, O&A.

Slides:



Advertisements
Similar presentations
BIS TDWG Conference, New Orleans, 2011 GBIF: Issues in providing federated access to digital information related to biological specimens David Remsen Senior.
Advertisements

Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
OBIS Australia – Regional Node for the Ocean Biogeographic Information System (OBIS) OBIS Australia is an operational component of the Census of Marine.
Ocean Biogeographic Information System Edward Vanden Berghe
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
MEDIN Partners Meeting Sept 2010 DASSH – The Archive for Marine Species and Habitats Dan Lear DASSH Project Co-ordinator Marine.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer October DarwinCore Archives – Simplified Format for publishing.
Online Data Flanders Marine Data & Information Centre InnovOcean site SeadataNet Annual Meeting, Madrid 2009.
Oceans Portal Workshop 30 th March 2004 Healthy oceans: cared for, understood and used wisely for the benefit of all, now and in the future healthy oceans:
GLOBAL BIODIVERSITY INFORMATION FACILITY TDWG 2009, Montpelier, November 12, 2009 Dag Endresen (NordGen)Samy Gaiji (GBIF) Dag Endresen (NordGen) & Samy.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Zope/Plone/Python for Research Ben Best OBISSEAMAP mapping marine megavertebrates
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
1 GBIF and Ocean Biodiversity, OBI'07 Conference, Oct 2-4, 2007, Dartmouth, Nova Scotia GBIF and Ocean Biodiversity Building the data web with OBIS Éamonn.
Knowledge base for growth and innovation in ocean economy: assembly and dissemination of marine data for seabed mapping LOT NO: 5 – BIOLOGY Simon Claus.
BIEN Confederated DB (S) Analytical DB(s) Heterogeneous source database(s) of Plots/Specimens/Occurrences Synonymy Names Reference taxonomy *** *** Feedback.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
CSIRO Marine Research Data Centre linked databases - CAAB, MarLIN and Divisional Data Warehouse.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
Exploring Spatial Data Infrastructure in an Open Source World Jacqueline Lowe UNC-Asheville National Environmental Modeling and Analysis Center Jacqueline.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Challenge Grant Update: Linking the Network of Natural Heritage Biodiversity Data to the Environmental Information Exchange Network.
LTER Data Management Margaret O’Brien Santa Barbara Coastal Long Term Ecological Research (LTER) Project Santa Barbara Channel Biodiversity Observation.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Dag Endresen Knowledge Systems Engineer GBIF New Orleans (Louisiana, USA) 20 October 2011 Biodiversity Information Standards, TDWG.
Canadensys update. Canadensys: what is it? A Canadian network of 11 universities, 5 botanical gardens and 2 museums. Over 25 biological collections and.
Fábio Lang da Silveira – This talk on behalf of OBIS International Committee and OBIS North & South America Nodes USP – Zoology.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
Hellenic Centre for Marine Research (HCMR) MedOBIS - Ocean Biogeographic Information System for the Eastern Mediterranean and Black Sea.
P088; Presented in Canberra, 27 th March, 2008 GR000: Presented in Fremantle on 20 th October, 2008 GAIA RESOURCES Experiences in mobilizing biodiversity.
1 openModeller Presentation Plan: Overview of openModeller OMWS: an open standard for distributed ecological niche modelling openModeller in relation to.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
Lifewatch tools. Software 2 data Species observations > 40 M records Tracking data birds : 1pos / 10min Taxonomy > names Environmental data….
TapirLink: Enabling the transition to TAPIR Renato De Giovanni TDWG 2007.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
Getting Data into AfrOBIS. AfrOBIS is part of the International OBIS Network OBIS is a strategic alliance of hundreds of scientists and organisations.
OBIS IODE PO OBIS INCOIS OBIS- SEAMAP Separate files OBIS Nodes Data providers Separate files GBIFLifeWatchGEOSSEOL,…CBDFAOISA Fail-over mirrorGeo-load.
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
COINAtlantic Expanding OBIS Canada partnerships and Visualizing OBIS Canada IPT Resources SG-OBIS-V May 25 – 27, 2016 UNESCO/IOC Project Office for IODE.
Python Driven Sensor Observation Service Benjamin Welton NASA USRP.
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
3.2) Data sharing and dissemination Data Sharing between OBIS-SEAMAP, OBIS and GBIF.
MIKADO – Generation of ISO – SeaDataNet metadata files
New features in KE EMu 3.1 and beyond
Flanders Marine Institute (VLIZ)
Relational Databases.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
GLOBAL BIODIVERSITY INFORMATION FACILITY
Populating a Data Warehouse
The Open Fiscal Data Package
Overview EMODnet Biology Portal Standards used Web services available
21 November Data Science Capabilities
Challenge Grant Update
HOW (and why?) DO WE DESCRIBE ?
Presentation transcript:

OBIS Data flows Dave Watts 8 March 2017 Data Centre, O&A

Outline The OBIS network Tools to publish data Data gaps Interaction with other networks Future bits ODIP II Workshop3 - OBIS Data Flows | Dave Watts

The network ODIP II Workshop3 - OBIS Data Flows | Dave Watts

The data flow structure OBIS-AU ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Biological data (first attempt) Distributed Generic Information Retrieval (DiGIR) Very old application built circa 2003 in PHP. To deliver species occurrence data from COML to OBIS/GBIF Delivers data in DwC xml format, via query Performance fine up to 50,000 records but awful after that Six million records from Poland to Copenhagen took 24 hours To be fair, servers etc not as fast as now ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Biological data (second attempt) Integrated Publishing Toolkit (IPT) Started prototype circa 2008 Delivers data in DwC tagged csv via a datafile download Performance fine - upto millions of records. GBIF has 715 million. Connects to any database or a csv import file. Crossmatch to DwC vocabs for export. ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Data standards Darwin core DwC – http://rs.tdwg.org/dwc/terms/ vocabs with definitions, examples, suggested values EML - Ecological Metadata Language (EML) is a metadata specification developed by the ecology discipline and for the ecology discipline. Developed into IPT circa 2009 very human readable! ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Key elements in DwC To publish to OBIS, the following are expected scientificnameId – should hold WoRMS LSID of taxa – allows verification of data providers species name e.g Wandering Albatross urn:lsid:marinespecies.org:taxname:212583 Other LSIDS can be used e.g. from Australian Faunal Directory occurrenceStatus – values of ‘present’ or ‘absent’ occurrenceId – unique value within an IPT resource and needed in links to the EventCore data. ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Biological data -IPT ODIP II Workshop3 - OBIS Data Flows | Dave Watts

IPT – Matching to TDWG DwC vocabs ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Biological data -IPT Pros Cons scalable - limited only by file size matches to vocabs in a very robust and friendly manner if using a database, can support SQL filter on table – reduce use of views single zip containing all data, metadata (EML) data versioning For OBIS, backbone taxonomy is WoRMS Limited impact of data provider’s servers Extensible by downloading new schemas Cons Custodian must actively ‘publish’ if new data or revisions Only CSV data ODIP II Workshop3 - OBIS Data Flows | Dave Watts

OBIS-ENV-DATA project Purpose: to add environmental and other context data to DwC data Designed to deal with CTD casts, trawl events and related catch composition, existing species occurrence records with environmental measurements, e.t.c. ODIP II Workshop3 - OBIS Data Flows | Dave Watts

OBIS-ENV-DATA project ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Existing OBIS services OGC Geoserver instance http://www.iobis.org/geoserver Two layers - OBIS:drs_with_woa, OBIS:points_ex R packages https://github.com/iobis/robis - occurrence records and mapping - species checklist ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Current data – by year ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Current data – by depth Number of sampling days per depth volume ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Why an aggregator? Queensland Museum Porifera (aka sponges) ODIP II Workshop3 - OBIS Data Flows | Dave Watts

GBIF the elephant in the room marine data marked as 'marine, harvested by iOBIS' OBIS Tier 2 OBISAU IPT all data if registered Data exchange by csv upload Data providers (mainly OZCAM) ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Where to for OBIS Near real-time data loading and data quality feedback Ability to handle the ENV data model Active API development Perhaps fossil records (land-based data, sediments - forams) Perhaps private data (e.g. sensitive) Need deep water records Need BNJ records Need contemporary records ODIP II Workshop3 - OBIS Data Flows | Dave Watts

Questions Oceans and Atmosphere / Data Centre Dave Watts Node manager OBIS Australia t +61 3 6232 5062 e dave.watts@csiro.au w www.obis.org.au O&A Data Centre