Harmonizing Measurements for Marine Biodiversity Observation Networks

Slides:



Advertisements
Similar presentations
Chapter 8: Capacity Building Presented by Co Chair Mr. John Briceño CCAD Central American Countries 11/29/2003.
Advertisements

SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological.
Review of approach 24 March 2015
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Codex Guidelines for the Application of HACCP
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
SONet: Scientific Observations Network Semtools: Semantic Enhancements for Ecological Data Management Mark Schildhauer, Matt Jones, Shawn Bowers, Huiping.
Controlled Vocabulary Working Group PRESENTED BY JOHN PORTER.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Growing challenges for biodiversity informatics Utility of observational data models Multiple communities within the earth and biological sciences are.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
Suggestions for the GEOSS Workplan C. Waldmann and R. Huber MARUM, Germany.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
LTER Data Management Margaret O’Brien Santa Barbara Coastal Long Term Ecological Research (LTER) Project Santa Barbara Channel Biodiversity Observation.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Slide: 1 CEOS SIT Technical Workshop |Caltech, Pasadena, California, USA| September 2013 CEOS Work Plan Section 6.1 G Dyke CEOS ad hoc Working Group.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal Natasha Noy Stanford University.
GEOBON Report of the breakout session. Challenge to GEO The global biodiversity community is coming together because of GEO bringing global tasks and.
IABIN Standards & Protocols Presented by: Mike Frame, USGS NBII Developed by Darrell McClarty IABIN Regional Coordinator.
The International Ocean Colour Coordinating Group International Network for Sensor Inter- comparison and Uncertainty assessment for Ocean Color Radiometry.
EcoGrid in SEEK A Data Grid System for Ecology Bertram Ludaescher University of California, Davis Arcot Rajasekar San Diego Supercomputer Center, University.
Data Foundations And Terminology (DFT) IG Virtual Meeting July 6 th 2016 Co-Chairs DFT IG :Gary Berg-Cross & Raphael Ritz P8 Sessions DFT IG Breakout Session.
Introduction to Data Management Arllet M. Portugal Integrated Breeding Platform Breeding Management System Intensive Workshop on Data Management Jan. 26,
Committee on Earth Observation Satellites
Strategies for NIS Development
Model Discovery through Metalearning
EMODnet Biology Work Package 2
CEOS Carbon Strategy – WGClimate Actions
GEO, Blue Planet and MBON
Network Information System Advisory Committee (NISAC)
WG Research Data Collections RDA P10 Montréal – September 2017
DataNet Collaboration
Essential Biodiversity Variables: towards an agreement on a common approach for biodiversity Rob Jongman, Wageningen UR Henrique Pereira, University of.
Persistent Identifiers Implementation in EOSDIS
Tomas Kliment Junior Researcher Italian National Research Council
Flanders Marine Institute (VLIZ)
Mapping the Network Landscape Ivette Serral
Programme Board 6th Meeting May 2017 Craig Larlee
Capacity Building Enhance the coordination of efforts to strengthen individual, institutional and infrastructure capacities, particularly in developing.
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
DA Task Report Data Integration and Analysis System
GLOBAL BIODIVERSITY INFORMATION FACILITY
Citizen Science’s contribution to GEO BON
Christian Ansorge Arona, 09/04/2014
Regional experiences, case of the Mediterranean Sea
RDA/TDWG Metadata Standards for Attribution of Physical and Digital Collections Stewardship Anne E Thessen, Matt Woodburn, Dimitris Koureas 21 Sept, 2017/Montreal,
Oceans and Society: Blue Planet
Status of Carbon Action Items
Data Management: Documentation & Metadata
Site classifications, definitions, and updates to Landnet
Cyber-Infrastructure for Marine Biodiversity Data
Carbon Actions for WGCV
An ecosystem of contributions
WG Research Data Collections An overview of the recommendation
OBI – Standard Semantic
LSI-VC Work Plan Updates
Existing Designs and Prototypes at RPI
GODAE Quality Control Pilot Project
Open Science: the crucial importance of metadata
Benthic systems: Unvegetated Sediments
Bird of Feather Session
Biodiversity break out session 7th June 2017
EMEP Monitoring strategy
Framework for Ocean Observing
EMEP Monitoring strategy
Committee on Earth Observation Satellites
Presentation transcript:

Harmonizing Measurements for Marine Biodiversity Observation Networks Margaret O’Brien Santa Barbara Channel MBON http://sbc.marinebon.org ESIP, Winter 2017

MBON Goals

MBON Data Processing

MBON Data Processing

GOOS Timeline Timeline from Group on Ocean Observing.

MBON Partnerships - example Source: Jennifer Brown - MBNMS, CINMS

CINMS Data Needs - Draft Source: Jennifer Brown - MBNMS, CINMS

Data in Preparation

DM Workshop, 2016 UCSB GOALS Understand the needs of potential MBON data users Initiate a coordinated approach for the three demonstration MBONs

DM Workshop, 2016 UCSB If vocabularies are complete, structured, well known and broadly used, then recommendations for their adoption can be handled. Current vocabulary efforts are conceptual and need to be fully operationalized Accommodate primary observations (e.g., organism spatial abundance) and derived variables (e.g., indices of evenness, dominance, diversity) Unambiguous Meet needs of all MBON data Structure, content Basic approach for the MBON community should be to adopt one or more existing vocabularies as is possible, and to augment those with missing terms or contribute definitions using established.

Adopt one or more existing vocabularies as is possible DM Workshop, 2016 UCSB Basic approach: Adopt one or more existing vocabularies as is possible Augment those with missing terms or contribute definitions using established. Process is likely to be complex because the biological data to which these variables must apply are often hand-collected, ad hoc and idiosyncratic Basic approach for the MBON community should be to adopt one or more existing vocabularies as is possible, and to augment those with missing terms or contribute definitions using established.

DM Workshop, 2016 UCSB Assemble existing vocabularies which could be applied to MBON data Examine and evaluate candidate vocabularies for MBON use Adopt groups of terms deemed appropriate and adequate Suggest additions to vocabularies which are incomplete Outline additional work and funding needs, if appropriate

Biodiversity Variables Essential Biodiversity Variables http://geobon.org/essential-biodiversity-variables/ebv-classes-2/ GOOS http://goosocean.org/index.php?option=com_content&view=article&id=79&Itemid=273 IOOC http://www.iooc.us/activities/biological-integration-observation-task-team/

Measurement Vocabularies Darwin Core (DwC) IndividualCount, OrganismQuantity, OrganismQuantityType (Occurrence) Extensions, e.g., “Fish Abundance” http://rs.gbif.org/sandbox/extension/mbg-fish-abundance.xml CF Conventions Mostly physical, a few related to biomass BODC Parameter codes Taxonomic Database Working Group (TDWG) http://www.tdwg.org/activities/osr/ The status of vocabularies for diversity-related variables can be compared to that for physical measurements from instrument data, where a considerable number of fairly formal descriptions are available (e.g., CF Conventions) and communities have well-established recommendations and processes for new contributions. biodiversity measurements will require formal measurement descriptions at least as complex as are used by the CF Conventions community.

Ontologies Population and Communities Ontology (PCO) http://www.ontobee.org/ontology/PCO Phenotypic Traits Ontology (PATO) http://www.ontobee.org/ontology/PATO Ecosystem Ontology (ECSO) http://bioportal.bioontology.org/ontologies/ECSO Ontlolgies are a step more complex than controlled lists, with structure that allows machine processing. But along with that, you get a class and CONTEXT structure that means there are other terms that can now be associated with that dataset, not just the measurement. That advantage means they are something we should explore using. There are some ontolgies out there now that may be able to help us: An ontology called “PCO”…. PATO …. ECSO is one that I am working on, with several from a much broader ontology and repository community. Introduce that.

Ecosystem Ontology (ECSO) Imports: PATO – phenotypic traits ENVO – environment, context CHEBI – elements, chemicals UO – units, dimensions Using OBO Foundry recommendations and practices ecso supported by d1, for data discovery at the measurement level (other efforts working at the higher levels, eg, GeoLink). examined many existing ontologies before proceeding. IMPORTS: components of several stable ontlogies. Created an ID system that can be permanent, plan for updates. Complex process. So started with one class of data, Carbon Cycling. Limited number of datasets (several 100s), but still a corpus that was complex enough to expose potential problems. LTER data was a major use case. Also modeling data from the MSTIMIP (model intercomparison project)

Data Diversity http://portal.lternet.edu knb-lter-hfr.103.27 knb-lter-sbc.37.4 Methods vary widely scope (organism, community, ecosystem), scale (temporal and spatial). Why would we go to all this trouble? Above, a satellite image depicting NPP values from the Harvard Forest (image from ORNL DAAC Below, a chamber for measuring in situ NPP in a benthic algal community at the Santa Barbara Coastal LTER. These would both use the same STANDARD NAME – that is, net_primary_production. But is that enough for a user to be able to tell them apart? Both datasets have values for “net_primary_production”, with rich metadata & units of “mass per area per time”

Observation Model https://github.com/NCEAS/oboe/ Measurement based Entity ENVO, CHEBI Characteristic PATO, OBOE Standard UO Protocol Precision Built on a basic observational model. Extensions. Model is compatible with other high level observation models like O&M The imported ontologies

Local Dictionary -> EML Metadata Cut to the chase: can put the ECSO measurement ID directly into metadata. See poster for more on dataset annotations, testing and implementation

ECSO - Biomass Some ECSO classes are ready for use in population studies Plus, it is mature enough to add another group of measurements. Looking for a candidate. https://github.com/DataONEorg/sem-prov-ontologies/

Input, Discussion Assemble existing vocabularies which could be applied to MBON data Examine and evaluate candidate vocabularies for MBON use Adopt groups of terms deemed appropriate and adequate Suggest additions to vocabularies which are incomplete Outline additional work and funding needs, if appropriate Back to the list of tasks.