SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological.

Slides:



Advertisements
Similar presentations
Program Goals: to create a new type of organization - a cyberinfrastructure collaborative for plant science - that will enable new conceptual advances.
Advertisements

Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
Semantic annotation on the SONet and Semtools projects: Challenges for broad multidisciplinary exchange of observational data Mark Schildhauer, NCEAS/UCSB.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
SONet: A Community-Driven Scientific Observations Network to achieve Semantic Interoperability of Environmental and Ecological Data Mark Schildhauer 1,
ODM2: Developing a Community Information Model and Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations Jeffery.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Dawn Wright Oregon State University Ned Dwyer Coastal & Marine Resources Centre, Ireland The International Coastal Atlas Network (ICAN) FGDC Marine & Coastal.
Using observational data models to enhance data interoperability for integrative biodiversity and ecological research Mark Schildhauer*, Luis Bermudez,
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
A Proposal for a Distributed Earth Observation Data Network Matthew B Jones UC Santa Barbara National Center for Ecological Analysis and Synthesis (NCEAS)
Observations and Ontologies Achieving semantic interoperability of environmental and ecological data Mark Schildhauer 1, Shawn Bowers 2, Josh Madin 3,
Managing Sustainability Solutions Initiative (SSI) data Kate Beard, Steve Cousins University of Maine NERACOOS/NECOSP Data Management Workshop, Sept. 26,
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
SONet: Scientific Observations Network Semtools: Semantic Enhancements for Ecological Data Management Mark Schildhauer, Matt Jones, Shawn Bowers, Huiping.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
L E Bermudez 1, P Bogden 2, G Creager 3, J Graybeal 4, Dec 2008 L E Bermudez 1, P Bogden 2, G Creager 3, J Graybeal 4, Dec
Cyberinfrastructure Overview Core Cyberinfrastructure Team Matthew B. Jones National Center for Ecological Analysis and Synthesis (NCEAS) University of.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
ESIP Federation: Connecting Communities for Advancing Data, Systems, Human & Organizational Interoperability November 22, 2013 Carol Meyer Executive Director.
Semantic Mediation in SEEK/Kepler: Exploiting Semantic Annotation for Discovery, Analysis, and Integration of Scientific Data and Workflows Bertram Ludäscher.
EO/GEO Team Response to Open GIS Consortium Catalog Interface RFP George Percivall February 1999.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Growing challenges for biodiversity informatics Utility of observational data models Multiple communities within the earth and biological sciences are.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
Subgroup 1 Collect interoperability requirements Define common, unified data model Engage tool & data providers, data consumers Subgroup 2 Identify and.
ENV proposal meeting, Geneva, Sep. 24, Proposal Objectives Joost van Bemmelen, ESA
GEO Work Plan Symposium 2012 ID-03: Science and Technology in GEOSS ID-03-C1: Engaging the Science and Technology (S&T) Community in GEOSS Implementation.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
PSCIC Working Group: Parag Chitnis Chris Greer Susan Lolle Sam Scheiner Jane Silverthorne Bill Zamer Manfred Zorn.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Metadata with MMI Opening the Door to Collaboration John Graybeal, Luis Bermudez, Philip Bogden, Steven Miller, Stephanie Watson.
Beginning with an NSF INTEROP project whose goal is to facilitate the deployment of an Integrated Ecosystem Approach (IEA) to management in the Northeast.
Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
The ADC, the CBC and the UIC Where Should These Committees Be Interacting? Gary J. Foley, USEPA Co-Chair User Interface Committee July 20, 2006.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Data Services Task Team WGISS-22 meeting Annapolis, the US, September 12th 2006 Shinobu Kawahito, JAXA/RESTEC.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
The ADC, the CBC and the UIC Where Should These Committees Be Interacting? Gary J. Foley, USEPA Co-Chair User Interface Committee July 20, 2006.
Update on Ecoinformatics Technical Working Group Activities Larry Fitzwater Computer Scientist US Environmental Protection Agency Rome, Italy – 17 May.
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
Data Infrastructure Building Blocks (DIBBS) NSF Solicitation Webinar -- March 3, 2016 Amy Walton, Program Director Advanced Cyberinfrastructure.
Award No: SES/SBE Project Title: Interoperability Strategies for Scientific Cyberinfrastructure: A Comparative Study Investigators: Geoffrey C.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
COST Action and European GBIF Nodes Anne-Sophie Archambeau.
Biodiversity and Ecological Modeling and Analysis.
National Institutes of Health U.S. Department of Health and Human Services Planning for a Team Science Evaluation ∞ NIEHS: Children’s Health Exposure Analysis.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
DataNet Collaboration
Improving Data Discovery Through Semantic Search
improve the efficiency, collaborative potential, and
SONet: A Community-Driven Scientific Observations Network to achieve Semantic Interoperability of Environmental and Ecological Data Mark Schildhauer1,
An ecosystem of contributions
OBI – Standard Semantic
Measurement Semantics: “MEASEM”
Bird of Feather Session
Presentation transcript:

SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological Analysis and Synthesis Univ. Calif, Santa Barbara TDWG 2008, Fremantle AU Oct Facilitating data interoperability within the environmental and ecological sciences through advanced semantic approaches

Motivation An oncoming deluge of ecological data…

Motivation  And locating desired information is already quite difficult… …Why is this?

Motivation Ecological data are highly heterogeneous… Variable syntax (csv, xls), Structures (tables, rasters, hierarchical) Semantics (terminology, units, methods) Derived from many disciplines: genomic, cellular, physiology, morphology, biodiversity, populations, communities, ecosystems Need for abiotic data too: hydrology, geospatial, climatology

Our Semantic Approach  Climbing the semantic ladder: Ontologies Semantic Annotations Metadata Data

Our Semantic Approach  Method for linking elements of data objects (e.g., columns in a table) to consistent and potentially rich sets of concepts  Semantic Annotations link EML attributes to concepts defined in a Formal Ontology  Store and retrieve annotations and ontologies in Metacat

Document Relationships (semantic annotation)

OBOE Quick Overview  Extensible Observation Ontology (OBOE)  Based on the assumption that much of scientific data consists of observations  OBOE provides a high-level abstraction of scientific observations and measurements  Enables data (or metadata) structures to be linked to domain-specific ontology concepts

OBOE– Extensible Observation Ontology Slide from Josh Madin

Observation Based Structured Query Both datasets contain “tree lengths” Annotation search for “tree length” would return both datasets Structured search allows the search to be limited by the observed entity (e.g. a tree or a tree branch) Increase precision and recall

Emerging Observational Data Models

SONet: A Community-Driven Scientific Observations Network to achieve Semantic Interoperability of Environmental and Ecological Data Project Organizers Mark Schildhauer 1, Shawn Bowers 2, Corina Gries 3, Deborah McGuinness 4, Philip Dibner 5, Josh Madin 6, Matt Jones 1, Luis Bermudez 7, John Graybeal 7 1 NCEAS UC Santa Barbara, 2 UC Davis Genome Center 3 CAP/LTER and Univ. of Arizona, 4 McGuinness Associates, 5 OGC Interoperability Institute, 6 Macquarie University, 7 Monterey Bay Aquarium Research Institute

Motivation MANY different “semantic” efforts underway in earth/biodiversity/environmental sciences, all converging on use of OBSERVATIONAL data construct SPECIALIZED needs and concerns of different domains may drive semantic technology solutions to be diverse and incompatible OPPORTUNITY exists for communicating and coordinating among different domains to achieve greater interoperability of emerging semantic technology solutions BENEFIT is providing cross-disciplinary scientists with more seamless and powerful access to a broad range of relevant data and information

Objectives of SONet Broad Objectives  Address semantic interoperability issues in environmental and ecological data [sharing, discovery, integration]  Build a network of practioners (SONet), including domain scientists, computer scientists, and information managers  Build generic, cross-disciplinary data interoperability solutions  Immediate Goals to Develop  An extensible and open observations data model to unify existing domain-specific approaches  A semantic (ontology) framework for scientific terminology, and corresponding domain extensions  Demonstration prototypes using these to address current interoperability issues

Working Groups Subgroup 1: Core Data Model for Observations Subgroup 2: Catalog of Common Field Observations Subgroup 3: Scientist-Oriented Term Organization Subgroup 4: Demonstration Projects  Subgroup 1  Collect interoperability requirements  Define common, unified data model  Engage tool & data providers, data consumers  Subgroup 2  Identify and catalog common observation types (semantics)  Engage data providers and information managers  Subgroup 3  Define general extension ontologies of scientific terms  Focus work on outputs of group 2  Engage range of domain scientists  Subgroup 4  Define and prototype demonstration projects  Ensure compatability of subgroups Each group consists of two team leads Postdoc funded to work on demonstration projects & help ensure compatibility across subgroups Core SONet Team

Workshops & Outreach Community workshops … to bring together project members, data managers, domain scientists, computer scientists, and members of the larger environmental informatics community  Workshop 1: Collect detailed requirements and use cases for each SONet subgroup  Workshop 2: Refine and extend use cases; Discuss and evaluate proposed data models and representations  Workshop 3: Present and discuss refined data models and representations; early evaluation and feedback  Workshop 4: Training; discuss and plan SONet sustainability … continue from prior NSF workshop on observation data models … approximately participants at each workshop

Initial Project Timeline Workshops and meetings: Year 1: first community workshop, project meeting Year 2: second community workshop, project meeting Year 3: last two community workshops, including training Project has just recently officially started Year 1Year 2Year 3 Project Leaders Meeting (1) (orientation & planning) Project Leaders Meeting (2) (evaluation & planning) Community Workshop (1) (requirements & use cases) Community Workshop (2) (use cases & modeling) Community Workshop (3) (modeling & refinement) Community Workshop (4) (training, sustainability) setup project mgmt. infrastructure, Postdoc hiring finalize community participants, meeting preparation document results, begin implementation & interoperability tests, setup network website document results, continue impl. & interop. tests continue impl. & interop. tests, meeting preparation finalize impl. & interop. tests, sustainability planning document results, execute plan for sustainability

Observation standards for review

Opportunity for Collaboration TDWG community interests and SONet? Observations and Specimen Records Interest Group Observations Task Group Contact Steve Kelling (OSR); Matt Jones (Observations Task Group) Biological Descriptions Interest Group (SDD)