Biological and Chemical Oceanography Data Management Office slide 1 of 21 Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography.

Slides:



Advertisements
Similar presentations
Data Provenance and Attribution for Published Datasets The Challenge and the reality check April 9-10, 2009 National Academy of Sciences, Woods Hole, MA.
Advertisements

Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Ontology Semantic Mediation in the Big Picture MMI Workshop - August 2005.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
1 Shifting the Burden from the User to the Data Provider Peter Fox High Altitude Observatory, NCAR (***) With thanks to eGY and various NSF, DoE and NASA.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Metadata Guides for Smarties Marine Metadata Initiative URL:
1 Class Exercise I: Use Cases Deborah McGuinness and Peter Fox (NCAR) CSCI Week 4 (part II), 2008.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Fox OOS meeting 1 Ontologies and Semantic Applications in Earth Sciences Peter Fox (TWC/RPI; formerly HAO/NCAR) Thanks to many. Projects funded.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
What has been lacking, until recently, is a successful method to develop, implement and sustain informatics solutions to modern application problems, such.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
SIF Status to ADC Co-Chairs Siri Jodha S. Khalsa Steve Browdy.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
1. Data providers deliver metadata records that describe their datasets through OGC catalogue services for the web. Each metadata record uses keyword concepts.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
A Prototype Ontology Tool and Interface for Coastal Atlas Interoperability Dawn J. Wright 1, Luiz Bermudez 2 (presenter), Liz O’Dea 3, Yassine Lassoued.
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
Marine Metadata Interoperability - Web Services Marine scientists face an opportunity and a challenge in the volume of data available from various ocean.
1 Implementing QA/QC Standards for In Situ Ocean Sensors Using OGC-Sensor Web Enablement a.k.a. QARTOD to OGC a.k.a. Q2O Project Status Brief to NOAA IOOS.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Public Access and Spatial Metadata Values: Semantic Network Services Response to EU Directives Maria Rüther Federal Environment Agency,
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Marine Metadata Interoperability Acknowledgements Ongoing funding for this project is provided by the National Science Foundation.
Djc -1 Daniel J. Crichton NASA/JPL 9 May 2006 CCSDS Information Architecture Working Group.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
EarthCube Integrative Activities: Cross-Domain Observational Metadata Environmental Sensing Network (X-DOMES) September 2015 – August 2017 Focusing on.
Botts – August 2004 Sensor Web Enablement Sensor Web Enablement WG (SWE-WG)
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
CMSP / OCM Vocabulary Services rpi
OPeNDAP: Accessing Data in a Distributed, Heterogeneous Environment
Statistical Information Technology
Presentation transcript:

Biological and Chemical Oceanography Data Management Office slide 1 of 21 Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography Data Management Office (BCO-DMO) Woods Hole Oceanographic Institution 26 July 2008 OOS Interoperability Planning Workshop National Space Science and Technology Center University of Alabama at Huntsville, AL USA

Biological and Chemical Oceanography Data Management Office slide 2 of 21 Discussion Points What do we mean by “interoperability”? Why does it matter? How do we achieve interoperability?  What are some of the expected challenges and effective strategies?

Biological and Chemical Oceanography Data Management Office slide 3 of 21

Biological and Chemical Oceanography Data Management Office slide 4 of 21 What do we mean by “interoperability”? the ability to exchange data and information between two or more systems (separated by a recognized boundary) with minimal loss of information  clients of interoperable systems may be machines syntactic interoperability  file formats  data structures  exchange protocols semantic interoperability  term definitions (controlled vocabularies)

Biological and Chemical Oceanography Data Management Office slide 5 of 21 Interoperability... a wee bit more from MMI a client should not be required to possess in depth understanding of the data in order to access them interoperable systems should be designed to support machine access (not just people clients) interoperable metadata systems should be designed to support automated, accurate, lossless, machine-to- machine exchange of information

Biological and Chemical Oceanography Data Management Office slide 6 of 21 Why does interoperability matter? improved open access to public data  enabling data mining, cataloging systems, portals, mash-ups ecosystem researchers are asking global questions model developers need access to data from a variety of domains (biology, chemistry, etc and social and economic data too)  we must rise to the challenge of designing interoperable data systems that are up to these tasks

Biological and Chemical Oceanography Data Management Office slide 7 of 21 How do we achieve interoperability? What are some of the expected challenges? varied funding sources and governing agencies projects with different priorities and leadership distributed data systems large volume, heterogeneous data expectation of near real-time availability

Biological and Chemical Oceanography Data Management Office slide 8 of 21 adopt standards use controlled vocabularies publish metadata databases use Semantic Web technologies build community What are some of the effective strategies? *

Biological and Chemical Oceanography Data Management Office slide 9 of 21 This is going to be difficult... best solutions will be found through community recognition of local implementations Baker & Chandler, in press. publication expected in DSR II before end of Title: Enabling Long-Term Oceanographic Research: Changing Data Practices, Information Management Strategies and Informatics online: final draft complete: 9-JUL-2008http://dx.doi.org/ /j.dsr

Biological and Chemical Oceanography Data Management Office slide 10 of 21 The data … CMarZ GEOTRACES Iron Synthesis NACP OCB US GLOBEC US JGOFS US SOLAS controlled vocabularies community building metadata standards *

Biological and Chemical Oceanography Data Management Office slide 11 of 21 Ocean Observatory Data … expecting large volume, heterogeneous data to be made available in near real time in the past, large volume data sets tended to be more homogeneous (e.g. fewer columns); while data sets with many columns tended to be smaller in size HomogeneousHeterogeneous Sensor dataManual observations PhysicalBiology, Chemistry Data from ocean observatory sensor arrays are/will be a continuous stream of large volume, heterogeneous data.

Biological and Chemical Oceanography Data Management Office slide 12 of 21 Ocean Observatory Data … attempting to reduce this challenge by  identifying 20 IOOS core variables  recognizing the importance of metadata  identifying common use cases  adopting common standards (OGC and SWE)  fostering community-wide involvement and communications  sponsoring mysterious, after-dark gatherings in Huntsville, AL providing little more information beyond street address and suggested footwear... *

Biological and Chemical Oceanography Data Management Office slide 13 of 21 Semantic Web Semantic Web technologies offer one solution set next four slides are courtesy of Peter Fox (HAO/ESSL/NCAR) and are from an April 2008 presentation

Biological and Chemical Oceanography Data Management Office slide 14 of 21 Semantic Web Basics The triple: {subject-predicate-object} Interferometer is-a optical instrument Optical instrument has focal length An ontology is a representation of this knowledge W3C is the primary (but not sole) governing organization for languages, specifications, best practices, etc.  RDF - Resource Description Framework  OWL Ontology Web Language (OWL 1.1 on the way) Encode the knowledge in triples, in a triple-store, software is built to traverse the semantic network, it can be queried or reasoned upon Put semantics between/ in your interfaces, i.e. between layers and components in your architecture, i.e. between ‘users’ and ‘information’ to mediate the exchange (P. Fox, 2008)

Biological and Chemical Oceanography Data Management Office slide 15 of 21 (P.Fox, 2008) … … VO Portal Web Serv. VO API DB 2 DB 3 DB n DB 1 Semantic mediation layer - VSTO - low level Semantic mediation layer - mid-upper-level Education, clearinghouses, other services, disciplines, etc. Metadata, schema, data Query, access and use of data Semantic query, hypothesis and inference Semantic interoperability Added value Mediation Layer Ontology - capturing concepts of Parameters, Instruments, Date/Time, Data Product (and associated classes, properties) and Service Classes Maps queries to underlying data Generates access requests for metadata, data Allows queries, reasoning, analysis, new hypothesis generation, testing, explanation, etc. *

Biological and Chemical Oceanography Data Management Office slide 16 of 21 Semantic Web Methodology and Technology Development Process Establish and improve a well-defined methodology vision for Semantic Technology based application development Leverage controlled vocabularies, etc. Use Case Small Team, mixed skills Analysis Adopt Technology Approach Leverage Technology Infrastructure Rapid Prototype Open World: Evolve, Iterate, Redesign, Redeploy Use Tools Science/Expert Review & Iteration Develop model/ontology (P. Fox, 2008)

Biological and Chemical Oceanography Data Management Office slide 17 of 21 (P. Fox, 2008) The Information Era: Interoperability managing and accessing large data sets higher space/time resolution capabilities rapid response requirements data assimilation into models crossing disciplinary boundaries. Modern information and communications technologies are creating an “interoperable” information era in which ready access to data and information can be truly universal. Open access to data and services enables us to meet the new challenges of understanding the Earth and its space environment as a complex system:

Biological and Chemical Oceanography Data Management Office slide 18 of 21 Semantic Web technologies being used in some of Peter Fox’s informatics projects: Semantically-Enabled Science Data Integration (SESDI): Semantic Knowledge Integration Framework (SKIF): Semantic Web for Earth and Environmental Terminology (SWEET): SWEET is a JPL project to provide a common semantic framework for a variety of Earth science initiatives.

Biological and Chemical Oceanography Data Management Office slide 19 of 21 more challenges... reality check: insufficient metadata … researcher aboard R/V Oceanus, August 2008, talking about metadata for shipboard sampling Data originators recognize that recording metadata is a good idea, and they will even record metadata ~ if we make it easy for them to do so. * "I feel like I'm so busy with all the other little things in a cruise that I don't have time to worry about making up log sheets. But if someone puts an already made log sheet in front of me, I'll use it." "When I see projects with good metadata, it just makes my heart go pitter patter."

Biological and Chemical Oceanography Data Management Office slide 20 of 21 more challenges... reality check: heterogeneous data sets … The BCO-DMO database includes many data sets in which depth and temperature are reported and more than 25 different definitions of each.

Biological and Chemical Oceanography Data Management Office slide 21 of 21 * Community building Share knowledge gained doing implementation at the local level – participate in workshops and publish Fall 2008 AGU Meeting in San Francisco, CA 8-12 December; Earth and Space Science Informatics IN12: Strategies for Improved Marine and Synergistic Data Access and Interoperability IN19: From Data to Synthesis: Next-Generation Science Applications Abstract Submission Deadline: 10 September 2359 UT Thank you...