Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.

Slides:



Advertisements
Similar presentations
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
Advertisements

Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
A Framework for Earth Science Search Interface Development Designing and Implementing S2S Eric Rozell, Tetherless World Constellation, RPI.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Scientific Knowledge Discovery in Complex Semantic Networks of Geophysical Systems (no pressure…) EGU2012, NP2.6 April 25, 2012, Vienna, Austria Peter.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Bringing Data Science, Xinformatics and Semantic eScience into the Graduate Curriculum (solicited) EGU (EOS 6/ ESSI2.3) April 25, 2012, Vienna.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
What has been lacking, until recently, is a successful method to develop, implement and sustain informatics solutions to modern application problems, such.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Janice Gordon September 5, 2012 Semantic Technologies for Integrating.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
1 Practical aspects of creating semantic web applications Peter Fox (RPI) ESIP Summer Meeting Knoxville, TN, July 21, 2010, 15:30pm Slides at:
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Transparency, applications, and ab- stuff – effect on tools for e-science: it’s all about Informatics June 21, 2010, IATUL 2010 Peter Fox (RPI and WHOI)
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West 1 Nancy Hoebelheinrich.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Semantics and analytics = making the data and the decisions smarter? Digital Antiquity CI Feb 7-8, 2013, Arlington VA Peter Fox (RPI and WHOI)
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Lessons learned from Semantic Wiki Jie Bao and Li Ding June 19, 2008.
How Environmental Informatics is Preparing Us for the Era of Big Data AGU FM 2013 GC11F-01 December 09, 2013, MW 3001 Peter
NMFS Use Case 1 review/ evaluation and next steps April 19, 2012 Woods Hole, MA Peter Fox (RPI* and WHOI**) and Andrew Maffei (WHOI) *Tetherless World.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Information Model Driven Semantic Framework Architecture and Design for Distributed Data Repositories AGU 2011, IN51D-04 December 9, 2011 Peter Fox (RPI)
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
Bit.ly/2c3XMgd.
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Informatics underlying Data Science (ists)
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
CMSP / OCM Vocabulary Services rpi
NMFS Use Case 1 review/ evaluation and next steps
Modeling Data Set Versioning Operations
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Modeling Data Set Versioning Operations
Presentation transcript:

Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison Andy Maffei (WHOI) Patrick West, Stephan Zednik (RPI) EGU 2010 Ocean Informatics

Basis of effort Staff and graduate students from the Tetherless World Constellation at Rensselaer Polytechnic Institute (RPI) have been collaborating with the Biological and Chemical Oceanography Data Management Office (BCO-DMO) -- a project operating out of the Woods Hole Oceanographic Institution and funded by the National Science Foundation. RPI staff and BCO-DMO team-members have been working with oceanographers, data managers, ontology modelers, software engineers and other experts to iteratively design and develop a semantically enabled prototype showing how domain scientists are able to perform better and smarter searches for data, access and manipulate more data sets, and begin to keep track of data provenance. There are plans for the features demonstrated in this prototype to be incorporated into BCO-DMO’s production website. If time: image informatics.. New results Tetherless World Constellation 2

3

4

5

6

7

Modern informatics enables a new scale-free** framework approach Use cases Stakeholders Distributed authority Access control Ontologies Maintaining Identity

Team… Collaboration: Small team of mixed skills created in order to provide a scientific infrastructure that is usable and extensible, providing semantic integration, and knowledge representation while requiring depth in each of the science areas. Facilitator - knows iterative methodology, guides the exercise Domain experts – knows resources, data, applications, tools Ontology modelers – to extract objects/relations from use cases and discussion Data Managers – understands the storage, organization and access to datasets Software engineers – responsible for architecture and technology aspects Scribe – capturing everything discussed Social Scientist – optional, as process is as much a social exercise as it is a technical and methodical activity Tetherless World Constellation 9

Tools Omni Graffle – Creation of Faceted-Browse Mockups CmapTools COE – Creation of Ontology Models, Causality graphs for provenance Protégé – Creation of Ongology and Individuals Skype (IM and VOIP), Dimdim (Web Conferencing), MediaWiki – Collaboration tools Google Web Toolkit + SmartGWT – Rapid UI Prototyping Jena/TDB and Joseki – triple store and SPARQL endpoint server – can be extended to perform reasoning and the execution of semantic rules. Tetherless World Constellation 10

Use cases 1.Do you have any data online from Hutchins from award number OCE ? 2.I want to download (temperature, biological,...) data in the following areas (N. Atlantic, bounding box, where JGOFs survey was done,...) 3.What new data has been added since last year (and organize it by project) 4.Show me all the places where the surface temperature in the North Atlantic is 25 degrees during June. Tetherless World Constellation 11

Quick prototype of use case 1 Tetherless World Constellation 12

Evolving the ontology model Tetherless World Constellation 13

To… Example where the iterative process helped to develop an understanding by WHOI domain experts ontologies and translating their concepts into an ontology and the ontology developers to understand the specific domain vocabulary. Successive iterations helped to expand and simplify concepts and incorporate already existing ontologies. Similar in instrument, platform, parameter ontology development. Tetherless World Constellation 14 Includes all of the foaf concepts for name, contact information, interests

Current version Tetherless World Constellation 15

Current version Tetherless World Constellation 16

Summary Migrated a database driven, highly programmed implementation into an ontology and smart query driven search with modest effort (okay, a few brain cells died along the way) – Use case driven – Ontology driven at many levels – Application oriented, rapid prototyping All along the way, we evaluated our semantic developments (ontologies) and implementation to gauge their benefits or deficiencies Continuing to add functions based on new use cases Tetherless World Constellation 17

HABCAM Image Informatics Color and Illumination Prof. Chuck Steward (RPI) Students: Ryan Leary and Zack Schilling Problems addressed: – Illumination Across images Within image – Color Differing attenuation in water for red, green and blue – Demosaicing is noisy Approach: – Combined physical and empirical model

Color Correction Based on Beer’s Law Before After

Illumination Correction Based on Light-Field Map Before After Difference

Further Information Contacts: – – Tetherless World Constellation 21