Existing Designs and Prototypes at RPI

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Multidisciplinary interoperability To build Operating Capacity Initial (IOC): extend current GEO IP3/AIP-2 interoperability systems in Forest, Biodiversity,
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
CBioC: Massive Collaborative Curation of Biomedical Literature Future Directions.
Machine Reasoning about Anomalous Sensor Data Matt Calder, Francesco Peri, Bob Morris Center for Coastal Environmental Sensoring Networks CESN University.
Demonstration of adding content to an ICAN Semantic Resource Roy Lowry, Adam Leadbetter, Olly Clements (NETMAR - BODC) Tanya Haddad (ICAN - OCA)
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SemantAqua: A Semantically-Enabled Provenance-Aware Water Quality Portal Evan W. Patton, Ping Wang, Jin Guang Zheng, Timothy Lebo, Li Ding, Joanne Luciano,
Advancing an Information Model for Environmental Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Richard P. Hooper, Kerstin Lehnert, Kim Schreuders,
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
LIFE+ Environmental Policy & Governance project: LIFE09 ENV/GR/ ACTION 2: SERVICE ARCHITECTURE & IMPLEMENTATION Activity 2.1: Design and implementation.
Research support was provided by NSF, award NSF-ITR-IIS , PI Tim Finin, UMBC. SPIRE Semantic Prototypes in Research Ecoinfomatics Approach We are.
A Semantically-Enabled Provenance- Aware Water Quality Portal Joint work with: Jin Guang Zheng, Ping Wang, Evan Patton, Timothy Lebo, Joanne Luciano Deborah.
The Prajna Project Utilities for Understanding Edward Swing.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
SemantEco Annotator for Linked Data Generation and Generalized Semantic Mapping Session: Technologies, Reasoning, and Annotation Methods of the Semantics.
TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal Ping Wang, Jin Guang Zheng, Linyun Fu, Evan W. Patton, Timothy Lebo, Li Ding, Joanne.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Facilitating Next Generation Science Collaboration: Marine Ecosystems Status Reports and Assessments June 24, 2014 IMBER – D2 Peter Fox (RPI/ Tetherless.
Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
ATU Decision Support System. Overview Decision Support System – what is it? Definition Main components Illustrative Scenario Ontology / Knowledge Base.
GeoLink Overview Goal: Develop Semantic Web technologies that facilitate discovery (and reuse) of geoscience data.Goal: Develop Semantic Web technologies.
Human-Aware Sensor Network Ontology (HASNetO): Semantic Support for Empirical Data Collection Paulo Pinheiro 1, Deborah McGuinness 1, Henrique Santos 1,2.
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
Semantic Water Quality Portal Jin Guang Zheng and Ping Wang Tetherless World Constellation.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Using a Simple Knowledge Organization System to facilitate Catalogue and Search for the ESA CCI Open Data Portal EGU, 21 April 2016 Antony Wilson, Victoria.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Human-Aware Sensor Networks Ontology (HASNet-O): PROV-O/OBOE/VSTO Alignments Paulo Pinheiro.
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Harmonizing Measurements for Marine Biodiversity Observation Networks
User Characterization in Search Personalization
Cloud based linked data platform for Structural Engineering Experiment
DataNet Collaboration
Improving Data Discovery Through Semantic Search
Jessie Kennedy Rob Gales, Robert Kukla
Semantic Support for Complex Ecosystem Research Environments
knowledge organization for a food secure world
SMART GROUND platform overview
Metadata Construction in Collaborative Research Networks
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Measurement Semantics: “MEASEM”
Chaitali Gupta, Madhusudhan Govindaraju
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Existing Designs and Prototypes at RPI Show and Tell Existing Designs and Prototypes at RPI

Earth Science Ontology Repository (ESOR)

What is ESOR?[1] The Earth Science Ontology Repository(ESOR) provides an entity matching service as a backend knowledge base for multiple applications. It serves a similar function as BioPortal[2], but with more of a focus on the Earth Science domain.

How to use ESOR links: keyword search: http://orion.tw.rpi.edu/~zhengj3/wod/earthsearch.php matched entity with score by keyword dataonetwc.tw.rpi.edu/linkipedia/search?query= ontology by entity dataonetwc.tw.rpi.edu/linkipedia/read?url=

Example#1 input: “snow, snow depth” (MsTMIP #28) output of keyword search:

Example #2 output of matched entity with score by keyword:

Example #3 output of ontology by entity:

Comparison with Bioportal using “snow depth” or “snow, snow depth” as input, Bioportal will return matched entities with either of the keywords “snow” or “depth”. Therefore, the better match “http://sweet.jpl.nasa.gov/2.3/propSpaceThickness.owl#SnowCover” is missed. A similar problem occurs with other MsTMIP variables (#10 Heterotrophic Respiration, #11 Leaf Area Index, #22 Near surface specific humidity, #23 Sensible heat, #24 Latent Heat .... ) Bioportal had much more complicated functions than ESOR. However, if we just use it as a backend knowledge base, our ESOR has better performance in terms of recall and precision.

List of ontologies version 1 (the current version) ChEBI (Chemical Entities of Biological Interest) OBI (Ontology for Biomedical Investigations) OBO-E (Extension Observation Ontology) PROV-O (The Provenance Ontology) SWEET Chemical Properties SWEET Human Research SWEET Units SemantEco Water Ontology Semanteco Pollution Ontology Time Ontology UO (Units of Measurement Ontology) dcterms (Dublin Core) foaf (Friend of a Friend) geonames wgs (Basic Geo Vocabulary) ... version 2 (coming soon) KB_Bio_101 (AURA) Santa Barbara Coastal Observation Ontology (OBOE-SBC) ... * for the complete list, see document section 3: https://docs.google.com/document/d/1Hs3k0RrfUoQkxKEJBJtU9trFdqC-NdhtwFKSz5wcHXM/edit#

Underlying techniques

How can we benefit from ESOR in D1? help the user to choose the right entity with which to annotate their dataset serve as a backend knowledge base for automatic semantic annotation match entities based on content instead of single keywords

SemantEco Annotator

SemantEco: Semantic Environmental Monitoring Goal Approach Question we try to answer Enable/Empower communities (citizens & scientists) to explore pollution sites, facilities, regulations, and health impacts along with provenance Connections to USGS, Lake George, IBM, expanding to discussions of predictions and intervention suggestions Where are pollution events happening? What are the health impacts? How does pollution correlate with population changes (wildlife, invasives, etc.)? Explanation of pollution limits Graphing thresholds and trends Possible health effect of contaminant (EPA) Filtering by facet to select type of data Link for reporting problems Extended with input from USGS, with population counts for birds & fish

Tools for Semantic Annotation of Measurements Annotator Demo: https://www.youtube.com/watch?v=pKO5NwgWnyc annotation of CSV’s OBOÉ design pattern D1 Phase I product, and other ongoing work at RPI... GITHUB, Web interface available

Focus of D1 Semantics in Phase II Semantics of Measurements: ...binding raw data values to concepts drawn from ontologies ...often through metadata ...using W3C standards for annotation-- PROV, OA ...to facilitate resource discovery and interpretation through enhanced precision and recall of searches

What’s new: ontology search Features: Weighted ranking Entity Linking Similarity check User Preference Semantic Annotator owl search takes advantages of the earth science ontology knowledge base and linkipedia tool from Tetherless World Constellation at RPI.

What is new: User Management User interface can be implemented for a variety of storage / representation methods User Store interface handles read-write for Users (to-from database, file, etc) Permission interface is a simple representation of source and level that can be translated to URI Permission User UserStore Misc interface updates: Integrated ontology search Other bug fixes

How can we benefit from SemantEco Annotator in D1? Annotator can be integrated into the DataONE annotator, especially for use with CSV-formatted datasets Reuse the design pattern Facilitate resource discovery and interpretation through enhanced precision and recall of searches

Related projects Jefferson Project

The Jefferson Project Fundamental Goal: Understand, Predict and Enable a Healthy Lake George Ecosystem Using Cutting–Edge Science to Enable Smarter Solutions Multi-beam SONAR, Bathymetric LiDAR, Terrestrial LiDAR, Sensor Network composed of 30+ instruments streaming data 365/24/7 High resolution, high accuracy, high data density seamless data set ~ 70 Tb/year total raw data expected, 7-8Tb/year of “product data” Semantic approach's contributes in two knowledge representation and reasoning areas human interventions on the deployment and maintenance of local sensor networks including the scientific knowledge to decide how and where sensors are deployed Data Integration through the use of the Human-Aware Sensor Network Ontology (HASNetO), which is based on OBOE, W3C PROV, and VSTO knowledge about simulation results including parameters, interpretation of results, and comparison of results against external data

Reference [1] ESOR link: http://orion.tw.rpi.edu/~zhengj3/wod/earthsearch.php [2] BioPortal: http://bioportal.bioontology.org/ [3] The Earth Science Ontology Repository: https://docs.google.com/document/d/1Hs3k0RrfUoQkxKEJBJtU9trFdqC-NdhtwFKSz5wcHXM/edit# [4] Wikipedia/dbpedia mapping for MsTMIP variables: https://docs.google.com/document/d/18tKNwyonw2sFzbFE0BzPt8WtPA1gVS6EPaGYxeh9dd4/edit#hea ding=h.fj17u9rk42u [5] MsTMIP use case: https://docs.google.com/document/d/1hS7j-TCLtbA2x0ztZZhXI- 1NNqE5DEi585MnOq8fhpo/edit#heading=h.u4s2g4klu51s [6] MsTMIP 45 variables mapping: https://docs.google.com/document/d/1y2ieWXIhmE6-vz3SesCf50tfO46DWcUBVbWpCWdlBWk/edit [7] Annotator: http://tw.rpi.edu/web/project/SemantEcoAnnotator Thanks!