Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.

Slides:



Advertisements
Similar presentations
Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Advertisements

Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Tools for DAML-Based Services, Document Templates, and Query Answering Richard Fikes Deborah McGuinness Sheila McIlraith Tran Cao Son Honglei Zeng Steve.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
How can Computer Science contribute to Research Publishing?
An Environment for Merging and Testing Large Ontologies Deborah McGuinness, Richard Fikes, James Rice*, Steve Wilder Associate Director and Senior Research.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Tools for Developing and Using DAML-Based Ontologies and Documents Richard Fikes Deborah McGuinness Sheila McIlraith Jessica Jenkins Son Cao Tran Gleb.
ICS (072)Database Systems Background Review 1 Database Systems Background Review Dr. Muhammad Shafique.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Jiao Tao, Li Ding, Deborah L. McGuinness Tetherless World Constellation Rensselaer Polytechnic Institute Troy, NY, USA Instance Data Evaluation on the.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Scientific Knowledge Discovery in Complex Semantic Networks of Geophysical Systems (no pressure…) EGU2012, NP2.6 April 25, 2012, Vienna, Austria Peter.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Overview of the Database Development Process
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Fox OOS meeting 1 Ontologies and Semantic Applications in Earth Sciences Peter Fox (TWC/RPI; formerly HAO/NCAR) Thanks to many. Projects funded.
ITEC224 Database Programming
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SemantAqua: A Semantically-Enabled Provenance-Aware Water Quality Portal Evan W. Patton, Ping Wang, Jin Guang Zheng, Timothy Lebo, Li Ding, Joanne Luciano,
1 Foundations IV: Ontology Evolution and Knowledge Management Class Session 6 Deborah McGuinness and Peter Fox (NCAR) CSCI Week 6 – October 6,
Integrated Development Environment for Policies Anjali B Shah Department of Computer Science and Electrical Engineering University of Maryland Baltimore.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
References: [1] Branch, B.D., Fosmire, M., The role of interdisciplinary GIS and data curation librarians in enhancing authentic scientific research.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
What to Send First? A Study of Utility in the Semantic Web Mike Dean 1, Prithwish Basu 1, Ben Carterette 2, Craig Partridge 1, and James Hendler 3 1 Raytheon.
123 Jiao Tao 1, Li Ding 2, Deborah L. McGuinness 3 Tetherless World Constellation Rensselaer Polytechnic Institute Troy, NY, USA 1 PhD Student 2 Postdoctoral.
Ch- 8. Class Diagrams Class diagrams are the most common diagram found in modeling object- oriented systems. Class diagrams are important not only for.
Aim Ability to automate the detection of financial inconsistency and irregularity Problem Need to create a unified and logically rigorous terminology.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Approach to building ontologies A high-level view Chris Wroe.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Class Diagrams. Terms and Concepts A class diagram is a diagram that shows a set of classes, interfaces, and collaborations and their relationships.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Scaling the Wall: Experiences adapting a Semantic Web application to utilize social networks on mobile devices Evan W. Patton 1 ( ) &
SysML 2.0 Formalism Requirements and Potential Language Architectures
CCNT Lab of Zhejiang University
Creating, Maintaining, and Integrating Understandable Knowledge Bases
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Ontology Evolution: A Methodological Overview
Analyzing and Securing Social Networks
An Environment for Merging and Testing Large Ontologies
Ontologies (What they are; Why you should care; What you should know)
Ontology-Based Approaches to Data Integration
Presentation transcript:

Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer Polytechnic Institute Troy, NY USA AGU 2008 Fall Meeting 15–19 December 2008, San Francisco, California

Previous (CS) Work General toolkit work exists to support many aspects of analysis and evolution of knowledge encodings Issues diagnosis and support for: –Collaboration (with distributed teams) –Diverse training levels –Interconnectivity with many systems/standards –Scale –Ontology mapping and merging Ontology (schema) diagnostics Instance level registration and analysis

Approaches Review structure and encoding of schema for logical and possible problems Review structure and encoding of ground data for logical and possible problems Review (and automatically or semi- automatically gather) existing ontologies and data Incorporate domain knowledge into analysis and mapping/merging process Expose selected resources to help domain experts find, encode, and analyze

Chimaera: An Ontology Evolution Environment An interactive web-based tool aimed at supporting: Merging (later mapping) of ontological terms from varied sources Diagnosis of coverage and correctness of ontologies Maintaining ontologies over time Features: multiple I/O languages, loading and merging into multiple namespaces, collaborative distributed environment support, integrated browsing/editing environment, extensible diagnostic rule language Built by computer scientists uses domain independent approach. Has been extended to leverage selected portions of domain dependent info. (

The Analysis Task Review KBs that: – Were developed using differing standards – May be syntactically but not semantically validated – May use differing modeling representations Produce KB logs (in interactive environments) –Identify provable problems –Suggest possible problems in style and/or modeling –Are extensible by being user programmable

Loads in logic encoding: Integrity/ logical checks (numbers outside ranges, missing values, values of wrong type, “Bad” form checks cycles in structure referenced but not defined, redundant super classes …

The KB Merging -> Mapping Task Work with Knowledge bases that: – Were developed independently by multiple authors – Express overlapping knowledge in a common domain – Use differing representations and vocabularies Produce merged KB with –Non-redundant –Coherent –Unified vocabulary, content, and representation Later emphasis (by this and other work) on creating mapping relationships rather than merging commands

Next Generation Update to current languages (OWL, SPARQL, …) Leverage general (domain independent) foundations Focus more on instance level data (studies show more than 90% of RDF data is instance data) Focus more on mapping rather than merging

Semantic e-Science Data Evaluation

Ontology-Enabled Data Registration: SEDRE (Sinha Rezgui): a system that enables scientists to semantically register data sets for optimal querying and semantic integration SEDRE enables mapping of heterogeneous data to concepts in domain ontologies Uses an ontology for the registration procedure

13 How to find the data? Include background knowledge of the form data providers typically do. One example from using terms and relationships from volcano chemistry, atmospheric chemistry, thermal profiles, solar irradiance data, etc.

14 Registration of Volcanic Data SO 2 Emission from Kilauea east rift zone - vehicle-based (Source: HVO) Abreviations: t/d=metric tonne (1000 kg)/day, SD=standard deviation, WS=wind speed, WD=wind direction east of true north, N=number of traverses Location Codes: U - Above the 180° turn at Holei Pali (upper Chain of Craters Road) L - Below Holei Pali (lower Chain of Craters Road) UL - Individual traverses were made both above and below the 180° turn at Holei Pali H - Highway 11

15 Registering Volcanic Data (2) Uses background knowledge to provide connections – for example by linking volcanoes to their lat long location. Exposes chemistry information in a typical structured form

Directions Include background ontologies in domain areas Focus on instance data and schema data (e.g. TW-OIE) Add domain-specific checks that should be performed Update interfaces to aim at broader (not just CS) audience Integrate more with existing (often domain-specific) environments (e.g., SEDRE) Focus on known issues such as unit conversion support Leverage extensive acronym expansion options (e.g., Chimaera’s extension to include other vocabularies) Smart search (e.g. Noesis) Use results of learners /crawlers Scale