An Environment for Merging and Testing Large Ontologies Deborah McGuinness, Richard Fikes, James Rice*, Steve Wilder Associate Director and Senior Research.

Slides:



Advertisements
Similar presentations
DAML Queries/Life Cycle SRI International. Parts of Ontologies (used in the examples to follow) Assumptions Researcher String lastName firstName Publication-ref.
Advertisements

April 23, 2007McGuinness NIST Interoperability Week One Ontology Spectrum Perspective Deborah L. McGuinness Acting Director & Senior Research Scientist.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Jim Hendler Chief Scientist - Information Systems Office DARPA.
Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Ontology-enhanced retrieval (and Ontology-enhanced applications) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Chapter 11 user support. Issues –different types of support at different times –implementation and presentation both important –all need careful design.
Tools for DAML-Based Services, Document Templates, and Query Answering Richard Fikes Deborah McGuinness Sheila McIlraith Tran Cao Son Honglei Zeng Steve.
Chapter 9: Ontology Management Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Chapter 9: Ontology Management Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
Accelerate Business Success With CRM CRM Interoperability.
Ontologies Come of Age: The Next Generation OCAS October 24, 2011 Bonn, Germany Deborah L. McGuinness Tetherless World Senior Constellation Chair Professor.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
Tools for Developing and Using DAML-Based Ontologies and Documents Richard Fikes Deborah McGuinness Sheila McIlraith Jessica Jenkins Son Cao Tran Gleb.
Web Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson & Bruce McManus Heart + Lung Institute.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Ontology-enhanced Search for Primary Care Literature Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems Laboratory.
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
1 Foundations IV: Ontology Evolution and Knowledge Management Class Session 6 Deborah McGuinness and Peter Fox (NCAR) CSCI Week 6 – October 6,
The Semantic Web Deborah McGuinness Associate Director and Senior Research Scientist Knowledge Systems Laboratory Stanford University Stanford, CA USA.
Ontologies Come of Age Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems Laboratory Stanford University Stanford,
CHAPTER TEN AUTHORING.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
1 5 Nov 2002 Risto Pohjonen, Juha-Pekka Tolvanen MetaCase Consulting AUTOMATED PRODUCTION OF FAMILY MEMBERS: LESSONS LEARNED.
Advanced topics in software engineering (Semantic web)
Ontologies Come of Age Deborah L. McGuinness Stanford University “The Semantic Web: Why, What, and How, MIT Press, 2001” Presented by Jungyeon, Yang.
1 Foundations IV: Ontology Evolution and Knowledge Management Class Session 8 Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Chap#11 What is User Support?
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
Creating Creating, Maintaining Integrating Maintaining, and IntegratingUnderstandable Knowledge Bases Richard FikesDeborah McGuinnessSheila McIlraith Jessica.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Ontologies and the Semantic Web Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems Laboratory Stanford University.
Introduction to Interactive Media Interactive Media Tools: Authoring Applications.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
 Programming - the process of creating computer programs.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
17 April 2005Sharif University of Tech Page 1 Ontologies Come of Age Amir Hossein Assiaee
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Ontology Properties The following properties are necessary for something in order to be considered as an ontology (specifications possessing these properties.
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Modeling with UML – Class Diagrams
Building Enterprise Applications Using Visual Studio®
CCNT Lab of Zhejiang University
Creating, Maintaining, and Integrating Understandable Knowledge Bases
Analyzing and Securing Social Networks
Service-Oriented Computing: Semantics, Processes, Agents
An Environment for Merging and Testing Large Ontologies
Ontologies (What they are; Why you should care; What you should know)
Chapter 11 user support.
Service-Oriented Computing: Semantics, Processes, Agents
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

An Environment for Merging and Testing Large Ontologies Deborah McGuinness, Richard Fikes, James Rice*, Steve Wilder Associate Director and Senior Research Scientist Knowledge Systems Laboratory Stanford University Stanford, CA *CommerceOne, Mountain View, CA

Motivation: Ontology Integration Trends u Integrated in most search applications (Yahoo, Lycos, Xift, …) u Core component of E-Commerce applications (Amazon, eBay, Virtual Vineyards, REI, VerticalNet, CommerceOne, etc.) u Integrated in configuration applications (Dell, PROSE, etc.)

Motivation: Ontology Evolution u Controlled vocabularies abound (SIC-codes, UN/SPSC, RosettaNet, OpenDirectory,…) u Distributed ownership/maintenance u Larger scale (Open Directory >23.5K editors, ~250K categories, 1.65M sites) u Becoming more complicated - Moving to classes and slots (and value restrictions, enumerated sets, cardinality)

Chimaera – A Merging and Diagnostic Ontology Environment Web-based tool utilizing the KSL Ontolingua platform that supports: u merging multiple ontologies found in distributed environments u analysis of single or multiple ontologies u attention focus in problematic areas u simple browsing and mixed initiative editing

The Need For KB Merging u Large-scale knowledge repositories will contain KBs produced by multiple authors in multiple settings u KBs for applications will be built by assembling and extending multiple modular KBs from repositories u KBs developed by multiple authors will frequently u Express overlapping knowledge in a common domain u Use differing representations and vocabularies u For such KBs to be used together as building blocks - Their representational differences must be reconciled

The KB Merging Task u Combine KBs that: u Were developed independently (by multiple authors) u Express overlapping knowledge in a common domain u Use differing representations and vocabularies u Produce merged KB with u Non-redundant u Coherent u Unified vocabulary, content, and representation

How KB Merging Tools Can Help u Combine input KBs with name clashes u Treat each input KB as a separate name space u Support merging of classes and relations u Replace all occurrences by the merged class or relation u Test for logical consistency of merge (e.g. instances/subclasses of multiple disjoint classes) u Actively look for inconsistent extensions u Match vocabulary u Find name clashes, subsumed names, synonyms,... u Focus attention u Portions of KB where new relationships are likely to be needed E.g., sibling subclasses from multiple input KBs u Derive relationships among classes and relations u Disjointness, equivalence, subsumption, inconsistency,...

Merging Tools Merging Tools u Merging can be arbitrarily difficult u KBs can differ in basic representational design u May require extensive negotiation among authors u Tools can significantly accelerate major steps u KB merging using conventional editing tools is u Difficult  Labor intensive  Error prone u Hypothesis: tools specifically designed to support KB merging can significantly u Speed up the merging process u Make broader user set productive u Improve the quality of the resulting KB

Our KB Analysis Task u Review KBs that: u Were developed using differing standards u May be syntactically but not semantically validated u May use differing modeling representations u May have different purposes u Produce KB logs (in interactive environments) u Identify provable problems u Suggest possible problems in style and/or modeling u Are extensible by being user programmable

Chimaera Usage u HPKB program – analyze diverse KBs, support KR novices as well as experts u Cleaning semi-automatically generated KBs u Browsing and merging multiple controlled vocabularies (e.g., internal vocabularies and UN/SPSC (std products and services codes)) u Reviewing internal vocabularies

Discussion/Conclusion Ontologies are becoming more central to applications, they are larger, more distributed, and longer-livedOntologies are becoming more central to applications, they are larger, more distributed, and longer-lived Environmental support (in particular merging and diagnostic support) is more critical for the broader user baseEnvironmental support (in particular merging and diagnostic support) is more critical for the broader user base Chimaera provides merging and diagnostic support for ontologies in many formatsChimaera provides merging and diagnostic support for ontologies in many formats It improves performance over existing toolsIt improves performance over existing tools It has been used by people of various training backgrounds in government and commercial applications and is available for use.It has been used by people of various training backgrounds in government and commercial applications and is available for use. - movie, tutorial, papers, link to live system, etc. - movie, tutorial, papers, link to live system, etc.

Extras

The Need For KB Analysis u Large-scale knowledge repositories will contain KBs produced by multiple authors in multiple settings u KBs for applications will be built by assembling and extending multiple modular KBs from repositories that may not be consistent u KBs developed by multiple authors will frequently u Express overlapping knowledge in different, possibly contradictory ways u Use differing assumptions and styles u Have different purposes u KBs must be reviewed for appropriateness and “correctness”

What is an Ontology? Catalog/ ID General Logical constraints Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal is-a Formal instance Value Restrs. Disjointness, Inverse, part- of…

Ontologies and importance to E-Commerce Simple ontologies provide: u Controlled shared vocabulary (search engines, authors, users, databases, programs all speak same language) u Organization (and navigation support) u Expectation setting (left side of many web pages) u Browsing support (tagged structures such as Yahoo!) u Search support (query expansion approaches such as FindUR, e-Cyc) u Sense disambiguation

Ontologies and importance to E-Commerce II u Foundation for expansion and leverage u Conflict detection u Completion u Regression testing/validation/verification support foundation u Configuration support u Structured, comparative search u Generalization/ Specialization u …

E-Commerce Search (starting point Forrester modified by McGuinness) u Ask Queries - multiple search interfaces (surgical shoppers, advice seekers, window shoppers) - set user expectations (interactive query refinement) - anticipate anomalies u Get Answers - basic information (multiple sorts, filtering, structuring) - modify results (user defined parameters for refining, user profile info, narrow query, broaden query, disambiguate query) - suggest alternatives (suggest other comparable products even from competitor’s sites) u Make Decisions - manipulate results (enable side by side comparison) - dive deeper (provide additional info, multimedia, other views) - take action (buy)

A Few Observations about Ontologies u Simple ontologies can be built by non-experts u Consider Verity’s Topic Editor, Collaborative Topic Builder, GFP interface, Chimaera, etc. u Ontologies can be semi-automatically generated u from crawls of site such as yahoo!, amazon, excite, etc. u Semi-structured sites can provide starting points u Ontologies are exploding (business pull instead of technology push) u most e-commerce sites are using them - MySimon, Affinia, Amazon, Yahoo! Shopping,, etc. u Controlled vocabularies (for the web) abound - SIC codes, UMLS, UN/SPSC, Open Directory, Rosetta Net, … u Business ontologies are including roles u DTDs are making more ontology information available u Businesses have ontology directors u “Real” ontologies are becoming more central to applications

Implications and Needs u Ontology Language Syntax and Semantics u Environments for Creation and Maintenance of Ontologies u Training (Conceptual Modeling, reasoning implications, …) u Issues: u Collaboration among distributed teams u Diverse training levels u Interconnectivity with many systems/standards u Analysis and Diagnosis u Scale