Knowledge Base Diagnostics Richard Fikes (Stanford KSL) Adam Pease (Teknowledge) Mala Mehrotra (Pragati Synergetic Research Inc.) Yolanda Gil (USC ISI)

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Languages & Inference Appropriate layering Do we need a logic? Do we need Description Logic? Legacy data; database storage vs inference Tolerant/anytime.
Ontology-enhanced retrieval (and Ontology-enhanced applications) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems.
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Tools for DAML-Based Services, Document Templates, and Query Answering Richard Fikes Deborah McGuinness Sheila McIlraith Tran Cao Son Honglei Zeng Steve.
1 Ontology Language Comparisons doug foxvog 16 September 2004.
K S L W i n e A g e n t : Testbed Application for Semantic Web Technologies Deborah McGuinness Eric Hsu Jessica Jenkins Rob McCool Sheila McIlraith Paulo.
Chapter 6: Design of Expert Systems
Dynamic Ontologies on the Web Jeff Heflin, James Hendler.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
CSE 111: Object Oriented Design. Design “To program is human but to design is divine” (WEH)
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy Stanford Medical Informatics Stanford University.
An Environment for Merging and Testing Large Ontologies Deborah McGuinness, Richard Fikes, James Rice*, Steve Wilder Associate Director and Senior Research.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
Tools for Developing and Using DAML-Based Ontologies and Documents Richard Fikes Deborah McGuinness Sheila McIlraith Jessica Jenkins Son Cao Tran Gleb.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Protégé An Environment for Knowledge- Based Systems Development Haishan Liu.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.
Methodologies, tools and languages for building ontologies. Where is their meeting point? Oscar Corcho Mariano Fernandez-Lopez Asuncion Gomez-Perez Presenter:
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Evaluating Ontology-Mapping Tools: Requirements and Experience Natalya F. Noy Mark A. Musen Stanford Medical Informatics Stanford University.
Database Systems: Design, Implementation, and Management Ninth Edition
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
 Knowledge Acquisition  Machine Learning. The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
1 Foundations IV: Ontology Evolution and Knowledge Management Class Session 6 Deborah McGuinness and Peter Fox (NCAR) CSCI Week 6 – October 6,
WebODE and its Ontology Management APIs. April 8th © Ontology Engineering Group WebODE and its Ontology Management APIs Ontology Engineering Group.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
ISURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Prof. Dr. Asuman Dogac METU-SRDC Turkey METU.
CoFM: An Environment for Collaborative Feature Modeling Li Yi Institute of Software, School of EECS, Peking University Key Laboratory of High Confidence.
 Architecture and Description Of Module Architecture and Description Of Module  KNOWLEDGE BASE KNOWLEDGE BASE  PRODUCTION RULES PRODUCTION RULES 
Proactive Acquisition Dialogues Jihie Kim Yolanda Gil
Sheila McIlraith, Knowledge Systems Lab DAML Kickoff 08/14/00 Mobilizing the Web with DAML-Enabled Web Services Services Team Sheila McIlraith (Technical.
Presented by: Ashgan Fararooy Referenced Papers and Related Work on:
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Creating Creating, Maintaining Integrating Maintaining, and IntegratingUnderstandable Knowledge Bases Richard FikesDeborah McGuinnessSheila McIlraith Jessica.
OilEd An Introduction to OilEd Sean Bechhofer. Topics we will discuss Basic OilEd use –Defining Classes, Properties and Individuals in an Ontology –This.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
© 2006 Pearson Addison-Wesley. All rights reserved 2-1 Chapter 2 Principles of Programming & Software Engineering.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
Approach to building ontologies A high-level view Chris Wroe.
17 April 2005Sharif University of Tech Page 1 Ontologies Come of Age Amir Hossein Assiaee
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Ontologies (What they are; Why you should care; What you should know) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge.
Tutoring & Help Systems Deepthi Bollu for CSE495 10/31/2003.
CIS 375 Bruce R. Maxim UM-Dearborn
OKBC (Open Knowledge Base Connectivity) An API For Knowledge Servers
KANAL: Knowledge ANALysis
Action Editor Storyboard
Creating, Maintaining, and Integrating Understandable Knowledge Bases
Knowledge Evolution Tools
KANAL: Knowledge ANALysis
Chapter 6: Design of Expert Systems
Semantic Web - Ontologies
Tools for DAML-Based Services, Query Answering, and
Chapter 5 Designing the Architecture Shari L. Pfleeger Joanne M. Atlee
ece 627 intelligent web: ontology and beyond
An Environment for Merging and Testing Large Ontologies
Ontologies (What they are; Why you should care; What you should know)
Test-Driven Ontology Development in Protégé
Yolanda Gil Jihie Kim Jim Blythe Surya Ramachandran
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Tools for DAML-Based Services, Query Answering, and
Presentation transcript:

Knowledge Base Diagnostics Richard Fikes (Stanford KSL) Adam Pease (Teknowledge) Mala Mehrotra (Pragati Synergetic Research Inc.) Yolanda Gil (USC ISI) Deborah McGuinness (Stanford KSL) 10/18/01

Knowledge Systems Laboratory, Stanford University2 Knowledge Evolution Tools  KB development requires knowledge evolution Debugging, refining, structuring, modularizing, …  Power tools are needed to support KB evolution  KB diagnosis >Bugs, omissions, heuristic warnings, architectural advice  KB partitioning >To enable effective reasoning >To produce reusable KB building blocks  KB merging >To enable interoperation of KBs with overlapping content  KSL is developing knowledge evolution tools

Knowledge Systems Laboratory, Stanford University3Chimaera  A Knowledge Evolution Tool Environment  Tools for KB diagnosis and merging  Available as a Web service or an OKBC client   Usable from a Web browser  Online user manual, tutorial, and demonstration movie  Performs KB diagnostics in batch mode  Uploads and analyzes user’s KB  Accepts KBs in OKBC, KIF, MELD, RDF, DAML, …  Provides results as HTML pages linked to frames and axioms  Provides user selectable set of diagnostic tests  Analyzes both the structure and content of a KB  Uses reasoners to analyze content

Knowledge Systems Laboratory, Stanford University4 Classification of Diagnostic Results  Errors  Logical inconsistencies E.g., contradictory type constraints  Content structure errors E.g., terms used but not defined  Anomalies  Missing information E.g., type constraints  Redundancies E.g., redundant superclass and type links  Extraneous structure or content E.g., terms defined but not used  Summaries E.g., counts of term references  Suggestions E.g., use consistent naming conventions

Knowledge Systems Laboratory, Stanford University5 “Background” Reasoning Analysis  Reasoning diagnostics that may take substantial time  Performed in background  Results incrementally posted on Web page  Completion notification sent to user via  Example reasoning diagnostics  Redundant axioms that are inferred by the KB (anomaly)  Inconsistent axioms whose negations are inferred by the KB (error)  Determine which relations in KB are primitive and non-primitive (summary) >Show relations on which each non-primitive relation depend  Determine classes that are disjoint (suggest adding results to KB)  Derive subclass and instance links (suggest adding links to KB) I.e., classification and recognition  Suggest reordering of an implication’s antecedents based on number of inferable instances of each antecedent (suggestion)

Knowledge Systems Laboratory, Stanford University6 Integration Into SHAKEN  Chimaera is a KB diagnostics tool in the SHAKEN system  Used to diagnose both pump priming and SME KBs  OKBC was used to do the integration  Chimaera is an OKBC client >Interacts with any OKBC server using the OKBC API >The Chimaera Web service uses Ontolingua as its OKBC server  SRI added an OKBC wrapper to the KM system >Enabled KM to be an OKBC server usable by OKBC clients >Enabled Chimaera’s diagnostics to run directly on KM KBs

Knowledge Systems Laboratory, Stanford University7 Chimaera Useful To SRI Team “Overall, we found that Chimaera was quite useful. It found 2 concepts (Indole and Imidazole) that were corrupted, several occurrences of redundant superclasses, and several incorrect domain and range constraints (due to our poor representation of "Information"). … We're currently fixing the bugs it revealed. It would be helpful if we could run Chimera on the component library frequently.” – Bruce Porter

Knowledge Systems Laboratory, Stanford University8 Next Steps: SME-Oriented Support  Provide interactive repair oriented follow-up to diagnostics  Identify KB content on which diagnosis result is based  Suggest repairs or repair strategies  Guide user through repair procedure  Examples  Class is a direct subclass of “THING” >Provide direct subclasses of THING as candidate superclasses >Step down through the class hierarchy  Class has redundant superclass links >Suggest removal of link(s) to most general classes  Type, cardinality, or bounds conflict >Suggest changing local conflicting constraint(s)  Missing information >Initiate acquisition dialogues for missing information

Knowledge Systems Laboratory, Stanford University9 Next Steps: Architectural Analysis  Summarize architectural features of a KB  Percentage of >Relations that are functions >Axioms that are propositional, first order, higher order >Axioms that are not horn clauses  Distribution of >Axioms by type (using the HPKB, RKF types) >Axiom lengths by number of literals >Functions by number of arguments >Relations by number of arguments >Direct subclasses per class >Direct subproperties per property >Restrictions per object >Property values per object

Knowledge Systems Laboratory, Stanford University10 Next Steps: Partitioning and Beyond  Integration of KB partitioning tools into Chimaera  Provide automatic KB partitioning to enhance usability  Automatic running of test cases E.g., queries and expected answers  Support regression testing of evolving KB  Provide result summaries from failed tests  Help with typographical errors  Spelling correction for undefined names E.g., classes, slots, relations, functions, constants  Spelling correction for anomalously occurring variables >Suggest is the same as another variable in the sentence

Knowledge Systems Laboratory, Stanford University11Summary  KSL is developing Chimaera to support KB evolution  Chimaera was integrated into the SHAKEN Y1 system Using OKBC(!)  Incrementally adding diagnostics E.g., “background” diagnostics that use sophisticated reasoning  Next steps  KB partitioning tools  Repair dialogues for SMEs  KB architectural analysis  Regression testing

Knowledge Systems Laboratory, Stanford University12 Role of Diagnostics in Systems  KE support  SME support  Increase productivity (“lightly trained”)  Step in managing KB development  Focus attention (e.g., redundant links)  Evaluation support  Diagnose KBs produced during evaluation  Batch mode  Foreground  Background  Changes in “patterns” in the KB between versions

Knowledge Systems Laboratory, Stanford University13 Sharing Diagnostics Information  Diagnostic specifications  Logical specifications  English specifications  Test cases  Diagnostic classifications  Learnings  Tricks of the trade  Sharing facilitators:  Working group  Mailing list  Findings data  Author, group, or team specific  Repair strategies  Alignments during collaborative development

Knowledge Systems Laboratory, Stanford University14 Developer Needs and Desires  Reasoner-specific diagnostics  Highly informative diagnostic results  Reporting architectural bias in a KB  Binary versus higher order relations  First order versus higher order axioms >Weakly versus strongly higher order  Disjunctions or conjunctions  Existential versus universal quantifiers  Frames to axioms ratios  Horn clauses  Axiom lengths  Functions  Confusion of existential and universal quantifiers  Type restrictions too general  Misspelling of variables

Knowledge Systems Laboratory, Stanford University15 Developer Needs and Desires  Domain-specific tests  Semantic tests  Maintainability measures  Recognizing typographical errors  Spell check undefined or unused terms  Redefining (e.g., breaking up) a predicate  Large scale modification techniques  Prioritizing diagnostics

Knowledge Systems Laboratory, Stanford University16 Integration Issues  Architecture  Use hosted services (like KSL)  Integrate special code  Take specifications from library  API  Interaction Mode - Batch versus Interactive/Repair  Translation issues  One major use of diagnostics is also in testing translators  Certain translations need to be done to do better analysis  Output integration

Knowledge Systems Laboratory, Stanford University17Evaluation  Record types and numbers of errors  Comparing KBs produced by SMEs versus KEs  Record use of repair strategies  Evaluate during testing  Feedback from SMEs about diagnostics

Knowledge Systems Laboratory, Stanford University18 Classification of Diagnostic Results  Errors  Logical inconsistencies  Content structure errors  (See Randy Davis thesis)  Anomalies  Missing information >Missing portions of descriptions  Redundancies  Extraneous structure or content  Summaries  Architectural biases  Suggestions  Stylistic suggestions  Static versus operational tests  Use of expertise about KR paradigms

Knowledge Systems Laboratory, Stanford University19 Diagnostic Issues/Goals  Role of Diagnostics in Systems  KE support, SME support  Evaluators of KBs  How to Share Diagnostics  Working Group?  Logical specification, English descriptions, tests, …  Know the Main Contributors  Possible Diagnostics  What do users want?  What can tool builders provide?  Integration Issues  Developer Needs/Desires  Evaluation

Knowledge Systems Laboratory, Stanford University20 The Role of KB Diagnostics  KE support  SME support  Increase productivity (“lightly trained”)  Mgmt of kb  Inference dependent quality improvement  Focus attention (ex. Redundant links)  Evaluation support  Abstract patterns – average fanout of specialization, statistics of number of uses of a predicate – big picture view  Version comparison  Regression testing

Knowledge Systems Laboratory, Stanford University21 Diagnostic Sharing  Diagnostic specifications  Logical specifications  English specifications  Test cases  Diagnostic classifications  Taxonomy of errors – bottlenecks,  Quantification  Alignments across systems – inconsistencies among smes  Repair strategies  How informative a system is (core dump vs. useful explanation)  Learnings  Tricks of the trade  Sharing Facilitators:  Working Group  Mailing list

Knowledge Systems Laboratory, Stanford University22 Sharing facilities  Working group  Mailing list  Posting of papers  Utilize Teknowledge

Knowledge Systems Laboratory, Stanford University23biases  Binary vs. higher arity  First order vs higher order  Weakly vs strongly higher order  Universal over existential  Disjunction vs. conjunction  Frame-ism  Horn clauses  Lisp style  Relations -> functions  Depth vs. breadth in hierarchy  …. Maybe report in summarizations..  At least document biases

Knowledge Systems Laboratory, Stanford University24Organizations/People  Cycorp – many special purpose - Kahlert  ISI – Why Not? – Chalupsky – KANAL – Gil - expect - Gil  Pragati – Clustering - Mehrotra  Stanford FRG/KSL – Partitioning – McCarthy, Amir, McIlraith  Stanford KSL – Chimaera - Fikes, McGuinness

Knowledge Systems Laboratory, Stanford University25Diagnostics  Errors – provable logical inconsistencies  Anomalies – redundancies, cycles,…  Summaries – word counts, …  Suggestions – naming conventions  Incompletenesses – explicit salient assertions or statistics  Stylistics - length of rule, … bad factoring, Randy davis – errors – incompleteness, inconsistent  Get this - Top ten list of things people do wrong in cyc - goolsbey Perspectives/units: Frame-like content vs. axioms vs. problem solving technology vs. learning to correct components

Knowledge Systems Laboratory, Stanford University26style  Static  Reasoner  Simulation / execution  Using examples  Summarization/improvements/critiquer

Knowledge Systems Laboratory, Stanford University27 Integration Issues  Architecuture  Use hosted services (like KSL)  Integrate special code  Take specifications from library  API  Interaction Mode – Batch vs. Interactive/Repair  Translation issues  one major use of diagnostics is also in testing translators  Certain translations need to be done to do better analysis  Background ontologies – meld starter ontology  Output integration

Knowledge Systems Laboratory, Stanford University28 Developer Needs/Desires Missing existentials Too high a type specification Variable name mismatch Semantic requests: Wrong semantic paradigm? Typos Spell check Large scale modification tools and their integration example removal/ fixing top level priotizing Diagnostics to minimize cost, ease maintenance

Knowledge Systems Laboratory, Stanford University29Evaluation  Record types of errors  Fine granularity  Kb differences across sme vs. ke developed ontologies across team  Record use of repair strategies…  Evaluate during testing…  Feedback from smes on features, usefulness, etc.  Attempt to keep extremely complete audit trails for future analysis  Important to be careful with diagnostic reporting

Knowledge Systems Laboratory, Stanford University30 Action Items  Working Group  Diagnostics repository  Web site  Follow up briefing  Mailing list

Knowledge Systems Laboratory, Stanford University31Chimaera  A Knowledge Evolution Environment  Tools for KB diagnosis and merging  Available as a Web service  www-ksl-svc.stanford.edu  Usable from a Web browser  Online user manual, tutorial, and demonstration movie  Provides user selectable set of diagnostic tests  Performs kb diagnostics in batch mode  Uploads and analyzes user’s KB  Accepts KBs in MELD, KIF, OKBC, DAML, RDF, XML, …  Provides results as HTML pages linked to frames and axioms  Analyzes both the structure and content of a KB  Uses hybrid reasoners to analyze content  Currently runs 28 diagnostic tests

Knowledge Systems Laboratory, Stanford University32Collection/Specification  Logical Specification of diagnostic  English Specification  Example kb that triggers diagnostic output

Knowledge Systems Laboratory, Stanford University33 Classification of Diagnostic Results II  Axiom Analysis  Axiom Syntax Problems E.g., no consequent to a implications  Axiom Redundancy E.g., 1. A =>B 2. A=>C 3. C =>B means 1 is redundant  Axiom Variable Usage E.g., Variable used in antecedent but not in consequent  Axiom Consistency E.g., A => not A  Axiom Tautology E.g., consequent repeats (portion of) antecedent