Explanation: The Next Phase in Question Answering Deborah L. McGuinness Knowledge Systems Laboratory Stanford University
McGuinness Outline n Motivation – Question Answering Systems Need to Provide Justifiable Answers n Explanation is a Necessary Component for Trust n Explanation requirements as gathered from DARPA, ARDA, academic, and commercial needs n Inference Web introduction: An Explanation Infrastructure for the Semantic Web (work with Pinheiro da Silva) l Registry l Portable Proofs for Interoperability l Explainer l Browser n Conclusion
McGuinness Motivation - TRUST If users (humans and agents) are to use and integrate system answers, they must trust them. System transparency supports understanding and trust. Even simple “lookup” systems should be able to provide information about their sources. As Question Answering systems become more complex, they may incorporate multiple hybrid information sources, multiple information manipulation techniques, integration of reasoners, conflict resolution strategies, prioritization, assumptions, etc., all of which may need explanation. Thus, systems should be able to explain their actions, sources, and beliefs.
McGuinness Requirements – Knowledge Provenance n Source name (CIA World Fact Book) n Author of original information n Date of original information and any updates n Authoritativeness of Source (is this considered reliable or certified reliable by some third party) n Degree of belief n Degree of completeness (can the closed world assumption be made for inference?) n Term or phrase meaning (in natural language or formal language) n Term inter-relationships (ontological relations including subclass, superclass, part-of, etc.)
McGuinness Requirements – Reasoning Information n Reasoner used, authors, version #, etc. n Reasoning method (tableaux, model elimination, etc.) n Inference rules supported by reasoner n Reasoner soundness and completeness n Reasoner assumptions (closed world, open, unique names, etc.) n Detailed trace of inference rules applied (with appropriate variable bindings to provide conclusion) n Term coherence n Assumptions used in derivation n Source consistency (is there support for A and not A) n Support for alternative reasoning paths to a single conclusion
McGuinness Reqs - Presentation n Presentation needs to be manageable (thus stand alone fragments are required) n Fragments need to be stand-alone n Proofs need to be pruned n Support for proof and explanation navigation n Web browser compatibility n Follow-up question support n Alternative justifications should be available
McGuinness Reqs – Distribution and Interoperability n Explanations must work in heterogeneous environments n Must interoperate on the web n Representations must be portable, shareable, and combinable n Proof interlingua required n Proof/explanation presentation - Presentation should have manageable (small) portions that are meaningful alone (without the context of an entire proof), users should be supported in asking for explanations and follow-up questions, users should get automatic and customized proof pruning, web browsing option, multiple formats, customizable, etc.
McGuinness Requirements – Explanation Generation n Provide abbreviated description of information manipulation path n Provide machine and user understandable descriptions (may require use of a formal language such as DAML+OIL, OWL, RDF) n Machine understandable representation of information manipulations (axioms such as FOL Semantics for DAML+OIL (Fikes&McGuinness) n Description of rewrite rules for abstraction
McGuinness Inference Web Framework for explaining reasoning tasks by storing, exchanging, combining, annotating, filtering, segmenting, comparing, and rendering proofs and proof fragments provided by reasoners. l DAML+OIL/OWL specification of proofs is an interlingua for proof interchange l Proof browser for displaying IW proofs and their explanations (possibly from multiple inference engines) l Registration for inference engines/rules/languages l Proof explainer for abstracting proofs into more understandable formats l Proof generation service for facilitate the creation of IW proofs by inference engines l Prototype implementation with Stanford’s JTP reasoner and SRI’s SNARK reasoner l Integrated with DQL and JTP in a few web agents for demonstrations l Discussions with Boeing, Cycorp, Fetch, ISI, Northwestern, SRI, UT, UW, W3C, … Collaborative work with Pinheiro da Silva
McGuinness IW Registry and Registrar n IW Registry has meta-data useful for disclosing data provenance and reasoning information such as descriptions of l inference engines along with their supported inference rules l Information sources such as organizations, publications and ontologies l Languages along with their axioms n The Registry is managed by the IW Registrar
McGuinness Inference Engine Registration (1) n An entry for SRI’s SNARK engine n An entry for SNARK’s Binary Resolution inference rule n Engine registration involves the creation of an engine entry and its association with entries of inference rules n Rule entries can be either reused or added to the registry
McGuinness Inference Engine Registration (2) n Otter’s binary resolution, hyper-resolution and paramodulation rules were reused for the registration of SNARK n Assumption and negated conclusion rules were added for SNARK Rule reuse addition
McGuinness Inference Engine Registration (3) Summarizing the Inference Engine Registration process: n Use the registry to include meta-information about the engine and its rules l Add an entry for the new inference engine l Identify the core inference rules supported by the engine l Add unregistered core inference rules, if any l Associated the core rules with the core inference engine n Prepare the engine to dump proofs in the IW format l Implement a routine for calling the proof generator service u Example routines in Java and Lisp can be provided l Publish successful results of the proof generator services in portable proof format (OWL/DAML/RDF/XML compliant files) n Browse your proofs in the IW Browser
McGuinness IW Browsers Registrars World Wide Web Registry entries Inference Web Architecture proof fragments non-IW documents Web agent Web document URL reference Agent dependency Caption Document maintenance Inference engines Reasoner agent
McGuinness Generation of IW proofs Reasoner Proof fragments Registry Registrar WWW Proof generator service (1) Send node information: reasoner ID, labeling sentence in KIF, rule ID, antecedent URIs, bindings, and sourceID (2) Verify information (3) Return proof fragments (4) publish proof fragments (can collect statistics, provide feedback,…)
McGuinness Portable Proofs n Proof Interlingua n Written in DAML+OIL (soon to be OWL) n Question Answering systems dump proofs in this format n
McGuinness
McGuinness Proofs and Explanations n Proofs can be displayed using the browser n Rewriting rules may be used to abstract proofs into more manageable explanations n Rewriting rules may leverage information about language axioms such as the DAML+OIL axiom set
McGuinness Wine Agent Example
McGuinness
McGuinness
McGuinness
McGuinness Conclusion n Proof specification ready for feedback/use n Proof browser prototype operational and expanding l Recent: ground axiom collection, source doc/ontology collection, aggregation view l Current: multiple formats, simplification, pruning, …) n Registration service expansion - integration with XML database, use in PAL, registration of services (with Fetch) n Inference engine integration work JTP functional, SNARK mostly done, KM under investigation. n Integration with web services – current: KSL Wine Agent, KSL DQL client (NIMD implementation), begin with registration of web services (Fetch) n Documentation – more examples, etc. More comments solicited (thanks to date to some for comments including Berners-Lee, Chalupsky, Chaudhri, Clark, Connolly, Forbus, Hawke, Hayes, Lenat, Murray, Porter, Reed, Waldinger, …)
McGuinness
McGuinness Technical Infrastructure Reqs n Provenance information - explain where source information: source name, date and author of last update, author(s) of original information, trustworthiness rating, etc. n Reasoning information - explain where derived information came from: the reasoner used, reasoning method, inference rules, assumptions, etc. n Explanation generation – provide abbreviated descriptions of the proof – may include reliance on a description of the representation language (e.g., DAML+OIL, OWL, RDF, …), axioms capturing the semantics, rewriting rules based on axioms, other abstraction techniques, etc. n Distributed web-based deployment of proofs - build proofs that are portable, sharable, and combinable that may be published on multiple clients, registry is web available and potentially distributed, … n Proof/explanation presentation - Presentation should have manageable (small) portions that are meaningful alone (without the context of an entire proof), users should be supported in asking for explanations and follow-up questions, users should get automatic and customized proof pruning, web browsing option, multiple formats, customizable, etc.
McGuinness Architecture
McGuinness Integration with SNARK n Done by non-SNARK author to test strategies for integration n Tests alternative reasoning strategy – proof by contradiction n No special modifications made as a test of leverage n Learned some new requirements (CNF processing, reasoning modes may be useful, …) n Initial integration fairly easy n More complete integration in process
McGuinness SNARK Example: nuclear threats (1)ore refiner material (2)black-mkt material (3)black-mkt ore (4) black-mkt ore (5)material detonator casing warhead (6) material warhead (7) detonator warhead (8) casing warhead (9)warhead missile nuke (10)warhead truck nuke (11)missile truck “Weapons-grade nuclear material may be derived from uranium ore if refining technology is available, or it may be acquired from a black market source. Foobarstan is known to have either uranium ore or a black market source, but not both. Foobarstan will build a nuclear warhead if and only if it can obtain nuclear material, a detonator, and the bomb casing. A warhead and a missile, or a warhead and a truck, constitute a nuclear threat. Foobarstan has either a missile or a truck.” QUESTION: Is Foobarstan a nuclear threat?
McGuinness Example: proof by contradiction
McGuinness Example: a proof tree
McGuinness An example in FOL
McGuinness Registering SNARK: next steps n Add support for ‘source’ and ‘author’ fields l Match with IW-registered ontologies where possible n Standardize treatment of SNARK rewrites l When do rewrites correspond to resolution, hyperresolution, paramodulation? l Utilize SNARK rewrites for IW abstraction strategies l Consider tableaux approaches for explanation n Implement correct handling of SNARK procedural attachments l SNARK includes procedural attachments for math, lists l User can define new procedural attachments on the fly l This constitutes an inference rule with an open-ended definition n Track variable bindings through course of proof n Integrate IW interface into SNARK standard release
McGuinness Extra
McGuinness Proof browsing: an example (1) n Tools can be used for browsing IW proofs. The following example demonstrates the use of the IW Browser to visualize, navigate and ask follow-up questions. n Lets assume a Wines ontology: n Determination of the type of a concept or instance is a typical problem on the Semantic Web. A reasoner may ask either about the type of an object and may also ask if an object is of a particular type Example Query : (rdf:type TonysSoftShell ?X) Example DAML KB : <rdf:RDF xmlns:rdf =“ xmlns:rdfs="
McGuinness Proof browsing: An example (2) n Browsers can display portions of proofs. n Selecting premises users can navigate throughout proof trees. Proof browsing: An example (2)
McGuinness Trust Disclosure n IW proofs can be used: l to provide provenance for “lookup” information l to display (distributed) deduction justifications l to display inference rule static information Trust Disclosure
McGuinness Technical Requirements n annotate information with meta information such as source, date, author, … at appropriate granularity level (per KB, per term, …) n explain where source information is from n explain where derived information came from n prune information and explanations for presentation (utilizing user context and information context for presentation) n provide a query language capable of expressing user requests along with filtering restrictions n provide a ubiquitous source annotation language n provide a ubiquitous proof language for interchange n Compare answers n propagate meta information appropriately (if I got something from a source I consider trusted and you consider me a trusted source, you may want to consider my source trusted as well) n Identify multiple (or unknown) truth values