Bigscholar 2014, April 8, Seoul, South Korea1 Trust and Hybrid Reasoning for Ontological Knowledge Bases Hui Shi, Kurt Maly, and Steven Zeil Contact:

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Some Prolog Prolog is a logic programming language
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Chronos: A Tool for Handling Temporal Ontologies in Protégé
WIMS 2014, June 2-4Thessaloniki, Greece1 Optimized Backward Chaining Reasoning System for a Semantic Web Hui Shi, Kurt Maly, and Steven Zeil Contact:
Propositional Logic Reading: C , C Logic: Outline Propositional Logic Inference in Propositional Logic First-order logic.
Logic.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
WIMS 2011, Sogndal, Norway1 Comparison of Ontology Reasoning Systems Using Custom Rules Hui Shi, Kurt Maly, Steven Zeil, and Mohammad Zubair Contact:
Analyzing Minerva1 AUTORI: Antonello Ercoli Alessandro Pezzullo CORSO: Seminari di Ingegneria del SW DOCENTE: Prof. Giuseppe De Giacomo.
Ameet N Chitnis, Abir Qasem and Jeff Heflin 11 November 2007.
CPSC 322, Lecture 23Slide 1 Logic: TD as search, Datalog (variables) Computer Science cpsc322, Lecture 23 (Textbook Chpt 5.2 & some basic concepts from.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
Logic in general Logics are formal languages for representing information such that conclusions can be drawn Syntax defines the sentences in the language.
CPSC 322, Lecture 23Slide 1 Logic: TD as search, Datalog (variables) Computer Science cpsc322, Lecture 23 (Textbook Chpt 5.2 & some basic concepts from.
Logical Agents Chapter 7. Why Do We Need Logic? Problem-solving agents were very inflexible: hard code every possible state. Search is almost always exponential.
Production Rules Rule-Based Systems. 2 Production Rules Specify what you should do or what you could conclude in different situations. Specify what you.
Logical Agents Chapter 7 Feb 26, Knowledge and Reasoning Knowledge of action outcome enables problem solving –a reflex agent can only find way from.
CPSC 322, Lecture 22Slide 1 Logic: Domain Modeling /Proofs + Top-Down Proofs Computer Science cpsc322, Lecture 22 (Textbook Chpt 5.2) March, 8, 2010.
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
CPSC 433 Artificial Intelligence CPSC 433 : Artificial Intelligence Tutorials T01 & T02 Andrew “M” Kuipers note: please include.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
RDF Triple Stores Nipun Bhatia Department of Computer Science. Stanford University.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Empirical Explorations with The Logical Theory Machine: A Case Study in Heuristics by Allen Newell, J. C. Shaw, & H. A. Simon by Allen Newell, J. C. Shaw,
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
CPSC 322, Lecture 22Slide 1 Logic: Domain Modeling /Proofs + Top-Down Proofs Computer Science cpsc322, Lecture 22 (Textbook Chpt 5.2) Oct, 26, 2010.
IDB, SNU Dong-Hyuk Im Efficient Computing Deltas between RDF Models using RDFS Entailment Rules (working title)
Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce Mohammad Farhan Husain, Pankil Doshi, Latifur Khan, Bhavani Thuraisingham University.
1 Inference Rules and Proofs (Z); Program Specification and Verification Inference Rules and Proofs (Z); Program Specification and Verification.
Logical Agents Logic Propositional Logic Summary
Comparison of BaseVISor, Jena and Jess Rule Engines Jakub Moskal, Northeastern University Chris Matheus, Vistology, Inc.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Export experiments in Corese. October 10th Export experiments in Corese Olivier Corby October 10th, 2005 Interoperability Working Days October 10th-11th,
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Logical Agents Chapter 7. Knowledge bases Knowledge base (KB): set of sentences in a formal language Inference: deriving new sentences from the KB. E.g.:
Automated Reasoning Early AI explored how to automated several reasoning tasks – these were solved by what we might call weak problem solving methods as.
Logical Agents Chapter 7. Outline Knowledge-based agents Logic in general Propositional (Boolean) logic Equivalence, validity, satisfiability.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
From Hoare Logic to Matching Logic Reachability Grigore Rosu and Andrei Stefanescu University of Illinois, USA.
© Copyright 2008 STI INNSBRUCK Intelligent Systems Propositional Logic.
Of 38 lecture 6: rdf – axiomatic semantics and query.
DEDUCTION PRINCIPLES AND STRATEGIES FOR SEMANTIC WEB Chain resolution and its fuzzyfication Dr. Hashim Habiballa University of Ostrava.
Inference in First Order Logic. Outline Reducing first order inference to propositional inference Unification Generalized Modus Ponens Forward and backward.
1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.
Logical Agents Chapter 7. Outline Knowledge-based agents Propositional (Boolean) logic Equivalence, validity, satisfiability Inference rules and theorem.
Expert System Seyed Hashem Davarpanah University of Science and Culture.
1 Instance Store Database Support for Reasoning over Individuals S Bechhofer, I Horrocks, D Turi. Instance Store - Database Support for Reasoning over.
Some Thoughts to Consider 5 Take a look at some of the sophisticated toys being offered in stores, in catalogs, or in Sunday newspaper ads. Which ones.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Service Computation 2013, Valencia, Spain1 Query Optimization in Cooperation with an Ontological Reasoning Service Hui Shi, Kurt Maly, and Steven Zeil.
Logical Agents. Outline Knowledge-based agents Logic in general - models and entailment Propositional (Boolean) logic Equivalence, validity, satisfiability.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Rule-based Reasoning in Semantic Text Analysis
EA C461 Artificial Intelligence
EA C461 – Artificial Intelligence Logical Agent
Using Rules with Ontologies in the Semantic Web
ece 720 intelligent web: ontology and beyond
Logical Agents Chapter 7.
Artificial Intelligence
Logical Agents Chapter 7.
CPSC 433 : Artificial Intelligence Tutorials T01 & T02
Logic: Domain Modeling /Proofs + Computer Science cpsc322, Lecture 22
Knowledge Representation I (Propositional Logic)
Deniz Beser A Fundamental Tradeoff in Knowledge Representation and Reasoning Hector J. Levesque and Ronald J. Brachman.
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

Bigscholar 2014, April 8, Seoul, South Korea1 Trust and Hybrid Reasoning for Ontological Knowledge Bases Hui Shi, Kurt Maly, and Steven Zeil Contact:

2 Outline Problem –Semantic web subject to changes –How to scale a reasoner to big data? Background –Knowledge base using ontologies –Inference strategies –Benchmarks –Optimized backward chaining Hybrid Reasoner –Materialization with search and retrieval –Change and marking trusted/untrusted areas –Reason over untrusted goals Conservative trust assessment –Property based trust –Pattern based trust Evaluation Conclusions Bigscholar 2014, April 8, Seoul, South Korea

3 Problem Efficiency of reasoning in the face of large scale and frequent changes within a question/answer system over a semantic web Issue –Forward chaining scales well for fixed knowledge bases –Backward chaining can handle changes in knowledge base but does not scale Bigscholar 2014, April 8, Seoul, South Korea

Background Existing semantic application: question/answer systems –Libra, Cimple, Arnetminer Semantic Web –Resource Description Framework(RDF) –Web Ontology Language (OWL) for specific knowledge domains –SPARQL query language for RDF –SWRL rule language Reasoning systems –Jena proprietary Jena rules –Pellet and KANON –ORACLE 11g –OWLIM Bigscholar 2014, April 8, Seoul, South Korea 4

5 Background Knowledge base (KB) –Ontologies –Representation formalism: Description Logic (DL) Inference methods for First Order Logic –Materialization and forward chaining pre-computes inferred truths and starts with the known data suitable for frequent computation of answers with data that are relatively static Owlim and Oracle –Query-rewriting and backward chaining expands the queries and starts with goals suitable for efficient computation of answers with data that are dynamic and infrequent queries Virtuoso Bigscholar 2014, April 8, Seoul, South Korea

Background Benchmarks evaluate and compare the performances of different reasoning systems –The Lehigh University Benchmark (LUBM) –The University Ontology Benchmark (UOBM) 6Bigscholar 2014, April 8, Seoul, South Korea

Background Optimized backward-chaining algorithm –generate a query response for a given query pattern based on a specific rule set (RDFS, Horst, custom) Ordered Selection Function Switching between Binding Propagation and Free Variable Resolution Avoid Repetition and Non-Termination (OLDT) owl:sameAs Optimization Bigscholar 2014, April 8, Seoul, South Korea 7

Hybrid reasoner Motivation example Assume fully materialized KB Harvester adds new fact: student0 enrolled course0 Query ‘Who is enrolled in course 0?’ ok Assume fact Porf0 teaches course0 in KB Query “Who is being taught by Prof0?” not ok as simple lookup; needs reasoning with rule such as: enrolledIn(?Student,?Course?), teaches(?Faculty,?Course) :- isTaughtBy(?Student,?faculty) Bigscholar 2014, April 8, Seoul, South Korea 8

Hybrid reasoner Mark region of KB ‘trusted’ that is not affected by change Hybrid algorithm: –If a goal is in trusted region then return substitutions from KB –else for each rule R and substitution σ1 such that the head of R σ1 matches goal proveTheRuleBody (R.body, σ1) proveTheRuleBody: prove each goal in the rule body one by one recursively 9Bigscholar 2014, April 8, Seoul, South Korea

Trustworthy Goals proof goal p(?X,?Y) is trustworthy if all instances of that goal derivable from facts and rules in the knowledge base are present in that knowledge base as instances –In practice, we will need to approximate set of trustworthy goals A partition into trusted and untrusted sets is called conservative if no untrustworthy goals are trusted 10Bigscholar 2014, April 8, Seoul, South Korea

Approximation 1: Trusted Properties property-based trust: –assume that any property P that was involved in a change is itself untrusted –take the closure of the “is used as a premise of” relation, P occurs in the body of a rule used to prove R …, P(x,y), … :- R(w,z) –then R is also untrusted. Bigscholar 2014, April 8, Seoul, South Korea 11

Approximation 1: Trusted Properties property-based trust breaks down in the face of “meta- rules” in the knowledge base, rules that permit reasoning about properties themselves, e.g., inverse rule special handling of the meta-rules common to RDF and OWL result in significant fractions of the knowledge base being marked as untrusted unnecessarily 12Bigscholar 2014, April 8, Seoul, South Korea

Approximation 2: Trusted Patterns –Pattern-based trust: a pattern P(X,Y) (where X and Y could be ground instances or free variables) is untrusted if it matches a change to the knowledge base or if it can be derived from a rule with an untrusted pattern as a premise –Offers finer discrimination than property-based Bigscholar 2014, April 8, Seoul, South Korea 13

Computing Untrusted Pattern Marking algorithm Add change Check each rule in the rule set to see if we can propagate the “untrust” forward by a limited, specialized analogue of forward chaining Add untrusted set produced from the above one change to the existing untrusted set, discarding any patterns that are specializations of other elements 14Bigscholar 2014, April 8, Seoul, South Korea

Untrusted Pattern Example –Harvester adds ‘worksFor(Fullprofessor0, University0)’ –Marking algorithm discovers as untrusted worksFor (Fullprofessor0, University0) member (University0, Fullprofessor0) memberOf (Fullprofessor0, University0) –Query: “Who are members of University0?” needs reasoning –Query: “Who are members of University1” ok for direct retrieval as memberOf(?x, University1) is trusted Bigscholar 2014, April 8, Seoul, South Korea 15

Evaluation: property-based 16 Bigscholar 2014, April 8, Seoul, South Korea Changes Actual # new properties Actual # new facts # untrusted properties Adding a new class2312 Add a subclass relationship between two new classes 2612 Add new Class as subClass of existing class 2512 Adding a new Property2212 Add a new Property as subPropertyOf of another new Property 2412 Add new Property as subPropertyOf of existing Property 2312 Add new Class as domain to a new Property 3513 Add new Class as range to a new Property 3513

Evaluation: pattern-based Produces the same number of properties as the ‘actual’ columns show comparison of performance of our hybrid pattern-based proof algorithm against our regular, optimized backward chaining algorithm and against the OWLIM using LUBM1, LUBM10, and LUBM40, of size 100,839, 1,272,871, and 5,307,754 objects Query response time (ms) after adding student 17Bigscholar 2014, April 8, Seoul, South Korea

Evaluation Query response time (ms) after adding undergraduate student percentage of untrusted facts in KB ranges 0 to a high of 10% percentage of untrusted patterns in KB ranges 0 to a high of 5% 18Bigscholar 2014, April 8, Seoul, South Korea

19 Conclusions We reported on our efforts to use ‘ trust’ in backward- chaining reasoners to accommodate the changing knowledge base. We have shown that a pattern-based marking algorithm errs on the conservative side at an acceptable level and We show that compared to a forward chaining algorithm and a pure backward chaining algorithm that our hybrid algorithm is better in almost all cases tested Bigscholar 2014, April 8, Seoul, South Korea

Future Work Explore the performance of the trust marking algorithm and of the hybrid reasoner as a function of the fraction of the knowledge base that is untrusted Explore the impact of long sequences of individual changes on the marking algorithm time and subsequently on the hybrid reasoner Explore performance of the hybrid reasoner as a function of the overall degree of inter-connection within the knowledge base semantics as a loosely connected network will lead to faster termination of the trust marking algorithm 20Bigscholar 2014, April 8, Seoul, South Korea