An Optimization Technique for RDFS Inference using the Application Order of RDFS Entailment Rules Kisung Kim, Taewhi Lee 2005. 7. 11.

Slides:



Advertisements
Similar presentations
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
Advertisements

Shelley Powers, O’Reilly SNU IDB Lab. Hyewon Kim
RDF Schema – Syntax and Intuition
RDF Databases By: Chris Halaschek. Outline Motivation / Requirements Storage Issues Sesame General Introduction Architecture Scalability RQL Introduction.
Dr. Jim Bowring Computer Science Department College of Charleston CSIS 690 (633) May Evening 2009 Semantic Web Principles and Practice Class 5: 27 May.
Knowledge Technologies Manolis Koubarakis 1 An Introduction to RDF Schema.
More RDF CS 431 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller Dieter Fensel.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
RDF, RDFS making the Web a better place Ir. Richard Vdovjak, MTD.
Cornell CS 502 Resource Description Framework Building the Semantic Web CS 502 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller.
VLDB 2005 An Efficient SQL-based RDF Querying Scheme Eugene Inseok Chong Souripriya Das George Eadon Jagannathan Srinivasan New England Development Center.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
TAPP-09 23/02/2009Giorgos Flouris1 On Explicit Provenance Management in RDF/S Graphs Institute of Computer Science Foundation for Research and Technology.
RDF Semantics by Patrick Hayes W3C Recommendation Presented by Jie Bao RPI Sept 4, 2008 Part 1 of RDF/OWL Semantics Tutorial.
Chapter 3 RDF Schema.
-By Mohamed Ershad Junaid UTD ID :
Reducing Search Space Scheme using RDF-Schema Domain and Range Information for Efficient RDF Query Processing Sungtae Kim SNU OOPSLA Lab. December 3, 2004.
1 CENTRIA, Dept. Informática da Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal. 2 Institute of Computer Science,
Logics for Data and Knowledge Representation
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
SQL Databases are a Moving Target Juan F. Sequeda – Syed Hamid Tirmizi –
IDB, SNU Dong-Hyuk Im Efficient Computing Deltas between RDF Models using RDFS Entailment Rules (working title)
© Copyright 2008 STI INNSBRUCK RDF and RDF Schema Semantic Web Lecture Lecture III – xx 2009 Dieter Fensel Slides.
Chapter 9. 9 RDFS (RDF Schema) RDFS Part of the Ontological Primitive layer Adds features to RDF Provides standard vocabulary for describing concepts.
Chapter 3 RDF Schema. Introduction RDF has a very simple data model RDF Schema (RDFS) enriches the data model, adding vocabulary and associated semantics.
Part I: Set Constructs. RDF Schema (RDFS) RDF does not provide mechanisms to define domain classes and properties RDFS is a vocabulary that provides many.
Chapter 3 RDF and RDFS Semantics. Introduction RDF has a very simple data model But it is quite liberal in what you can say Semantics can be given using.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
RDF & RDF Schema Machine Understandable Metadata for the Web Semantic Web - Spring 2006 Computer Engineering Department Sharif University of Technology.
Part II. Property domain and range restriction RDFS allows us to put restriction on the properties by specifying their domain and range, providing the.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
RDF languages and storages part 1 - expressivness Maciej Janik Conrad Ibanez CSCI 8350, Fall 2004.
Of 35 lecture 5: rdf schema. of 35 RDF and RDF Schema basic ideas ece 627, winter ‘132 RDF is about graphs – it creates a graph structure to represent.
RDF Schema (RDFS) RDF user communities need to define the vocabularies (terms) to indicate that they  are describing specific kinds or classes of resources.
Scalable Distributed Reasoning Using MapReduce Jacopo Urbani, Spyros Kotoulas, Eyal Oren, and Frank van Harmelen Department of Computer Science, Vrije.
Introduction The Resource Description Framework (RDF) is recommended by the World Wide Web Consortium (W3C) to model meta-data about the resources of the.
Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema Jeen Broekstra, Arjohn Kampman, and Frank van Harmelen 정홍석
Practical RDF Chapter 10. Querying RDF: RDF as Data Shelley Powers, O’Reilly SNU IDB Lab. Hyewon Lim.
OIL and DAML+OIL: Ontology Languages for the Semantic Web Sungshin Lim TOWARDS THE SEMANTIC WEB: Ontology-driven Knowledge.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Practical RDF Ch.10 Querying RDF: RDF as Data Taewhi Lee SNU OOPSLA Lab. Shelley Powers, O’Reilly August 27, 2004.
ELW2009, Jenova SWCLOS: A Semantic Web Processor on CLOS Advanced Discussion Seiji Koide National Institute of Informatics.
Ch 7: RDF schema 현근수, 김영욱, 백상윤, 이용현 Team C. Introduction Semantic web modeling In RDF: simply creates graph structure to represent data In RDFS: about.
Important Concepts from the W3C RDF Vocabulary/Schema Sungtae Kim SNU OOPSLA Lab. August 19, 2004.
Description of Information Resources: RDF/RDFS (an Introduction)
R Store Angelique Moscicki Oshani Seneviratne Sergio Herrero-Lopez.
Sesame: An Architecture for Storing and Querying RDF Data and Schema Inf. Yasser Ganji Saffar When they were out of sight Ali Baba.
Of 38 lecture 6: rdf – axiomatic semantics and query.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall RDF & RDF Schema Machine Understandable Metadata for the.
Knowledge Technologies Manolis Koubarakis 1 Some Other Useful Features of RDF.
Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.
Describing Web Resources in RDF Semantic Web. Knowledge Technologies Manolis Koubarakis 2 The Semantic Web “Layer Cake”
Linked Data & Semantic Web Technology The Semantic Web Part 7. RDF Semantics Dr. Myungjin Lee.
Semantic Web in Depth RDF Schema Dr Nicholas Gibbins –
Semantics of RDF(S) and OWL Ivan Herman, W3C (Last update: 28 Sep 2007) Ivan Herman.
Charlie Abela Department of Intelligent Computer Systems
Rya Query Inference.
An Introduction to RDF Schema
The Semantic Web Part 6. RDF Vocabularies: RDF Schema
Chapter 3 RDF Schema.
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Service-Oriented Computing: Semantics, Processes, Agents
Tutorial on Semantic Web
Tools for DAML-Based Services, Query Answering, and
Introduction to RDF and RDFS Editor: MR3
RDF 1.1 Concepts and Abstract Syntax
ece 720 intelligent web: ontology and beyond
Chapter 3 RDF and RDFS Semantics
Semantic Web Basics (cont.)
Tools for DAML-Based Services, Query Answering, and
Presentation transcript:

An Optimization Technique for RDFS Inference using the Application Order of RDFS Entailment Rules Kisung Kim, Taewhi Lee

Contents  Introduction  Related Work  Background & Motivation  Our Approaches  Application Order of RDFS Entailment Rules  Avoiding Producing Redundant Results  Experiments  Appendix

 RDF Schema  Provides additional expressive power and semantics to RDF model  Gives a mechanism to declare classes, properties, domain and range of a property  RDFS inference  From RDF Schema information, infer another RDF triples  Class/Property hierarchy  Resource type  RDF entailment rules gives a way for complete inference Introduction Class hierarchy Property hierarchy Domain/Range of property RDF Model RDF Schema

Related Work  RDF Semantics  Propose the RDF Model Theory, a semantic theory for RDF and RDFS  Provide the RDFS entailment rules  Patrick Hayes, RDFS Semantics, 2004, W3C Recommendation  WILBUR  Claim that exhaustive, iterative application of RDFS entailment rules is not a realistic way  Propose a lazy evaluation strategy  Ora Lassila, Taking the RDF Model Theory out for a Spin, ISWC, 2002  Sesame  Use a practical exhaustive forward chaining algorithm  Jeen Broekstra, Arjohn Kampman, Inferencing and Truth Maintenance in RDF Schema, Practical and Scalable Semantic System, 2003  Jena  Use a hybrid approach(forward chaining + backward chaining)  But doesn’t provide RDBMS-based inferencer

Sesame Inference Strategy publication SubjectPredicateObjectExplicit writerdfs:rangearticle1 rdfs:subClassOfpublication1 Jimwritewriting011 Triples Table RDQL {?X} rdf:type publication SQL Select t.subject from triples t where t.predicate = rdf:type and t.object = publication Easy to translate queries No semantic interpretation article Forward chaining Beginning with facts, chaining through rules, and finally establishing the goal subClassOf rdfs3 writing01 rdf:type article 0 rdfs9 writing01 rdf:type publication 0

RDFS Entailment Rules  Consist of 13 rules  Give a way for the complete inference  Infer new RDF statements based on the presence of other statements Example> rdfs3 : type inference through property range information If Repository contains: aaa rdfs:range xxx uuu aaa yyy Then add: yyy rdf:type xxx

rdf1 rdfs4a, 4b rdfs3_1, rdfs3_2 rdfs2_1, rdfs2_2 rdfs5_1, rdfs5_2 rdfs6 rdfs7_1, rdfs7_2 rdfs8 rdfs9_1, rdfs9_2 rdfs10 rdfs11_1, rdfs11_2 rdfs12 rdfs13 Sesame RDFMTInferencer New Triples Table Inferred Triples Table Sesame Inference Strategy Triples Table

Dependencies between RDFS Entailment Rules  Shows which rules must be triggered at the next iteration  Sesame uses the dependency table to eliminate redundant inferencing steps Rule dependency table SubjectPredicateObject writerdfs:rangearticle rdfs:subClassOfpaper Jimwritewriting01 Triples Table rdfs3 writing01 rdf:type article 0 rdfs9 writing01 rdf:type paper 0

Motivation(1/2)  Using the dependency table cannot remove inefficiency completely  Useless application of rule Example> rdfs8 triggers rdfs7 Need to apply only when there is superproperty of ‘rdfs:subClassof’ Rule Name If E contains:then add: rdfs7 aaa rdfs:subPropertyOf bbb uuu aaa yyy uuu bbb yyy rdfs8 uuu rdf:type rdfs:Class uuu rdfs:subClassOf rdfs:Resource

Motivation(2/2)  Redundant result Example> Rule 2, Rule 4 may create same results uuu rdf:type rdfs:Resource Rule Name If E contains:then add: rdfs2 aaa rdfs:domain xxx uuu aaa yyy uuu rdf:type xxx rdfs4auuu aaa xxx uuu rdf:type rdfs:Resource

Our approaches(1/6) Application Order of RDFS Entailment Rules  To minimize the useless application of rule  Assumption  There are no superclass or superproperty of pre-defined RDFS constructs  Order of the inference  Inference for new RDF data with pre-stored RDF Schema information  Inference for new RDF Schema information with pre- stored RDF Schema information  Inference for new and old RDF data with new RDF Schema information

Our approaches(2/6) Application Order of RDFS Entailment Rules  Iteration occurs when the inferred result contains subproperty or subclass of RDFS constructs  These are the information about the RDF schema itself  Starting point of repetition is different according the inferred results

Our approaches(3/6) Application Order of RDFS Entailment Rules rdf1 rdfs4a, 4b rdfs7_1 rdfs2, 3 rdfs9_1 rdfs13 rdfs8 rdfs10 rdfs11_1, rdfs11_2 rdfs6 rdfs12 rdfs5_1, rdfs5_2 rdfs7_2 rdfs9_2 Type inference with pre- defined RDF Schema Build Class Hierarchy Build Property Hierarchy Type inference with newly- defined RDF Schema Subclass of RDFS class Subproperty of RDF property

Our approaches(4/6) Application Order of RDFS Entailment Rules  Does this ordering guarantee complete inference?  We can show this with the dependency table 12_12_23_13_24a4b5_15_267_17_289_19_21011_111_ X X XXXX 12_12_23_13_24a4b5_15_267_17_289_19_21011_111_ XX 12_12_23_13_24a4b5_15_267_17_289_19_21011_111_ Remove the rules which are applied after rule 8 Assume that there is no subclass/subproperty of RDF Schema constructs

Our approaches(5/6) Avoiding Producing Redundant Results  Inferred triples must be checked whether already exist in triple table before insertion  Avoiding production of same results can improve performance  Add join predicates to the rule application SQL  Do not consider results that must be inferred by previous rules  Optimize constructing the transitive closure (subClassOf, subPropertyOf)

Our approaches(6/6) Avoiding Producing Redundant Results  rdfs2, rdfs3  Do not consider the property whose domain/range is ‘rdfs:Resource’  rdfs4a, rdfs4b infer triples which asserts that type of a resource is ‘rdfs:Resource’  rdfs7  Do not consider triples such as aaa subPropertyOf aaa  rdfs9  Do not consider triples such as aaa subClassOf aaa  rdfs5, rdfs11  Select distinct triples before checking n1 n2 If N nodes exists between two node, n1, n2, the application of the rule make n same results subClassOf

Experiment(1/3)  Environment  Pentium M GHz  1GB Ram  Windows XP Professional  Java SDK  Sesame  MySQL  Datasets Size(MB) # of statements # of inferred statements SesameOur approach Wordnet ,62699,690 NCI ,373966,827 GO2756,653,5922,055,383

Experiment(2/3)  # of rule application and inference time Our approach reduces # of rule application and improves the inference performance # of rule applicationInference time SesameOur approach Improvement(%) Sesame(s)Our approach(s) Improvement(%) Wordnet NCI GO

Experiment(3/3)  Scalability for data loading

Appendix(1/3) RDFS Entailment Rules Rule NameIf E contains:then add: rdfs1 uuu aaa lll. where lll is a plain literal (with or without a language tag). _: nnn rdf:type rdfs:Literal. where _: nnn identifies a blank node allocated to lll by rule rule lg. rdfs2 aaa rdfs:domain xxx. uuu aaa yyy. uuu rdf:type xxx. rdfs3 aaa rdfs:range xxx. uuu aaa vvv. vvv rdf:type xxx. rdfs4a uuu aaa xxx. uuu rdf:type rdfs:Resource. rdfs4b uuu aaa vvv. vvv rdf:type rdfs:Resource. rdfs5 uuu rdfs:subPropertyOf vvv. vvv rdfs:subPropertyOf xxx. uuu rdfs:subPropertyOf xxx. rdfs6 uuu rdf:type rdf:Property. uuu rdfs:subPropertyOf uuu. rdfs7 aaa rdfs:subPropertyOf bbb. uuu aaa yyy. uuu bbb yyy. rdfs8 uuu rdf:type rdfs:Class. uuu rdfs:subClassOf rdfs:Resource. rdfs9 uuu rdfs:subClassOf xxx. vvv rdf:type uuu. vvv rdf:type xxx. rdfs10 uuu rdf:type rdfs:Class. uuu rdfs:subClassOf uuu. rdfs11 uuu rdfs:subClassOf vvv. vvv rdfs:subClassOf xxx. uuu rdfs:subClassOf xxx. rdfs12 uuu rdf:type rdfs:ContainerMembershipProperty. uuu rdfs:subPropertyOf rdfs:member. rdfs13 uuu rdf:type rdfs:Datatype. uuu rdfs:subClassOf rdfs:Literal.

Appendix(2/3) Application of the RDFS entailment rules  Rules with one premise triple Example> rdfs8 RULE) uuu rdf:type rdfs:Class  uuu rdfs:subClassOf rdfs:Resource SQL) SELECT nt.subj,, FROM newtriples WHERE pred = and obj =  Rules with two premise triples  Need two SQL Example> rdfs2 RULE) aaa xxx & uuu aaa yyy  uuu rdf:type xxx SQL1) SELECT nt.subj,, t.obj FROM newtriples nt LEFT JOIN triples t ON t.subj = nt.pred WHERE t.pred = AND t.subj IS NOT NULL

Appendix(3/3) Application of the RDFS entailment rules  SQL of rdfs11_2 SELECT t.sub, 19, t.super FROM (SELECT DISTINCT t1.subj AS sub, 19, nt.obj AS super FROM triples nt LEFT JOIN triples t1 ON nt.subj = t1.obj AND t1.pred = 19 WHERE nt.id > 141 AND nt.id <= AND (t1.id <= ) AND nt.pred = 19 AND nt.obj > 0 AND t1.subj IS NOT NULL AND nt.subj != nt.obj AND t1.subj != t1.obj) t WHERE (t.sub, 19, t.super) NOT IN (SELECT subj, pred, obj FROM triples)