 -Queries: Enabling Querying for Semantic Associations on the Semantic Web WWW2003 (Budapest, May 23, 2003) Paper Presentation Kemafor Anyanwu Amit Sheth.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
By Daniela Floresu Donald Kossmann
Semantic Web Thanks to folks at LAIT lab Sources include :
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Knowledge Graph: Connecting Big Data Semantics
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
RDF Databases By: Chris Halaschek. Outline Motivation / Requirements Storage Issues Sesame General Introduction Architecture Scalability RQL Introduction.
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The Semantic Web Week 1 Module Content + Assessment Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
Swoogle Swoogle Semantic Search Engine Web-enhanced Information Management Bin Wang.
Indexing XML Data Stored in a Relational Database VLDB`2004 Shankar Pal, Istvan Cseri, Gideon Schaller, Oliver Seeliger, Leo Giakoumakis, Vasili Vasili.
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22,
Information Retrieval in Practice
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Semantic Web. Course Content
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Practical RDF Chapter 1. RDF: An Introduction
Ranking Relationships on the Semantic Web Budak Arpinar This work is funded by NSF-ITR-IDM Award# titled '‘SemDIS: Discovering Complex Relationships.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
Logics for Data and Knowledge Representation
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery Dengping Wei, Ting Wang, Ji Wang, and Yaodong Chen Reporter: Ting.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
SemRank: Ranking Complex Relationship Search Results on the Semantic Web Kemafor Anyanwu, Angela Maduko, Amit Sheth LSDIS labLSDIS lab, University of Georgia.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
updated ’08CmpE 583 Fall 2008Introduction- 1 CmpE 583- Web Semantics: Theory and Practice Atilla ELÇİ Computer Engineering Department Eastern.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #22 Secure Web Information.
Peer-to-Peer Discovery of Semantic Associations Matthew Perry, Maciej Janik, Cartic Ramakrishnan, Conrad Ibanez, Budak Arpinar, Amit Sheth 2 nd International.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Searching and Ranking Documents based on Semantic Relationships PaperPaper presentation ICDE Ph.D. Workshop 2006 April 3rd, 2006, Atlanta, GA, USA This.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Context Aware Semantic Association Ranking SWDB Workshop Berlin, September 7, 2003 Boanerges Aleman-MezaBoanerges Aleman-Meza, Chris Halaschek, I. Budak.
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
An Ontological Approach to Financial Analysis and Monitoring.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Discovering and Ranking Semantic Associations over a Large RDF Metabase Chris Halaschek, Boanerges Aleman- Meza, I. Budak Arpinar, Amit P. Sheth 30th International.
Of 24 lecture 11: ontology – mediation, merging & aligning.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
The Semantic Web By: Maulik Parikh.
Peer-to-Peer Discovery of Semantic Associations
Associative Query Answering via Query Feature Similarity
ece 627 intelligent web: ontology and beyond
LOD reference architecture
Topic: Semantic Text Mining
Presentation transcript:

 -Queries: Enabling Querying for Semantic Associations on the Semantic Web WWW2003 (Budapest, May 23, 2003) Paper Presentation Kemafor Anyanwu Amit Sheth Large Scale Distributed Information Systems Lab University of Georgia This material is based upon work supported by the National Science Foundation under Grant No

“Finding out about” [ Belew00 ] relationships! Finding things From ….. to…..

Outline Semantic Associations: Introduction A Formal Framework for Semantic Associations on the Semantic Web  -Queries For Discovering Semantic Associations  Implementation Strategies & Issues Related Work Conclusion & Future Work

Web Search/Query Techniques are “Entity-Centric” Search Engines Query Languages documents Entities: XML elements, RDF Resources, etc Meta-Search Engines Entities + metadata

But….. “ An object by itself is intensely uninteresting ”. Grady Booch, Object Oriented Design with Applications, 1991

 Mechanisms for querying about and retrieving complex relationships between entities. We need A B C 1. A is related to B by x.y.z x y z z’ y’ 2. A is related to C by i. x.y’.z’ u v ii. u.v (undirected path)  3. A is “related similarly” to B as it is to C (y’  y and z’  z  x.y.z  x.y’.z’) So are B and C related? ?

Why do we need this? Very useful in information analytics  national security  business intelligence Avoids the task of familiarizing oneself with schemas in order to formulate queries  especially when multiple schemas are involved !

Example in 9-11 context What are relationships between Khalid Al- Midhar and Majed Moqed ?  Connections Bought tickets using same frequent flier number  Similarities Both purchased tickets originating from Washington DC paidby cash and picked up their tickets at the Baltimore-Washington Int'l Airport Both have seats in Row 12 “What relationships exist (if any) between Osama bin Laden and the 9-11 attackers”

&r4 &r8 &r2 Bank Account no FFlyer Payment paidby typeOf(instance) subClassOf(isA) subPropertyOf paidby “Marwan” “Al-Shehhi” &r7 &r1 fname lname purchased “M’mmed” “Atta” Ticket Flight forflight String number purchased Client fname lname String Passenger FFNo String Customer Cash fflierno creditedto fflierno creditedto paidby holder &r9 float amount purchased for &r5&r6 purchased for CCard fname lname “Abdulaziz ” “Alomari ” fname lname “XYZ123” &r3 String ffid &r11 holder paidby holder

A Foundation for Semantic Associations on the Semantic Web

Complex Relationships? Traditional notions of relationships are captured by single n-ary relations  e.g. RDF:Property, UML Association, E-R:relationship, etc. Complex relationships can be viewed as specific compositions of multiple single n-ary relations  e.g. Sequence composition of binary relations allows us to capture paths Relation Sequences + certain operations allow us to detect very interesting relationships  Connectivity  Similarity

Semantic Web RDF is the current W3C standard for metadata representation on the Semantic Web Other proposals include OWL, DAML+OIL, UML, Topic Maps, etc. In RDF, the basic unit of relationship is a Property

(Karvounarakis et al 2002) gives a formalization of RDF/RDFS which forms the basis for a typed RDF query language – RQL.  It provides a type system for RDF Schemas  For each type e.g. class type  c, property type  p, there is a mapping [[ ]] to its members  e.g. for a property type p, [[p]] is defined as {[v1, v2] | v1  [[ p.domain ]], v2  [[ p.range ]] }  { [[ p’ ]] | ’  p} Formal Data Model for RDF

We add The notion of an RDFS Schema Set. Basically, a union of a set of RDF Schemas supplying the context for a query  In the example, Flight + Banking Schemas The notion of a Property Sequence, which is the sequential composition of RDF Properties and define relations on Property Sequences A formalization for Semantic Associations based on Property Sequences and their relations

Property SequenceFinite sequence of properties PS = [P 1, P 2, P 3, … P n ], P i is a property defined in an RDF Schema RS j of a schema set RSS. e.g. [purchased, paidby]. ps  [[PS]] implies i. ps[i]  [[P i ]] for 1  i  n ii. ps[i][1] = ps[i+1][0] Joined Property Sequences ( ) PS 1 PS 2   c  (PS 1.NodesOfPS()  PS 2.NodesOfPS()). c is called join node  -Isomorphic Property Sequences (   ) PS 1   PS 2  i. PS 1 = [P 1, P 2, P 3, … P m ], PS 2 = [Q 1, Q 2, Q 3, … Q m ] ii. for all i, 1  i  m: P i = Q i or P i  Q i or Q i  P i (  means subpropertyOf ) Note that the Property Sequences need not be exact to be  -Isomorphic, just similar.   A sequence such as “ awarded.paidby ” which means that a passenger was awarded a ticket, paid for by frequent miles is considered  -Isomorphic to “ purchased.paidby ”.

Semantic Associations

 -pathAssociation Let PS be a Property Sequence and ps  [[PS]]. If x and y are the origin/terminus and terminus/origin of ps respectively,   -pathAssociated (x, y) &r1 &r5 &r6 purchased for “M’mmed” “Atta” fname lname “Abdulaziz” “Alomari ” fname lname  -pathAssociated

&r4 &r8 purchased fflierno creditedto  -joinAssociated  -joinAssociation  -joinAssociated (x, y)  a)  PS 1,PS 2 : PS 1 PS 2 b)  ps 1,ps 2 : ps1  [[ PS 1 ]], ps2  [[ PS 2 ]] i.x is the origin of ps1 and y is the origin of ps2 or ii.x is the terminus of ps1 and y is the terminus of ps2. “Marwan” “Al-Shehhi” &r7 &r1 fname lname “M’mmed” “Atta” fname lname purchased &r5 &r11 holder paidby  -joinAssociated &r6 for “Abdulaziz” “Alomari ” fname lname join nodes 

 -IsoAssociation  -IsoAssociated (x, y)  a)  PS 1,PS 2 : PS 1  PS 2 b)  ps 1,ps 2 : ps 1  [[ PS 1 ]], ps 2  [[PS 2 ]] i. x is the origin/terminus of ps 1 and y is the origin/terminus of ps 2.  -IsoAssociated &r8 &r2 paidby “Marwan” “Al-Shehhi” &r7 &r1 fname lname purchased “M’mmed” “Atta” paidby &r9 fname lname &r3 CashTicketPassenger

 -Queries for Discovering Semantic Associations

Let  U (2) = { {x, y} : x, y   U and x  y}, PS = {PS : PS is a Property Sequence}, PS (2) = {{PS 1, PS 2 } : PS 1, PS 2 are Property Sequences} A  -Query Q maps from a pair of keys to the PS and PS (2) in the following manner:   :  U (2)  2 PS   :  U (2)  2 PS(2)     :  U (2)  2 PS(2)   -Queries

Implementation Approaches for  - Operators Exploit existing RDF storage & query infrastructure:  Persistent Stores  Translations to query expressions at data store layer, guided by index structures  Memory-Resident Stores  Employ graph traversal algorithms Alternative Representation with complimentary indexes and algorithms i.e. search-engine type Strategy

Evaluation Testbed Ontology RDF Description Base wrt to this schema is populated from 30+ sources

Knowledge Agents SEMANTIC ENHANCEMENT SERVER Content Agents ONTOLOGY 3 rd Party Applications Knowledge API Bespoke Application SEMANTIC QUERY SERVER Semantic Query API Use of Semagix Freedom for automatic ontology-driven metadata extraction to create large RDF description-base from many sources Semagix Freedom Semagix Freedom is based on prior research at the LSDIS Lab-> resulting SCORE technology.SCORE technology

 -PathAssociated(Transfer1, Iraq) Transfer1 → Account2 → IraqInternationalBank → Iraq Transfer1 → Account2 → SaddamHussein → Iraq Transfer1 → Account2 → SaddamHussein → IraqGovernment → Iraq

ρ-joinAssociated(Account2, 1) Account2 → IraqInternationalBank → Iraq 1 → SaddamHussein → Iraq Account2 → IraqInternationalBank → Iraq 1 → SaddamHussein → IraqGovernment → Iraq Account2 → SaddamHussein 1 → SaddamHussein

ρ-IsoAssociated(Account2, Account1) Account2 → at → IraqInternationalBank → locatedIn → Iraq Account1 → at → PakistanInternationalBank → locatedIn → Pakistan Account2 → p_holder → SaddamHussein → fromLocation → Iraq Account1 → p_holder → OsamaBinLaden → fromLocation → SaudiArabia Account2 → p_holder→ SaddamHussein → leaderOf → IraqGovernment → locatedIn → Iraq Account1→ p_holder→ OsamaBinLaden → leaderOf → AlQeada → locatedIn → Afghanistan

Current & Future Work Data Preprocessing and Serialization Context  Specification & Representation  Streamline Query Processing  Ranking Query Processing Optimizations  Index structures  Heuristics  Complexity =  (n-1) (l=1) (# paths of length l) (probability of keeping path of length l). Result Presentation Spatio-Temporal constraints

Related Work IR over XML, Relational Databases  [Hristidis et al 02,03], [Theobald et al 02],[Guha et al 03] Support for Path Expressions in Semi- Structured and Object-Oriented models [Christophides et al 94], [Abiteboul et al 97], [Buneman et al 00], etc. Graph Databases  [Mendelzon, Wood 89]

More info.  Project description, papers, presentations