Interoperation of Information Sources via Articulation of Ontologies Prasenjit Mitra, Gio Wiederhold Stanford University Supported by AFOSR- New World.

Slides:



Advertisements
Similar presentations
MediaView -- Towards a “Semantic” Multimedia Database Model
Advertisements

eClassifier: Tool for Taxonomies
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Copyright Irwin/McGraw-Hill Data Modeling Prepared by Kevin C. Dittman for Systems Analysis & Design Methods 4ed by J. L. Whitten & L. D. Bentley.
By Murat Şensoy Ontology Alignment by Murat Şensoy
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
Systems Analysis Requirements structuring Process Modeling Logic Modeling Data Modeling  Represents the contents and structure of the DFD’s data flows.
By Helen Howell. 2 Outline ] What is interoperation? ] Why? ] Problems ] Approach to Interoperations ] ONION ] Cyc.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
TOSS: An Extension of TAX with Ontologies and Similarity Queries Edward Hung, Yu Deng, V.S. Subrahmanian Department of Computer Science University of Maryland,
March 2000 Gio XIT 1 Increasing the Precision when Obtaining Information from the Web Gio Wiederhold Stanford University 4 April 2000 related report: www-db.stanford.edu/pub/gio/1999/miti.htm.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy Stanford Medical Informatics Stanford University.
By ANDREW ZITZELBERGER A Framework for Extraction Ontology Based Information Management.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
August 2000 Gio vdR 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University August 2000 Reind van de Riet celebration.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Ontology Maintenance with an Algebraic Methodology: a Case Study Jan Jannink, Gio Wiederhold Presented by: Lei Lei.
Semi-Automatic Generation of Mini-Ontologies from Canonicalized Relational Tables Chris Hathaway Supported by NSF.
Semi-Automatic Generation of Mini-Ontologies from Canonicalized Relational Tables Chris Hathaway.
March 2000 Gio XIT 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University March 2000 report: www-db.stanford.edu/pub/gio/1999/miti.htm.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Developing an Ontology for Irrigation Information Resources *Cornejo, C., H.W. Beck, D.Z. Haman, F.S. Zazueta. University of Florida Gainesville, FL. USA.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Intelligent Database Systems Lab Presenter: WU, JHEN-WEI Authors: Rodrigo RizziStarr, Jose´ Maria Parente de Oliveira IS Concept maps as the first.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
Dimitrios Skoutas Alkis Simitsis
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
S calable K nowledge C omposition Ontology Interoperation January 19, 1999 Jan Jannink, Prasenjit Mitra, Srinivasan Pichai, Danladi Verheijen, Gio Wiederhold.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Information Retrieval
1 Data Mining: Text Mining. 2 Information Retrieval Techniques Index Terms (Attribute) Selection: Stop list Word stem Index terms weighting methods Terms.
University of the Aegean AI – LAB ESWC 2008 From Conceptual to Instance Matching George A. Vouros AI Lab Department of Information and Communication Systems.
Object storage and object interoperability
AIFB Ontology Mapping I3CON Workshop PerMIS August 24-26, 2004 Washington D.C., USA Marc Ehrig Institute AIFB, University of Karlsruhe.
1 Ontology Evolution within Ontology Editors Presentation at EKAW, Sigüenza, October 2002 L. Stojanovic, B. Motik FZI Research Center for Information Technologies.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
Composing semantic Web services under constraints E.Karakoc, P.Senkul Journal: Expert Systems with Applications 36 (2009)
February 2001 Gio Wiederhold Stanford University
Associative Query Answering via Query Feature Similarity
Tools For Resolving Heterogeneity Computer Science Department
Obtaining Precision when Integrating Information
Semantic Precision for Web-based Interoperation
Extracting Semantic Concept Relations
Presented by: Prof. Ali Jaoua
Stanford University, CSD
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presentation transcript:

Interoperation of Information Sources via Articulation of Ontologies Prasenjit Mitra, Gio Wiederhold Stanford University Supported by AFOSR- New World Vistas Program

Prasenjit Mitra2 Outline zIntroduction zPreliminaries zArticulation Generation Toolkit zOntology Algebra zConclusion

Prasenjit Mitra3 Interoperation of Information Sources zCompose information zMultiple independent, heterogeneous sources zReliability, scalability zSemantic heterogeneity - same term different semantics - different term same semantics

Prasenjit Mitra4 Preliminaries: Ontology zOntology - hierarchy of terms and specification of their properties. zModeled as a directed labeled graph + set of rules. zOnt = ( V, E, R) V - set of nodes(concepts) E - set of edges(properties) R - set of rules involving V,E

Prasenjit Mitra5 Example Car H123 Instance List_Price VID PMitra Name Unit $ 40kCA Address Amount Vehicle LuxuryCa r PMitra Buyer Instance SubClass Instance VIN H123 OwnerRetail_ Price CA NameAddr $35k SubClass Buyer Owner Attribute Equ

Prasenjit Mitra6 Articulation Rules zArticulation rules - logic-based rules that relate concepts in two ontologies: -Binary Relationships (O1.Car SubClassOf O2.Vehicle) (O1.Buyer Equ O2.Owner) (O2.LuxuryCar SubClassOf O1.Car)

Prasenjit Mitra7 Articulation Rules (contd.) zHorn Clauses - (O1.Car O1.Instance X), (X O1.Price Y), (Y > $30000) => (O2.LuxuryCar Instance X) - (V O2.Retail_Price P), (C O1.List_Price L), (L Unit U), (L Amount A), (V Equ C) => (P Equ concat(U,A))

Prasenjit Mitra8 Contributions zArticulation Generation Toolkit - produce translation rules semi-automatically - a library of reusable heuristic methods - a GUI to display ontologies and interact with the expert zOntology Algebra - query rewriting and planning

Prasenjit Mitra9 Articulation Generator Driver Ont1Ont2 Phrase Relator Thesaurus Structural Matcher Semantic Network Human Expert Context-based Word Relator OntA

Prasenjit Mitra10 Articulation Generation Methods zNon-iterative Methods - Lexical Matcher - Thesaurus-based Matcher - Corpus-based Matcher - Instance-based Matcher zIterative Methods - Structural Matcher - Inference-based Matcher

Prasenjit Mitra11 Lexical Methods zPreprocessing rules. -Expert-generated seed rules. e.g., (O1.List_Price Equ O2.Retail_Price) -Context-based preprocessing directives. e.g., (O1.UK_Govt Equ O2.US_Govt) -Stop-word Removal & Stemming -Word match (full or partial) -Phrase match e.g., (O1.Ministry_Of_Defence Equ O2.Defense_Ministry) 0.6

Prasenjit Mitra12 Thesaurus-based methods zConsult a dictionary/thesaurus to find synonyms, related words zGenerate a similarity measure or relatedness measure - words that have similar words in their definitions are similar zGet more semantically meaningful relationships from WordNet (syn, hyper)

Prasenjit Mitra13 Candidate Match Repository Term linkages automatically extracted from 1912 Webster’s dictionary * * free, other sources. being processed. Based on processing headwords ý definitions Notice presence of 2 domains: chemistry, transport

Prasenjit Mitra14 Corpus-based zCollect a set of text documents preferably from same domain - search using keywords in google zBuild a context vector (1000-character neighbourhood) for each word zCompute word-pair similarity based on the cosine of the vectors zUse word-pair similarity to find similarity among labels of nodes/edges

Prasenjit Mitra15 Structural Methods zUses results of lexical match zIf x% of parent nodes match & y% of children nodes match zSpecial relations (AttributeOf) match

Prasenjit Mitra16 Tools to create articulations Graph matcher for Articulation- creating Expert Vehicle ontology Transport ontology Suggestions for articulations

Prasenjit Mitra17 continue from initial point Also suggest similar terms for further articulation: by spelling similarity, by graph position by term match repository Expert response: 1. Okay 2. False 3. Irrelevant to this articulation All results are recorded Okay ’s are converted into articulation rules

Prasenjit Mitra18 An Ontology Algebra Operations can be composed Operations can be rearranged Alternate arrangements can be evaluated Optimization is enabled The record of past operations can be kept and reused when sources change

Prasenjit Mitra19 Binary Operators A knowledge-based algebra for ontologies The Articulation Function (ArtGen), given two ontologies, supplies articulation rules between them. Intersection create a subset ontology keep sharable entries Union create a joint ontology merge entries Difference create a distinct ontology remove shared entries

Prasenjit Mitra20 Intersection: Definition zO1 = (V1, E1, R1) O2 = (V2, E2, R2) zOI = ( O1 Int ArtGen O2) = (VI, EI, RI) zArules = ArtGen( O1, O2 ) zVI = Nodes( Arules ) zEI = Edges( Arules ) + Edges(E1, VI.V1) + Edges( E2, VI.V2) zRI = Arules + Rules( R1, VI.V1) + Rules( R2, VI.V2)

Prasenjit Mitra21 Intersection: Example z ARules = { (O1.Car SubClass O2.LuxuryCar), (O1.MSRP Equ O2.Price)} z NI = ( O1.Car, O1.MSRP, O2.LuxuryCar, O2.Price ) z EI = Edges(ARules) + {(O1.Car Attribute O1.MSRP), (O2.LuxuryCar Attribute Price)} InexpCar Car LuxuryCar MSRPPrice LuxuryTax SubClass Equ O1 O2 OI

Prasenjit Mitra22 Intersection: Properties zCommutative? OI 12 =(O1 Int ArtGen O2) = (O2 Int ArtGen O1)=OI 21 VI 12 = VI 21, EI 12 = EI 21, RI 12 =RI 21 ARules 12 = ARules 21 ArtGen(O1, O2) = ArtGen(O2, O1) zInt ArtGen is commutative iff ArtGen is commutative

Prasenjit Mitra23 Semantically Commutative Example: zArtGen(O1,O2) : ( O1.Car SubClassOf O2.Vehicle) zArtGen(O2,O1) : ( O2.Vehicle SuperClassOf O1.Car) zDefn: ArtGen is Semantically Commutative iff ArtGen(O1,O2) ArtGen(O2,O1) Operands to intersection can be rearranged if ArtGen is semantically commutative

Prasenjit Mitra24 Associativity Example zExample: zArtRules(O1, O3) : (O1.Car SubClassOf O3.Vehicle) zArtRules(O2, O3) : (O2.Truck SubClassOf O3.Vehicle) zArtRules(O1, O2) : null zIntersection is not associative!

Prasenjit Mitra25 Associativity z(O1.O2).O3 = O1.(O2.O3) iff ArtGen is consistent and transitively connective zconsistent - given two nodes it generates the same relationship irrespective of relationships between other nodes

Prasenjit Mitra26 Associativity (contd.) zArtGen relates A and B: (A Rel B) = (A R1 B) or (B R2 A) zTransitively connective: If ArtGen generates (A Rel B), (B Rel C) then it also generates (A Rel C’)  where A  O1, B  O2, C,C’  O3

Prasenjit Mitra27 Transitive Connectivity A B C E D O1 O2 O3

Prasenjit Mitra28 Exploiting the result. Processing & query evaluation is best performed within Source Domains & by their engines Result has links to source Avoid n 2 problem of interpreter mapping [Swartout HPKB year 1]

Prasenjit Mitra29 Using the Articulation Rules zQ(X) :- (LuxuryCar Instance X), (X Owner Y), (Y Addr `CA’) zTranslated using Articulation Rules Q(X) :- (Car Instance X),(X List_Price Y) (Y Unit ‘$’),(Y Amount A), (A > 30000), (X Buyer Z), (Z Address ‘CA’)

Prasenjit Mitra30 Related Work zTsimmis zGarlic zInfomaster zInformation Manifold zClio zLSD

Prasenjit Mitra31 Conclusion zIntroduced an Ontology Interoperation System. zInteroperation based on articulations that bridge the semantic gap. zGraph-oriented data model, logical rules zFounded on Ontology Algebra Thanks to Stefan Decker, Alex Carobus, Jan Jannink, Sergey Melnik, Shrish Agarwal