Logics for Data and Knowledge Representation

Slides:



Advertisements
Similar presentations
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Advertisements

OWL - DL. DL System A knowledge base (KB) comprises two components, the TBox and the ABox The TBox introduces the terminology, i.e., the vocabulary of.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 4: Semantic Networks and Description Logics.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
A Framework for Ontology-Based Knowledge Management System
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 52 Database Systems I Relational Algebra.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
TOSS: An Extension of TAX with Ontologies and Similarity Queries Edward Hung, Yu Deng, V.S. Subrahmanian Department of Computer Science University of Maryland,
Representation of Web Data in a Web Warehouse Ragini A.S. & Shipra Dutta November 20 th, 2001.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Partners Using NLP Techniques for Meaning Negotiation Bernardo Magnini, Luciano Serafini and Manuela Speranza ITC-irst, via Sommarive 18, I Trento-Povo,
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Developing facets in UDC for online retrieval Claudio Gnoli (University of Pavia) Aida Slavic (UDC Consortium) 8th NKOS Workshop, Corfu, 1 Oct 2009.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
LDK R Logics for Data and Knowledge Representation Semantic Matching.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Logics for Data and Knowledge Representation Semantic Matching.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
LDK R Logics for Data and Knowledge Representation Lightweight Ontologies.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
LDK R Logics for Data and Knowledge Representation Application of (Ground) ClassL.
ISURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Prof. Dr. Asuman Dogac METU-SRDC Turkey METU.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
LDK R Logics for Data and Knowledge Representation ClassL (part 3): Reasoning with an ABox 1.
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
LDK R Logics for Data and Knowledge Representation ClassL (Propositional Description Logic with Individuals) 1.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Of 29 lecture 15: description logic - introduction.
LDK R Logics for Data and Knowledge Representation Description Logics: family of languages.
Ontology Technology applied to Catalogues Paul Kopp.
Distributed Instance Retrieval over Heterogeneous Ontologies Andrei Tamilin (1,2) & Luciano Serafini (1) (1) ITC-IRST (2) DIT - University of Trento Trento,
Logics for Data and Knowledge Representation ClassL (part 1): syntax and semantics.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
The Semantic Web By: Maulik Parikh.
COMP6215 Semantic Web Technologies
ece 720 intelligent web: ontology and beyond
ece 627 intelligent web: ontology and beyond
Element Level Semantic Matching
Ontology.
Logics for Data and Knowledge Representation
Information Retrieval
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Information Networks: State of the Art
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
CIS Monthly Seminar – Software Engineering and Knowledge Management IS Enterprise Modeling Ontologies Presenter : Dr. S. Vasanthapriyan Senior Lecturer.
Presentation transcript:

Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies

Outline Lightweight Ontologies Descriptive and classification ontologies Real world and classification semantics Lightweight Ontologies Converting classifications into Lightweight Ontologies Applications on Lightweight Ontologies Document Classification Query-answering Semantic Matching 2

Ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Animal Bird Head Mammal Predator Herbivore Goat Tiger Chicken Cat Is-a Eats Part-of Body Ontologies are explicit specifications of conceptualizations [Gruber, 1993] They are often thought of as directed graphs whose nodes represent concepts and whose edges represent relations between concepts

Concepts and Relations between them ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES CONCEPT: it represents a set of objects or individuals EXTENSION: the set above is called the concept extension or the concept interpretation Concepts are often lexically defined, i.e. they have natural language names which are used to describe the concept extensions (e.g. Animal, Lion, Rome), often with an additional description (gloss) RELATION: a link from the source concept to the target concept The backbone structure of an ontology graph is a taxonomy in which the relations are ‘is-a’, ‘part-of’ and ‘instance-of’, whereas the remaining structure of the graph supplies auxiliary information about the modeled domain and may include relations like ‘located-in’, ‘eats’, ‘ant’, etc. They are respectively called hierarchical (BT/NT) and associative (RT) relations in Library Science.

Ontology as a graph: a mathematical definition ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES An ontology is an ordered pair O = <V, E> V is the set of vertices describing the concepts E is the set of edges describing relations Animal Bird Head Mammal Predator Herbivore Goat Tiger Chicken Cat Is-a Eats Part-of Body

Tree-like Ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Animal Bird Head Mammal Predator Herbivore Goat Tiger Chicken Cat Is-a Part-of Body Take the ontology in the previous slide and remove those auxiliary relations… … we get a tree-like ontology consisting of its backbone structure with ‘is-a’ and ‘part-of’ relations (*), that is an informal lightweight ontology. (*) Notice that in some cases we can obtain more complex structures like DAGs or even with cycles

Descriptive VS. Classification Ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Descriptive ontologies They are used to describe a piece of world, such as the Gene ontology, Industry ontology, etc. The purpose is to make a clear description of the world. This is usually the first idea to mind when people talk about ontologies. Classification ontologies They are used to classify things, such as books, documents, web pages, etc. The aim is to provide a domain specific category to organize individuals accordingly. Such ontologies usually take the form of classifications with or without explicit meaningful links. From paper: F. Giunchiglia, B. Dutta, V. Maltese. Faceted lightweight ontologies. In “Conceptual Modeling: Foundations and Applications”, A. Borgida, V. Chaudhri, P. Giorgini, Eric Yu (Eds.) LNCS 5600 Springer, 2009.

Real world and Classification semantics ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Real world semantics In descriptive ontologies, concepts represent real world entities. For example, the extension of the concept animal is the set of real world animals, which can be connected via relations of the proper kind. Classification semantics In classification ontologies, the extension of each concept (label of a node) is the set of documents about the entities or individual objects described by the label of the concept. For example, the extension of the concept animal is “the set of documents about animals” of any kind. From paper: F. Giunchiglia, B. Dutta, V. Maltese. Faceted lightweight ontologies. In “Conceptual Modeling: Foundations and Applications”, A. Borgida, V. Chaudhri, P. Giorgini, Eric Yu (Eds.) LNCS 5600 Springer, 2009. 8

Why ‘Lightweight’ Ontologies? ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES The majority of existing ontologies are ‘simple’ taxonomies or classifications, i.e., hierarchically organized categories used to classify resources. Ontologies with arbitrary relations do exist, but no intuitive and efficient reasoning techniques support such ontologies in general. … so we need ‘lightweight’ ontologies.

Lightweight Ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES A (formal) lightweight ontology is a triple O = <N,E,C> where: N is a finite set of nodes, E is a set of edges on N, such that <N,E> is a rooted tree, C is a finite set of concepts expressed in a formal language F, such that for any node ni ∈ N, there is one and only one concept ci ∈ C, and, if ni is the parent node for nj, then cj ⊑ ci. NOTE: lightweight ontologies are in classification semantics From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007.

Converting tree-like structures into LOs ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES For a descriptive ontology, the backbone taxonomy of ‘is-a’ and ‘instance-of’ is intuitively coincident with the subsumption (‘⊑’) relation in LOs. NOTE: ‘part-of’ relations correspond to subsumption only if transitive. For instance the following chain cannot be translated: handle part-of door part-of school part-of school system For a classification ontology, the extension of each node is the set of documents (books, websites, etc.) that should be classified under the node. Therefore, the links has to be interpreted as ‘subset’ relations and can be transformed directly into subsumption in the target LOs. From paper: F. Giunchiglia, B. Dutta, V. Maltese. Faceted lightweight ontologies. In “Conceptual Modeling: Foundations and Applications”, A. Borgida, V. Chaudhri, P. Giorgini, Eric Yu (Eds.) LNCS 5600 Springer, 2009.

Descriptive and classification ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Animal Vertebrate Mammal A B D Invertebrate C E Bird is-a (a) World Europe France A B D Asia C E Italy part-of F Rome (b) (a) and (b) are two descriptive ontologies. The corresponding classification ontologies are obtained by substituting all the relations with ‘subset’. (a) and (b) can be converted into lightweight ontologies by substituting the relations into subsumptions. However, the semantics changes from real world to classification semantics. From paper: F. Giunchiglia, B. Dutta, V. Maltese. Faceted lightweight ontologies. In “Conceptual Modeling: Foundations and Applications”, A. Borgida, V. Chaudhri, P. Giorgini, Eric Yu (Eds.) LNCS 5600 Springer, 2009. 12

Populated (Lightweight) Ontologies ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES In Information Retrieval, the term classification is seen as the process of arranging a set of objects (e.g., documents) into a set of categories or classes. A classification ontology is said populated if a set of objects has been classified under ‘proper’ nodes. Thus a populated (lightweight) ontology includes (explicit or implicit) ‘instance-of’ relations

Example of a Populated Ontology ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Animal ⊑ ⊑ ⊑ ⊑ Bird Mammal Head Body ⊑ ⊑ ⊑ Chicken Predator Herbivore Instance-of ⊑ ⊑ ⊑ ‘Chicken Soup’ Instance-of Cat Tiger Goat ‘How to Raise Chicken’ Instance-of Instance-of Instance-of ‘Tom and Jerry’ ‘www.protectTiger.org’ …

Lightweight Ontologies in ClassL: TBox ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Subsumption terminologies. Recall: ‘… C is a finite set of concepts expressed in a formal language F, such that for any node ni∈N, there is one and only one concept ci∈C, and, if ni is the parent node for nj ,then cj ⊑ ci.’ Bird ⊑ Animal Mammal ⊑ Animal Chicken ⊑ Bird Cat ⊑ Predator … NOTE: a tree-like ontology can be transformed into a lightweight ontology, but not vice versa. This is because we loose information during the translation.

Populated LOs in ClassL: TBox + ABox ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES ‘instance-of’ links are encoded into ‘concept assertions’: Chicken(ChickenSoup) Cat(TomAndJerry) … Instances are the elements of the domain, namely the documents classified in the categories.

Classifications are: Easy to use for humans ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Easy to use for humans Pervasive (Google, Yahoo, Amazon, our PC directories, email folders, address book, etc.). Largely used in commercial applications (Google, Yahoo, eBay, Amazon, BBC, CNN, libraries, etc.). Have been studied for very long time (e.g., Dewey Decimal Classification system - DDC, Library of Congress Classification system - LCC, etc.). From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007.

Classification Example: Yahoo! Directory ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES

Classification Example: Email Folders ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES

Classification Example: E-Commerce Category ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES

Computers and Internet Label Semantics ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Level Natural language words are often ambiguous E.g. Java (an island, a beverage, a programming language) When used with other words in a label, improper senses can be pruned E.g., “Java Language” – only the 3rd sense of Java is preserved We translate node labels into unambiguous propositions in ClassL in classification semantics This can be done by using NLP (Natural Language Processing) techniques 1 2 3 … (1) (3) (5) (7) (8) Programming Subjects Computers and Internet From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007. Java Language Java Beans 4

Link semantics A B ? A B C (b) (a) ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Get-specific principle: Child nodes in a classification are always considered in the context of their parent nodes. As a consequence they specialize the meaning of the parent nodes. Subsumption relation (a): the extension of the child node is a proper subset of the parent node. The meaning of node 2 is B. General intersection relation (b): the extension of the child node is a subset of the parent node. The meaning of node 2 is C = A ⊓ B. We generalize to (b). The meaning of the node is what we call the concept at node. 1 2 A B ? A B C (b) (a) From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007.

In ClassL: C4 = Ceurope ⊓ Cpictures ⊓ Citaly Concept at node ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Wine and Cheese Italy Europe Austria Pictures 1 2 3 4 5 From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007. In ClassL: C4 = Ceurope ⊓ Cpictures ⊓ Citaly

Document Classification ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Document concept: each document d in a classification is assigned a proposition Cd in ClassL, build from d in two steps: keywords are retrieved from d by using standard text mining techniques. keywords are converted into propositions by using the methodology discussed above to translate node labels. Automatic classification: For any given document d and its concept Cd we classify d in each node ni such that: ⊨ Cd ⊑ Ci, and there is no node nj (j ≠ i), for which ⊨ Cj ⊑ Ci and ⊨ Cd ⊑ Cj. In other words we always classify in the node with the most specific concept. From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007.

Query-answering ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Query-answering on a hierarchy of documents based on a query q as a set of keywords is defined in two steps: The ClassL proposition Cq is build from q by converting q’s keywords as said above. The set of answers (retrieval set) to q is defined as a set of subsumption checking problems in ClassL: Aq = {d ∈ document | T ⊨ Cd ⊑ Cq} From paper: F. Giunchiglia, M. Marchese, I. Zaihrayeu. “Encoding Classifications into Lightweight Ontologies.” J. of Data Semantics VIII, Springer-Verlag LNCS 4380, pp 57-81, 2007.

Semantic Matching: Why? ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Most popular knowledge can be represented as graphs. The heterogeneity between knowledge graphs demands the exposition of relations, such as semantically equivalent. Some popular situations that can be modeled as a matching problem are: Concept matching in semantic networks. Schema matching in distributed databases. Ontology matching (ontology “alignment”) in the Semantic Web. 26

The Matching Problem ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES Matching Problem: given two finite graphs, finds all nodes in the two graphs that syntactically or semantically correspond to each other. Given two graph-like structures (e.g., classifications, XML and database schemas, ontologies), a matching operator produces a mapping between the nodes of the graphs. Solution: A possible solution [Giunchiglia & Shvaiko, 2003], consists in the conversion of the two graphs in input into lightweight ontologies and then matching them semantically. 27

A Matching Problem ONTOLOGIES :: LIGHTWEIGHT ONTOLOGIES :: CLASSIFICATIONS :: APPLICATIONS ON LIGHTWEIGHT ONTOLOGIES ? 28