Ontology-Driven Information Retrieval Nicola Guarino Laboratory for Applied Ontology Institute for Cognitive Sciences and Technology (ISTC-CNR) Trento-Roma,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Towards a Reference Library of Upper Ontologies the DOLCE point of view Nicola Guarino Head, Laboratory for Applied Ontology Institute for Cognitive Sciences.
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Ontological Resources and Top-Level Ontologies Nicola Guarino LADSEB-CNR, Padova, Italy
KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
Chapter 6: Modeling and Representation Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Ontology-Driven Conceptual Modelling Speaker:. Harmonization Meeting What is Ontology?  A discipline of Philosophy  Meta-physics dates.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Basics of Knowledge Management ICOM5047 – Design Project in Computer Engineering ECE Department J. Fernando Vega Riveros, Ph.D.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Information and Business Work
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Formal Ontology and Information Systems Nicola Guarino (FOIS’98) Presenter: Yihong Ding CS652 Spring 2004.
Infomaster: An information Integration Tool O. M. Duschka and M. R. Genesereth Presentation by Cui Tao.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Classifying (Medical) Ontologies Stefano Borgo Laboratory for Applied Ontology (LOA) Institute for Cognitive Sciences and Technology (ISTC-CNR) Trento-Roma,
The role of ontologies for the Semantic Web (and beyond)
Some facets of knowledge management in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) Facets of Knowledge Organization A tribute.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Jon Atle GullaSpråkteknologi og innovasjon1 Språkteknologi i industrielle anvendelser Or: How we have commercialized linguistic technologies 1. Linguistics.
Query Expansion.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Ontologies and Classifications
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
The Cognitive Perspective in Information Science Research Anthony Hughes Kristina Spurgin.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
110/19/2015CS360 AI & Robotics AI Application Areas  Neural Networks and Genetic Algorithms  These model the structure of neurons in the brain  Humans.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
02. November 2007 Florian Probst Data and Knowledge Modelling for the Geosciences - Chris Date Seminar - e-Science Institute, Edinburgh Semantic Reference.
Wordnet - A lexical database for the English Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
DOLCE and its comparison with SUMO
Public Access and Spatial Metadata Values: Semantic Network Services Response to EU Directives Maria Rüther Federal Environment Agency,
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
What Does the User Really Want ? Relevance, Precision and Recall.
Towards a Reference Library of Upper Ontologies the DOLCE point of view Nicola Guarino Head, Laboratory for Applied Ontology Institute for Cognitive Sciences.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Ontologies, Conceptualizations, and Possible Worlds Revisiting “Formal Ontologies and Information Systems” 10 years later Nicola Guarino CNR Institute.
Chapter 6. Inference beyond the index 2007 년 1 월 30 일 부산대학교 인공지능연구실 김민호 Text : FINDING OUT ABOUT Page. 182 ~ 251.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Selecting Relevant Documents Assume: –we already have a corpus of documents defined. –goal is to return a subset of those documents. –Individual documents.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Theoretical Perspectives: Information, Language and Cognition Week 14 Lecture notes INF 380E: Perspectives on Information Spring
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
COMP6215 Semantic Web Technologies
Knowledge Representation Part I Ontology
ece 627 intelligent web: ontology and beyond
Formalizations of Commonsense Psychology
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,
Federated & Meta Search
Presentation transcript:

Ontology-Driven Information Retrieval Nicola Guarino Laboratory for Applied Ontology Institute for Cognitive Sciences and Technology (ISTC-CNR) Trento-Roma, Italy ICDL 2004, New Dehli, India

Summary The role of ontologies in (generalized) IR Semantic Matching Increasing precision Increasing recall Problems with WordNet The role of Foundational Ontologies

The key problem Semantic matching

Simple queries: need more knowledge about what the user wants Search for “Washington” (the person) –Google: 26,000,000 hits –45th entry is the first relevant –Noise: places Search for “George Washington” –Google: 2,200,00 hits –3rd entry is relevant –Noise: institutions, other people, places

Solution: ontology+semantic markup Ontology –Person George Washington George Washington Carver –Place Washington, D.C. –Artifact George Washington Bridge –Organization George Washington University Semantic disambiguation/markup of questions and corpora –What Washington are you talking about?

The role of taxonomy and lexical knowledge Search for “Artificial Intelligence Research” –Misses subfields of the general field –Misses references to “AI” and “Machine Intelligence” (synonyms) –Noise: non-research pages, other fields…

Solution: Extra knowledge –Ontology: Sub-fields (of AI) Knowledge Representation Machine Vision etc. Neural networks –Lexicon: Synonyms (for AI) Artificial Intelligence Machine Intelligence Techniques –Query Expansion Add disjuncted sub-fields to search Add disjuncted synonyms to search –Semantic Markup of question and corpora Add “general terms” (categories) Add “synonyms”

Ontology-driven search engines Idealized view –Ontology-driven search engines act as virtual librarians Determine what you “really mean” Discover relevant sources Find what you “really want” Requires common knowledge on all ends –Semantic linkage between questioning agent, answering agent and knowledge sources Hence the “Semantic Web”?

Two main roles of ontologies in IT Semantic Interoperability –Generalized database integration –Virtual Enterprises, Concurrent Engineering –e-commerce –Web services Information Retrieval –Documents –Facts (query answering) –Products –Services [Not mentioning IS analysis and design…]

The role of linguistic ontologies coupled with structured representation formalisms Why not just simple thesauri based on fixed keyword hierarchies? –Data’s intrinsic dynamics needs to keep track of new terms –Need of understanding a rigid set of terms –Heterogeneous descriptions need a broad-coverage vocabulary Linguistic ontologies with structured representation formalisms –Decouple user vocabulary from data vocabulary –Increase recall (synonyms, generalizations) –Increase precision (disambiguation, ontology navigation) –Further increase precision (by capturing the structure of queries and data)

Using WordNet as an ontology Unclear semantic interpretation of hyperonimy Instantiation vs. subsumption Object-level vs. meta-level Hyperonymy used to account for polysemy (law both a document and a rule) Unclear taxonomic structure Glosses not consistent with taxonomic structure Heterogeneous leves of generality Formal constraints violations (especially concerning roles) Polysemous use of antonymy (child/parent vs. daughter/son) Poor ontology of adjectives and qualities Shallow taxonomy of verbs

When subtle distinctions are important “Trying to engage with too many partners too fast is one of the main reasons that so many online market makers have foundered. The transactions they had viewed as simple and routine actually involved many subtle distinctions in terminology and meaning” Harvard Business Review, October 2001

Ontologies and intended meaning Models M D (L) Language L Commitment: K = Conceptualization Ontology Intended models I K (L) Interpretation

Levels of Ontological Precision Ontological precision Axiomatized theory Glossary Thesaurus Taxonomy DB/OO scheme tennis football game field game court game athletic game outdoor game Catalog game athletic game court game tennis outdoor game field game football game NT athletic game NT court game RT court NT tennis RT double fault game(x)  activity(x) athletic game(x)  game(x) court game(x)  athletic game(x)   y. played_in(x,y)  court(y) tennis(x)  court game(x) double fault(x)  fault(x)   y. part_of(x,y)  tennis(y)

Ontology Quality: Precision and Coverage Low precision, max coverage Less good Good High precision, max coverage BAD Max precision, low coverage WORSE Low precision and coverage

I A (L) M D (L) I B (L) Why precision is important False agreement!

DOLCE a Descriptive Ontology for Linguistic and Cognitive Engineering Strong cognitive bias: descriptive (as opposite to prescriptive) attitude Emphasis on cognitive invariants Categories as conceptual containers: no “deep” metaphysical implications wrt “true” reality Clear branching points to allow easy comparison with different ontological options Rich axiomatization

DOLCE’s basic taxonomy Endurant Physical Amount of matter Physical object Feature Non-Physical Mental object Social object … Perdurant Static State Process Dynamic Achievement Accomplishment Quality Physical Spatial location … Temporal Temporal location … Abstract Quality region Time region Space region Color region …

Foundational ontologies and ontological analysis Domain ontologies –Physical objects –Information and information processing –Social interaction –Ontology of legal and financial entities Ontology, language, cognition Ontology-driven information systems –Ontology-driven conceptual modeling –Ontology-driven information access –Ontology-driven information integration Research priorities at LOA-CNR