Ontology Quality and the Semantic Web Chris Welty IBM Watson Research Center.

Slides:



Advertisements
Similar presentations
May 23, 2004OWL-S straw proposal for SWSL1 OWL-S Straw Proposal Presentation to SWSL Committee May 23, 2004 David Martin Mark Burstein Drew McDermott Deb.
Advertisements

Ontology-Driven Conceptual Modeling Chris Welty IBM Watson Research Center.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Re-learning Ontology Management for the Web Chris Welty IBM Research.
Jim Hendler Chief Scientist - Information Systems Office DARPA.
The Internet and the Web
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 1.1 Chapter 1 : Introduction The World-Wide-Web.
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Configuration management
KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
Ontologies: Dynamic Networks of Formally Represented Meaning Dieter Fensel: Ontologies: Dynamic Networks of Formally Represented Meaning, 2001 SW Portal.
Structure of The World Wide Web From “Networks, Crowds and Markets” Chapter 13 Eyal Feder Nov, 14.
Semantic Web Thanks to folks at LAIT lab Sources include :
CS570 Artificial Intelligence Semantic Web & Ontology 2
XHTML Basics.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Ontological Analysis of Taxonomic Relationships Nicola Guarino, LADSEB-CNR,Italy Chris Welty, Vassar College, USA Thanks to: Bill Andersen, Pierdaniele.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
NIST Foundations of Ontological Analysis Chris Welty, Vassar College.
IBM T.J. Watson Research Center Foundations of Ontological Analysis Chris Welty, Vassar College.
How I was right, even when I was wrong Chris Welty IBM Research.
The Architecture Design Process
Four Dark Corners of Requirements Engineering
Formal Ontology and Information Systems Nicola Guarino (FOIS’98) Presenter: Yihong Ding CS652 Spring 2004.
* The basic components of a web site are: * Content – information displayed or accepted from users * Static – content that doesn’t change for different.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Query Relevance Feedback and Ontologies How to Make Queries Better.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
Ontologies and Classifications
Programming the Web Web = Computer Network + Hypertext.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Applying Rigidity to Standardizing OBO Foundry Candidate Ontologies A.Patrice Seyed and Stuart C. Shapiro Department of Computer Science Center for Cognitive.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
The INTERNET how it works. the internet: defined So, what is it?
Evaluating Ontological Decisions with OntoClean Chris Welty, Vassar College, USA Nicola Guarino, LADSEB-CNR, Italy.
1 Creating Web Pages Part 1. 2 OVERVIEW: HTML-What is it? HyperText Markup Language, the authoring language used to create documents on the World Wide.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
SWAP-07 Ontology Engineering with OntoClean Chris Welty IBM Watson Research Center.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
OWL Representing Information Using the Web Ontology Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Mariano Fernández López &Asunción Gómez Pérez The integration of OntoClean in WebODE Mariano Fernández-López Asunción Gómez-Pérez Artificial Intelligence.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Hypertext. Hypertext History (1) Many early attempts to organize human knowledge Many early attempts to organize human knowledge Thesaurus (Roget) Thesaurus.
Ontologies, Conceptualizations, and Possible Worlds Revisiting “Formal Ontologies and Information Systems” 10 years later Nicola Guarino CNR Institute.
Ontology and the lexicon Nicola Guarino and Christopher A. Welty(2004). An Overview of OntoClean Weber ( 張澄清 ) 2014/04/23 1.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Ontology Evaluation Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Computer and Internet Basics
The Semantic Web By: Maulik Parikh.
COMP6215 Semantic Web Technologies
Lecture 1: Introduction and Multimedia Data Representations
Knowledge Representation Part II Description Logic & Introduction to Protégé Jan Pettersen Nytun.
Building the Semantic Web
Warm Handshake with Websites, Servers and Web Servers:
Conceptual Modeling and Ontological Analysis
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Introduction to World Wide Web
WEB & HTML Background Info.
An Introduction to HTML Pages
Presentation transcript:

Ontology Quality and the Semantic Web Chris Welty IBM Watson Research Center

Outline Welcome, opening joke History of web and hypertext Semantic Web overview Ontology Engineering and Quality Summary and Closing joke

History of Hypertext 1945: Vannevar Bushs Memex –Associative Indexing and links 1965: Ted Nelson coins hypertext –Nonsequential writing 1967: Andries van Dams Hypertext Editing System (sponsored by IBM). 1985: Janet Walkers Symbolics Document Examiner 1987: Bill Atkinsons Hypercard on the Mac 1991: Tim Berners-Lee proposes HTTP, HTML, & URL –Genesis c : Mark Andreesen releases Mosaic for Mac, Unix, Windows…

Hypertext Research Dating back at least to the late 60s Many foci –Technology (mouse, software, protocols) –User interaction –Aesthetic –Post-modern –Engineering Largely ignored by web developers –Especially in the early days of the web (93-96)

Grassroots to the Web Early web dominated by what it looks like in Mosaic Focus on spreading the word, not doing it right Many early web pages didnt have links in text at all –Catalog pages with lists of links –Text pages with few or no links –Embedded images more interesting than links Just do it rather than do it right But… –When the web became serious, the research started to matter

Semantic Web Defined, to date, by RDF and OWL Genesis c Still in the early days –Faster adoption (so far) than early web –FOAF the most widely used SW Ontology Agent Person Organization Group DocumentImage

Ontology Research Dating back… Multiple foci –Technology (logics, reasoners…) –Meta-physics (what there is) –Knowledge Acquisition –NLP –Engineering Largely ignored by SW developers –Web 2.0, groundswell –Specifically criticized by some SW pundits

A little semantics… The SW catchphrase –A little semantics goes a long way Sometimes strengthened –A lot of semantics is too much –80/20 rule Double-edged sword –FOAF doesnt look like even 1% –The simplicity of FOAF hides any serious value proposition for SW –SW not for people, for data –Important to get it right?

Some evidence Does quality matter? Good quality ontologies cost more –Required for some applications Improvements in quality can improve performance [Welty, et al, 2004] –18% f-improvement in search –Cleanup cost ~1mw/3000 classes –BUT … low quality ontology still improved base

Dimensions of Quality Coverage, correctness, richness, commitment [Kashyap, 2003] Organization, modularity [Rector, 2002] Relation to reality [Smith & Welty, 2001] Making meaning clear [Guarino, 1998] Meta-level consistency [Guarino & Welty, 2000] Captures the invariant structure of the domain [Welty & Guarino, 2001]

Making Meaning Clear Part-of relates parts to their wholes –E.g. part-of(engine,car) Part-of is irreflexive Part-of is anti-symmetric Nothing can have only one part

Reduction of unintended models Generally, involves more axioms Typically requires negation –Disjointness Positive axioms –Also makes meaning clear, e.g. Clear significance for ontology alignment Mammal Horse Chess Piece Horse

Meta-Level Consistency with OntoClean Identity Unity Rigidity Dependence Actuality Permanence Note on terminology: property is a unary relation (aka class), meta-property is a property of a class

Identity The foundation of ontology, conceptual analysis, etc The criteria under which equivalence is determined –Or under which difference is determined Already accepted practice in RDBs, OOP When you conceive of a class, ask What makes each instance unique? –Note for SW: uniqueness not assumed Meta-property –Is there an identity criterion for this class (+I) –Not always productive to specify the precise condition Esp. if this results in artificial attributes –-I +I

Unity Criteria An object x is a whole under iff is an equivalence relation that binds together all the parts of x, such that P(y,x) (P(z,x) y,z)) but not y,z) x(P(y,x) P(z,x)) P is the part-of relation can be seen as a generalized indirect connection

Unity Meta-Properties If all instances of a property are wholes under the same relation it carries unity (+U) When at least one instance of a property is not a whole, or when two instances are wholes under different relations, it does not carry unity (-U) When no instance of a property is a whole, it carries anti- unity (~U) -U +U +U ~U

Rigidity An essential property of an entity is a property that must necessarily (always) hold A rigid property is a property that is essential to all possible instances (+R) A non-rigid property is a property that is not rigid (-R) An anti-rigid property is a property that is not essential to all possible instances (~R) +R ~R

Formal Rigidity is rigid (+R): x (x) (x) –e.g. Person, Apple is non-rigid (-R): x (x) ¬ (x) –e.g. Red, Male is anti-rigid (~R): x (x) ¬ (x) –e.g. Student, Agent (what about time?)

Rigidity Constraint +R ~R Why? x P(x) Q(x) Q ~R P +R O10

Which one is better? Computer has-part Memory Disk Drive Computer Part Memory Part Disk Part Computer Part Disk DriveMemory Computer has-part Due to: Guizzardi, et al, I~R-U +I+R+U+I+R~U -I~R-U +I+R+U +I~R~U+I~R-U +I+R~U

Ontology Alignment Most automatic alignment tools would say yes Lets take a closer look Food Apple Food Apple Caterpillar Are these the same?

Ontology Alignment Different meta-properties for Food Different intended meaning Should not be aligned Meta-level analysis helps make meaning more clear Food Apple Food Apple Caterpillar +I~U+D~R +I+U-D+R

A formal ontology of properties Property Non-sortal -I Role ~R+D Sortal +I Formal Role Attribution -R-D Category +R Mixin -D Type +O Quasi-type -O Non-rigid -R Rigid +R Material role Anti-rigid ~R Phased sortal -D +L

The Backbone Taxonomy Assumption: no entity without identity Quine, 1969 Since identity is supplied by types, every entity must instantiate a type The taxonomy of types spans the whole domain Together with categories, types form the backbone taxonomy, which represents the invariant structure of a domain (rigid properties spanning the whole domain)

Entity Physical object Amount of matter Group Organization Location Living being Person Animal Social entity Agent Apple Fruit Food Legal agent Group of people Red apple Red Vertebrate CaterpillarButterfly Country Geographical Region Lepidopteran

Entity Physical object Amount of matter Group Organization Location Living being Person Animal Social entity Apple Fruit Group of people Vertebrate Country Geographical Region Lepidopteran

Upper-Level Backbone The upper level backbone accounts for 5% of an ontology and spans the domain In empirical work, this is the most important layer [Fan et al, 2003] Some value in providing upper level ontologies to establish the basic distinctions

Backbone of quality Conjecture: the primary purpose of an ontology is to specify the backbone taxonomy, which is the invariant structure of the domain Bad ontologies: –folksonomies, –Subject hierarchies –Thesauri

Summary Good ontologies should: –Clarify meaning Add constraints to eliminate unintended models –Have clear identity criteria –Have consistent meta-level properties –Specify the invariant structure of a domain

Use OntoClean for all your ontology cleaning needs!