Ontologies and the Semantic Web

Slides:



Advertisements
Similar presentations
Hybrid Logics and Ontology Languages
Advertisements

Charting the Potential of Description Logic for the Generation of Referring Expression SELLC, Guangzhou, Dec Yuan Ren, Kees van Deemter and Jeff.
Mitsunori Ogihara Center for Computational Science
The Semantic Web: Ontologies and OWL Ian Horrocks and Alan Rector Summary.
Description Logic Based Ontology Languages Ian Horrocks Information Systems Group Oxford University Computing Laboratory.
Ontologies and Databases Ian Horrocks Information Systems Group Oxford University Computing Laboratory.
1 ISWC-2003 Sanibel Island, FL IMG, University of Manchester Jeff Z. Pan 1 and Ian Horrocks 1,2 {pan | 1 Information Management.
Three Theses of Representation in the Semantic Web
Knowledge Representation and Reasoning using Description Logic Presenter Shamima Mithun.
An Introduction to Description Logics
CS570 Artificial Intelligence Semantic Web & Ontology 2
Computing & Information Sciences Kansas State University Lecture 16 of 42 CIS 530 / 730 Artificial Intelligence Lecture 16 of 42 Knowledge Engineering.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
1 Semantic Web Technologies: The foundation for future enterprise systems Okech Odhiambo Knowledge Systems Research Group Strathmore University.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Combining the strengths of UMIST and The Victoria University of Manchester A Tableaux Decision Procedure for SHOIQ Ian Horrocks and Ulrike Sattler University.
OWL: A Description Logic Based Ontology Language Ian Horrocks Information Management Group School of Computer Science University of Manchester.
Description Logic Reasoning
Ontology and Ontology-Based Applications C. Farkas Some of the slides were obtained from presentations of Ian Horrocks.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Semantic Web and its Logical Foundations Serguei Krivov, Ecoinformatics Collaboratory Gund Institute for Ecological Economics, UVM.
How can Computer Science contribute to Research Publishing?
Semantic Web The Story So Far Ian Horrocks Oxford University Computing Laboratory.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Semantic Web The Story So Far
OntoWeb SIG 2: Ontology Language Standards Heiner Stuckenschmidt Vrije Universiteit Amsterdam With contributions from: Ian Horrocks and Frank van Harmelen.
DL systems DL and the Web Ilie Savga
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
1 Semantic Web Mining Presented by: Chittampally Vasanth Raja 10IT05F M.Tech (Information Technology)
Ontology Reasoning: the Why and the How
Combining the strengths of UMIST and The Victoria University of Manchester Applications of Description Logics State of the Art and Research Challenges.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
DLs and Ontology Languages. ’s OWL (like OIL & DAML+OIL) based on a DL –OWL DL effectively a “Web-friendly” syntax for SHOIN i.e., ALC extended with transitive.
Semantic Web The Story So Far Ian Horrocks Oxford University Computing Laboratory.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Practical RDF Chapter 1. RDF: An Introduction
Ian Horrocks and Alan Rector
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
Knowledge representation
Applying Belief Change to Ontology Evolution PhD Student Computer Science Department University of Crete Giorgos Flouris Research Assistant.
Reasoning with Expressive Description Logics Ian Horrocks University of Manchester Manchester, UK Logical Foundations for the Semantic Web.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
Dept. Computer Science, Korea Univ. Intelligent Information System Lab. 1 Sohn Jong-Soo Intelligent Information System lab. Department of Computer Science.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Presented by:- Somya Gupta( ) Akshat Malu ( ) Swapnil Ghuge ( ) Franz Baader, Ian Horrocks, and Ulrike Sattler.
An Introduction to Description Logics (chapter 2 of DLHB)
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Description Logics: Logic foundation of Semantic Web Semantic.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Strategies for Realizing the Semantic Web Ian Horrocks.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Artificial Intelligence 2004 Ontology
DAML+OIL: an Ontology Language for the Semantic Web.
OWL Representing Information Using the Web Ontology Language.
Organization of the Lab Three meetings:  today: general introduction, first steps in Protégé OWL  November 19: second part of tutorial  December 3:
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Description Logics Dr. Alexandra I. Cristea. Description Logics Description Logics allow formal concept definitions that can be reasoned about to be expressed.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Ontology Technology applied to Catalogues Paul Kopp.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Description Logic Reasoning
Ontology Engineering: Tools and Methodologies
Ontologies and Databases
CIS Monthly Seminar – Software Engineering and Knowledge Management IS Enterprise Modeling Ontologies Presenter : Dr. S. Vasanthapriyan Senior Lecturer.
Presentation transcript:

Ontologies and the Semantic Web Ian Horrocks <horrocks@cs.man.ac.uk> Information Management Group School of Computer Science University of Manchester

The Semantic Web

Today’s Web Distributed hypertext/hypermedia Information accessed via (keyword based) search and browse Browser tools render information for human consumption

What is the Semantic Web? Web was “invented” by Tim Berners-Lee (amongst others), a physicist working at CERN His vision of the Web was much more ambitious than the reality of the existing (syntactic) Web: This vision of the Web has become known as the Semantic Web “… a set of connected applications … forming a consistent logical web of data …” - Even if he didn’t invent it, TBL was a key figure in the development of the Web. - This is what he originally intended, and what he would like to see the web become. “… an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation …”

Hard Work using “Syntactic Web” Find images of Peter Patel-Schneider, Frank van Harmelen and Alan Rector… Concrete motivation for why we want the semantic web - some tasks are hard work using existing web. Rev. Alan M. Gates, Associate Rector of the Church of the Holy Spirit, Lake Forest, Illinois

Impossible (?) using “Syntactic Web” Complex queries involving background knowledge Find information about “animals that use sonar but are neither bats nor dolphins” Locating information in data repositories Travel enquiries Prices of goods and services Results of human genome experiments Finding and using “web services” Given a DNA sequence, identify its genes, determine the proteins they can produce, and hence the biological processes they control Delegating complex tasks to web “agents” Book me a holiday next weekend somewhere warm, not too far away, and where they speak either French or English , e.g., Barn Owl Workflows widely used in eScience; hard to (automatically) find relevant services Can think of final bullet as a sort of grand challenge; web equivelent of robot football

What is the Problem? Consider a typical web page: Markup consists of: rendering information (e.g., font size and colour) Hyper-links to related content Semantic content is accessible to humans, but not (easily) to computers… - Information in natural language, and even embedded in images. - And if this is a problem for web pages, what about hypermedia, databases and other web resources?

What is the (Proposed) Solution? Add semantic annotations to web resources NL annotations (possibly with rendering annotation) already associated with images (only way google can find them) Augment NL with semantic annotation. Dr. Alan Rector, Professor of Computer Science, University of Manchester Dr. <Person>Alan Rector</Person>, <Job>Professor of Computer Science</Job>, University of Manchester Rev. <Person>Alan M. Gates</Person>, <Job>Associate Rector</Job> of the Church of the Holy Spirit, Lake Forest, Illinois Rev. Alan M. Gates, Associate Rector of the Church of the Holy Spirit, Lake Forest, Illinois

What is the (Proposed) Solution? Now... that should clear up a few things around here NL annotations (possibly with rendering annotation) already associated with images (only way google can find them) Augment NL with semantic annotation.

Giving Semantics to Annotations External agreement on meaning of annotations Agree on meaning of a set of annotation tags E.g., Dublin Core Limited flexibility and extensibility Limited number of things can be expressed Use Ontologies to specify meaning of annotations Agree on language used to describe meaning Meanings of vocabularies of terms given by ontologies New terms can be formed by combining existing ones Meaning (semantics) of such terms is formally specified Can combine/relate terms in multiple ontologies Big Apple is a large fruit or a city?

Ontologies

Ontology: Origins and History In Philosophy, fundamental branch of metaphysics Studies “being” or “existence” and their basic categories Aims to find out what entities and types of entities exist “Categories”: Aristotle, 350 BC. The oldest known tree diagram was drawn in the 3rd century AD by the Greek philosopher Porphyry in his commentary on Aristotle's categories. Figure 2 shows a version of the Tree of Porphyry, as it was drawn by the 13th century logician Peter of Spain. It illustrates the subcategories under Substance, which is called the supreme genus or the most general supertype.

Ontology in Information Science An ontology is an engineering artefact consisting of: A vocabulary used to describe (a particular view of) some domain An explicit specification of the intended meaning of the vocabulary. Often includes classification based information Constraints capturing background knowledge about the domain Ideally, an ontology should: Capture a shared understanding of a domain of interest Provide a formal and machine manipulable model In Information Science, an ontology is the product of an attempt to formulate an exhaustive and rigorous conceptual schema about a domain. This domain does not have to be the complete knowledge of that topic, but purely a domain of interest decided upon by the creator of the ontology. An ontology is typically a hierarchical data structure containing all the relevant entities and their relationships and rules within that domain (e.g., a domain ontology). However, computational ontology does not have to be hierarchical at all. The computer science usage of the term ontology is derived from the much older usage of the term ontology in philosophy. An ontology which is not tied to a particular problem domain but attempts to describe general entities is known as a foundation ontology or upper ontology. Typically, more specialized domain specific schemata must be created to make the data useful for real world decisions.

Example Ontology (Protégé)

Applications of Ontologies e-Science, e.g., Bioinformatics Open Biomedical Ontologies Consortium (GO, MGED) Used e.g., for “in silico” investigations relating theory and data E.g., relating data on phosphatases to (model of) biological knowledge Graphic showing protein functional domains (sequences of amino acids?); the identifying characteristics of different proteins.

Applications of Ontologies Medicine Building/maintaining terminologies such as Snomed, NCI & Galen - Example from project to (semi-) automate the annotation of MRI images of the brain - FMA derived ontology used to capture knowledge of brain anatomy Frontal Lobe Temporal Lobe Parietal Lobe Occipital Lobe Central Sulcus Lateral Sulcus

Applications of Ontologies Organising complex and semi-structured information UN-FAO, NASA, Ordnance Survey, General Motors, Lockheed Martin, …

Applications of Ontologies Military/Government DARPA, NSA, NIST, SAIC, MoD, Department of Homeland Security, … The Semantic Web and so-called Semantic Grid

Ontology Languages Development and standardisation of (Semantic) Web ontology languages has been responsible for huge increase in the development and application of ontologies.

Ontology Languages for the Web Semantic Web effort led to development of “resource description” language(s) E.g., RDF, and later RDF Schema (RDFS) RDFS is recognisable as an ontology language Classes and properties Sub/super-classes (and properties) Range and domain (of properties) But RDFS too weak to describe resources in sufficient detail, e.g.: No existence/cardinality constraints No transitive, inverse or symmetrical properties No localised range and domain constraints … And RDF(S) has “higher order flavour” with non-standard semantics Difficult to provide reasoning support

From RDFS to OWL Two languages developed to address deficiencies & problems of RDFS: OIL: developed by group of (largely) European researchers DAML-ONT: developed by group of (largely) US researchers Efforts merged to produce DAML+OIL Development carried out by “Joint EU/US Committee on Agent Markup Languages” DAML+OIL submitted to as basis for standardisation Web-Ontology (WebOnt) Working Group formed WebOnt developed OWL language based on DAML+OIL OWL now a W3C recommendation (i.e., a standard) OIL, DAML+OIL and OWL based on Description Logics OWL is effectively a “Web-friendly” syntax for SHOIN

What Are Description Logics? A family of logic based Knowledge Representation formalisms Descendants of semantic networks and KL-ONE Describe domain in terms of concepts (classes), roles (properties, relationships) and individuals Operators allow for composition of complex concepts Names can be given to complex concepts, e.g.: Object oriented style of modelling HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic) HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic) HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic) HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic) HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic)

Semantics and Reasoning Distinguished by: Formal semantics (typically model theoretic) Decidable fragments of FOL (often contained in C2) Closely related to Propositional Modal & Dynamic Logics, and to Guarded Fragment development of DLs motivated by weak semantics of existing systems, e.g., semantic networks Cat Animal IS-A has-color Black Felix Mat sits-on [Quillian, 1967]

Semantics and Reasoning Distinguished by: Formal semantics (typically model theoretic) Decidable fragments of FOL (often contained in C2) Closely related to Propositional Modal & Dynamic Logics, and to Guarded Fragment Provision of reasoning services Decision procedures for key problems (satisfiability, subsumption, etc) Implemented systems (highly optimised)

Why Description Logic? OWL exploits results of 15+ years of DL research Well defined (model theoretic) semantics

Why Description Logic? OWL exploits results of 15+ years of DL research Well defined (model theoretic) semantics Formal properties well understood (complexity, decidability) I can’t find an efficient algorithm, but neither can all these famous people. [Garey & Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. Freeman, 1979.]

Why Description Logic? OWL exploits results of 15+ years of DL research Well defined (model theoretic) semantics Formal properties well understood (complexity, decidability) Known reasoning algorithms

Pellet Why Description Logic? OWL exploits results of 15+ years of DL research Well defined (model theoretic) semantics Formal properties well understood (complexity, decidability) Known reasoning algorithms Implemented systems (highly optimised) Pellet

Why Description Logic? Foundational research was crucial to design of OWL Informed Working Group decisions at every stage, e.g.: “Why not extend the language with feature x, which is clearly harmless?” “Adding x would lead to undecidability - see proof in […]” Heath-Robinson approach to language design; big tree is undecidability.

Why the Strange Names? Description Logics are a family of KR formalisms Mainly distinguished by available operators Available operators indicated by letters in name, e.g., S : basic DL (ALC) plus transitive roles (e.g., ancestor  R+) H : role hierarchy (e.g., hasDaughter v hasChild) O : nominals/singleton classes (e.g., {Italy}) I : inverse roles (e.g., isChildOf ´ hasChild–) N : number restrictions (e.g., >2hasChild, 63hasChild) Basic DL + role hierarchy + nominals + inverse + NR = SHOIN SHOIN is the basis for W3C’s OWL Web Ontology Language SHOIN is very expressive, but still decidable (just)

Class/Concept Constructors Language is very elegant: compact, compositional, easy to understand (!?) C is a concept (class); P is a role (property); x is an individual name

Knowledge Base / Ontology A TBox is a set of “schema” axioms (sentences), e.g.: {Parent v Person u >1hasChild, HappyParent ´ Parent u 8hasChild.(Intelligent t Athletic)} An ABox is a set of “data” axioms (ground facts), e.g.: {John:HappyParent, John hasChild Mary} An OWL ontology is just a SHOIN KB

OWL RDF/XML Exchange Syntax E.g., Parent u 8hasChild.(Intelligent t Athletic): <owl:Class> <owl:intersectionOf rdf:parseType=" collection"> <owl:Class rdf:about="#Parent"/> <owl:Restriction> <owl:onProperty rdf:resource="#hasChild"/> <owl:allValuesFrom> <owl:unionOf rdf:parseType=" collection"> <owl:Class rdf:about="#Intelligent"/> <owl:Class rdf:about="#Athletic"/> </owl:unionOf> </owl:allValuesFrom> </owl:Restriction> </owl:intersectionOf> </owl:Class>

Why Ontology Reasoning? Given key role of ontologies in many applications, it is essential to provide tools and services to help users: Design and maintain high quality ontologies, e.g.: Meaningful — all named classes can have instances

Why Ontology Reasoning? Given key role of ontologies in many applications, it is essential to provide tools and services to help users: Design and maintain high quality ontologies, e.g.: Meaningful — all named classes can have instances Correct — captures intuitions of domain experts Banana split is a kind of fruit sundae?

Why Ontology Reasoning? Given key role of ontologies in many applications, it is essential to provide tools and services to help users: Design and maintain high quality ontologies, e.g.: Meaningful — all named classes can have instances Correct — captures intuitions of domain experts Minimally redundant — no unintended synonyms  Banana split Banana sundae

Why Ontology Reasoning? Given key role of ontologies in many applications, it is essential to provide tools and services to help users: Design and maintain high quality ontologies, e.g.: Meaningful — all named classes can have instances Correct — captures intuitions of domain experts Minimally redundant — no unintended synonyms Answer queries over ontology classes and instances, e.g.: Find more general/specific classes Retrieve individuals/tuples matching a given query

Research Challenges

Increasing Expressive Power Complex role inclusion axioms [Horrocks&Sattler, IJCAI-03] E.g., hasLocation ± partOf v hasLocation Concrete domains/datatypes, e.g., [Lutz, IJCAI-99; Pan et al, ISWC-03] E.g., value comparison (income > expenditure) Database style keys [Lutz et al, JAIR 2004] E.g., make + model + chassis-number is a key for Vehicles Rule language extensions First order extensions (e.g., SWRL) [Horrocks et al, JWS, 2005] Hybrid language extensions, e.g., [Eiter et al, KR-04; Motik et al, ISWC-04] LP/F-Logic/Common Logic [Chen et al, JLP, 1993; de Bruijn et al, WWW-05] Clear that OWL isn’t expressive enough for all applications; users always want/need more

Improving Scalability Optimisation techniques Improve performance of DL reasoners, e.g., [Sirin et al, KR-06] Reduction to disjunctive Datalog [Motik et at, KR-04] Transform DL ontology to DatalogÇ rules Use LP techniques to deal with large numbers of ground facts Hybrid DL-DB systems [Horrocks et al, CADE-05] Use DB to store “Abox” (individual) axioms Cache inferences and use DB queries to answer/scope logical queries Polynomial time algorithms for sub-ALC logics [Baader et al, IJCAI-05] Graph based techniques for subsumption computation

Tools and Infrastructure Editors/environments Oiled, Protégé, Swoop, Construct, Ontotrack, …

Tools and Infrastructure Editors/environments Oiled, Protégé, Swoop, Construct, Ontotrack, … Reasoning systems Cerebra, FaCT++, Kaon2, Pellet, Racer, … Pellet

Tools and Infrastructure Editors/environments Oiled, Protégé, Swoop, Construct, Ontotrack, … Reasoning systems Cerebra, FaCT++, Kaon2, Pellet, Racer, … Non-standard inferences Explanation, matching, least common subsumer, …

Tools and Infrastructure Editors/environments Oiled, Protégé, Swoop, Construct, Ontotrack, … Reasoning systems Cerebra, FaCT++, Kaon2, Pellet, Racer, … Non-standard inferences Explanation, matching, least common subsumer, … Design methodologies Foundational ontologies, modularisation, etc. Entity Substantial Quality Event Achievement Stative Accomplishment Perdurant Endurant

Summary Semantic Web aims to make web content more accessible to automated processes Adds semantic annotations to web resources Ontologies provide vocabulary for annotations Terms have well defined meaning OWL ontology language based on (description) logic Exploits results of basic research on complexity, reasoning, etc. Many research challenges remain Including expressive power, scalability and tools

Acknowledgements Thanks to my many friends in the DL and Semantic Web communities, in particular: Alan Rector Franz Baader Uli Sattler

Resources FaCT++ system (open source) Protégé http://owl.man.ac.uk/factplusplus/ Protégé http://protege.stanford.edu/plugins/owl/ W3C Web-Ontology (WebOnt) working group (OWL) http://www.w3.org/2001/sw/WebOnt/ DL Handbook, Cambridge University Press http://books.cambridge.org/0521781760.htm

Thank you for listening Any questions?