Advanced OWL tutorial 2005 Ontology Normalisation, Pre- and Post- Coordination Alan Rector BioHealth Informatics Group.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
OWL - DL. DL System A knowledge base (KB) comprises two components, the TBox and the ABox The TBox introduces the terminology, i.e., the vocabulary of.
BioHealth Informatics Group Feb 2005Ontology tutorial, © 2005 Univ. of Manchester1 Formal Modelling Alan Rector.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
CO-ODE/HyOntUse JISC/EPSRC 1 Why I need both OWL/DLs & Frames Alan Rector Medical Informatics Group Bio Health Informatics Forum Department of Computer.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Chapter 8: Web Ontology Language (OWL) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
1 Foundations of the Semantic Web: Ontology Engineering Building Ontologies 1 Alan Rector & colleagues.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
1 Ontologies, Clinical and Genomic Information How to say what we mean and mean what we say Opportunities & Pitfalls Alan Rector, Jeremy Rogers, Chris.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Knowledge Representation Ontology are best delivered in some computable representation Variety of choices with different: –Expressiveness The range of.
The Mapping Problem: How do experimental biological models relate to each other, and how can dynamic computational models be used to link them? Gary An,
Primary funding is provided by the JISC and ESRC. Based at Manchester Computing, The University of Manchester. 1 ‘The Famous 5’ Worked Examples from MIMAS.
Developing Biomedical Ontologies in OWL Alan Rector School of Computer Science / Northwest Institute of Bio-Health Informatics with.
Developing Biomedical Ontologies in OWL Alan Rector School of Computer Science / Northwest Institute of Bio-Health Informatics with.
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
BioHealth Informatics Group Advanced OWL Tutorial 2005 Ontology Engineering in OWL Alan Rector & Jeremy Rogers BioHealth Informatics Group.
OWL and SDD Dave Thau University of Kansas
© University of Manchester 1 Ontology Normalisation, Pre- and Post- Coordination Alan Rector & CO-ODE/NIBHI University of Manchester
Manchester Medical Informatics Group OpenGALEN 1 Linking Formal Ontologies: Scale, Granularity and Context Alan Rector Medical Informatics Group, University.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
OWL 2 in use. OWL 2 OWL 2 is a knowledge representation language, designed to formulate, exchange and reason with knowledge about a domain of interest.
11 Chapter 11 Object-Oriented Databases Database Systems: Design, Implementation, and Management 4th Edition Peter Rob & Carlos Coronel.
BioHealth Informatics Group Ontology Tutorial, © 2005 Univ. of Manchester1 Informal Modelling Robert Stevens.
Review of OWL for Biomedicine Alan Rector & CO-ODE/NIBHI University of Manchester OpenGALEN BioHealth Informatics Group © University.
Developing Biomedical Ontologies in OWL Alan Rector School of Computer Science / Northwest Institute of Bio-Health Informatics with.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
BioHealth Informatics Group A Practical Introduction to Ontologies & OWL Session 2: Defined Classes and Additional Modelling Constructs in OWL Nick Drummond.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
BioHealth Informatics Group Ontology tutorial, © 2005 Univ. of Manchester1 Formal Modelling Alan Rector.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Advanced topics in software engineering (Semantic web)
Authoring: In and Out of the Real World MesMuses 2003 Andy Dingley Codesmiths
Primary funding is provided by the JISC and ESRC. Based at Manchester Computing, The University of Manchester. 1 1 Creating a Metadatabase for MIMAS Services.
Based on “A Practical Introduction to Ontologies & OWL” © 2005, The University of Manchester A Practical Introduction to Ontologies & OWL Session 2: Defined.
OilEd An Introduction to OilEd Sean Bechhofer. Topics we will discuss Basic OilEd use –Defining Classes, Properties and Individuals in an Ontology –This.
Melanie Feinberg, Spring 2010 Organizing Information 7 statements.
Ontology Engineering Lab #3 – September 16, 2013.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Lexically Suggest, Logically Define: QA of Qualifiers &
Ontologies for the Integration of Geospatial DataTU Wien, April 24-28, 2006 Ontologies for the Integration of Geospatial Data Michael Lutz Semantics and.
ONTOLOGY ENGINEERING Lab #3 – September 15,
Of 38 lecture 6: rdf – axiomatic semantics and query.
Description Logics Dr. Alexandra I. Cristea. Description Logics Description Logics allow formal concept definitions that can be reasoned about to be expressed.
zoo.ox.ac.uk © David Shotton, 2007 David Shotton Image BioInformatics Research Group Oxford e-Research Centre and Department of.
ece 627 intelligent web: ontology and beyond
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Constructing an Argument Definitions Distinctions Conceptual Analyses Thought Experiments.
Ontology Technology applied to Catalogues Paul Kopp.
Of 24 lecture 11: ontology – mediation, merging & aligning.
1 Letting the classifier check your intuitions Existentials, Universals, & other logical variants Some, Only, Not, And, Or, etc. Lab exercise - 3b Alan.
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Hierarchical Clustering
ece 627 intelligent web: ontology and beyond
Lab exercise - 3a Alan Rector & colleagues
Measuring Social Life: How Many? How Much? What Type?
ece 720 intelligent web: ontology and beyond
Constructing an Argument
The Gene Ontology: an evolution
Ontology-Based Approaches to Data Integration
Information Networks: State of the Art
University of Manchester
Presentation transcript:

Advanced OWL tutorial 2005 Ontology Normalisation, Pre- and Post- Coordination Alan Rector BioHealth Informatics Group

Advanced OWL tutorial 2005 When to use a classifier 1.At author time: As a compiler – Pre-coordination ►Ontologies will be delivered as “pre-coordinated” ontologies to be used without a reasoner ►To make extensions and additions quick, easy, and responsive, distri but developments, empower users to make changes ►Part of an ontology life cycle 2.At delivery time: As a service: - Post-coordination ►Many fixed ontologies are too big and too small ►Too big to find things; too small to contain what you need ►Create them on the fly ►Part of an ontology service 3.At application time: as a reasoner - Post- coordination & inference ►Decision support, query optimisation, schema integration, …, …, … ►Part of a reasoning service

Advanced OWL tutorial 2005 When to use a classifier 1: Pre-coordinated delivery classifier as compiler ►The life cycle ►Gather requirements, sketch, experiment ►Establish patterns – design a “language” ►Criteria for success: What a subject domain expert can learn in a few days ►Bulk authoring ►Classification ►Quality assurance ►Commit classifier results to a pre-coordinated ontology & deliver ►Polyhierarchies (Protégé, DAG-Edit, OWL-Lite, RDF(S), Topic Maps, … ►Query and use with you favourite tool Development & evolution

Advanced OWL tutorial 2005 Commit Results to a Pre- Coordinated Ontology Assert (“Commit”) changes inferred by classifier

Advanced OWL tutorial 2005 Precoordinated Ontologies ►Author ►Classify ►Export to OWL-Lite or RDF ►Query with RDQL, SparQL, or your favourite RDF/RDF(S) query tool ►Use as a vocabulary in your database

Advanced OWL tutorial 2005 The problem But we want to ►Build ontologies cooperatively with different groups ►Extend ontologies smoothly ►Re-use pieces of ontologies ►Build new ontologies on top of old ►Quit starting from scratch Knowledge is Big, Fractal & Changeable!

Advanced OWL tutorial 2005 Assertion: ►Let the ontology authors ►create discrete modules ►describe the links between modules ►Let the logic reasoner ►Organise the result The arrival of logic-based ontologies/OWL gives new opportunities to make ontologies more manable and modular

Advanced OWL tutorial 2005 Logic-based Ontologies: Conceptual Lego hand extremity body acute chronic abnormal normal ischaemic deletion bacterial polymorphism cell protein gene infection inflammation Lung expression

Advanced OWL tutorial 2005 Logic-based Ontologies: Conceptual Lego “ SNPolymorphism of CFTRGene causing Defect in MembraneTransport of Chloride Ion causing Increase in Viscosity of Mucus in CysticFibrosis …” “Hand which is anatomically normal”

Advanced OWL tutorial 2005 Logical Constructs build complex concepts from modularised primitives Genes Species Protein Function Disease Protein coded by gene in humans Function of Protein coded by gene in humans Disease caused by abnormality in Function of Protein coded by gene in humans Gene in humans

Advanced OWL tutorial 2005 Normalising (untangling) Ontologies Structure Function Part-whole Structure Function Part-whole

Advanced OWL tutorial 2005 Rationale for Normalisation ►Explicit distinctions between modules ►Primitives are opaque to the reasoner ►Information implicit in primitive names cannot contribute to modularisation ►Primitives are indivisible to both human and reasoner ►Each primitive should represent a single notion ►Therefore, each primitive must belong to exactly one module ►If a primitive belongs to two modules, they are not modular. ►If a primitive belongs to two modules, it probably conflates two notions ►Therefore concentrate on the “primitive skeleton” of the domain ontology

Advanced OWL tutorial 2005 Normalisation and Untangling Let the reasoner do multiple classification ►Tree ►Everything has just one parent ►A ‘strict hierarchy’ ►Directed Acyclic Graph (DAG) ►Things can have multiple parents ►A ‘Polyhierarchy’ ►Normalisation ►Separate primitives into disjoint trees ►Link the trees with restrictions ►Fill in the values

Advanced OWL tutorial 2005 Tables are easier to manage than DAGs / Polyhierarchies …and get the benefit of inference: Grass and Leafy_plants are both kinds of Plant

Advanced OWL tutorial 2005 Remember to add any closure axioms Closure Axiom Then let the reasoner do the work

Advanced OWL tutorial 2005 Normalisation: From Trees to DAGs Before classification A tree After classification A DAG Directed Acyclic Graph

Advanced OWL tutorial 2005 Untangling and Enrichment Using a classifier to make life easier Substance - Protein - - ProteinHormone Insulin - Steroid - - SteroidHormone Cortisol - Hormone - -ProteinHormone Insulin - - SteroidHormone Cortisol - Catalyst - - Enzyme ATPase - PhsioloicRole - - HormoneRole - - CatalystRole - Substance - - Protein Insulin ATPase - Steroid - - Cortisol Hormone  Substance & playsRole - someValuesFrom HormoneRole ProteinHormone  Protein & playsRole someValuesFrom HormoneRole SteroidHomone  Steroid & playsRole someValuesFrom HormoneRole Catalyst  Substance & playsRole someValuesFrom CatalystRole Enzyme  Protein & playsRole someValuesFrom CatalystRole Insulin  playsRole someValuesFrom HormoneRole Cortisol  playsRole someValuesFrom HormoneRole ATPase  playsRole someValuesFrom CatalystRole Substance - Protein - - ProteinHormone Insulin - - Enzyme ATPase - Steroid - - SteroidHomone^ Cortisol -Hormone - - ProteinHormone^ Insulin^ - - SteroidHormone^ Cortisol^ - Catalyst - - Enzyme^ ATPase^

Advanced OWL tutorial 2005 The original tangled ontology

Advanced OWL tutorial 2005 Modularised into structure and function ontologies (all primitive)

Advanced OWL tutorial 2005 Unified by an ontology that imports both structure and function ontologies and adds linking definitions Before classification - primitives in yellow - defined in orange; prefixes indicate imported ontology

Advanced OWL tutorial 2005 Definition of Hormone

Advanced OWL tutorial 2005 Unified ontology after classification

Advanced OWL tutorial 2005 Normalisation: Criterion 1 The skeleton should consist of disjoint trees ►Every primitive concept should have exactly one primitive parent ►All multiple hierarchies the result of inference by reasoner

Advanced OWL tutorial 2005 Normalisation Criterion 2: No hidden changes of meaning ►Each branch should be homogeneous and logical (“Aristotelian”) ►Hierarchical principle should be subsumption ►Otherwise we are “lying to the logic” ►The criteria for differentiation should follow consistent principles in each branch eg. structure XOR function XOR cause

Advanced OWL tutorial 2005 A Non-homogeneous taxonomy “ On those remote pages it is written that animals are divided into: a. those that belong to the Emperor b. embalmed ones c. those that are trained d. suckling pigs e. mermaids f. fabulous ones g. stray dogs h. those that are included in this classification i. those that tremble as if they were mad j. innumerable ones k. those drawn with a very fine camel's hair brush l. others m. those that have just broken a flower vase n. those that resemble flies from a distance" From The Celestial Emporium of Benevolent Knowledge, Borges

Advanced OWL tutorial 2005 Normalisation Criterion 3 Distinguish “Self-standing” and “Refining” Concepts ►Self-standing concepts ►Roughly Welty & Guarino’s “sortals” ►person, idea, plant, committee, belief,… ►Refining concepts – depend on self-standing concepts ►mild|moderate|severe, hot|cold, left|right,… ►Roughly Welty & Guarino’s non-sortals ►Closely related to Smith’s “fiat partitions” ►Usefully thought of as Value Types by engineers ►For us an engineering distinction…

Advanced OWL tutorial 2005 Normalisation Criterion 3a Self-standing primitives should be globally disjoint & open ►Primitives are atomic ►If primitives overlap, the overlap conceals implicit information ►A list of self-standing primitives can never be guaranteed complete ►How many kinds of person? of plant? of committee? of belief? ►Can’t infer: Parent & ¬sub 1 &…& ¬sub n-1  sub n ►Heuristic: ►Diagnosis by exclusion about self-standing concepts should NOT be part of ‘standard’ ontological reasoning

Advanced OWL tutorial 2005 Normalisation Criterion 3b Refining primitives should be locally disjoint & closed ►I ndividual values must be disjoint ►but can be hierarchical ►e.g. “very hot”, “moderately severe” ►Each list can be guaranteed to be complete ►Can infer Parent & ¬sub 1 &…& ¬sub n-1  sub n ►Value types themselves need not be disjoint ►“being hot” is not disjoint from “being severe” ►Allowing Valuetypes to overlap is a useful trick, e.g. ►restriction has_state someValuesFrom (severe and hot)

Advanced OWL tutorial 2005 Normalisation Criterion 4 Axioms ►No axiom should denormalise the ontology No axiom should imply that a primitive is part of more than one branch of primitive skeleton ►If all primitives are disjoint, any such axioms will make that primitive unsatisfiable ►A partial test for normalisation: ►Create random conjunctions of primitives which do not subsume each other. ►If any are satisfiable, the ontology is not normalised

Advanced OWL tutorial 2005 Consequences 1 ►All self-standing primitives are disjoint ►All multiple classification is inferred ►For any two primitive self-standing classes, either one subsumes the other or they are disjoint ►Every self standing concept is part of exactly one primitive branch of the skeleton ►Every self-standing concept has exactly one most specific primitive ancestor

Advanced OWL tutorial 2005 Consequences 2 ►Primitives introduced by a conjunction of one class and a boolean combination of zero or more restrictions ►Tree subclass-of Plant and restriction isMadeOf someValuesFrom Wood ►Resort subclass-of Accommodation restriction isIntendedFor someValueFrom Holidays

Advanced OWL tutorial 2005 Consequences 3 ►Use of axioms limited (outside of skeleton construction) The following are a safe but not exhaustive guide: ►The right side of subclass axioms limited to restrictions ►Both sides of disjointness axioms limited to restrictions ►No equivalence axioms with primitives on either side

Advanced OWL tutorial 2005 A real example: Build a simple treee easy to maintain

Advanced OWL tutorial 2005 Let the classifier organise it

Advanced OWL tutorial 2005 If you want more abstractions, just add new definitions (re-use existing data) “Diseases linked to abnormal proteins”

Advanced OWL tutorial 2005 And let the classifier work again

Advanced OWL tutorial 2005 And again – even for a quite different category “Diseases linked genes described in the mouse”

Advanced OWL tutorial 2005 And let classifier check consistency (My first try wasn’t)

Advanced OWL tutorial 2005 Summary: Why Normalise? Why use a Classifier? ►To compose concepts ►Allow conceptual lego ►To manage polyhierarchies ►Adding abstractions (“axes”) as needed ►Normalisation ►Untangling ►labelling of “kinds of is-a” ►To avoid combinatorial explosions ►Keep bicycles from exploding ►To manage context ►Cross species, Cross disciplines, Cross studies ►To check consistency and help users find errors

Advanced OWL tutorial 2005 But remember the ‘ontology’ is just one part of an infomration resource ►The ontology ►Naming - Definitions & necessary conditions ►Classification ►Indexing ►Knowledge bases ►What we know about those entities – what is true in general ►Databases ►What we know about individuals ►Instance stores – specialised databases that link to ontologies ►Plus ►Lexicons ►Metadata ►Mappings