1 Building and Using Ontologies Robert Stevens Department of Computer Science University of Manchester Manchester UK.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Object-Oriented Analysis and Design
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Description Logics. Outline Knowledge Representation Knowledge Representation Ontology Language Ontology Language Description Logics Description Logics.
Software Requirements
Project Life Cycle Jon Ivins DMU. Introduction n Projects consist of many separate components n Constraints include: time, costs, staff, equipment n Assets.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
1/31 CS 426 Senior Projects Chapter 1: What is UML? Chapter 2: What is UP? [Arlow and Neustadt, 2005] January 22, 2009.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
1 CS 426 Senior Projects Chapter 1: What is UML? Chapter 2: What is UP? [Arlow and Neustadt, 2002] January 26, 2006.
BIS310: Week 7 BIS310: Structured Analysis and Design Data Modeling and Database Design.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 10 Structuring.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Building and Using Ontologies Dr. Robert Stevens Department of Computer Science University of Manchester
Computer System Analysis Chapter 10 Structuring System Requirements: Conceptual Data Modeling Dr. Sana’a Wafa Al-Sayegh 1 st quadmaster University of Palestine.
FRE 2672 Urban Ontologies : the Towntology prototype towards case studies Chantal BERDIER (EDU), Catherine ROUSSEY (LIRIS)
-Ontologies: Bio-Ontologies: Their Creation and Design Dr. Peter Karp SRI, Dr. Robert Stevens & Professor Carole Goble University.
What is UML? What is UP? [Arlow and Neustadt, 2005] January 23, 2014
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Knowledge Representation Ontology are best delivered in some computable representation Variety of choices with different: –Expressiveness The range of.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
Knowledge representation
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Jessica Chen-Burger A Framework for Knowledge Sharing and Integrity Checking for Multi-Perspective Models Yun-Heh (Jessica) Chen-Burger Artificial Intelligence.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
Chapter 7 System models.
MGED Ontology Working Group MGED4 Boston, MA Feb. 15, 2002 Chris Stoeckert, Center for Bioinformatics, U. Penn Helen Parkinson, EBI.
Lecture 7: Requirements Engineering
System models l Abstract descriptions of systems whose requirements are being analysed.
Modified by Juan M. Gomez Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
Information Systems Engineering. Lecture Outline Information Systems Architecture Information System Architecture components Information Engineering Phases.
February 24, 2006 ONTOLOGIES Helena Sofia Pinto ( )
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Software Engineering Prof. Ing. Ivo Vondrak, CSc. Dept. of Computer Science Technical University of Ostrava
Andreas Abecker Knowledge Management Research Group From Hypermedia Information Retrieval to Knowledge Management in Enterprises Andreas Abecker, Michael.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Logical view –show classes and objects Process view –models the executables Implementation view –Files, configuration and versions Deployment view –Physical.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Copyright © 1997 Pangea Systems, Inc. All rights reserved. Building Ontologies.
© 2006 Pearson Addison-Wesley. All rights reserved 2-1 Chapter 2 Principles of Programming & Software Engineering.
Approach to building ontologies A high-level view Chris Wroe.
APPLICATION OF ONTOLOGIES IN CANCER NANOTECHNOLOGY RESEARCH Faculty of Engineering in Foreign Languages 1 Student: Andreea Buga Group: 1241E – FILS Coordinating.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 10 Structuring.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
1 Software Requirements Descriptions and specifications of a system.
Lisbon, 30 th March 2016 Gianluca Luraschi Gonçalo Cadete “Towards a Methodology for Building.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
What is UML? What is UP? [Arlow and Neustadt, 2005] October 5, 2017
ece 627 intelligent web: ontology and beyond
Methontology: From Ontological art to Ontological Engineering
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Chapter 20 Object-Oriented Analysis and Design
Metadata Framework as the basis for Metadata-driven Architecture
Ontology-Based Approaches to Data Integration
Ontologies in Bioinformatics
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Lecture 10 Structuring System Requirements: Conceptual Data Modeling
Presentation transcript:

1 Building and Using Ontologies Robert Stevens Department of Computer Science University of Manchester Manchester UK

2 Introduction The nature of bioinformatics resources What is knowledge? What is an ontology? What are the uses of ontologies? Components of an ontology Building an ontology (in brief)

3 The Nature of Bioinformatics Resources Over 500 databanks and analysis tools that work over resources Repositories of knowledge and data and generation of new knowledge Knowledge often held as free text; some use made of controlled vocabularies Enormous amount of semantic heterogeneity and poor query facilities Knowledge about services not always apparent

4 What is Knowledge? Knowledge – all information and an understanding to carry out tasks and to infer new information Information -- data equipped with meaning Data -- un-interpreted signals that reach our senses PATRICIAGRACEKENNEDY SAIDMINEISAPINT Patricia Grace Kennedy said mine is a pint name noun verb Pat Baker is a Manchester bioinformatician who drinks beer. …CEKENN… Single letter amino acid codes C – cysteine K - lysine Protein that acts as a tyrosine kinase in the liver of primates.

5 Capturing Knowledge Capturing knowledge for both humans an computer applications A set of vocabulary definitions that capture a community’s knowledge of a domain `An ontology may take a variety of forms, but necessarily it will include a vocabulary of terms, and some specification of their meaning. This includes definitions and an indication of how concepts are inter- related which collectively impose a structure on the domain and constrain the possible interpretations of terms.'

6 What Does an Ontology Do? Captures knowledge Creates a shared understanding – between humans and for computers Makes knowledge machine processable Makes meaning explicit – by definition and context

7 What is an Ontology? Catalog/ ID General Logical constraints Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal is-a Formal instance Value Restrs.Disjointness, Inverse, part- of…

8 Roles of Ontologies in Bioinformatics We can divide ontology use into three types: Domain-oriented, which are either domain specific (e.g. E. coli ) or domain generalisations (e.g. gene function or ribosomes); Task-oriented, which are either task specific (e.g. annotation analysis) or task generalisations (e.g. problem solving); Generic, which capture common high level concepts, such as Physical, Abstract and Substance. Important in ontology management and language applications.

9 Uses of Ontology Community reference -- neutral authoring. Either defining database schema or defining a common vocabulary for database annotation -- ontology as specification. Providing common access to information. Ontology- based search by forming queries over databases. Understanding database annotation and technical literature. Guiding and interpreting analyses and hypothesis generation

10 Components of an Ontology Concepts: Class of individuals – The concept Protein and the individual `human cytochrome C’ Relationships between concepts Is a kind of relationship forms a taxonomy Other relationships give further structure – is a part of Axioms – Disjointness, covering, equivalence,…

11 Knowledge Representation Ontology are best delivered in some computable representation Variety of choices with different: –Expressiveness The range of constructs that can be used to formally, flexibly, explicitly and accurately describe the ontology –Ease of use –Computational complexity Is the language computable in real time? Rigour -- Satisfiability and consistency of the representation Systematic enforcement mechanisms –Unambiguous, clear and well defined semantics

12 Languages Vocabularies using natural language –Hand crafted, flexible but difficult to evolve, maintain and keep consistent, with weak semantics –Gene Ontology Object-based KR: frames –Extensively used, good structuring, intuitive. Semantics defined by OKBC standard –EcoCyc (uses Ocelot) and RiboWeb (uses Ontolingua) Logic-based: Description Logics –Very expressive, model is a set of theories, well defined semantics –Automatic derived classification taxonomies –Concepts are defined and primitive

13 Building Ontologies No field of Ontological Engineering equivalent to Knowledge or Software Engineering; No standard methodologies for building ontologies; Such a methodology would include: –a set of stages that occur when building ontologies; –guidelines and principles to assist in the different stages; –an ontology life-cycle which indicates the relationships among stages.

14 The Development Lifecycle Two kinds of complementary methodologies emerged: –Stage-based, e.g. TOVE [Uschold96] –Iterative evolving prototypes, e.g. MethOntology [Gomez Perez94]. Most have TWO stages: 1.Informal stage ontology is sketched out using either natural language descriptions or some diagram technique 2.Formal stage ontology is encoded in a formal knowledge representation language, that is machine computable –the informal representation helps the former –the formal representation helps the latter.

15 A Provisional Methodology A skeletal methodology and life-cycle for building ontologies; Inspired by the software engineering V-process model; The overall process moves through a life-cycle. The left side charts the processes in building an ontology The right side charts the guidelines, principles and evaluation used to ‘quality assure’ the ontology

16 The V-model Methodology Conceptualisation Integrating existing ontologies Encoding Representation Identify purpose and scope Knowledge acquisition Evaluation: coverage, verification, granularity Conceptualisation Principles: commitment, conciseness, clarity, extensibility, coherency Encoding/Representation principles: encoding bias, consistency, house styles and standards, reasoning system exploitation Ontology in Use User Model Conceptualisation Model Implementation Model

17 The ontology building life- cycle Identify purpose and scope Knowledge acquisition Evaluation Language and representation Available development tools Conceptualisation Integrating existing ontologies Encoding Building

18 Starting Concept List Chemicals – atom, ion, molecule, compound, element; Molecular-compound, ionic-compound, ionic-molecular- compound, …; Ionic-macromolecular-compound and ionic-small- macromolecular-compound; Protein, peptide, polyprotein, enzyme, holoprotein, apoprotein,… Nucleic acid – DNA, RNA, tRNA, mRna, snRNA, …

19 Conceptualisation Sketch Chemical AtomElementCompoundMoleculeIon MetalNon-Metal Metaloid Molecular Compound Molecular Element Ionic Compound Ionic Molecule Ionic Molecular Compound

20 Molecule Conceptualisation Sketch Nucleic Acid ProteinPolysaccharide DNARNAEnzyme MacromoleculeSmall Molecule Ionic Macromolecular Compound StarchGlycogen mRNAtRNArRNAsnRNA Peptide

21 Initial Encoding class-def chemical subclass-of substance class-def molecule subclass-of chemical class-def compound subclass-of chemical class-def molecular-compound subclass-of molecule and compound

22 Molecules Revisited Nucleic Acid ProteinPolysaccharide DNARNAEnzyme MacromoleculeSmall Molecule Ionic Macromolecular Compound StarchGlycogen mRNAtRNArRNAsnRNA Peptide Non-Ionic Macromolecular Compound

23 More Encoding class-def chemical subclass-of substance class-def defined molecule subclass-of chemical Slot-constraint contains-bond min-cardinality 1 has-value covalent-bond class-def defined compound subclass-of chemical Slot-constraint has-atom-types greater-than 1 class-def defined molecular-compound subclass-of molecule and compound

24 Expansion Sketch and encode in cycles Build a taxonomy of a small portion Then build links to other portions Add more detail Document sources, author, date and argumentation.

25 Summary An ontology captures knowledge for a shared understanding The important question is not whether an artefact is an ontology, but whether it does any good Making our understanding of domain explicit, consistent and processable Bioinformatics resources are knowledge resources – needs to be both human and machine understandable