Building and Using Ontologies Dr. Robert Stevens Department of Computer Science University of Manchester

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Database Design: ER Modelling (Continued)
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
An Overview of Ontologies and their Practical Applications Gianluca Correndo
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
CS570 Artificial Intelligence Semantic Web & Ontology 2
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Knowledge Representation
Ontology-Driven Conceptual Modelling Speaker:. Harmonization Meeting What is Ontology?  A discipline of Philosophy  Meta-physics dates.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Semiotics and Ontologies. Ontologies contain categories, lexicons contain word senses, terminologies contain terms, directories contain addresses, catalogs.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
1 Ontology Language Comparisons doug foxvog 16 September 2004.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
1 Building and Using Ontologies Robert Stevens Department of Computer Science University of Manchester Manchester UK.
COG and GO tutorial.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Knowledge Representation Ontology are best delivered in some computable representation Variety of choices with different: –Expressiveness The range of.
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Confidential 111 Financial Industry Business Ontology (FIBO) [FIBO– Business Entities] Understanding the Business Conceptual Ontology For FIBO-Business.
Dimitrios Skoutas Alkis Simitsis
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Semantic Data & Ontologies CMPT 455/826 - Week 5, Day 2 Sept-Dec 2009 – w5d21.
Advanced topics in software engineering (Semantic web)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Copyright © 1997 Pangea Systems, Inc. All rights reserved. Building Ontologies.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Mining the Biomedical Research Literature Ken Baclawski.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
ece 627 intelligent web: ontology and beyond
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
1 First Order Logic CS 171/271 (Chapter 8) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Lecture 8-2CS250: Intro to AI/Lisp What do you mean, “What do I mean?” Lecture 8-2 November 18 th, 1999 CS250.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
COMP6215 Semantic Web Technologies
DOMAIN ONTOLOGY DESIGN
Information Organization
ece 627 intelligent web: ontology and beyond
Ontology From Wikipedia, the free encyclopedia
Chapter 2 Database Environment.
Ontology.
ece 720 intelligent web: ontology and beyond
Methontology: From Ontological art to Ontological Engineering
Ontology-Based Approaches to Data Integration
Ontology.
Ontologies in Bioinformatics
CIS Monthly Seminar – Software Engineering and Knowledge Management IS Enterprise Modeling Ontologies Presenter : Dr. S. Vasanthapriyan Senior Lecturer.
Presentation transcript:

Building and Using Ontologies Dr. Robert Stevens Department of Computer Science University of Manchester

Introduction Knowledge & metadata The nature of bioinformatics resources A shared understanding Terminologies and ontologies Building an ontology Using an ontology

What is Knowledge? Knowledge – all information and an understanding to carry out tasks and to infer new information Information -- data equipped with meaning Data -- un-interpreted signals that reach our senses Michael Ashburner Professor University of Cambridge UK ISMBISMB Name Job Institution Country ConfConf man academic, senior ancient university, 5 rated European important figure in biology BIOLOGYBIOLOGY

What is Metadata? Metadata is data about data (information about information) A schema is a DBs metadata; as is the administrator's name; the creator, date of creation, documentation The label on an Ependorf tube in a freezer is metadata A DBs entry’s annotation is metadata on the sequence data

Syntax & Semantics Infix2 + 3 = 5 Prefix= Postfix = Binary = 101 RomanII + III = V = 42

Types of Semantics –An operational semantics for a language is defined by what a sentence in that language will do. –Denotational semantics is a precise mathematical definition of the objects and relations of language in which each sentence of the language names, or denotes, a mathematical object, such as a function. –Natural semantics are the loose ordinary language sense, in which the semantics of a statement is its "meaning". –The term logistic semantics refers to formal models that attempt to represent the natural semantics of some external domain.

Knowledge in Bioinformatics

A Shared Understanding Synonyms and homonyms are rife Need to know that terms in one resource mean the same in another resource Means comparisons are much easier: Can ask questions over many resources Structure enables discovery and query abstractions Useful for both humans and computers The Gene Ontology allows queries outside one model organism

London Bills of Mortality

Aggregated Stats

What is an Ontology? A means of capturing knowledge in a computationally amenable form A shared understanding for humans and computers A set of vocabulary terms that represents a community’s understanding of a domain A set of definitions for those terms The relations between those terms A formal semantics A conceptual model whose labels provide a vocabulary Nucleic acid DNARNA tRNArRNA Ribosome

The art of ranking things in genera and species is of no small importance and very much assists our judgment as well as our memory. You know how much it matters in botany, not to mention animals and other substances, or again moral and notional entities as some call them. Order largely depends on it, and many good authors write in such a way that their whole account could be divided and subdivided according to a procedure related to genera and species. This helps one not merely to retain things, but also to find them. And those who have laid out all sorts of notions under certain headings or categories have done something very useful. Gottfried Wilhelm Leibniz, New Essays on Human Understanding

Components of an Ontology: Concepts Concepts: A unit of thought –AKA: Class, Set, Type, Predicate –Gene, Reaction, Macromolecule Terms are labels of concepts Taxonomy of concepts –Generalization ordering among concepts –Concept A is a parent of concept B iff every instance of B is also an instance of A –Superset / subset –“A kind of” vs. “a part of” Nucleic acid DNARNA tRNArRNA Ribosome

Components of an Ontology: Relations Relations and Attributes –AKA: Slots, properties, roles –Product of Gene, Map-Position of Gene –Reactants of Reaction, K eq of Reaction Meta information about relations –Cardinality, optionality, type restrictions on filler –Transitive, symmetric, functional role properties –Role hierarchies Slot: Expresses Range: Polypeptide or RNA Domain: Genes Cardinality: At-least-1 General Axioms (constraints) –Nucleic acids < 20 residues are oligonucleiotides

Gene Ontology “a dynamic controlled vocabulary that can be applied to all eukaryotes” Built by the community for the community. Three organising principles: Molecular function, Biological process, Cellular component Is-a and Part of taxonomy ~15,000 concepts

Components of an Ontology: Instances Instances –AKA: objects, individuals, set members –trpA Gene, Reaction , Death-receptor-3 –Strictly speaking, an ontology with instances is a knowledge base –The distinction between an instance and a concept is difficult. –Lard-binding-proteins are all those that bind Death-receptor-3.

Components of an Ontology: Properties Primitive: properties are necessary –Globular protein must have hydrophobic core, but a protein with a hydrophobic core need not be a globular protein Defined: properties are necessary + sufficient –Eukaryotic cells must have a nucleus. Every cell that contains a nucleus must be Eukaryotic.

An Ontology Building Life-cycle Identify purpose and scope Knowledge acquisition Evaluation Language and representation Available development tools Conceptualisation Integrating existing ontologies Encoding Building Ontology Learning Consistency Checking

How to do it Collect terms: MacroMolecule, Protein, Enzyme, Holoprotein, Holoenzyme. Arrange into a Polyhierarchy (by hand) Write a definition for each term Encode in some representation Carry on Test against scope, requirements and competency questions

How to do it Enzyme: is-a MacroMolecule polymerOf AminoAcid Catalyses Reaction HoloEnzyme: is-a MacroMolecule  polymerOf AminoAcid  binds ProstheticGroup  Catalyses Reaction  HoloProtein: is-a MacroMolecule polymerOf AminoAcid binds ProstheticGroup Protein: is-a MacroMolecule polymer of AminoAcid

Tips for Building your Terminology Choose a narrow,but useful area Build using domain experts Regard computer scientists as a service You’ll never be complete or correct: Publish early Be practical: Truth and beauty is a bonus Be open A large commitment and a never ending process Start simple and migrate to expressivity and “correctness” as you develop OWL can do this migratory path