Document Navigation: Ontologies or Knowledge Organisation Systems Simon Jupp Bio-Health Informatics Group University of Manchester, UK Bio-ontologies SIG,

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

The use of Ontology in Organising and Managing Protein Family Resources Katy Wolstencroft, University Of Manchester.
Experiences from the NCBO OBO-to-OWL Mapping Effort Dilvan A. Moreira, University of São Paulo Mark A. Musen, Stanford University.
Mitsunori Ogihara Center for Computational Science
Semantic Similarity Measures Across The Gene Ontology. Relating Sequence to Annotation. P.W. Lord, R.D. Stevens, A.Brass, and C. Goble Department of Computer.
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Storing and Retrieving Biological Instances with the Instance Store Daniele Turi, Phillip Lord, Michael Bada, Robert Stevens.
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Introduction to Protégé AmphibiaTree 2006 Workshop Sunday 8:45–9:15 J. Leopold & A. Maglia.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
1 The Future of Clinical Bioinformatics: Overcoming Obstacles to Information Integration Barry Smith Brussells, Eurorec Ontology Workshop, 25 November.
COHSE Informed WWW Link Navigation Using Ontologies Prof. Carole Goble, Sean Bechhofer Dr. Leslie Carr, Prof. Wendy Hall, Prof. David De Roure, Steve Harris,
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2007 National Library of Medicine National Institutes of Health U.S. Dept. of Health.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Limning the CTS Ontology Landscape Barry Smith 1.
A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases SeaLife Simon Jupp.
HL7 HL7  Health Level Seven (HL7) is a non-profit organization involved in the development of international healthcare.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
NCBI’s Bioinformatics Resources Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries U.F. Genetics Institute January 2015.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Networks and Interactions Boo Virk v1.0.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
Gene Expression Data Annotation – an application of the cell type ontology Helen Parkinson, PhD 19 May 2010.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Open Terminology Portal (TOP) Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer Institute, Center for Biomedical Informatics.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Japan Consortium for Glycobiology and Glycotechnology DataBase 日本糖鎖科学統合データベース PACDB - Pathogen Adherence to Carbohydrate Database The Pathogen Adherence.
Ontologies for Terminologies, Knowledge Representation & Software: Benefits & Gaps (“Don’t make the tea”) (Only a part of Knowledge Representation) Alan.
Screen Readers Cannot See (Ontology Based Semantic Annotation for Visually impaired Web users) Yeliz Yesilada, Simon Harper, Carole Goble and Robert Stevens.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
TDM in the Life Sciences Application to Drug Repositioning *
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
CCNT Lab of Zhejiang University
Annotation: linking literature to gene products
CCO: concept & current status
Gramene’s Ontologies Tutorial
Presentation transcript:

Document Navigation: Ontologies or Knowledge Organisation Systems Simon Jupp Bio-Health Informatics Group University of Manchester, UK Bio-ontologies SIG, ISMB 2007 Vienna, Austria A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

Document navigation over a Semantic Web Introduction Ontologies are being developed as background knowledge to drive the Semantic Web. Message: Formal ontologies are not the only knowledge artefact needed, artefacts with weaker semantics have their role and are the best solution in some circumstances

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases COHSE (Conceptual Open Hypermedia SErvice) Navigation via Hypertext is a mainstay of WWW Problem: Links are typically embedded to Web pages; hard-coding, format restrictions, ownership, legacy resources, maintenance, Unary targets etc.

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases

HTML Document in Linked HTML Document out DLS Agent Knowledge Service Resource Service Ontology SKOS Search Engine Annotation DB COHSE Architecture

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Sealife project A realisation of a Semantic Grid Browser, which links the current Web to the emerging eScience infrastructure Built on existing tools developed by partners: COHSE, but also GoPubMed and others. Applications: from cells via tissue to patients Evidence-based medicine Patent and literature mining Molecular biology

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases What is a Sealife browser? A Sealife browser is a browser that identifies concepts in texts and offers appropriate services based on these concepts. Three main components: 1) an underlying ontology or set of ontologies 2) a text mining component 3) a service composition module

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Sealife use case - study of disease NeLI: National electronic Library of Infection portal. Range of users but few links. Given a document about Tuberculosis, where would users want to navigate to next?

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Background Knowledge What style of knowledge model is best suited for navigation? Do we need strict semantics, are they a help or hindrance? All I want to say is this concept has something to do with this concept - this doesnt fit well with formal ontology e.g Polio has something to do with polio virus / polio disease / polio treatment - get me documents about all of them!

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Sealife background knowledge To cover molecular biology through to medicine we need a large knowledge artefact to serve as background knowledge for SeaLife. This artefact must support sensible navigation between documents on the web. Luckily…

Phenotype Sequence Proteins Gene products Transcript Pathways Cell type BRENDA tissue / enzyme source Development Anatomy Phenotype Plasmodium life cycle -Sequence types and features -Genetic Context - Molecule role - Molecular Function - Biological process - Cellular component -Protein covalent bond -Protein domain -UniProt taxonomy -Pathway ontology -Event (INOH pathway ontology) -Systems Biology -Protein-protein interaction -Arabidopsis development -Cereal plant development -Plant growth and developmental stage -C. elegans development -Drosophila development FBdv fly development.obo OBO yes yes -Human developmental anatomy, abstract version -Human developmental anatomy, timed version -Mosquito gross anatomy -Mouse adult gross anatomy -Mouse gross anatomy and development -C. elegans gross anatomy -Arabidopsis gross anatomy -Cereal plant gross anatomy -Drosophila gross anatomy -Dictyostelium discoideum anatomy -Fungal gross anatomy FAO -Plant structure -Maize gross anatomy -Medaka fish anatomy and development -Zebrafish anatomy and development -NCI Thesaurus -Mouse pathology -Human disease -Cereal plant trait -PATO PATO attribute and value.obo -Mammalian phenotype -Habronattus courtship -Loggerhead nesting -Animal natural history and life history eVOC (Expressed Sequence Annotation for Humans)

ICD ICD Synopsis Nosologiae Methodicae OPCS SNOP CPT MESH ICD MeSH OPCS EmTree SNOP CPT 1700 OPCS3 CTV3 READ ICPC SNOMED-2 SNOMED International SNOMED-RT SNOMED-CT GALENDM&D UMLS FMA OPCS4OPCS4.3 History of Medical Vocabularies

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases What do we need for navigation? The bio-medical domain is rich in vocabularies and ontologies. Large lexical resource including textual definitions and synonyms There is a varying degree of semantics, expressivity and formality in these vocabularies (e.g. MeSH) and ontologies (e.g FMA). Most include some form of hierarchy. Hierarchies are well suited for driving navigation. Question: Do we want strict sub/super class relationships? Or, do we want looser notations such as broader/narrower?

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Ontology or Vocabulary? Initial approach to COHSE and SeaLife was to represent everything in OWL The strict semantics of OWL do not always lend themselves to sensible navigation, conversion from vocabularies to OWL are difficult. Its hard to model some things in OWL… MeSHOBO/OWL Head <-- Ear <-- Nose Accident <-- Traffic Accident <-- Accident Prevention Nucleus part_of Cell Cell has_part Nucleus - Not always True PolioDisease causedby PolioVirus

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases SKOS (Simple Knowledge Organisation System) Purpose: Subject Metadata and information retrieval RDF/XML representation Model ideally suited for our task e.g. This document is about tuberculosis e.g. prefLabe, altLabel, broader, narrower, related. Model for representing concept schemes, thesauri, classification system, taxonomies etc… SKOS Concept is an individual of the class skos:concept, relationships are just asserted facts between (not restrictions!)

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Conversion to SKOS relationship:part_of --> skos:broader ( e.g. finger part_of hand) relationship:contains --> skos:narrower ( e.g. skull contains brain) relationship:causes --> skos:related ( e.g. PolioDisease causes PolioVirus) Sub properties: skos:broader skos:narrower inverse obo:part_ofobo:has_part Leaves us open to migration back to OWL

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Conversion to SKOS SKOS MeSH OBO ontologies OWL ontologies SNOMED NeLI Taxonomies Concept Schemas Thesauri Other..

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Advantage of this approach For a given concept e.g. Polio Virus, we can query multiple resources and bring related concepts together. Rapid (and cheap!) generation of knowledge artefact Take advantage efforts in multiple biomedical communities We dont have to make any strong ontological distinctions

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Disadvantage of this approach Trade off: Lose the inferential power when querying a knowledge resource Inability to do inconsistency checking Potentially large redundancy in our knowledge base Maintenance and scalability (> concepts) - especially for dynamic hyper-linking. Unwanted concepts & relationship - especially from OWL conversion e.g. Physical Entity, Continuant etc…. Linking overload!

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Plug for Manchesters SKOS plug-in - Protégé 4 Instance hierarchy viewer OBO or OWL --> SKOS wizards Various rendering options

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Conclusion We need a large knowledge artefact that support navigation between related web resources Rapid generation and reuse of existing terminologies (cheap) Loosening the semantics of our model enables this with acceptable trade off SKOS is a suitable model to represent our knowledge artefact We have strong use cases from the life sciences - demos available Still some open issues ….

A Semantic Grid Browser for the Life Sciences Applied to the Study of Infectious Diseases Acknowledgments Manchester Robert Stevens Sean Bechhofer Yeliz Yesilada NeLI Patty Kostkova Other COHSE developers Sealife project Thank you.