Don’t know much about philosophy: The confusion over bio-ontologies

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

The National Center for Biomedical Ontology One of three National Centers for Biomedical Computing launched by NIH in 2005 Collaboration of Stanford, Berkeley,
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Public Health Ontology 101 Mark A. Musen, M.D., Ph.D. Stanford Center for Biomedical Informatics Research Stanford University School of Medicine Die Seuche.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
BioPortal: A Web Repository and Services for Biomedical Ontologies and Data Resources Natasha Noy and the BioPortal team Stanford Center for Biomedical.
BioPortal as (the only functional) OOR SandBox (so far) Natasha Noy, Michael Dorf Stanford University.
1 The Business Case for Large-Scale Ontology Projects: Are we at a tipping point? Mark A. Musen, M.D., Ph.D. Stanford Medical Informatics Stanford University.
Mark A. Musen Stanford University
Experiences from the NCBO OBO-to-OWL Mapping Effort Dilvan A. Moreira, University of São Paulo Mark A. Musen, Stanford University.
CS570 Artificial Intelligence Semantic Web & Ontology 2
The National Center for Biomedical Ontology Online Knowledge Resources for the Industrial Age Mark A. Musen Stanford University
National center for ontological research University at Buffalo The Center for the Arts October 27, 2005.
1 Building Ontologies from the Ground Up When users set out to model their professional activity Mark A. Musen Professor of Medicine and Computer Science.
Ontology Notes are from:
National center for ontological research. Part One: The History of NCOR and ECOR Part Two: How to Establish JCOR: The Japanese Consortium.
Semantics Overview Sharon L. Bolding, PhD Jan 26, 2008.
Biological Ontologies Neocles Leontis April 20, 2005.
Ontology Evolution Mark A. Musen Stanford University.
Ontologies in Biomedicine Mark A. Musen Stanford University.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Knowledge representation
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Why we need the OBO Core Michael Ashburner, Suzanna Lewis and Barry Smith.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
Ontologies in Biomedicine What is the “right” amount of semantics? Mark A. Musen Stanford University.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Data Integration and Management A PDB Perspective.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal Natasha Noy Stanford University.
Approach to building ontologies A high-level view Chris Wroe.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Semantic Web. If you can't explain it simply, you don't understand it well enough Everything should be made as simple as possible, but not simpler Great.
National center for ontological research.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
Fundamentals of Object Oriented Modeling
The UMLS and the Semantic Web
The Semantic Web By: Maulik Parikh.
The National Center for Biomedical Ontology
BioPortal as (the only functional) OOR SandBox (so far)
ece 627 intelligent web: ontology and beyond
Independent Study of Ontologies
Ontology From Wikipedia, the free encyclopedia
Stanford Medical Informatics
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
ece 720 intelligent web: ontology and beyond
Ontology Reuse In MBSE Henson Graves Abstract January 2011
Methontology: From Ontological art to Ontological Engineering
NSDL Data Repository (NDR)
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Ontology-Based Approaches to Data Integration
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Collaborative RO1 with NCBO
The National Center for Biomedical Ontology
Semantic Wikis Expedition #52 Conor Shankey CEO July 18, 2006
Building Ontologies with Protégé-2000
Presentation transcript:

Don’t know much about philosophy: The confusion over bio-ontologies Mark A. Musen Stanford University

Ontologies are essential to make sense of online data repositories Semantic Glue In partnership with its users. Built specialist tools for annotation and ontology creation and management. Consensus outweighs complexity It doesn’t matter how simple/scruffy the ontology is, what counts is if everyone uses it. It doesn’t matter how smart and complete the ontology is, it doesn’t count if no-one uses it. Its so boriiinnnnnnggggggggg Hendler Principle – a little semantics goes a long way. SAEL - SOFG Anatomy Entry List. Plenty of complexity if it can managed.

Porphyry’s depiction of Aristotle’s Categories Supreme genus: SUBSTANCE Differentiae: material immaterial Subordinate genera: BODY SPIRIT Differentiae: animate inanimate Subordinate genera: LIVING MINERAL Differentiae: sensitive insensitive Proximate genera: ANIMAL PLANT Differentiae: rational irrational Species: HUMAN BEAST Individuals: Socrates Plato Aristotle …

What Is An Ontology? The study of being A discipline co-opted by computer science to enable the explicit specification of Entities Properties and attributes of entities Relations among entities A theory that provides a common vocabulary for an application domain

A Portion of the OBO Library

Why develop an ontology? To share a common understanding of the entities in a given domain among people among software agents between people and software To enable reuse of data and information to avoid re-inventing the wheel to introduce standards to allow interoperability and automatic reasoning To create communities of researchers

Ontologies are just the beginning Databases Declare structure Ontologies Knowledge bases Enumerate domain terms Provide domain descriptions Software agents Problem-solving methods Annotated Data

Why are people confused? There are multiple languages for developing ontologies There are multiple reasons for building ontologies There is no way to know by inspection which ontologies are well formed, useful, or correct

There are many languages, each with its own advocates DAG-Edit: Original langauge for the GO OBO-Edit: DAG-Edit on steroids Protégé frames: Based on CLIPS and JESS UML: From software engineering XML: Great for marking up documents and data sets RDF(S): XML syntax, with classes and relations OWL: A W3C “recommendation”

The Web Ontology Language (OWL) Comes in three flavors: OWL Lite (frame-based, like OBO-Edit) OWL DL (adds decription logic, permitting automatic classification) OWL Full (first-order logic and then some) Is rapidly being adopted for use in biomedical ontologies, including: NCI Thesaurus (cancer biology and oncology) MGED Ontology (DNA micro-array experiments) FuGO (functional genomics in the broadest sense) BioPAX (metabolic pathways) Constitutes the only real standard language for building ontologies at this point

Types of Ontologies “Light” versus “heavy” “Upper” versus “domain specific” “Reference” versus “application-specific”

A Small Portion of ICD9-CM 724 Unspecified disorders of the back 724.0 Spinal stenosis, other than cervical 724.00 Spinal stenosis, unspecified region 724.01 Spinal stenosis, thoracic region 724.02 Spinal stenosis, lumbar region 724.09 Spinal stenosis, other 724.1 Pain in thoracic spine 724.2 Lumbago 724.3 Sciatica 724.4 Thoracic or lumbosacral neuritis 724.5 Backache, unspecified 724.6 Disorders of sacrum 724.7 Disorders of coccyx 724.70 Unspecified disorder of coccyx 724.71 Hypermobility of coccyx 724.71 Coccygodynia 724.8 Other symptoms referable to back 724.9 Other unspecified back disorders

FMA Is-a Parts of the heart Part-of Anatomical Spatial Entity Structure Organ Organ Part Body Space Anatomical Feature Viscus Organ Subdivision Organ Component Organ Cavity Internal Feature Hollow Viscus Cardiac Chamber Heart Parts of the heart Organ Cavity Subdivision Cavity of Heart Right Atrium Wall of Heart Cavity of Right Atrium Fossa Ovalis Wall of Right Atrium Myocardium Is-a Part-of Sinus Venarum Myocardium of Right Atrium SA Node

Classes and attributes in the FMA

Attributes of a class (here, “Esophagus”)

Types of Ontologies “Light” versus “heavy” “Upper” versus “domain specific” “Reference” versus “application-specific”

Suggested Upper Merged Ontology (SUMO)

Why are people confused? There are multiple languages for developing ontologies There are multiple reasons for building ontologies There is no way to know by inspection which ontologies are well formed, useful, or correct

The problem: There are infinite distinctions that you can make about the world The distinctions that matter are the ones that relate to the task to be performed

Goals of Biomedical Ontologies To provide a classification of biomedical entities To summarize and annotate data To simplify the engineering of complex software systems To provide a formal specification of biomedical knowledge

A Small Portion of ICD9-CM 724 Unspecified disorders of the back 724.0 Spinal stenosis, other than cervical 724.00 Spinal stenosis, unspecified region 724.01 Spinal stenosis, thoracic region 724.02 Spinal stenosis, lumbar region 724.09 Spinal stenosis, other 724.1 Pain in thoracic spine 724.2 Lumbago 724.3 Sciatica 724.4 Thoracic or lumbosacral neuritis 724.5 Backache, unspecified 724.6 Disorders of sacrum 724.7 Disorders of coccyx 724.70 Unspecified disorder of coccyx 724.71 Hypermobility of coccyx 724.71 Coccygodynia 724.8 Other symptoms referable to back 724.9 Other unspecified back disorders

Goals of Biomedical Ontologies To provide a classification of biomedical entities To summarize and annotate data To simplify the engineering of complex software systems To provide a formal specification of biomedical knowledge

Ontologies are essential to make sense of online data repositories Semantic Glue In partnership with its users. Built specialist tools for annotation and ontology creation and management. Consensus outweighs complexity It doesn’t matter how simple/scruffy the ontology is, what counts is if everyone uses it. It doesn’t matter how smart and complete the ontology is, it doesn’t count if no-one uses it. Its so boriiinnnnnnggggggggg Hendler Principle – a little semantics goes a long way. SAEL - SOFG Anatomy Entry List. Plenty of complexity if it can managed.

Summarization and annotation of data Biologists generally don’t care about modeling reality beyond their data Biologists care about Making sense of terabytes of data Accessing and indexing data Comparing data sets with one another The goal is to create annotations that make distinctions about the data, not about the world

Goals of Biomedical Ontologies To provide a classification of biomedical entities To summarize and annotate data To simplify the engineering of complex software systems To provide a formal specification of biomedical knowledge

Engineering of complex software systems Object-oriented design and programming are well entrenched in current software-engineering practices OOP owes considerable legacy to frame-based knowledge-representation systems developed in AI in the 1970s Ontologies are now at the core of advanced software engineering

Model-Directed Architecture Platform-independent UML model describes domain ontology Computer-Assisted Software Engineering tools perform transformation of UML model into program code Platform-independent models have potential to outlive particular software implementations

Goals of Biomedical Ontologies To provide a classification of biomedical entities To summarize and annotate data To simplify the engineering of complex software systems To provide a formal specification of biomedical knowledge

The Foundational Model of Anatomy

Formal specification of biomedical knowledge In the “information society,” there will be increasing motivation for representing human knowledge in machine-processable form Ontologies of professional knowledge are being seen as having value even for their own sake

Goals of Biomedical Ontologies To provide a classification of biomedical entities To summarize and annotate data To simplify the engineering of complex software systems To provide a formal specification of biomedical knowledge

Why are people confused? There are multiple languages for developing ontologies There are multiple reasons for building ontologies There is no way to know by inspection which ontologies are well formed, useful, or correct

Ontologies are not like journal articles It is difficult to judge methodological soundness simply by inspection We may wish to use an ontology even though some portions Are not well designed Make distinctions that are different from those that we might want

Ontologies are not like journal articles II The utility of ontologies Depends on the task May be highly subjective Users should want the opinions of more than 2–3 hand-selected reviewers Peer review needs to scale to the entire user community

Solution Snapshot

An experiment to offer standards that define minimal criteria for “good” ontologies Textual definitions of all terms Unambiguously defined relations A plurality of independent users Commitment to collaborate with other developers Open content Recognized language Unique namespaces Appropriate versioning Clearly specified content

The National Center for Biomedical Ontology One of three National Centers for Biomedical Computing launched by NIH in 2005 Collaboration of Stanford, Berkeley, Mayo, Buffalo, Victoria, UCSF, Oregon, and Cambridge Primary goal is to make ontologies accessible and usable Research will develop technologies for ontology indexing, alignment, and peer review

Goals for BioPortal Web accessible repository of ontologies for the biomedical community Support for ontology Peer review Annotation (marginalia) Versioning Alignment Search

Other Center Activities Biological Driving Projects that will use BioPortal ontologies to annotate biomedical data Collaborating projects that will use BioPortal ontologies for natural-language processing information integration data and knowledge visualization Outreach activities to help different communities to build better ontologies and to utilize the Center’s technology

Ontologies are just the beginning Databases Declare structure Ontologies Knowledge bases Enumerate domain terms Provide domain descriptions Software agents Problem-solving methods Annotated Data

There is hope for the confused! There are multiple languages for developing ontologies—but standards are emerging! There are multiple reasons for building ontologies—we simply need to embrace diversity There is no way to know by inspection which ontologies are well formed, useful, or correct—but there are peer-review methods on the horizon