Ontology: A Guide for the Intelligence Analyst

Slides:



Advertisements
Similar presentations
Willing to spend the time! Self motivated! Self responsibility! (If you need something Ask For IT!!!!!) Ability to communicate! (Vocabulary) Write,
Advertisements

Upper Ontology Summit Tuesday March 14 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center.
1 Towards a Reference Terminology for Talking about Ontologies and Related Artifacts Barry Smith with thanks to Werner.
Knowledge Representation
National center for ontological research University at Buffalo The Center for the Arts October 27, 2005.
Ontology in Buffalo Barry Smith. 2 Ontology (phil.) The science of being Ontologies (tech.) Standardized classification systems which enable data from.
Ontology Notes are from:
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Third-generation information architecture November 4, 2008.
1 Doing Ontology Over Images Barry Smith. What ontologies are for.
1 Introduction to Ontology: Terminology Barry Smith with thanks to Werner Ceusters, Waclaw Kusnierczyk, Daniel Schober.
1 Introduction to (Geo)Ontology Barry Smith
National center for ontological research. Part One: The History of NCOR and ECOR Part Two: How to Establish JCOR: The Japanese Consortium.
Pathways and Networks for Realists Barry Smith 1.
1 Ontologie als konkretisierte Darstellung der Wirklichkeit Barry Smith.
Philosophy and Computer Science: New Perspectives of Collaboration
Why, in the future, all sciences will be computer sciences Barry Smith.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Copyright © 2013 Curt Hill The Zachman Framework What is it all about?
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
3rd Country Training, K.Subieta: System Engineering and Databases. Lecture 3, Slide 1 February 20, 2004 Lecture 3: Introduction to Software Analysis and.
Welcome to AP Biology Mr. Levine Ext. # 2317.
Core 6 (University at Buffalo) Dissemination of Ontology Best Practices Barry Smith (PI) Fabian Neuhaus (Post-Doc) Werner.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Confidential 111 Financial Industry Business Ontology (FIBO) [FIBO– Business Entities] Understanding the Business Conceptual Ontology For FIBO-Business.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Leo Obrst, Fabian Neuhaus MITRE, NIST An Open Ontology Repository: Rationale, Expectations & Requirements Session.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 How to build an ontology Barry Smith
High throughput biology data management and data intensive computing drivers George Michaels.
Ontology in MBSE How ontologies fit into MBSE The benefits and challenges.
Of 24 lecture 11: ontology – mediation, merging & aligning.
1 Why computer science needs philosophy Barry Smith National Center for Ontological Research.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
National center for ontological research.
1 Standards and Ontology Barry Smith
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
IDEAS Model for Coalition Architecture Interoperability
Philosophy and Computer Science: New Perspectives of Collaboration
The Semantic Web By: Maulik Parikh.
Object Management Group Information Management Metamodel
Course Outcomes of Object Oriented Modeling Design (17630,C604)
Guilford County SciVis V102.03
The Semiotic Engineering of Human-Computer Interaction Section I Foundation Chapter 1 Introduction.
Object-Oriented Analysis and Design
Knowledge Representation Part I Ontology
ece 627 intelligent web: ontology and beyond
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Ontology From Wikipedia, the free encyclopedia
Ontology Measurement and Evaluation Ontolog mini-series Co-sponsored by NIST, Ontolog and NCOR Steve Ray, NIST October 19, 2006.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Ontology Reuse In MBSE Henson Graves Abstract January 2011
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
KNOWLEDGE REPRESENTATION
Metadata Framework as the basis for Metadata-driven Architecture
Mike Timms and Cathleen Kennedy University of California, Berkeley
Bird of Feather Session
Chapter 2 Database Environment Pearson Education © 2009.
IDEAS Chris Partridge 6/27/2019.
Presentation transcript:

Ontology: A Guide for the Intelligence Analyst Barry Smith http://ontology.buffalo.edu/smith

Problem of ensuring sensible cooperation in a massively interdisciplinary community concept type instance model representation data

What do these mean? ‘conceptual data model’ ‘semantic knowledge model’ ‘reference information model’ ‘an ontology is a specification of a conceptualization’

help is on the way ...

national center for ontological research

ECOR Partner Institutions Laboratory for Applied Ontology, Trento/Rome Center for Theoretical and Applied Ontology, Turin Foundational Ontology Group, University of Leeds JCOR – Japanese Center for Ontological Research

Ontologies (tech.) Ontology (phil.) Standardized classification systems which enable data from different sources to be combined Ontology (phil.) The theory of being

The need strong general purpose classification hierarchies created by domain specialists thoroughly tested in real use cases to help us navigate through oceans of data

Good ontologies should be intelligible to human beings computationally useful capable of being glued together

The actuality (too often) myriad special purpose ‘light’ ontologies, prepared by ontology engineers and deposited in internet ‘repositories’ or ‘registries’ which only create NEW oceans of data

Schemaweb ontologies (http://www.w3.org/) MusicBrainz Metadata Vocabulary Musical Baton Vocabulary Beer Ontology Kissology

‘Lite’ ontologies often do not generalize … repeat work already done by others are not gluable together no roadmap for progressive improvement reproduce the very problems of communication which ontology was designed to solve

Ontology (science) The empirical study of how to build humanly useful and computationally tractable representations of entities and of the relations between them Evidence-based terminology research

Why NCOR? Why NCOR? NCOR will advance ontology as science develop measures of quality for ontologies to establish best practices

Why NCOR? NCOR will provide coordination and support for investigators working in ontology and its applications engage in outreach endeavors designed to foster the goals of high quality ontology in both theory and practice advance ontology education

National Center for Biomedical Ontology $18.8 mill. NIH Roadmap Center Stanford Medical Informatics University of San Francisco Medical Center Berkeley Drosophila Genome Project Cambridge University Department of Genetics The Mayo Clinic University at Buffalo Department of Philosophy

From chromosome to disease

… legacy of Human Genome Project genomics transcriptomics proteomics reactomics metabonomics phenomics behavioromics connectomics toxicopharmacogenomics bibliomics … legacy of Human Genome Project

 need for semantic annotation of data where in the body ? what kind of disease process ?  need for semantic annotation of data dir.niehs.nih.gov/ microarray/datamining/

Woops: 54M already ! Compare with 3M Dec 2004, and 12 M june 2005 when I did this.

natural language labels to make the data cognitively accessible to human beings dir.niehs.nih.gov/ microarray/datamining/

compare: legends for maps http://www.ags.gov.ab.ca/GRAPHICS/uranium/athabasca_group_map_with_legend.jpg

ontologies are legends for data dir.niehs.nih.gov/ microarray/datamining/

compare: legends for cartoons

legends help human beings use and understand complex representations of reality help human beings create useful complex representations of reality help computers process complex representations of reality

computationally tractable legends help human beings find things in very large complex representations of reality

ontologies are legends for images

what lesion ? what brain function ?

which period? which architectural style? which type of building?

ontologies are legends for mathematical equations xi = vector of measurements of gene i k = the state of the gene ( as “on” or “off”) θi = set of parameters of the Gaussian model ...

ontologies are legends for word lists ...and the Computer's View name education CV private work © 2006 Adam Pease, Articulate Software Slide inspired by Frank von Harmelan Slide inspired by Frank von Harmelan

The Idea GlyProt MouseEcotope DiabetInGene GluChem sphingolipid transporter activity DiabetInGene GluChem

annotation using common ontologies yields integration of databases MouseEcotope GlyProt Holliday junction helicase complex DiabetInGene GluChem

Glue-ability / integration rests on the existence of a common benchmark called ‘reality’ the ontologies we want to glue together are representations of what exists in the world not of what exists in the heads of different groups of people

truth is correspondence to reality

simple representations can be true

there are true cartoons

a cartoon can be a veridical representation of reality

a network diagram can be a veridical representation of reality

http://www.psb.ugent.be/cbd/images/research/networks.jpg

pathway maps are representations of complexes of types http://www.nature.com/msb/journal/v1/n1/images/msb4100014-f1.jpg

maps may be correct by reflecting topology, rather than geometry

an image can be a veridical representation of reality a labeled image can be a more useful veridical representation of reality

an image labelled with computationally tractable labels can be an even more useful veridical representation of reality

annotations help us to find images

annotations using common ontologies can yield integration of image data

and link image databases together Gazetteer GlyProt ruins of Hadrami mosque CIA Factbook GluChem

if you’re going to semantically annotate piles of data, better work out how to do it right from the start

two kinds of annotations

names of types

names of instances

instances vs. types dir.niehs.nih.gov/ microarray/datamining/

instances vs. types types dir.niehs.nih.gov/ microarray/datamining/

instances

molecular images and radiographic images are representations of instances

First basic distinction type vs. instance (science text vs. diary) (human being vs. Tom Cruise)

For ontologies it is generalizations that are important = ontologies are about types, kinds

Inventory vs. Catalog: Two kinds of representational artifact Databases represent instances Ontologies represent types

Catalog vs. inventory A 515287 DC3300 Dust Collector Fan B 521683 Gilmer Belt C 521682 Motor Drive Belt

Catalog vs. inventory

Catalog of types/Types

Ontology types Instances

Ontology = A Representation of types

An ontology is a representation of types We learn about types in reality from looking at the results of scientific experiments in the form of scientific theories experiments relate to what is particular science describes what is general

object types organism animal cat mammal siamese frog instances

Ontologies are here

or here

ontologies represent general structures in reality (leg)

Ontologies do not represent concepts in people’s heads

They represent types in reality

which provide the benchmark for integration

My job here Not tools: Leo Obrst, Chris Welty Not instances: Werner Ceusters Ontology content : the types in reality

How to build an ontology create an initial top-level classification of your domain = ~50 most common types of entities arrange the corresponding expressions terms into an informal is_a hierarchy according to this universality principle A is_a B  every instance of A is an instance of B fill in missing terms to give a complete hierarchy move on to populate the lower levels of the hierarchy) annotate your data

Example domain: threat, vulnerability Eric Little

Example domain: The ontology of documents Hernando de Soto

valuable work on ‘documents’ in the context of XML, etc. e.g. Bob Glushko: “A document  is a purposeful and self-contained collection of information.” focuses on information content, but there is more than information here

transactional documents passport contract tax form bill of lading shipping authorization plane ticket visa

the legal powers of documents the social interactions in which they play a role the institutional systems to which they belong provenance (original, copy, forgery ...)

document vs. attachments signatures, seals, stamps ...

http://www.chazj.com/indent/en/1664qa.jpg

anchoring documents to reality

Countersignatures

document template vs. filled-in document document vs. the piece of paper (or other physical carrier) upon which it is written/printed, ...

Standardized documents filled in completely/partially correctly/incorrectly validly/invalidly

from the Shiprock Navajo fair New Mexico, September 30-October 1, 2005

Standardized documents allow networking across time across space (individuals linked through document systems) improve the flow of communications allow standardized transactions

Documents are artifacts analogous to organizations, rules, prices, debts, claims and obligations ...

John Searle The Construction of Social Reality claims and obligations are brought into existence by the performance of speech acts

appointings, marryings, promisings change the world We perform a speech act ... the world changes, instantaneously

The de Soto thesis document systems are mechanisms for creating the institutional orders of modern societies

stock and share certificates create capital marriage licenses create bonds of matrimony statutes of incorporation create companies title deeds create property rights and property owners insurance certificates create insurance coverage

Identity documents create identity and thereby create the possibility of identity theft what is the ontology of identity? what is the epistemology of identity (the technologies of identification)?

What you can do with a document sign it stamp it witness it fill it in revise it nullify it deliver it (de facto, de jure) ...

types of document systems types of document acts types of document systems types of document pathways

if you’re going to semantically annotate piles of data, better work out how to do it right from the start

Tomorrow: The problems, and a strategy for the future