Presentation is loading. Please wait.

Presentation is loading. Please wait.

What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared.

Similar presentations


Presentation on theme: "What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared."— Presentation transcript:

1 What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared conceptualization.” In English An ontology makes explicit… What things are we talking about? How are those things related to each other? Why? Defining a common vocabulary in which knowledge can be formally represented can support sharing and reuse among software systems but perhaps more importantly an ontology can support sharing and reuse of application domain knowledge. How? Applied to datasets

2 1.Conceptualization refers to an abstract model of some phenomenon in the world that you want to represent in an ontology. 2.Specification is a description of what you want an ontology to do. 3.Explicit means that the type of concepts used, and the constraints on their use are explicitly defined. 4.Formal refers to the fact that the ontology should be machine- readable. 5.Shared reflects the notion that an ontology captures consensual knowledge, that is, it is not private of some individual, but accepted by a group. R. Studer, V. R. Benjamins, and D. Fensel: Knowledge engineering: Principles and methods. Data and Knowledge Engineering (DKE), 25(1-2):161-197, 1998. “An ontology is a formal, explicit specification of a shared conceptualization.”

3 Debate between ‘purists’ (philosophers/AI), who want an ontology to be absolutely correct, and ‘practitioners’ (e.g. biologists), who want to get things done, has become -- over the last few years -- a fruitful collaboration.

4 Ontologies are often represented as hierarchies (graphs, trees) Nodes are the entities and edges are the relationships For example, the Ice_Cream_Ontology (ICO) Entities are various ice cream treats, relationship “is_a” subtype Ontologies can be more complicated that this taxonomy tree. For example, the GO is represented as a DAG, directed acyclic graph, since a term (entity) can have more than one parent

5 Different sorts of ontologies Top level/Upper – captures fundamental aspects of reality regardless of the domain Continuant vs. occurrent; (cat, cat chasing a mouse) Type vs. instance (cat, “Fluffy”) Domain Reference/Core/Foundational – (complete) representation of a domain Canonical – defining ‘normal’ Application ontology – Integration of different types of entities for a specific purpose For example, Biomedical domain OBO (Open Biomedical Ontologies) – a suite of orthogonal interoperable reference ontologies Mappings between ontologies: e.g. lung (anatomy), lung development (biological process), abnormal lung development (phenotype) Using OBO-Relations – also an ontology

6 Cell cycle Essential highly conserved process that is important in understanding cancer. The Cell Cycle Ontology (CCO) is an application ontology that integrates knowledge about the cell cycle from a diverse set of already existing resources. http://www.cellcycleontology.org/ organism protein-protein interactions biological process protein gene

7 Contains two types of biological entities Continuant – Object Occurrent – Process

8 But… How to represent ‘cell cycle’ in DAG (acyclic) format OBO Temporal relations

9 Cell cycle ontology currently exists for Homo sapiens (human) Arabidopsis thaliana (thale cress) Saccharomyces cerevisiae (brewer’s yeast) Schizosaccharomyces pombe (fission yeast) None for mouse… yet!

10 CCO human CCO mouse Parse cross-references to GO extract human genes MGI Mouse GO Human-Mouse orthologs Phenotype, OMIM Expression Pathways Perl – regular expressions Database – SQL, joins ?

11 [Term] id: CCO:B0000111 name: SWI4_YEAST def: "Regulatory protein SWI4" [UniProtKB:P25302] synonym: "Cell-cycle box factor subunit SWI4" EXACT [UniProtKB:P25302] synonym: "Protein ART1" EXACT [UniProtKB:P25302] xref: EMBL:AAC03209.1 xref: EMBL:CAA35949.1 xref: UniProtKB:P25302 is_a: CCO:B0000000 ! core cell cycle protein relationship: encoded_by CCO:G0002318 ! SWI4_yeast relationship: has_function CCO:F0000274 ! DNA binding relationship: has_function CCO:F0000289 ! transcription factor activity relationship: has_function CCO:F0001646 ! protein binding relationship: has_function CCO:F0005713 ! identical protein binding relationship: has_source CCO:T0000016 ! Saccharomyces cerevisiae organism relationship: located_in CCO:C0000252 ! nucleus relationship: participates_in CCO:I0002887 ! swi4-ssa1 physical interaction relationship: participates_in CCO:I0003305 ! swi6-mpg1 physical interaction relationship: participates_in CCO:I0005128 ! ho-491 physical interaction relationship: participates_in CCO:I0005527 ! swi4-2 physical interaction relationship: participates_in CCO:P0000012 ! G1/S transition of mitotic cell cycle relationship: transforms_into CCO:B0009551 ! SWI4_YEAST-Phosphoserine159 relationship: transforms_into CCO:B0009552 ! SWI4_YEAST-Phosphoserine806 relationship: transforms_into CCO:B0009553 ! SWI4_YEAST-Phosphoserine1003 relationship: transforms_into CCO:B0009554 ! SWI4_YEAST-Phosphoserine1007


Download ppt "What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared."

Similar presentations


Ads by Google