Ontology for Biomedical Investigation

Slides:



Advertisements
Similar presentations
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
Advertisements

Developing an application ontology for biomedical resource annotation and retrieval: challenges and lessons learned C. Torniai, M. Brush, N. Vasilevsky,
Representing the Immune Epitope Database in OWL Jason A. Greenbaum 1, Randi Vita 1, Laura Zarebski 1, Hussein Emami 2, Alessandro Sette 1, Alan Ruttenberg.
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Standardizing Metadata Associated with NIAID Genome Sequencing Center Projects Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical.
Goal and Status of the OBO Foundry Barry Smith. 2 Semantic Web, Moby, wikis, crowd sourcing, NLP, etc.  let a million flowers (and weeds) bloom  to.
Functional Genomics Ontology FuGO and Metabolomics Society Ontology group Susanna-Assunta Sansone Nutr/Toxicogenomics Projects Coordinator EMBL-EBI Metabolomics.
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
Metadata For CARMEN Phillip Lord and Frank Gibson.
FuGO: Development of a Functional Genomics Ontology (FuGO) Patricia L. Whetzel 1, Helen Parkinson 2, Assunta-Susanna Sansone 2,Chris Taylor 2, and Christian.
He Lab Overview: Systems Biology Studies of Host-pathogen Interactions and Vaccines (Workshop on Data, Text, Web & Social Network Mining) Yongqun “Oliver”
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
National center for ontological research. Part One: The History of NCOR and ECOR Part Two: How to Establish JCOR: The Japanese Consortium.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
The OBO Foundry approach to ontologies and standards with special reference to cytokines Barry Smith ImmPort Science Talk / Discussion June 17, 2014.
Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical Informatics U.T. Southwestern Medical Center Standardizing Metadata Associated.
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Limning the CTS Ontology Landscape Barry Smith 1.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
OBI Tutorial Overview 8:30 AM Introduction To OBI – Presenter: Christian Stoeckert – Content: Overview of.
Cell Ontology 2.0 Elimination of multiple is_a inheritance through instantiation of relationships to terms in outside ontologies, such as the GO cellular.
The Chronious Ontology Suite: Methodology and Design Principles Luc Schneider[1], Mathias Brochhausen[1,2] [1] Institute for Formal Ontology and Medical.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
EMBL- EBI Wellcome Trust Genome Campus Hinxton, Cambridge, CB10 1SD, UK Standards and infrastructure for managing experimental metadata Philippe Rocca-Serra,
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
BFO-aligned Ontologies for Clinical and Translational Research: OGMS, IDO, and VO (Orlando Presentation, 2/8/2013)
Data Integration and Management A PDB Perspective.
Representing Flow Cytometry Experiments within FuGE Josef Spidlen 1, Peter Wilkinson 2, and Ryan Brinkman 1 1 BC Cancer Research Centre, Vancouver, BC,
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
The MGED Ontology W3C Workshop on Semantic Web for life Sciences October 27, 2004 Presented by Liju Fan MGED Ontology Working Group Senior Scientist, KEVRIC.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
The Ontology for Biobanking Chris Stoeckert, Jie Zheng, and Mathias Brochhausen University of Pennsylvania and University of Arkansas for Medical Sciences.
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
Sharing the knowledge of electrophysiology data Phillip Lord, Frank Gibson and the CARMEN Consortium.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
OBO Foundry Workshop 2009 Cell Ontology (CL) Preliminary review.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Describing Bioinformatic Metadata at EBI James Malone
TRANSITION FROM BFO 1.1 TO BFO 2.0 (OWL FORMAT) Jie Zheng Department of Genetics University of Pennsylvania May 13 th, 2013.
Immunology Ontology Rho Meeting October 10, 2013.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Immunology Ontology Workshop Buffalo, NY June 11-13, 2012.
Contributions to mouse BIRN tools and resources Maryann Martone and Mark Ellisman University of California, San Diego 2008.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
1 LS DAM Overview August 7, 2012 Current Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Mervi Heiskanen, NCI-CBIIT, Joyce.
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
Immune Epitope Database assays. Standard immune epitope definition Classical (textbook) definition: An epitope, also known as antigenic determinant, is.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Randi Vita, M.D. Better living through ontologies at the Immune Epitope Database La Jolla Institute for Allergy & Immunology Division of Vaccine Discovery.
The National Center for Biomedical Ontology
Metadata mapping in bioCADDIE: challenging cases
Building a community for genome and proteome annotation
VO: Vaccine Ontology Abstract Discussion Statistics of Current VO
Mental Functioning and the Gene Ontology
Data challenges in the pharmaceutical industry
Functional Annotation of the Horse Genome
OBO Foundry Principles
Ontology of biomedical investigations (OBI)
CCO: concept & current status
OBI – Standard Semantic
NCBO CTSA workshop, Baltimore
OBO Foundry Update: April 2010
The National Center for Biomedical Ontology
Presentation transcript:

Ontology for Biomedical Investigation 3/16/2016, ACS, San Diego Bjoern Peters La Jolla Institute for Allergy and Immunology

My hats High throughput wetlab-studies mandated to submit data to repositories Bioinformatics core; data management and analysis (at institute + for multi-project grants) Immune Epitope Database Manual literature curation Receiving data submissions from Development of machine learning tools Ontology development (OBI) Principles for ontology development (OBO foundry)  Complete lack of focus and inability to say no

Acronym similarity and beyond: OBI and OBO The Open Biomedical Ontologies (OBO) project has two groups of member ontologies OBO foundry OBO library The Ontology of Biomedical Investigations (OBI) is a member ontology of the OBO foundry project  Start with the OBO foundry

OBO foundry history: Gene Ontology Model organism databases wanted to use the same vocabulary to annotate gene products  Gene Ontology Consortium 16,000 citations (!)

OBO foundry history: Gene Ontology Killer app 1 (design goal): Predicting function of a newly discovered gene Compare sequence of gene with genes of known function GO ensures same vocabulary is used across model organisms Hierarchy of terms enables dealing with different level of details: Hit 1: GO:0005262 = ‘calcium channel activity’ Hit 2: GO:0005245 = ‘voltage-gated calcium channel activity’  Both hits consistent (GO:0005245 is_a GO:0005262) Killer app 2 (bonus): Gene set enrichment analysis for microarrays Measure gene expression in presence or absence of a chemical Determine which genes are expressed differently ( typically >100) Determine if there is an enrichment of GO annotations in the differentially expressed genes (e.g. DNA repair  chemical causes damage)

OBO foundry history: Beyond GO The practical success of GO made ontologies a thing in biomedical science Knowledge representation community became engaged  more formal ontology development beyond controlled vocabulary + acyclic graph Sibling ontologies emerged that wanted to ‘plug in’ to GO + expand beyond genomics Transcriptomics Proteomics Metabolomics Proliferation of incompatible ontologies with overlapping scope Rather than mimicking the attributes that made GO successful, this resembled the status quo ante of incompatible model organism DBs

Principles include (among others): Mission: Develop a family of interoperable ontologies that are both logically well-formed and scientifically accurate Approach: Participants voluntarily adhere (and contribute) to ontology development principles that facilitate the foundry mission Principles include (among others): licensing that facilitates re-use common machine readable syntax standardized naming conventions non-overlapping content and re-use Ontologies that want to participate in the Foundry are peer-reviewed for principle compliance http://www.obofoundry.org/

Eight OBO Foundry ontologies Over hundred of ontologies in OBO Library OBO member ontologies Eight OBO Foundry ontologies + Disease Ontology = 9 (2016) Over hundred of ontologies in OBO Library http://www.obofoundry.org/

Gene Ontology http://www.obofoundry.org/ Mailing list, contact developers For issues and new term request http://www.obofoundry.org/

OBO foundry principles Since 2014: Review of Review process  Ongoing process to clarify the wording of the principles and criteria to be used to evaluate ontologies for compliance Notify us of problems via issue tracker https://github.com/OBOFoundry/OBOFoundry.github.io/issues Common sense principles open – licensing that enables re-use format – OWL / OBO uris - identifiers versioning – deal with it documented – tell others how to use it locus of authority – contact person users – have more than 1 Metadata principles textual definitions naming conventions Foundry specific principles delineated content relations collaboration maintenance http://www.obofoundry.org/

Original Focus of OBO foundry: ‘Biological Reality’ Gene ontology (GO, the start of it all) Molecular function Biological process Cellular component Relationship Ontology (RO, evolved with gene ontology) Ontologies describing complementary entities to GO (Anatomy (XFO, ZFA), Chemical entities (ChEBI), Proteins (PRO), Phenotypes (PATO)) Focus on representing assertions like: “chemical X inhibits protein Y” Problem: Lack of context/evidence/provenance/metadata to go along with these assertions We would like to know: Who said that? Based on what? Crystal structure of chemical binding to active site? Protein expression level in animal administered with chemical?  OBI goal: represent the experiment / investigation

OBI Workshop San Diego Jan. OBI Origins MO/ MAGE FuGO FuGE 2nd FuGO Workshop Hinxton July OBI 2004 2005 2006 2007 MAGE Jamboree Hinxton Dec 1st FuGO Workshop Philadelphia Feb. Cancer Genomics Polypmorphisms Genome Sequences Crop Sciences OBI Workshop San Diego Jan. MGED 8 Bergen Sept. SOFG Philadelphia Oct MAGE Jamboree Stanford March Transcriptomics (MGED) Proteomics (PSI) PSI Siena April Toxicogenomics Environmental Genomics Nutrigenomics (MGED RSBI) Metabolomics Flow Cytometry Cellular Assays Immport IEDB Neuroinformatics

OBI – a user driven project Established 2007 (along with OBO foundry) First stable release in 2009 Current members typically have one or more applications that drive OBI development. Funding comes from these projects 9 year running effort, 1-2 phone calls per week, 1-2 face-to-face meetings per year  Open project with constant addition of new communities, please consider joining!

>30 projects involved with OBI Adverse Event Reporting Ontology (AERO) eNanoMapper NCBO Resource Index An Ontology for Drug Discovery Investigations (DDI) EuPathDB Neuroscience Information Framework (NIF) Antibody Registry Evidence Ontology (ECO) NIAID GSC/BRC project Beta Cell Genomics Ontology (BCGO) Experimental Factor Ontology (EFO) Ontology for Biological and Clinical Statistics (OBCS) BioSharing Functional Genomics Data (FGED) Society Ontology of Core Data Mining Entities (OntoDM) Brucellosis Ontology (IDOBRU) Immune Epitope Database and Analysis Resource (IEDB) Statistics Ontology (STATO) Cardiac Electrophysiology (CEP) Ontology Influenza Ontology The Encyclopedia of DNA Elements (ENCODE) Cell Line Ontology Informed Consent Ontology (ICO) The Investigation / Study / Assay (ISA) tab-delimited (TAB) Chemical Methods Ontology (CHMO) miRNA and Aging Ontology (MIAGO) Vaccination Informed Consent Ontology (VICO) Cognitive Paradigm Ontology (CogPO) NanoParticle Ontology Vaccine Ontology eagle-i National Center for Biomedical Ontology (NCBO) Annotator

Partial high level class overview of OBI BFO IAO OBO OBI is a entity continuant occurrent information content entity planned process data item investigation specimen collection material component separation plan specification material combination assay document material processing material entity biological process (GO) material maintenance processed material specimen gross anatomical part (CARO) organism (NCBI taxonomy) molecular entity (ChEBI) organization device processed specimen study design specifically dependent independent generically function role human subject enrollment Upper level ontology (ignore) OBI imports foundry ontologies

Example metadata for OBI term OWL annotation property  Example Preferred Term* glucometer Definition* A measurement device with the function to measure and record the level/amount of glucose in a blood sample Term Editor* PERSON:Frank Gibson PERSON:Helen Parkinson Definition Source* http://en.wikipedia.org/wiki/Glucose_meter Curation Status* Ready for release Example of usage Diabetic patients use glucometers to determine their glucose levels Alternative Term glucose meter * required

Example logical definition Asserted single inheritance hierarchy. Inferred multiple inheritance

Modeling an experiment in OBI Measure Function is realized by Process: Collecting specimen from organism inheres in Process: Analyte assay Glucometer Measurement Datum 1.2 mg/ml has specified input has specified output has specified input has specified input has specified input has specified output inheres in located in is realized by Specimen Role is realized by is realized by Analyte Role Glucose molecules Blood Specimen Evaluant Role inheres in part of inheres in

Applications of OBI Adding semantic expressivity to data stored in the IEDB Designing smart, standardized submission forms for EuPathDB Harmonize the annotation across different functional genomics resources using ISA-TAB

What is the OBI killer app? For us (=data resource providers): Standardize metadata capture at data generation Formulate curation rules + consistency checks Drive query interfaces Inference of missing data For the broader community Aggregation of experimental evidence, summarize and resolve differences Query across resources

Pain Points: Using community standards in a production system They don’t change (fast enough to keep up with knowledge) They change (and make existing data invalid) Scope limitations We are now formalizing our workflow to address this

Internal source of truth: Nomenclature standards + homemade extensions Community standards Internal source of truth: Nomenclature standards + homemade extensions In House Human subject enrollment system (HLA typing) Immune Epitope database (IEDB) curation external DB MHC binding prediction tools In House MHC Binding Assay Database

What should we be doing now? How should we be doing it? Ontology developers: Adhere to and contribute to OBO foundry principles Consider submitting ontology for foundry membership Do not duplicate efforts and limit benefits; rather contribute to existing efforts to make them better Data generators Use OBO foundry ontologies for data capture Contribute to ontologies rather than complaining that they are incomplete!!! Include personnel costs in your grants for this Resist the temptation to build your own ontology island Where are things going in the future? Depends. Data scientists could be drivers of knowledge discovery, or forever deal with mapping identifiers

Acknowledgements OBO Foundry: http://www.obofoundry.org/ Colin Batchelor (E), Royal Society of Chemistry, UK Mathias Brochhausen (O) Melanie Courtot (O, T), Simon Fraser University, Vancouver, Canada Melissa Haendel (O), Oregon Health & Science University, Portland, OR (OBO coordinator) Janna Hastings (E, O) Suzi Lewis (OBO coordinator) Chris Mungall, Lawrence Berkeley Laboratory, Berkeley, USA (OBO coordinator) (T) Darren Natale (E) James A. Overton (T), Knocean.com, Toronto, Canada Bjoern Peters (E) Philippe Rocca-Serra (E, O) Alan Ruttenberg (O, T), University at Buffalo, Buffalo, USA (OBO coordinator) Richard Scheuermann (OBO coordinator) (O), J. Craig Venter Institute, La Jolla, CA, USA Lynn Schriml (E,O) (OBO coordinator) Barry Smith (OBO coordinator) (O) Chris Stoeckert (O), University of Pennsylvania, Philadelphia, PA, USA Ramona Walls (E, O), The iPlant Collaborative, University of Arizona, Tucson, AZ, USA Jie Zheng (T), University of Pennsylvania, Philadelphia, USA OBI consortium: http://obi-ontology.org/ OBI: Anita Bandrowski1, Ryan Brinkman2, Mathias Brochhausen3, Matthew H. Brush4, Bill Bug†, Marcus C. Chibucos5, Kevin Clancy6, Mélanie Courtot7, Dirk Derom8, Michel Dumontier9, Liju Fan10, Jennifer Fostel11, Gilberto Fragoso12, Frank Gibson13, Alejandra Gonzalez-Beltran14, Melissa A. Haendel4, Yongqun He15, Mervi Heiskanen16, Tina Hernandez-Boussard9, Mark Jensen17, Yu Lin15, Allyson L. Lister14, Phillip Lord18, James Malone19, Elisabetta Manduchi20, Monnie McGee21, Norman Morrison22, James A. Overton23, Helen Parkinson19, Bjoern Peters23*, Philippe Rocca-Serra14, Alan Ruttenberg17, Susanna-Assunta Sansone14, Richard H. Scheuermann24, Daniel Schober25, Barry Smith17, Larisa N. Soldatova26, Christian J. Stoeckert Jr.20, Chris F Taylor19, Carlo Torniai4, Jessica A. Turner27, Randi Vita23, Patricia L. Whetzel1, Jie Zheng20 1University of California, San Diego, La Jolla, California, USA 2British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada 3University of Arkansas for Medical Sciences, Little Rock, Arkansas, USA 4Oregon Health & Science University, Portland, Oregon, USA †Deceased 5University of Maryland School of Medicine, Baltimore, Maryland, USA 6Thermo Fisher Scientific, Carlsbad, California, USA  7Simon Fraser University, Burnaby, British Columbia, Canada 8The Vrije Universiteit Brussel, Ixelles, Brussels, Belgium 9Stanford University, Stanford, California, USA 10Ontology Workshop, LLC, Columbia, Maryland, USA 11National Toxicology Program, NIEHS, National Institutes of Health, Research Triangle Park, North Carolina, USA 12Center for Biomedical Informatics and Information Technology, National Institutes of Health, Rockville, Maryland, USA 13Royal Society of Chemistry, Cambridge, Cambridgeshire, United Kingdom 14University of Oxford, Oxford, Oxfordshire, United Kingdom 15University of Michigan Medical School, Ann Arbor, Michigan, USA 16National Cancer Institute, Rockville, Maryland, USA 17University at Buffalo, Buffalo, New York, USA 18Newcastle University, Newcastle-upon-Tyne, Tyne and Wear, United Kingdom 19European Molecular Biology Laboratory- European Bioinformatics Institute, Hinxton, Cambridgeshire, United Kingdom 20University of Pennsylvania, Philadelphia, Pennsylvania, USA 21Southern Methodist University, Dallas, Texas, USA 22The University of Manchester, Manchester, Greater Manchester, United Kingdom 23La Jolla Institute for Allergy and Immunology, La Jolla, California, USA 24J. Craig Venter Institute, La Jolla, California, USA 25Leibniz Institute of Plant Biochemistry, Halle, Saxony-Anhalt, Germany 26Brunel University London, Uxbridge, Middlesex, United Kingdom  27Georgia State University, Atlanta, Georgia, USA  Need to add industry disclosure