Download presentation
Presentation is loading. Please wait.
Published byKya Pinkerman Modified over 9 years ago
1
Representing the Immune Epitope Database in OWL Jason A. Greenbaum 1, Randi Vita 1, Laura Zarebski 1, Hussein Emami 2, Alessandro Sette 1, Alan Ruttenberg 3, and Bjoern Peters 1 1 La Jolla Institute for Allergy and Immunology 2 Science Applications International Corporation 3 Science Commons
2
Overview Background –Immune epitopes –Epitope mapping experiments –The Immune Epitope Database (IEDB) IEDB development cycle –Ontology development –Database design –Content curation Database export into OWL
3
Mouse Virus cell CD8 + T cell epitopes in viral infection MHC-I
4
cell MHC-I T Cytokine Release Cytotoxicity T T Proliferation Mouse TCR CD8 CD8 + T cell epitopes in viral infection Virus epitope role: the role of a material entity that is realized when it binds to an adaptive immune receptor. Context is key – What immune receptor? What host? What happened to the host previously (infections? vaccinations? diseases?)… adaptive immune response: a GO:immune response resulting from epitope binding by adaptive immune receptor
5
Entities in a epitope mapping experiment T APC T Data items –spot count –spot forming cells per million Processes –Administering substance in vivo –Take sample from organism –Perform ELISPOT assay –Transform data Material entities –Cell –Organism –Peptide Roles and Functions –Immunogen –Antigen –Antigen presenting cell –Effector cell 42 SFC/10^6
6
Goal: To catalog and make accessible immune epitope characterizing experiments IEDB www.immuneepitope.org Literature curation Epitope discovery contract submission The Immune Epitope Database (IEDB) 10 full time curators Content >6,500 journal articles >50,000 epitopes >300,000 experiments Completed: 98% infectious disease 95% allergy Next: autoimmunity (25%)
7
Example curated experiment: typically 100 – 300 fields
10
Summary I Immune epitopes are the molecular entities recognized by adaptive immune receptors The IEDB catalogs experiments defining immune epitopes Large amounts of complex data, which poses challenges for data consistency
11
Overview Background –Immune epitopes –The Immune Epitope Database (IEDB) IEDB development cycle –Ontology development –Database design –Content curation Database export into OWL
12
Development cycle Ontology development identify entities and relations Database design table structure lookup table values validation rules Content curation add new content recurate invalid content
13
Ontology development (ONTIE) Re-use terms from OBO foundry candidate ontologies Native ONTIE terms for entities specific for epitopes Goal is to find a good home for them Imports from: Gene Ontology Cell Ontology ChEBI, NCBI Taxonomy OBI Protein Ontology Information Artifact Ontology Partial high-level ‘is a’ hierarchy Available: http://ontology.iedb.org/http://ontology.iedb.org/
14
Database design / implementation Ontology terms | Database tables History: initial design (to get started) iterative updates (to fix things) redesign from scratch for 2.0 because we (still) can Tables aligned with ontology Improved understanding between software engineers and domain experts ‘ontologic normalization’
15
Content migration and re-curation IEDB 1.0 Rule based validation first pass: 693,133 inconsistencies 1.conditional field-to-field mapping 2.script based re-curation (SQL) IEDB 2.0 3. manual recuration (web interface)
16
Summary II Application specific ontology (ONTIE) developed based on OBO foundry principles, and relying heavily on OBI Database re-designed and structure aligned with the ontology Data migrated and consistency enforced by rule based validation engine
17
Overview Background –Immune epitopes –The Immune Epitope Database (IEDB) IEDB development cycle –Ontology development –Database design –Content curation Database export into OWL
18
Subset of IEDB 2.0 Database export into OWL
22
Advantages of OWL export Allows to directly use ontology and OWL reasoner to perform consistency checks Provides expressive query language within the IEDB Enables query across integrated biomedical databases.
23
Future Work Provide IEDB in triple store / access through SPARQL queries Complete ontology development and OWL export for all data in the IEDB Overcome technical challenges (Pellet takes 1 minute to classify 100 assays; 300,000 in IEDB…) Overcome ontological challenges (cells, peptides, negative data, …)
24
IEDB Team - www.iedb.org SAIC Scott Stewart Tom Carolan Hussein Emami San Diego Supercomputer Center Phil Bourne Julia Ponomarenko Zhanyang Zhu Technical University of Denmark Ole Lund Morten Nielsen University of Copenhagen Søren Buus La Jolla Institute for Allergy & Immunology THANKS! OBI Consortium - http://obi-ontology.org Alan Ruttenberg – Science Commons
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.