Ontology, RDF, SW for Chemical Structures

Slides:



Advertisements
Similar presentations
Re-learning Ontology Management for the Web Chris Welty IBM Research.
Advertisements

Terminologies: An e-Science perspective Nicholas Gibbins Intelligence, Agents, Multimedia University of Southampton.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
Information and Business Work
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Paul Groth VU University Amsterdam Convergence Meeting: Semantic Interoperability for Clinical Research & Patient.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Serving society Stimulating innovation Supporting legislation Workshop on the INSPIRE registry and registers Martin Tuchyňa, Tomáš.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
SATuRN HIV-1 Genotyping System Justen Manasa Africa Centre for Health and Population Studies Virology laboratory based at the Nelson R. Mandela School.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Strategies for Realizing the Semantic Web Ian Horrocks.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
ChEBI, text mining and ontological best practice Colin Batchelor Royal Society of Chemistry
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
6 Dec Rev. 14 Dec CmpE 583 Fall 2008OWL Intro 1 OWL Intro Notes off Lacy Ch. 4 Atilla Elçi.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
APPLICATION OF ONTOLOGIES IN CANCER NANOTECHNOLOGY RESEARCH Faculty of Engineering in Foreign Languages 1 Student: Andreea Buga Group: 1241E – FILS Coordinating.
Class Diagrams. Terms and Concepts A class diagram is a diagram that shows a set of classes, interfaces, and collaborations and their relationships.
Review assessment two and three Design and develop an interactive, multimedia application to meet the documented requirements of an identified client.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Requirements Specification
Discover What’s Been Missing Vicky Hampshire 18th January 2017
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
Chemical structure search in PATENTSCOPE
Classifying Chemistry: Current Efforts in Canada
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Agenda Federated Enterprise Architecture Vision
Linked Data Web that can be processed by machines
Towards a framework for architectural design decision support
Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Development of the Amphibian Anatomical Ontology
Chemical structure search in PATENTSCOPE
Data Exchange & Public Reference Data
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
SMART GROUND platform overview
Analyzing and Securing Social Networks
HIV Integrase Therapeutics
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
PREMIS Tools and Services
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Database Development Cycle
Chapter 1 Introduction to Chemistry:
Semantic Markup for Semantic Web Tools:
Zach Wahl and Tatiana Baquero Project Performance Corporation (PPC)
Web archives as a research subject
Semantic Web Towards a Web of Knowledge - Outline
An Automated Registration System
Metadata supported full-text search in a web archive
Presentation transcript:

Ontology, RDF, SW for Chemical Structures T N Bhat & J. Barkley NIST Bhat@nist.gov Query tool Use Case Publications

Major Features, Goal – to Reduce User Frustration We have established a use case at the HCLS Website - Chemical taxonomies Combining of Rule-based terms with Vocabulary-based terms to define elements of RDF Organization of the elements of RDF into predictable ontology using concepts from use cases Developing tools and techniques to present the information using familiar database environments Allows easier portability and implementation of the information by the community Illustrating the concept using high profile data such as for AIDS inhibitors and Protein Data Bank contents

Combining of Rule-based with Vocabulary-based elements to define RDF Chemical structures are definable by atomic connectivity – thus structures are suitable for identification using graph theory – InChI Suitable for machine reasoning Graphs are hard to digest for humans – therefore proposal is to combine InChI with familiar vocabularies such as Ala, Phenyl, Adenine Also include synonyms in the vocabulary for greater coverage among diverse users Vocabularies make it easier for humans to recognize the information

InChI – a Scalable URI InChI is generated using a software that decodes the chemical connectivity information in certain layers such as chirality, ring structure, atom type and then re-codes them to form a text string InChI is a naming standard for chemicals recommended by IUPAC

InChI – a rule-based URI _1_2FC10H11NO2_2Fc11-10_2812_2913-9-5-7-3-1-2-4-8_287_296-9_2Fh1-4_2C9H_2C5-6H2_2C_28H2_2C11_2C12_29

Vocabulary-based Definitions For decades scientists have been developing names to identify structures and their images Simple names His Ala DNA ATP Semi-rule-based IUPAC names 2-amino-3-methylpentanamide 4-amino-3-hydroxy-6-methylheptanoic_acid 1-[(Benzenesulfonyl-methyl-amino)-phenyl-butyl]-piperidin-4-yl}-propyl-carbamic acid, naphthalen-1-ylmethyl ester Names facilitate text-based queries of desired components Names when used together with InChI provide a smoother integration of machine and human needs

Use-Case for SW; Treatment for AIDS is a work in progress Treatments for AIDS are of two types Prevention – the most effective Containment Drugs to contain, and reduce the viral load Majority of the drugs ( ~17) target either HIV protease or RT Complete suppression of either of these viral enzymes could cure AIDS But drug resistance leads only to partial suppression of the enzymes All the drug design efforts for AIDS are based on structures Data needed for drug-design is scattered over many Web resources and users often wean through the data manually Therefore AIDS drug design is an ideal target for Semantic Web and novel new database related technologies SW connection between NIST and NIAID AIDS database Choose the problem that matters Website

Annotation Technique/Developing Structural Ontology Define compounds using chemical features of interest to use cases Fragment, subgroup, class 000503 030798 000505 1A8K

Modeling with Protégé – Suitable for Text-based Ontology

Web tools Structures are different from text based info Structures are not amenable to text-based query/rendering techniques Majority of the structural users never heard (nor want to hear!) about SPARQL – query language for RDF Commonly preferred/expected way to query is by ‘click’ Semantic Web for Structures needs new Web tools that allow navigation by clicking on structural features

Chem-BLAST for Structural Semantic Web http://bioinfo.nist.gov/SemanticWeb_pr3d/chemblast.do Prasanna et al. PROTEINS 60, 1-4 (2005). Prasanna et al. PROTEINS 63(4), 907-917(2006). Download publications

Future Plans Extend the work to chemical structures from Protein Data Bank If interest exists hold a workshop at NIST Proposed dates - last two weeks of March 2008 Workshop will be in conjunction with the NIST wide Ontology week Possible collaboration with IUPAC (International Union of Pure and Applied Chemistry ) and ChEBI Contact: Colin Batchelor BatchelorC@rsc.org RSC Publishing, Royal Society of Chemistry Community participation is essential for further development Contact bhat@nist.gov 301 975 5448 (US)