Service-enabling Biomedical Research Enterprise

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Medical Informatics Basics
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Mining the Biomedical Research Literature Ken Baclawski.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
1 2. Knowledge Management. 2  Structuring of knowledge enables effective and efficient problem solving dynamic learning strategic planning decision making.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
‘’ DUBNA, 2016, July, 07 Yury Akatkin, Elena Yasinovskaya
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Conceptualizing the research world
Biological Databases By: Komal Arora.
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
CCNT Lab of Zhejiang University
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Knowledge Management Systems
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
knowledge organization for a food secure world
Sponsored by the University of Southampton
Taxonomies, Lexicons and Organizing Knowledge
Department of Genetics • Stanford University School of Medicine
Functional Annotation of the Horse Genome
File Systems and Databases
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Bioinformatics Vicki & Joe.
LOD reference architecture
IDEAS Core Model Concept
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Semantic Interoperability in Digital Library Systems
AI Discovery Template IBM Cloud Architecture Center
Database Systems: Design, Implementation, and Management Tenth Edition
Presentation transcript:

Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy 5/1/2019

Introduction Life sciences have witnessed a flurry of innovations triggered by sequencing of human genome as well as genomes of other genomes. Area of transformational medicine aims to improve communication between basic and clinical science to allow more therapeutic and diagnostic insights. 5/1/2019

Translational medicine From bench to bedside Exchange ideas, information and knowledge across organizational, governance, socio-cultural, political and national boundaries. Currently mediated by the internet and exponentially-increasing resources Digital resources: scientific literature, experimental data, curated annotation (metadata) human and machine generated. Ex: Blast Searches NCBI taxonomy 5/1/2019

Driving principles Key requirements: large volume of data to be managed. How? Transform to Digital Machine readable Capable of being filtered Aggregated Transformed automatically Context information: use and meaning along with content Knowledge integration: combines data from research in mouse genetics, cell bilogy, animal neuropsychology, protein biology, neuropathology, and other areas. Attention to drug discovery, systems bilogy and personalized medicine that rely heavily on integrating and interpreting data produced by experiments. Heterogenious data 5/1/2019

BioSem Enterprise Architecture Transform results Ex: integrate, generate metadata Dissemination Of results Clinical experiments Ex: drug discovery Diagnostic tools Research Knowledge Ex: Blast Clinical data Ex: JNI ontology Academic Knowledge Ex: cell, psychology molecular Treatment methods 5/1/2019

Use case Parkinson’s disease (PD): System physiology perspective Cellular and molecular biology perspective Pharmacology relating to chemical compounds that bind to receptors Example query: show me the neuronal components that bind to a ligand which is a therapeutic agent in Parkinson’s disease in reach of the dopaminergic neurons in the substania nigra. Domain specific shared semantics and classifications Ontologies can help map among the domains and support seamless integration and interoperation. 5/1/2019

Development of Ontologies Manual interaction between ontologists in experts Textual descriptions are used for adding to this base Link pre-existing ontologies for extensive coverage 5/1/2019

Ontology design and creation Approach (fig. 5.1) Subject matter Knowledge (Text) Identify core terms And phrases Map phrases to Relationship between classes Model terms using ontological Constructs: classes, properties Arrange classes and relationships in subsumption hierarchies Information queries Identify new classes and relationships Refine subsumption hierarchies Pre-existing classifications And ontologies Re-use classes and relationships Extenf subsumption hierarchies 5/1/2019

Identifying concepts and hierarchies Text describing PD in p.105 Study the analysis Based on the analysis identify important ontological concepts relevant to PD: Genes Proteins Genetic mutations Diseases See fig. 5.2 Next step is to identify relationship among concepts 5/1/2019

Identifying and extracting relationships 5/1/2019

Extending the ontology based on information queries Consider various queries and identify concepts and relationships needed to be part of PD ontology. These concepts are needed to retrieve information and knowledge from the system. This lead to additional new concepts. See fig.5.4 5/1/2019

PD: adding concepts to support information queries 5/1/2019

Ontology Re-use It is desirable to re-use the ontology and vocabulary developed in the healthcare and life-sciences fields. Diseases: PD information can be used in Huntington’s and Alzeimer’s. PD can reuse information from International classification of diseases ICD and its subset SNOMED. Genes: more genes and genomic concepts such as proteins, pathways are added to ontologies. Consider connecting to Gene Ontology. Neurological concepts: Consider using Neuro names 2007. Enzymes: concepts related to enzymes and other chemicals may be required; you may use Enzyme Nomenclature 2007 Be aware of inconsistencies and circularities. Multiple models may emerge; choice should be based on use cases and functional requirements. 5/1/2019

Data sources Now answering the question that we posted in slide#6, three data sources need to be integrated: Neuron database, PDSP KI database, PubChem 5/1/2019

Data Integration A centralized approach where data available through web based interfaces is converted into RDF and stored in a centralized repository A federated approach where data continues to reside in the existing repositories. RDF mediator converts underlying data into RDF format. RDF allows for focus on logical structures of information in contrast to only representational format (XML) or storage format (relational). 5/1/2019

Mapping ontological concepts to RDF graphs Sample query discussed earlier results in these concepts: Compartment located_on Neuron Receptor located_in Compartment Ligand binds_to Receptor Ligand associated_with Disease Next task to map these into RDF maps in the underlying data sources. Using ontological definitions, data sources, SPARQL queries, and name space, RDF graphs are extracted. 5/1/2019

Generation and merging of RDF graphs UR14 Parkinson’s disease UR16 D_Neuron UR12 Neuron Database type binds_to associated_with Neuron UR12 D1 UR14 5-H Tryptamine UR15 5-H Tryptamine UR15 Located_in D_Dendrite UR12 Located_in PDSPKI Database PubChem database 5/1/2019

Integrated RDF graph 5/1/2019 Parkinson’s disease UR16 D_Neuron UR12 type associated_with Neuron UR12 5-H Tryptamine UR15 Located_in binds_to D1 UR14 D_Dendrite UR12 Located_in 5/1/2019

Assignment 2 Consider the PD case study that used ontological approach to querying distributed databases. Discuss 10 reasons of using this approach as opposed to common SQL query and relational database approach. Why is Google, Yahoo or MSN search not good enough for searching biological database? Discuss centralized and federated approach to data integration in the context of this case study. Submit a softcopy of the document in the digital drop box. How to do this? Read Chapter 5, read it again. The answers can be formed from the information provided there and from your experience with relational database systems. 5/1/2019

Summary Semantic web technologies provide an attractive technological informatics foundation for enabling the Bench to Bedside Vision. Many areas of biomedical research including drug discovery, systems biology, personalized medicine rely heavily on integrating and interpreting heterogeneous data set. This is part of ongoing work in the framework of the work being performed in the Healthcare and Life Sciences Interest Group of W3C. 5/1/2019