Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Adopt HIMSS Definition of Interoperability Become a Champion for Interoperability.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A Secure Interoperable Infrastructure For Healthcare Information System Ehsan ul Haq Abrar Ahmed Sair
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
1 CIS607, Fall 2005 Semantic Information Integration Instructor/Organizer: Dejing Dou Week 1 (Sept. 28)
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
What is “Biomedical Informatics”?. Biomedical Informatics Biomedical informatics (BMI) is the interdisciplinary field that studies and pursues.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Practical RDF Chapter 1. RDF: An Introduction
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
From Bench to Bedside: Applications to Drug Discovery and Development Eric Neumann W3C HCLSIG co-chair Teranode Corporation HCLSIG F2F Cambridge MA.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Schema Interoperability Liam Magee Global Cities Institute RMIT University Melbourne, Australia.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
SSO: THE SYNDROMIC SURVEILLANCE ONTOLOGY Okhmatovskaia A, Chapman WW, Collier N, Espino J, Conway M, Buckeridge DL Ontology Description The SSO was developed.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
OWL Representing Information Using the Web Ontology Language.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
The Neuroscience information framework A User’s Guide.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Chapter 7 K NOWLEDGE R EPRESENTATION, O NTOLOGICAL E NGINEERING, AND T OPIC M APS L EO O BRST AND H OWARD L IU.
Craig Kuziemsky April 24, The need for a MDS for HPEOLC 2. Existing work – Canada 3. Existing work - International 4. Discussion 5. Questions.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Syntax and semantics >AMYLASEE1 TGCATNGY A very simple FASTA file.
How Semantic Web Technologies are Enabling the Bench to Bedside Vision Susie Stephens Principal Product Manager, Life Sciences Oracle
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
W3C Semantic Web for Health Care and Life Sciences Interest Group
Harnessing the Semantic Web to Answer Scientific Questions:
The Semantic Web By: Maulik Parikh.
Biological Databases By: Komal Arora.
HCLS Scientific Discourse C-SHALS 2009
Harnessing the Semantic Web to Answer Scientific Questions:
Middleware independent Information Service
Knowledge Management Systems
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Cataloging the Internet
What is “Biomedical Informatics”?
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Service-enabling Biomedical Research Enterprise
BioRDF Task Force.
Harnessing the Semantic Web to Answer Scientific Questions:
Presentation transcript:

Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita, Luciano, Marshall, Ogbuji, Rees, Stephens, Wong, Wu, Zaccagnini, Hongsermeier, Neumann, Herman, Cheung

Author Background Many authors with diverse backgrounds: Senior scientist at pharmaceutical company (Ruttenberg) Director of Bioinformatics at Harvard Univ. (Clark) Neuroscientist and informaticist at Drexel Univ. (Bug) Developer for Science Commons at MIT (Samwald) Scientist at National Library of Medicine (Bodenreider) Principal investigator at healthcare company (Chen) Developer of ‘Semantic Wb’ iPhone app (Doherty) Informatics scientist at pharmaceutical company (Forsberg) Clinical instructor in neurology (Gao) Senior Medical Informatician at healthcare company (Kashyap) Alzheimer researcher (Kinoshita)

Author Background (Cont’d) Lecturer on Genetics at Harvard Univ. (Luciano) Co-chair of HCLSIG (we’ll get to that later) (Marshall) Computer engineer at Case Western Univ. (Ogbuji) Lead software engineer at pharmaceutical company (Rees) Director of Bioinformatics at Johnson & Johnson (Stephens) Alzheimer researcher (Wong) Alzheimer researcher (Wu) VP of healthcare technology company (Zaccagnini) Principle Informatician at healthcare company (Hongsermeier) Senior Director of scientific data management co. (Neumann) Semantic Web Activity Lead at W3C (Herman) Associate Prof. of Computer Science & Genetics (Cheung)

Definitions Translational Research – the movement of discoveries in basic research (the Bench) to application at the clinical level (the Bedside) Evidence-based medicine → solutions for public health problems Information ecosystem – scientific literature, experimental data, summaries of knowledge of gene products, diseases, and compounds, and informal scientific discourse and commentary in a variety of forums The Semantic Web – an extension of the current Web that enables navigation and meaningful use of digital resources by automatic processes Based on common formats supporting aggregation and integration of data from diverse sources

Problems with the existing Information Ecosystem Lack of uniformly structured data Often not interoperable; data can’t be used in combination with data in other formats “Data silos” concept – disconnected databases not used in conjunction with one another Data synthesis is necessary to cure and prevent diseases Example: Parkinson’s Disease, Huntington’s Disease, and Alzheimer’s Disease share clinical, neural, cellular, and molecular traits—but specialists in each disease are often unaware of important literature on the other diseases which could be related

Criteria for Solutions Web-based Queries across experimental data from different communities of interest Practicality Linking and discovery of data interpretations need to be supported Tied to the scientific publication process and culture Tied to both current and evolving processes

The Semantic Web Triples – consist of subject, predicate, and object Subject: human TP53 gene Page describing the gene Predicate: hasGeneProduct Page describing the relationship between them with a human- readable definition Object: human TP53 MRNA Page describing the object Uniform Resource Identifier (URI) – string of characters used to identify a name or a resource Web URLs (Universal Resource Locators) are URIs Networked computers are able to follow these, locate identical (related) data references, and integrate knowledge

Semantic Web Technologies Uniform naming of elements of discourse by URIs Resource Description Framework (RDF) – gives formal specification of the syntax and semantics of statements (triples) Shared standards and technologies around these methods of organization SPARQL – standard query language for retrieving RDF-format answers Gleaning Resource Descriptions from Dialects of Language (GRDDL) – tool used to turn XML data into RDF Shared practices in using these standards and technologies

Applications to Biomedical Research Faster movement of innovation from research laboratory to clinic or hospital Utilizes the World Wide Web, the most successful information dissemination and sharing apparatus in existence URIs are global in scope – less effort will be spent developing services to map gene identifiers to information about the gene Common schema languages simplify management and comprehension of complicated and rapidly-evolving relationships among data The subject, predicate, object structure itself can guide users towards correct use of the data Flexible, extendable, decentralized databases allow room for future expansion of scientific knowledge Ability to extend others’ work and build upon their data Ability to check for consistency, classify, and inference Less errors in diagnosis/treatment (clinical) and data analysis (research)

Health Care and Life Sciences Interest Group (HCLSIG)

How did you conceptualize the medical examples? How does HCLSIG hope to address informaticians’ potential lack of specialized medical knowledge?

How did you conceptualize the database syntax? Again, I’m no expert—does anyone care to elaborate about the parts of (SQL) syntax such as: Will HCLSIG be able to utilize the resources of non-informatics medical practitioners?