Semantics for eScience Susie Stephens, Principal Research Scientist, Eli Lilly.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

The Integration of Biological Data Using Semantic Web Technologies Susie Stephens Principal Product Manager, Life Sciences Oracle
Convergence Workshop, March 2013 The goals and expected outputs of the convergence initiative Dipak Kalra EuroRec.
Improving the sharing of NICE content via syndication: what the future could hold Andrew Fenton CIO NICE 20 March 2014.
“Service Framework” workgroup
Scientific RDF Databases Michael Mertens K.U.Leuven.
The Open Innovation Center Susie Stephens, Principal Research Scientist, Eli Lilly.
Coordinating data interoperability – a W3C perspective M. Scott Marshall, Ph.D. W3C HCLS IG co-chair Leiden University Medical Center University of Amsterdam.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
A Secure Interoperable Infrastructure For Healthcare Information System Ehsan ul Haq Abrar Ahmed Sair
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
@Interontology08, February 27, 2008 The Semantic Web for Scientific Research: A ‘perfect storm’ for the development of Ontology Alan Ruttenberg Principal.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS
Linked TCM and Drug Datasets Background  Traditional Chinese Medicine (TCM), which is a type of alternative medicine, is receiving growing attention from.
Introduction to Pharmacoinformatics
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Information Management for the Life Sciences M. Scott Marshall Marco Roos Adaptive Information Disclosure University of Amsterdam.
Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,
Applying the Semantic Web at UCHSC - Center for Computational Pharmacology Ian Wilson.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Overview of Discovery & Development Informatics at Lilly Rick Bishop, Manager, DDIT Phil Brooks, Information Consultant, DDIT Hans Constandt, Senior Business.
Teranode Tools and Platform for Pathway Analysis Michael Kellen, Solution Manager June 16, 2006.
Phase II Additions to LSG Search capability to Gene Browser –Though GUI in Gene Browser BLAST plugin that invokes remote EBI BLAST service Working set.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Semantic Web, Web Services and Museums: Mapping the Road to Implementation John Perkins “MESMUSES Workshop” Florence, June 16-17, 2003.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009.
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
TMO Review Jin Guang Zheng, Tetherless World Constellation.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
Alan Ruttenberg School of Dental Medicine Applications Alan Ruttenberg Oral Diagnostic Sciences Clinical and Translational Data Exchange.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
E-SI Theme: Exploiting Diverse Sources of Scientific Data Re-use or Re-invention - a Roadmap for Data Integration 27 th -28th November 2006 Prof. Jessie.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
W3C Semantic Web for Health Care and Life Sciences Interest Group
Linking Open Drug Data (HCLSIG LODD)
W3C Semantic Web for Health Care and Life Sciences Interest Group
Harnessing the Semantic Web to Answer Scientific Questions:
HCLS Scientific Discourse C-SHALS 2009
Harnessing the Semantic Web to Answer Scientific Questions:
W3C Semantic Web for Health Care and Life Sciences Interest Group
Overview Linked Data Principals Linking Open Drug Data.
Sponsored by the University of Southampton
BioRDF Task: Building a Knowledgebase for Neuroscience
HCLS Scientific Discourse Progress Report
WikiNeuron: Semantic Neuro-Mashup
Linking Open Drug Data (HCLSIG LODD)
HCLS Tutorial: The W3C Health Care and Life Sciences Interest Group
An ecosystem of contributions
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
The Linked Data Cloud Source: Chris Bizer. Linking Open Drug Data Susie Stephens, Principal Research Scientist, Eli Lilly.
W3C Semantic Web for Health Care and Life Sciences Interest Group
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Service-enabling Biomedical Research Enterprise
BioRDF Task Force.
Linking Open Drug Data (HCLSIG LODD)
Harnessing the Semantic Web to Answer Scientific Questions:
Presentation transcript:

Semantics for eScience Susie Stephens, Principal Research Scientist, Eli Lilly

Outline Introduction to the Semantic Web W3Cs Semantic Web for Health Care and Life Sciences Interest Group Semantic Web Solutions at Lilly

Introduction to the Semantic Web

Drivers for the Semantic Web Business models develop rapidly these days, so infrastructure that supports change is needed Organizations are increasingly forming and disbanding collaborations so need to be able to better share data Increasing need in pharma to be able to query across data silos Data is growing so quickly that it is no longer possible for individuals to identify patterns in their heads Increasing recognition of the benefits of collective intelligence

Characterizing the Semantic Web Semantic Web is an interoperability technology An architecture for interconnected communities and vocabularies A set of interoperable standards for knowledge exchange

Creating a Web of Data Source: Ivan Herman Graph representation Data in various formats Applications

Mashing Data Source: W3C

W3Cs Semantic Web for Health Care and Life Sciences Interest Group

Task Forces Terminology – Semantic Web representation of existing resources Task lead - John Madden Scientific Discourse – building communities through networking Task leads - Tim Clark, John Breslin Clinical Observations Interoperability – patient recruitment in trials Task lead - Vipul Kashyap BioRDF – integrated neuroscience knowledge base Task lead - Kei Cheung Linking Open Drug Data – aggregation of Web-based drug data Task lead - Chris Bizer Other Projects: Clinical Decision Support, URI Workshop, Collaborations with CDISC & HL7

BioRDF: Integrating Heterogeneous Data Integration and analysis of heterogeneous data sets Hypothesis, Genome, Pathways, Molecular Properties, Disease, etc. NeuronDB BAMS NC Annotations Homologene SWAN Entrez Gene Gene Ontology Mammalian Phenotype PDSPki BrainPharm AlzGene Antibodies PubChem MESH Reactome Allen Brain Atlas Publications

BioRDF: Looking for Targets for Alzheimers Signal transduction pathways are considered to be rich in druggable targets CA1 Pyramidal Neurons are known to be particularly damaged in Alzheimers disease Casting a wide net, can we find candidate genes known to be involved in signal transduction and active in Pyramidal Neurons? Source: Alan Ruttenberg

BioRDF: SPARQL Query Source: Alan Ruttenberg

BioRDF: Results: Genes, Processes DRD1, 1812adenylate cyclase activation ADRB2, 154adenylate cyclase activation ADRB2, 154arrestin mediated desensitization of G-protein coupled receptor protein signaling pathway DRD1IP, 50632dopamine receptor signaling pathway DRD1, 1812dopamine receptor, adenylate cyclase activating pathway DRD2, 1813dopamine receptor, adenylate cyclase inhibiting pathway GRM7, 2917G-protein coupled receptor protein signaling pathway GNG3, 2785G-protein coupled receptor protein signaling pathway GNG12, 55970G-protein coupled receptor protein signaling pathway DRD2, 1813G-protein coupled receptor protein signaling pathway ADRB2, 154G-protein coupled receptor protein signaling pathway CALM3, 808G-protein coupled receptor protein signaling pathway HTR2A, 3356G-protein coupled receptor protein signaling pathway DRD1, 1812G-protein signaling, coupled to cyclic nucleotide second messenger SSTR5, 6755G-protein signaling, coupled to cyclic nucleotide second messenger MTNR1A, 4543G-protein signaling, coupled to cyclic nucleotide second messenger CNR2, 1269G-protein signaling, coupled to cyclic nucleotide second messenger HTR6, 3362G-protein signaling, coupled to cyclic nucleotide second messenger GRIK2, 2898glutamate signaling pathway GRIN1, 2902glutamate signaling pathway GRIN2A, 2903glutamate signaling pathway GRIN2B, 2904glutamate signaling pathway ADAM10, 102integrin-mediated signaling pathway GRM7, 2917negative regulation of adenylate cyclase activity LRP1, 4035negative regulation of Wnt receptor signaling pathway ADAM10, 102Notch receptor processing ASCL1, 429Notch signaling pathway HTR2A, 3356serotonin receptor signaling pathway ADRB2, 154transmembrane receptor protein tyrosine kinase activation (dimerization) PTPRG, 5793ransmembrane receptor protein tyrosine kinase signaling pathway EPHA4, 2043transmembrane receptor protein tyrosine kinase signaling pathway NRTN, 4902transmembrane receptor protein tyrosine kinase signaling pathway CTNND1, 1500Wnt receptor signaling pathway Many of the genes are related to AD through gamma secretase (presenilin) activity Source: Alan Ruttenberg

LODD: Introduction B C Thing typed links A D E Thing Search Engines Linked Data Mashups Linked Data Browsers Use Semantic Web technologies to 1. publish structured data on the Web 2. set links between data from one data source to data within other data sources Source: Chris Bizer

LODD: Potential Links between Data Sets Source: Chris Bizer

LODD: Potential questions to answer Physicians and Pharmacists What are alternative drugs for a given indication (disease)? What are equivalent drugs (generic version of a brand name, or the chemical name of a active ingredient)? Are there ongoing clinical trials for a drug? Patients What background information is available about a drug? What are the contraindications of a drug? Which alternative drugs are available? What are the results of clinical trials for a drug? Pharmaceutical Companies What are other companies with drugs in similar areas? Which companies have a similar therapeutic focus? Source: Chris Bizer

LODD: Linked Version of ClinicalTrials.gov Total number of triples: 6,998,851 Number of Trials: 61,920 RDF links to other data sources: 177,975 Links to: DBpedia and YAGO (from intervention and conditions) GeoNames (from locations) Bio2RDF.org's PubMed (from references) Source: Chris Bizer

Semantic Web Solutions at Lilly

Implementations at Lilly Integration of Clinical and Pathways Data Competitive Intelligence Experimental Metadata Discovery Metadata

Discovery Metadata: Goals Integrate master data throughout the discovery process to enable information sharing/integration for the scientific community Model key relationships between master data classes Provide ability to integrate disparate data sets quicker than the normal warehouse paradigm typically allows Create a re-usable and sustainable semantic implementation Allow for user-driven, manual curation of key data relationships Source: Phil Brooks

Discovery Metadata: Ontology Source: Phil Brooks

Discovery Metadata: Architecture Application 1Application 2Application 3 … SOA Layer/Enterprise Service Bus (WebServices, Visualizers, DataAccess Components ) Authentication SOASOA DATADATA APPSAPPS SQLSPARQL Source Model 1 Source Model 2 Source Model 3 Source Model 4 Local Assertions Top Level Ontology Provenance Other Sources Other Sources Source … ETL Other Tools Spreadsheets Rdbms Source: Phil Brooks

External Collaborations RDF Access to Relational Databases - Chris Bizer, Eric Prud'hommeaux Scalability testing of relational to RDF mapping approaches End User Semantic Web Authoring - David Karger Enhancing the scalability and robustness of the Exhibit and Potluck tools Scientist-Driven Semantic Integration of Knowledge in Alzheimer's Disease - Tim Clark, June Kinoshita Project to develop an integrated knowledge infrastructure for the neuromedical research community, pairing rich digital semantic context with the ever-growing digital scientific content on the web Provenance Collection and Management - Carole Goble, Beth Plale Project to develop a metadata taxonomy for global data at Lilly which enables the rapid integration of data and mining/analysis algorithms into dataflows which support clinical and discovery decisions W3Cs Health Care and Life Sciences Interest Group

Conclusion Many Semantic Web solutions are being explored within the health care and life sciences community Lilly is seeing tangible benefits in multiple projects from Semantic Web Semantic Web provides a flexible framework for data integration Incremental adoption of technology Flexibility to integrate unanticipated data sets Link existing silos together Lilly is setting up open collaborations in this space Try out LSG