An integrative approach to drug repositioning: a use case for semantic web technologies Paul Rigor Institute for Genomics and Bioinformatics Donald Bren.

Slides:



Advertisements
Similar presentations
Numedicus (est. 2008) provides collaborative & consultative services to companies seeking expert advice in the area novel uses for existing drugswww.numedicus.co.uk.
Advertisements

RDB2RDF: Incorporating Domain Semantics in Structured Data Satya S. Sahoo Kno.e.sis CenterKno.e.sis Center, Computer Science and Engineering Department,
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
From Chemoinformatics to Systems Chemical Biology Irene Kouskoumvekaki, Associate Professor, Computational Chemical Biology, CBS, DTU-Systems Biology #27803,
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Strategies for the Lean Years Ruth Farrell, Associate VP for Research Administration.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Workshop in Bioinformatics 2010 What is it ? The goals of the class… How we do it… What’s in the class Why should I take the class..
Social Pharmacy and Pharmacoepidemiology Lister Hill National Center for Biomedical Communications Text-based Discovery in Biomedicine The Architecture.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Molecular Library and Imaging Francis Collins, NHGRI Tom Insel, NIMH Rod Pettigrew, NIBIB Building Blocks and Pathways Francis Collins,NHGRI Richard Hodes,
The NIH Roadmap for Medical Research
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Formal Empirical Applied Mathematical and technical methods and theories Cognitive, behavioral, and organizational techniques and theories ImagingBioInformaticsClinical.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Genome-scale Metabolic Reconstruction and Modeling of Microbial Life Aaron Best, Biology Matthew DeJongh, Computer Science Nathan Tintle, Mathematics Hope.
Paul Groth VU University Amsterdam Convergence Meeting: Semantic Interoperability for Clinical Research & Patient.
Information Systems Basic Core Specialization Clinical Imaging BioInformatics Public Health Computer Science Methods (formal models) Biomedical Decision.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Indiana Institute For Personalized Medicine David A Flockhart MD, PhD Professor of Medicine, Genetics and Pharmacology Indiana University.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Biomedical Databases & Tools Rolando Garcia-Milian Biomedical & Health Information Services Department Health Sciences Center Library.
Discovery from Linking Open Data (LOD) Annotated Datasets Louiqa Raschid University of Maryland PAnG/PSL/ANAPSID/Manjal.
NT CVD : Development and Validation of new Diagnostic, Preventive and Therapeutic Tools for the Prevention of Cardiovascular Disease in Chronic Kidney.
NIH Council of Councils Meeting November 21, 2008 LINCS Library of Integrated Network-based Cellular Signatures.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
PyMine – A PyMol Plugin to Integrate and Visualize Chemical and Biological Data for Drug Discovery Zhijun Li, Ph.D. Departments of Chemistry & Biochemistry.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
1 How Informatics Can Drive Your Research Barry Smith
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Bringing Genomics Home Your DNA: A Blueprint for Better Health Dr. Brad Popovich Chief Scientific Officer Genome British Columbia March 24, 2015 / Vancouver,
TEACHING PHARMACOLOGY IN BRASIL Roberto Soares de Moura Department of Pharmacology State University of Rio de Janeiro - Brasil.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,
EB3233 Bioinformatics Introduction to Bioinformatics.
Pathway: a collection of genes, proteins, and /or small molecules that modulate a cellular process or disease state Growing demand in biological sciences.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
Why Write A Grant? Elaine M. Hylek, MD, MPH Professor of Medicine Associate Director, Education and Training Division BU CTSI Section of General Internal.
Joe Pekny, Professor Chemical Engineering Director, e-Enterprise Center Discovery Park Marietta Harrison, Professor Medicinal Chemistry & Molecular Pharmacology.
NIH and the Clinical Research Enterprise Third Annual Medical Research Summit March 6, 2003 Mary S. McCabe National Institute of Health.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
TMO Review Jin Guang Zheng, Tetherless World Constellation.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Clinical Research Informatics [CRI]. Informatics, defined generally as the intersection of information and computer science with a health-related discipline,
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
Understanding Epidemiology Introduction to Epidemiology and Epidemiological Concepts.
Computational Biology Signaling networks and drug repositioning Lars Juhl Jensen.
TDM in the Life Sciences Application to Drug Repositioning *
Constructing a Predictor to Identify Drug and Adverse Event Pairs
The UMLS and the Semantic Web
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
What is Biomedical Research?
DReNIn_O “A high-level ontology for drug repositioning” Joseph Mullen
Sponsored by the University of Southampton
A. Dix, S. Vlaic, R. Guthke, J. Linde 
Collaborative RO1 with NCBO
12th – 13th November GRANADA
Presentation transcript:

An integrative approach to drug repositioning: a use case for semantic web technologies Paul Rigor Institute for Genomics and Bioinformatics Donald Bren School for Information and Computer Science University of California in Irvine Mentor: Dr. Olivier Bodenreider Lister Hill National Center for Biomedical Communications National Library of Medicine National Institutes of Health

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH A little history

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH A little history

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH The tragic history of thalidomide

 1964 – Serendipitous discovery as a novel treatment for symptoms of leprosy Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Paving a new history

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Repositioning: novel applications of Thalidomide

 Discovering novel targets for existing drugs  Big pharma finds this approach economically attractive compared to de novo drug discovery  Three R’s: Reprofiling, repurposing, and repositioning What is drug repositioning? Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH

Drug development pipelines Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Ashburn and Thor 2004

 Drug repositioning has paved the way for rational drug design  Instead of phenotypic assays (trial-and-error), we are delving deeper into biological processes (and components) that underlie disease Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Rational drug design

 (1) Side effects (or adverse events; Campillos, et. al. 2008)  Clinical  in silico  (2) Ligand-based (chemical similarity)  (3) Structure-based (simulations)  (4) Network-based (network pharmacology; Keiser, et.al. 2009)  Polypharmacology  Target promiscuity  Genomics (and next generation sequencing)  (5) Serendipitous (clinical/experimental)! Drug repositioning methods Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH

 Campillos, et. al., 2008  Examples:  Donepezil (original indications to treat dementia in Alzheimer’s Disease)  Has similar side effects as Venlafaxine (an anti- depressant, 5-HTT)  Predicted and experimentally validated Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Drug Repositioning: Side-effect

 Our proof of concept hinges on the similarity of side- effects among drugs (Campillos, et. al. 2008)  We focus on existing FDA-approved drugs which are annotated in DrugBank  We also focus on the database of side effects called SIDER Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH What is our strategy?

 In some instances, the molecular targets of drugs are known and annotated  However, not all FDA-approved drugs have known and experimentally validated targets  How do we infer targets of drugs with no known targets? Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH What is our hypothesis?

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Hypothesis (graphical) Shared Side effects Shared Side effects Drug1 Drug2 Target1 ??

 1) The ‘unknown’ target of one drug can be inferred from the ‘known’ target of another drug by the similarity between their side effects  2) We can also infer additional targets for drugs with known targets. Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Hypotheses

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH What information do we need? SIDER database DrugBank database Diseasome Uniprot

What islands of knowledge are we navigating? Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH

 We have deployed SPARQL databases in-house which include  1) DrugBank  2) Side effect database Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Which data?

 Technical:  Linked Open Drug Data  Virtuoso Instances with SIDER and Drug Bank  Integration:  SIDER for drug side-effects  DrugBank for drug targets  Data Analysis:  Calculate pair-wise drug-drug Jaccard similarity measure on shared side-effects Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Method

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Preliminary results: side effect similarity Side-effects ASide-effects B Nausea Vomitting Death 0.3

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Preliminary results: significance of side effect similarity? At t= 0.5, p-value=5.797e-209

Name1Name2Side effect similarity score CARBINOXAMINEDexchlorpheniramine Maleate CARBINOXAMINEDiphenhydramine CARBINOXAMINEClemastine Dexchlorpheniramine MaleateDiphenhydramine Chlorthalidonehydroflumethiazide ClemastineDiphenhydramine PENTOBARBITALsecobarbital Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Preliminary results: validate high similarity score with shared targets

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Can we now address hypothesis #1? Shared Side effects Shared Side effects Drug1 Drug2 Target1 ??

Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Can we now address hypothesis #1: Yes? pair-iddrug1drug2TSSS 1 berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/ berlin.de/sider/resource/drugs/

 Bendroflumethiazide:  For the treatment of high blood pressure and management of edema  Chlorthalidone:  For management of hypertension either as the sole therapeutic agent or to enhance the effect of other antihypertensive drugs in the more severe forms of hypertension Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Let’s take the first pair:

 Validated side effect similarity as an indicator for drugs sharing similar targets  We have verified the utility of drug data from the semantic web Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Initial conclusion

 Time: 6 weeks, new world  Technological hurdles  The state of the technologies underlying the semantic web is still evolving (many options and resources)  The state of the datasets themselves are in flux  Provenance  Trust  Currency Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Discussion: what are the challenges?

 Integrate database of regulatory motifs (MotifMap) for network-based pharmacology  Explore several semantic web technologies  Ontologies to use and/or extend (BioRDF/LODD)  Endpoint for linked data  Back-end storage Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Moving forward: future work

 Gained new perspective on drug discovery  Exciting new technologies!  Exposed to international community of data integrators in the health care and life sciences  Discussions with W3C HCLS/LODD  Exposed to resources at the NLM!  UMLS/NDFRT Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Experience

 NLM Mentor: Dr. Olivier Bodenreider  Ph.D. Advisor: Dr. Pierre Baldi  Jonathan Mortensen  May Cheh, Summer Rotation Program  My colleagues!  Lister Hill National Center for Biomedical Communications  NLM’s BIT Program  ORISE and NIH Paul Rigor - Lister Hill National Center for Biomedial Communications - National Library of Medicine - NIH Acknowledgements