From biomedical informatics to translational research

Slides:



Advertisements
Similar presentations
“It is the responsibility of those of us involved in today’s biomedical research enterprise to translate the remarkable scientific innovations we are witnessing.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Wrapup. NHGRI strategic plan What does the NIH think genomics should be for the next 10 years? [Nature, Feb. 2011]
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
Transformed (transforming) Health Care System Connie White Delaney, PhD, RN, FAAN, FACMI School of Nursing Professor & Dean Academic Health Center Director,
Global Alignment and Collaboration Jo
1 Knowledge Management for Disease Coding (KMDC): Background & Introduction Timothy Hays, Ph.D. Project Manager, Knowledge Management for Disease Coding.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Overview of Biomedical Informatics Rakesh Nagarajan.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Future Trends: Translational Informatics James J. Cimino Chief, Laboratory for Informatics Development Mark O. Hatfield Clinical Research Center National.
What is “Biomedical Informatics”?. Biomedical Informatics Biomedical informatics (BMI) is the interdisciplinary field that studies and pursues.
National Institute on Aging Richard J. Hodes, M.D. Director,NIA/NIH/DHHS ADC Meeting – NIH Roadmap and Budget October 2003.
Measurable Interoperability for Archival Data Lewis J. Frey, PhD
Clinical and Translational Science Award (CTSA) John F. Ennever Medical Director, CTO Director, RKSR.
RESEARCH TEAMS OF THE FUTURE Working Groups and Co-Chairs  Interdisciplinary Research Patricia Grady, NINR Ken Olden, NIEHS Larry Tabak, NIDCR  High-risk.
The NIH Roadmap for Medical Research
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Medical Informatics Basics
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Johns Hopkins Technology Transfer 1 Translational Biomedical Research: Moving Discovery from Academic Centers to the Community Translational Biomedical.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Bioinformatics and medicine: Are we meeting the challenge?
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
From Bench to Bedside: Applications to Drug Discovery and Development Eric Neumann W3C HCLSIG co-chair Teranode Corporation HCLSIG F2F Cambridge MA.
Ontologies and data integration in biomedicine Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Kno.e.sis.
8 October 2009Microbial Research Commons1 Toward a biomedical research commons: A view from NLM-NIH Jerry Sheehan Assistant Director for Policy Development.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Facilitate Scientific Data Sharing by Sharing Informatics Tools and Standards Belinda Seto and James Luo National Institute of Biomedical Imaging and Bioengineering.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
ADVANCED DB SYSTEMS BIOMEDICAL ENGINEERING. Index INTRODUCTION  BIOMEDICAL ENGINEERING  B.E. DATASETS APPLICATIONS  DATA MINING ON FDA DATABASE  ONTOLOGY-BASED.
NIH ROADMAP FOR MEDICAL RESEARCH RESEARCH TEAMS OF THE FUTURE.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
National Center for Research Resources G. Iris Obrams, M.D., M.P.H., Ph.D. NCRR Update 5 August 2006.
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Bioinformatics and Computational Biology
University of Kentucky Center for Clinical and Translational Science (CCTS) November 2015 Stephen W. Wyatt, DMD, MPH Senior Associate Director Center for.
D4FF55A0-6B6F BF422A9BA9 Present by: Xiao Chen On December 7, 2015.
NIH and the Clinical Research Enterprise Third Annual Medical Research Summit March 6, 2003 Mary S. McCabe National Institute of Health.
Kathleen Giacomini, Mark J. Ratain, Michiaki Kubo, Naoyuki Kamatani, and Yusuke Nakamura NIH Pharmacogenomics Research Network III & RIKEN Center for Genomic.
HeC PMT Meeting Tallinn 10 July 2008 Brainstorming session: what interest can there be in thinking about a possible new HeC 2 project to follow-up the.
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
1 LS DAM Overview August 7, 2012 Current Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Mervi Heiskanen, NCI-CBIIT, Joyce.
Biomedical Informatics and Health. What is “Biomedical Informatics”?
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
TDM in the Life Sciences Application to Drug Repositioning *
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
National Healthcare Science Week 2017
Applications of the Interspace Analysis for Community Repositories
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Making “Open Data” Work: Challenges for Data Integration in Genomics Research
Sponsored by the University of Southampton
Functional Annotation of the Horse Genome
What is “Biomedical Informatics”?
Clinical and Translational Science Awards Program
Trial Funding and Engagement: The NIH Sponsored CTSA Program
Presentation transcript:

From biomedical informatics to translational research Kno.e.sis Wright State University, Dayton, Ohio May 27, 2009 From biomedical informatics to translational research Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA

Outline Translational research Enabling translational research Anatomy of a translational research experiment Promising results Challenging issues

Translational research (Translational medicine)

Translational medicine/research Definition Effective transformation of information gained from biomedical research into knowledge that can improve the state of human health and disease Goals Turn basic discoveries into clinical applications more rapidly (“bench to bedside”) Provide clinical feedback to basic researchers [Butte, JAMIA 2008]

Combining clinical informatics and bioinformatics Associates Clinical informatics Electronic medical records Clinical knowledge bases Bioinformatics Sequence databases Gene expression Model organism databases Common computational resources Biomedical natural language processing Biomedical knowledge engineering

Translational bioinformatics “… the development of storage, analytic, and interpretive methods to optimize the transformation of increasingly voluminous biomedical data into proactive, predictive, preventative, and participatory health. Translational bioinformatics includes research on the development of novel techniques for the integration of biological and clinical data and the evolution of clinical informatics methodology to encompass biological observations. The end product of translational bioinformatics is newly found knowledge from these integrative efforts that can be disseminated to a variety of stakeholders, including biomedical scientists, clinicians, and patients.” AMIA strategic plan http://www.amia.org/inside/stratplan

Aspects of translational research Huge volumes of data Publicly available repositories Publicly available tools Data-driven research

Huge volumes of data Affordable, high-throughput technologies DNA sequencing Whole genomes Multiple genomes Single nucleotide polymorphism (SNPs) genotyping Millions of allelic variants between individuals Gene expression data from micro-array experiments Text mining Full-text articles Whole MEDLINE Electronic medical records Genome-wide association studies

Publicly available repositories DNA sequences GenBank / EMBL / DDBJ Gene Expression data GEO, ArrayExpress Biomedical literature MEDLINE, PubMedCentral Biomedical knowledge OBO ontologies Clinical data (genotype and phenotype) dbGaP

Publicly available tools DNA sequences BLAST Gene Expression data GenePattern, … Biomedical literature Entrez, MetaMap Biomedical knowledge Protégé Culture of sharing encouraged by the funding agencies Grants for tools and resource development Mandatory sharing plan in large NIH grants Mandatory sharing of manuscripts in PMC for NIH-funded research

Data-driven research Paradigm shift Hypothesis-driven Data-driven Start from hypothesis Run a specific experiment Collect and analyze data Validate hypothesis (or not) Data-driven Integrate large amounts of data Identify patterns Generate hypothesis Validate hypothesis (or not) through specific experiments Biomedical informatics as a supporting discipline for biology and clinical medicine Biomedical informatics as a discipline in its own right, addressing important questions in medicine

Translational bioinformatics as a discipline “The availability of substantial public data enables bioinformaticians’ roles to change. Instead of just facilitating the questions of biologists, the bioinformatician, adequately prepared in both clinical science and bioinformatics, can ask new and interesting questions that could never have been asked before. […] There is a role for the translational bioinformatician as question-asker, not just as infrastructure-builder or assistant to a biologist.” [Butte, JAMIA 2008]

Enabling translational research Clinical Translational Research Awards (CTSA)

Translational research NIH Roadmap http://nihroadmap.nih.gov/

Clinical and Translational Science Awards The purpose of the CTSA Program is to assist institutions to forge a uniquely transformative, novel, and integrative academic home for Clinical and Translational Science that has the consolidated resources to: 1) captivate, advance, and nurture a cadre of well-trained multi- and inter-disciplinary investigators and research teams; 2) create an incubator for innovative research tools and information technologies; and 3) synergize multi-disciplinary and inter-disciplinary clinical and translational research and researchers to catalyze the application of new knowledge and techniques to clinical practice at the front lines of patient care. http://nihroadmap.nih.gov/

CTSA program (NCRR) 38 academic health centers in 23 states 14 centers added in 2008 60 centers upon completion Funding provided for 5 years Total annual cost: $500 M Annual funding per center: $4-23 M Depending on previous funding http://www.ncrr.nih.gov/clinical_research_resources/clinical_and_translational_science_awards/

Clinical and Translational Science Awards http://www.ctsaweb.org/

Other related programs National Centers for Biomedical Computing “networked national effort to build the computational infrastructure for biomedical computing in the nation” http://www.ncbcs.org/

Other related programs Cancer Biomedical Informatics Grid (caBIG) Key elements Bioinformatics and Biomedical Informatics Community Standards for Semantic Interoperability Grid Computing 1000 participants from 200 organizations Funding: $60 M in the first 3 years (pilot) “an information network enabling all constituencies in the cancer community – researchers, physicians, and patients – to share data and knowledge.” https://cabig.nci.nih.gov/

Translational research and data integration

Genotype and phenotype [Goh, PNAS 2007] OMIM [HPO]

Genotype and phenotype [Goh, PNAS 2007] Publicly available data OMIM 1284 disorders 1777 genes No ontology Manual classification of the diseases into 22 classes based on physiological systems Analyses supported Genes associated with the same disorders share the same functional annotations

Genes and environmental factors [Liu, BMC Bioinf. 2008] MEDLINE (MeSH index terms) Genetic Association Database

Genes and environmental factors [Liu, BMC Bioinf. 2008] Publicly available data MEDLINE 3342 environmental factors 3159 diseases Genetic Association Database 1100 genes 1034 complex diseases 863 diseases with both Genetic factors Environmental factors Analyses supported Proof-of-concept study

Integrating drugs and targets [Yildirim, Nature Biot. 2007] DrugBank ATC Gene Ontology

Genes and environmental factors [Yildirim, Nature Biot. 2007] Publicly available data DrugBank 4252 drugs 808 experimental drugs associated with at least one protein target ATC Aggregate drugs into classes Gene ontology Aggregate gene products by functional annotations OMIM Gene-disease associations … Analyses supported Industry trends Properties of drug targets in the context of cellular networks Relations between drug targets and disease-gene products

Anatomy of a translational research experiment

Integrating genomic and clinical data Genomic data Clinical data No genomic data available for most patients No precise clinical data available associated with most genomic data (GWAS excepted)

Integrating genomic and clinical data Genomic data

Integrating genomic and clinical data Genomic data Upregulated genes Diseases (extracted from text + MeSH terms)

Integrating genomic and clinical data Genomic data Clinical data Coded discharge summaries Laboratory data Upregulated genes Diseases (extracted from text + MeSH terms)

The Butte approach Methods Courtesy of David Chen, Butte Lab

The Butte approach Results Courtesy of David Chen, Butte Lab

The Butte approach Extremely rough methods No pairing between genomic and clinical data Text mining Mapping between SNOMED CT and ICD 9-CM through UMLS Reuse of ICD 9-CM codes assigned for billing purposes Extremely preliminary results Rediscovery more than discovery Extremely promising nonetheless

The Butte approach References Dudley J, Butte AJ "Enabling integrative genomic analysis of high-impact human diseases through text mining." Pac Symp Biocomput 2008; 580-91 Chen DP, Weber SC, Constantinou PS, Ferris TA, Lowe HJ, Butte AJ "Novel integration of hospital electronic medical records and gene expression measurements to identify genetic markers of maturation." Pac Symp Biocomput 2008; 243-54 Butte AJ, "Medicine. The ultimate model organism." Science 2008; 320: 5874: 325-7

Promising results

Pharmacogenomics of warfarin Narrow therapeutic range Large interindividual variations in dose requirements Polymorphism involving two genes CYP2C9 VKORC1 Genetic test available Development of models integrating variants of CYP2C9 and VKORC1 for predicting initial dose requirements (ongoing RCTs) Step towards personalized medicine

Integration of existing studies/datasets 49 experiments in the domain of obesity Rediscovery of known genes Identification of potential new genes Analysis of genes potentially associated with nicotine dependence Rediscovery of known findings Identification of networks of genes associated with type II diabetes mellitus [English, Bioinformatics 2007] [Sahoo, JBI 2008] [Liu, PLoS 2007; Rasche, MBC Gen. 2008]

Challenging issues

Challenging issues Datasets Ontologies Tools Other issues

Challenging issues Datasets Lack of annotated datasets Largely text-based (need for text mining) Limited availability of clinical data (EHRs, PHRs) Need for deidentification Heterogeneous formats Need for conversion Lack of metadata Limited discoverability, limited reuse

Challenging issues Ontologies Lack of universal identifiers for biomedical entities Need for normalization through terminology integration systems (e.g., UMLS) Lack of standard for identifiers Need for bridging across formats Lack of universal formalism Need for conversion between formalisms Limited availability of some ontologies Delay in adopting standards e.g., SNOMED CT

Challenging issues Tools Lack of semantic interoperability Difficult to combine tools/services Limited scalability of automatic reasoners Difficult to process large datasets

Other challenging issues Limited number of researchers “adequately prepared in both clinical science and bioinformatics” Need for validation of potential in silico discoveries through specific experiments Collaboration with (wet lab) biologists Must be factored in in grants

Conclusions

Conclusions Translational medicine is an emerging discipline We live in partially unchartered territory Biomedical informatics is at the core of translational medicine Strong informatics component to translational medicine We live in exciting times New possibilities for biomedical informaticians From service providers… …to biomedical researchers

Medical Ontology Research Contact: Web: olivier@nlm.nih.gov mor.nlm.nih.gov Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA