2007 CDISC International Interchange Ontologies in Clinical Research: Representation of clinical research data in the framework of formal biomedical ontologies.

Slides:



Advertisements
Similar presentations
UC BRAID: Co-creating and evaluating performance in a regional laboratory for conducting translational science UC BRAID Executive Committee: Steven Dubinett.
Advertisements

Nursing Research Opportunities in the USPHS CAPT. Victoria L. Anderson, RN, CRNP, MSN.
Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry.
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
Elements of a clinical trial research protocol
Standardizing Metadata Associated with NIAID Genome Sequencing Center Projects Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical.
South Carolina Clinical & Translational Research Institute Royce R. Sampson, MSN, RN, CRA SUCCESS Center Director Stephanie.
Overview of Biomedical Informatics Rakesh Nagarajan.
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
CTO - Clinical Trials/Research in the Ontology of Biomedical Investigation Richard H. Scheuermann U.T. Southwestern Medical Center.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
What is “Biomedical Informatics”?. Biomedical Informatics Biomedical informatics (BMI) is the interdisciplinary field that studies and pursues.
The NIH Roadmap for Medical Research
Richard H. Scheuermann, Ph.D. Department of Pathology Division of Biomedical Informatics U.T. Southwestern Medical Center Standardizing Metadata Associated.
Medical Informatics Basics
Formulating objectives, general and specific
Standardizing Metadata Associated with NIAID Genome Sequencing Center Projects and their Implementation in NIAID Bioinformatics Resource Centers Richard.
McGraw-Hill © 2006 The McGraw-Hill Companies, Inc. All rights reserved. The Nature of Research Chapter One.
Limning the CTS Ontology Landscape Barry Smith 1.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
OBI – Communities and Structure 1. Coordination Committee (CC): Representatives of the communities -> Monthly conferences 2. Developers WG: CC and other.
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
From Bench to Bedside: Applications to Drug Discovery and Development Eric Neumann W3C HCLSIG co-chair Teranode Corporation HCLSIG F2F Cambridge MA.
Dave Iberson-Hurst CDISC VP Technical Strategy
Experimentation in Computer Science (Part 1). Outline  Empirical Strategies  Measurement  Experiment Process.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
1 Copyright © 2011 by Saunders, an imprint of Elsevier Inc. Chapter 8 Clarifying Quantitative Research Designs.
EXPERIMENTAL EPIDEMIOLOGY
How to integrate data Barry Smith. The problem: many, many silos DoD spends more than $6B annually developing a portfolio of more than 2,000 business.
National Center for Research Resources G. Iris Obrams, M.D., M.P.H., Ph.D. NCRR Update 5 August 2006.
2 3 where in the body ? where in the cell ?
Mining the Biomedical Research Literature Ken Baclawski.
How To Design a Clinical Trial
Need for common standard upper ontology
NCI Enterprise Services (aka COPPA) CTRP and the Suite March 19, 2009.
Introduction to Research. Purpose of Research Evidence-based practice Validate clinical practice through scientific inquiry Scientific rational must exist.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Clinical Research Informatics [CRI]. Informatics, defined generally as the intersection of information and computer science with a health-related discipline,
Immunology Ontology Rho Meeting October 10, 2013.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Structured Protocol Representation for the Cancer Biomedical Informatics Grid: caSPR and caPRI.
Biomedical Informatics and Health. What is “Biomedical Informatics”?
Clinical and Translational Science Awards (CTSAs) Future Directions and Opportunities for ACNP investigators.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
GCP (GOOD CLINICAL PRACTISE)
How To Design a Clinical Trial
Dave Iberson-Hurst CDISC VP Technical Strategy
NIH Clinical Trial Requirements
Assign membership into BFO/OBI branches
What is “Biomedical Informatics”?
OBI – Standard Semantic
Clinical and Translational Science Awards Program
What is “Biomedical Informatics”?
Presentation transcript:

2007 CDISC International Interchange Ontologies in Clinical Research: Representation of clinical research data in the framework of formal biomedical ontologies Richard H. Scheuermann Chief, Division of Biomedical Informatics U.T. Southwestern Medical Center

Outline Motivation - US NIH Clinical and Translational Science Award (CTSA) Ontologies and the Open Biomedical Ontologies (OBO) Foundry Ontology for Biomedical Investigations (OBI) Ontology for Clinical Investigations (OCI) –Approach –Current status –Future direction

Clinical and Translational Science Award (CTSA)  Implementing biomedical discoveries made in the last 10 years demands an evolution of clinical science.  New prevention strategies and treatments must be developed, tested, and brought into medical practice more rapidly.  CTSA awards will help to lower barriers between disciplines, and encourage creative, innovative approaches to solving complex medical problems.  These awards will catalyze change -- breaking silos, breaking barriers, and breaking conventions.

Building a National CTSA Consortium

Trial Design Advanced Degree-Granting Programs Participant & Community Involvement Regulatory Support Biostatistics Clinical Resources Biomedical Informatics Clinical Research Ethics CTSAHOME NIH & other government agencies Healthcare organizations Industry Each academic health center will create a home for clinical and translational science

Data management - to develop a comprehensive controlled information system infrastructure to capture and manage clinical and translational research data Data integration - to integrate clinical and translational research data with data and knowledge from external public database resources Data analysis - to support clinical and translational research data analysis by providing state-of-the-art software analytical tools Support - to provide training and support for CRIS use Clinical Research Information System

Clinical Research Information Systems Protocol Design Protocol Design Statistical Endpoint Analysis Case Report Form Development Visit Management Subject Enrollment & Consenting Clinical Data Capture Laboratory Experimentation Specification Phase Implementation Phase Analysis Phase Sample Procurement & Processing Sample Procurement & Processing Consent Form Development Consent Form Development IRB Submission & Approval Grant Proposal Development Grant Proposal Development Enrollment Criteria Specification Subject Identification & Recruitment Agency & Scientific Reporting Laboratory Results Analysis Integrative Data Analysis Workflow Stakeholders Principal Investigator Principal Investigator IRB Grants Management Grants Management Research Coordinator Research Coordinator Study Subject Study Subject Lab Personnel Lab Personnel Principal Investigator Principal Investigator Study Monitor Study Monitor Data & Statistics Analyst Data & Statistics Analyst Database Analyst Database Analyst Study Sponsor Study Sponsor Functions Feasibility Study

Requirements Accurate Representation –therapeutic drug as a design variable vs. medical history –DNA as a therapeutic agent vs. analysis specimen Interoperability –unambiguous data exchange between research sites –effective data exchange between software applications Customization –support of study-specific details Dynamics –role changes throughout and between studies –eligibility criteria to relevant clinical phenotype Inference –semantic queries (e.g. patients with autoimmune disease) Meta-analysis –studies with common features (e.g. all studies where flu vaccine was evaluated as a conditional variable)

Constraints Essential to build upon and extend, or map to, existing and emerging data standards (e.g. HL7, CDISC) and relevant vocabularies (e.g. ICD-9/10, NCI Thesaurus, SNOMED-CT) Recognize the difference between medical (hospital) IT and biological (science) IT Support wide variety of different clinical and translational study types - reduce complexity by modeling commonalities Support needs of multiple stakeholders - different uses of same data Standards should be easy to implement and use Standards need to be easily and logically extensible Support clinical research data use cases

Need for standard representations Data standards + Common vocabularies + Extensible data model = Data interoperability Description Framework

Data Standards and Interoperability: Minimum Information Sets - CDISC Codelists, MIBBI Vocabularies & Ontologies - ICD-9/10, SNOMED, LOINC, NCI Thesaurus, OBO Foundry Object Models - CDISC, HL7 RIM, BRIDG, FuGE Exchange Syntaxes - HL7, XML, RDF

Definition of “Ontology” Philosophical “The study of that which exists” (ISMB 2005) “The science of what is: of the kinds and structures of the objects, and their properties and relations in every area of reality” (ISMB 2005) Information/computer scientists “A shared, common, backbone taxonomy of relevant entities, and the relationships between them, within an application domain” (ISMB 2005) “A computable representation of biological reality” (ISMB 2005) “A structured vocabulary” “A formal way of representing knowledge in which concepts are described both by their meaning and their relationship to each other” (Bard 2004) “A data model that represents a domain and is used to reason about the objects in that domain and the relations between them” (Wikipedia)

Provide clear thinking about how to structure information Support data integration, modeling, query processing, user interface development, data exchange/export To enforce data correctness To be able to map to database management systems To enables a computer to reason over the data To provide the capability to infer relationships that have not been explicitly defined Ontology Goals

The OBO Foundry The OBO foundry is a set of interoperable ontologies that adhere to a growing set of principles set forth for best practices in ontology development

The OBO Foundry

16 RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy / placeholder) Anatomical Entity (FMA, CARO) Organ Function (placeholder) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) Initial OBO Foundry Ontologies building out from the original GO

17 Mature OBO Foundry ontologies (now undergoing reform) Cell Ontology (CL) Chemical Entities of Biological Interest (ChEBI) Foundational Model of Anatomy (FMA) Gene Ontology (GO) Phenotypic Quality Ontology (PaTO) Relation Ontology (RO) Sequence Ontology (SO)

18 Ontologies being built to satisfy Foundry principles ab initio Ontology for Clinical Investigations (OCI) Common Anatomy Reference Ontology (CARO) Environment Ontology (EnvO) Ontology for Biomedical Investigations (OBI) Protein Ontology (PRO) RNA Ontology (RnaO) Subcellular Anatomy Ontology (SAO) Disease Ontology (DO)

Disease Ontology

OBO Foundry provides a suite of basic science Reference Ontologies designed to serve as modules for re-use in Application Ontologies such as: Infectious Disease Ontology Immunology Ontology Multiple Sclerosis Ontology Mammalian Adult Neurogenesis Ontology 20

Ontology for Biomedical Investigations - Overview  International collaboration (since 2006) Communities developing ontologies/terminologies - Unambiguous description of how the investigation was performed - Consistent annotation, powerful queries and data integration  Describe the laboratory workflow Set of universal terms - Investigation (organization, intent, design etc) - Material (biological and chemical, manipulation and transformation) - Protocols and instrumentations - Data generated and types of analysis performed on it Set of biological and technological domain-specific terms - To meet the annotation requirements of any given community (e.g. clinical research)  Part of the Open Biomedical Ontology (OBO) Foundry Orthogonality and x-referencing with existing bio-ontologies 'Interoperable by construction' with those under the Foundry -Including Unit, Quality (PATO), Environment and Chemical (ChEBI) ontologies Agree on an initial structure (trunk) with is_a relationship - Rely on Relation Ontology (RO)

OBI – Communities and Structure 1. Coordination Committee (CC): Representatives of the communities -> Monthly conferences 2. Developers WG: CC and other communities’ members Weekly conferences calls 3. Advisors: -> cBiO will oversee the Open BioMedical Ontology (OBO) initiative

OBI – Top Level Classes  Continuant: an entity that endure/remains the same through time Independent Continuant: stands on its own E.g. All physical entities (instrument, technology platform, document etc.) E.g. Biological material (organism, population etc.) Dependent Continuant: inheres from another entity E.g. Environment (depend on the set of ranges of conditions, e.g. geographic location) E.g. Characteristics (entity that can be measured, e.g. temperature, unit) - Realizable: an entity that is realized through a process (executed/run) E.g. Software (a set of machine instructions) E.g. Design (the plan that can be realized in a process) E.g. Role (the part played by an entity within the context of a process)  Occurrent: an entity that occurs/unfolds in time E.g. Temporal Regions, Spatio-Temporal Regions (single actions or Event) Process E.g. Investigation (the entire ‘experimental’ process) E.g. Assay (process of performing some tests and recording the results)

Ontology for Clinical Investigations Approach Transparency and inclusivity ( ge; Google “OCI wiki”) ge Combined top down/bottom up approach (prospective standardization) –Assembled term lists –Combine terms –Separate homonyms –Combine synonyms –Assigned membership into BFO/OBI branches –Position terms within branches –Define terms Testing

OCI Wiki

Term lists

Homonyms sample size: 1. A subset of a larger population, selected for investigation to draw conclusions or make estimates about the larger population. 2. The number of subjects in a clinical trial. 3. Number of subjects required for primary analysis.

Study Design Descriptive research – research in which the investigator attempts to describe a group of individuals based on a set of variable in order to document their characteristics –Case study – description of one or more patients –Developmental research – description of pattern of change over time –Normative research – establishing normal values –Qualitative research – gathering data through interview or observation –Evaluation research – objectively assess a program or policy by describing the needs for the services or policy, often using surveys or questionnaires Exploratory research –Cohort or case-control studies – establish associations through epidemiological studies –Methodological studies – establish reliability and validity of a new method –Secondary analysis – exploring new relationships in old data –Historical research – reconstructing the past through an assessment of archives or other records Experimental research –Randomized clinical trial – controlled comparison of an experimental intervention allowing the assessment of the causes of outcomes Single-subject design Sequential clinical trial Evaluation research – assessment of the success of a program or policy –Quasi-experimental research –Meta-analysis – statistically combining findings from several different studies to obtain a summary analysis

Assign membership into BFO/OBI branches

Biological marker (CDISC) Study populations (CDISC) Trial coordinator (CDISC) Study variable (CDISC) Drug (RCT) Subject (MUSC)

Case report form (CDISC) Patient file (CDISC) Consent form (CDISC) New drug application (MUSC) Investigational new drug application (MUSC)

Meta-analysis (CDISC) Quality assurance (CDISC) Quality control (CDISC) Baseline assessment (CDISC) Validation (CDISC) Coding (MUSC) Permuted block randomization (MUSC) Secondary-study-protocol (RCT) Intervention-step (RCT) Blinding-method (RCT) Study design Development plan (CDISC) Standard operating procedures (CDISC) Statistical analysis plan (CDISC)

Future directions Engage more stakeholders Direct collaboration with organizations such as CDISC Continue development Evaluation approaches and metrics –Based on scientific use cases –Categories of use cases Interoperability –Data exchange –Accuracy of representation –Homonyms and context; ontology helps us do that Reasoning and inference –Test with CTSA IT Project (trial registration)

OCI Working Group Jennifer Fostel Richard Scheuermann Cristian Cocos W. Jim Zheng Wenle Zhao Herb Hagler Jamie Lee Matthias Brochhausen Amar K. Das Dave Parrish Barry Smith Trish Whetzel