1 How Philosophy of Science Can Help Biomedical Research Barry Smith

Slides:



Advertisements
Similar presentations
Fundamentals of Quality Health Research FH Health Research Intelligence Group.
Advertisements

Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
1 The Future of Biomedical Informatics Barry Smith University at Buffalo
Gene Ontology John Pinney
Introduction to Bioinformatics Richard H. Scheuermann, Ph.D. Director of Informatics JCVI.
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
FuGO: Development of a Functional Genomics Ontology (FuGO) Patricia L. Whetzel 1, Helen Parkinson 2, Assunta-Susanna Sansone 2,Chris Taylor 2, and Christian.
The Relation Ontology Barry Smith 1. Concepts, Types and Frames ConceptsFrames Types Relational Structures 2.
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
1 Workshop 7.00 Welcoming Remarks 7.15 Barry Smith (Buffalo, NY) 7.40 Lindsay Cowell (Duke University, NC) 8.05 Nigam Shah (Stanford University, CA) 8.30.
1 Introduction to (Geo)Ontology Barry Smith
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
National center for ontological research. Part One: The History of NCOR and ECOR Part Two: How to Establish JCOR: The Japanese Consortium.
Scientific Data Mining: Emerging Developments and Challenges F. Seillier-Moiseiwitsch Bioinformatics Research Center Department of Mathematics and Statistics.
Chapter 1 Conducting & Reading Research Baumgartner et al Chapter 1 Nature and Purpose of Research.
ALEC 604: Writing for Professional Publication Week 7: Methodology.
CTO - Clinical Trials/Research in the Ontology of Biomedical Investigation Richard H. Scheuermann U.T. Southwestern Medical Center.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Genetics: From Genes to Genomes
1 Ontology (Science) Barry Smith University at Buffalo
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Research Methods Ass. Professor, Community Medicine, Community Medicine Dept, College of Medicine.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
RESEARCH FRAMEWORK Yulia Sofiatin Department of Epidemiology and Biostatistics 2012 YS 2011.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
Formal Empirical Applied Mathematical and technical methods and theories Cognitive, behavioral, and organizational techniques and theories ImagingBioInformaticsClinical.
Limning the CTS Ontology Landscape Barry Smith 1.
Introduction to Basic Science Emily L. Lowe, Ph.D. Microbiology, Immunology and Molecular Genetics UCLA.
2007 CDISC International Interchange Ontologies in Clinical Research: Representation of clinical research data in the framework of formal biomedical ontologies.
Gene Set Enrichment Analysis (GSEA)
Bioinformatics and medicine: Are we meeting the challenge?
OBI – Communities and Structure 1. Coordination Committee (CC): Representatives of the communities -> Monthly conferences 2. Developers WG: CC and other.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
Evaluating a Research Report
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
Bioinformatics Brad Windle Ph# Web Site:
Copyright 2006, Ida Sim Ida Sim, MD, PhD Associate Professor of Medicine Associate Director for Medical Informatics Program in Biological and Medical Informatics.
Experimentation in Computer Science (Part 1). Outline  Empirical Strategies  Measurement  Experiment Process.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
RADical microarray data: standards, databases, and analysis Chris Stoeckert, Ph.D. University of Pennsylvania Yale Microarray Data Analysis Workshop December.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Use of Predictive Biomarkers in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute
Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
2 3 where in the body ? where in the cell ?
“Ontology” Group Report: Summary Xiaoshu, John, Vinay, Duncan, Robert, Amit, Alfredo, Vipul - An attempt to summarize and organize …
Clustering Algorithms to make sense of Microarray data: Systems Analyses in Biology Doug Welsh and Brian Davis BioQuest Workshop Beloit Wisconsin, June.
Research Methods Ass. Professor, Community Medicine, Community Medicine Dept, College of Medicine.
Mining the Biomedical Research Literature Ken Baclawski.
Need for common standard upper ontology
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 Ontology (Science) vs. Ontology (Engineering) Barry Smith University at Buffalo
Basic Formal Ontology Barry Smith August 26, 2013.
WHAT IS RESEARCH? According to Redman and Morry,
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Biomedical Informatics and Health. What is “Biomedical Informatics”?
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
Assign membership into BFO/OBI branches
Standards Development for Metabolomics
Presentation transcript:

1 How Philosophy of Science Can Help Biomedical Research Barry Smith

How to Do Biology across the Genome? 2

MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFES IPTEVRSFYEDEKSGLIKVVKFRTGAMDRKRSFEKVVIS VMVGKNVKKFLTFVEDEPDFQGGPISKYLIPKKINLMVY TLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSFRVLER CHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKY GYNLAPYMFLLLHVDELSIFSAYQASLPGEKKVDTERL KRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRAC ALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVC KLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDD NNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGI SLLAFAGPQRNVYVDDTTRRIQLYTDYNKNGSSEPRLK TLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPW MDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEY ATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGS RFETDLYESATSELMANHSVQTGRNIYGVDFSLTSVSG TTATLLQERASERWIQWLGLESDYHCSFSSTRNAEDV 3 sequence of X chromosome in baker’s yeast

MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYEDEKSGLIKVVKFRTGAMDR KRSFEKVVISVMVGKNVKKFLTFVEDEPDFQGGPIPSKYLIPKKINLMVYTLFQVHTLKFNRKDYDTL SLFYLNRGYYNELSFRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAPYM FLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRA CALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCAC TARDLDIFDDNNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYVDDTTR RIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPWMDVVGFEDP NQVTNRDISRIVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGS RFETDLYESATSELMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQERASERWIQWLGLESDYHCS FSSTRNAEDVVAGEAASSNHHQKISRVTRKRPREPKSTNDILVAGQKLFGSSFEFRDLHQLRLCYEI YMADTPSVAVQAPPGYGKTELFHLPLIALASKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAPV RNFIEEGYDGVTDLYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLGYLIVDEFHNFETEVYRQS QFGGITNLDFDAFEKAIFLSGTAPEAVADAALQRIGLTGLAKKSMDINELKRSEDLSRGLSSYPTRMF NLIKEKSEVPLGHVHKIRKKVESQPEEALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRVV WIHGKLGAAEKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDNRLNIIELIQGVGRLRDGG LCYLLSRKNSWAARNRKGELPPKEGCITEQVREFYGLESKKGKKGQHVGCCGSRTDLSADTVELIE RMDRLAEKQATASMSIVALPSSFQESNSSDRYRKYCSSDEDSNTCIHGSANASTNASTNAITTAST NVRTNATTNASTNATTNASTNASTNATTNASTNATTNSSTNATTTASTNVRTSATTTASINVRTSATT TESTNSSTNATTTESTNSSTNATTTESTNSNTSATTTASINVRTSATTTESTNSSTSATTTASINVRTS ATTTKSINSSTNATTTESTNSNTNATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSAATTESTN SNTSATTTESTNASAKEDANKDGNAEDNRFHPVTDINKESYKRKGSQMVLLERKKLKAQFPNTSEN MNVLQFLGFRSDEIKHLFLYGIDIYFCPEGVFTQYGLCKGCQKMFELCVCWAGQKVSYRRIAWEAL AVERMLRNDEEYKEYLEDIEPYHGDPVGYLKYFSVKRREIYSQIQRNYAWYLAITRRRETISVLDSTR GKQGSQVFRMSGRQIKELYFKVWSNLRESKTEVLQYFLNWDEKKCQEEWEAKDDTVVVEALEKG GVFQRLRSMTSAGLQGPQYVKLQFSRHHRQLRSRYELSLGMHLRDQIALGVTPSKVPHWTAFLSM LIGLFYNKTFRQKLEYLLEQISEVWLLPHWLDLANVEVLAADDTRVPLYMLMVAVHKELDSDDVPDG RFDILLCRDSSREVGE 4

5

6 Stelzl et al., Cell, 2005

network of gene interactions in E. coli

8

9

10 what cellular component? what molecular function? what biological process?

11

12

13 The Idea of Common Controlled Vocabularies MouseEcotope GlyProt DiabetInGene GluChem sphingolipid transporter activity

14 The Idea of Common Controlled Vocabularies MouseEcotope GlyProt DiabetInGene GluChem Holliday junction helicase complex

15 male courtship behavior, orientation prior to leg tapping and wing vibration Gene Ontology

16 Benefits of GO 1.based in biological science 2.links data to biological reality 3.links people to software 4.links data together across species (human, mouse, yeast, fly...) across granularities (molecule, cell, organ, organism, population)

The goal all biological (biomedical) research data should cumulate to form a single, algorithmically processible, whole 17

Ontologies already being applied to achieve this goal Sjöblöm T, et al. analyzed 13,023 genes in 11 breast and 11 colorectal cancers GO tells you what is standard functional information for these genes By tracking deviations from this standard 189 genes could be identified as being mutated at significant frequency and thus as providing targets for diagnostic and therapeutic intervention. Science Oct 13;314(5797):

Towards Empirical Philosophy processualist vs. 3-dimensionalist reductionist vs. non-reductionist realist vs. nominalist If ontologies based on different philosophical principles are tested for their utility in support of scientific research, which types of ontologies will prove most useful? 19

20 Some sample ontologies Cell Ontology (CL) Foundational Model of Anatomy (FMA) Environment Ontology (EnvO) Gene Ontology (GO) Infectious Disease Ontology Phenotypic Quality Ontology (PaTO) Protein Ontology (PRO) RNA Ontology (RnaO) Sequence Ontology (SO)

21

22

23

24

The problem High throughput experimentation data is meaningless unless the researcher is provided with detailed information concerning how it was obtained 25

To make experimental data computationally accessible we need ontologies to describe the data (1) from the point of view of their relation to reality (2) from the point of view of their relation to experiments 26

27 Three solutions The MGED Ontology OBI: The Ontology for Biomedical Investigations EXPO: The Experiment Ontology

28 MGED (Microarray Gene Expression Data) Ontology

MGED Ontology Individual =def. name of the individual organism from which the biomaterial was derived Experiment =def. The complete set of bioassays and their descriptions performed as an experiment for a common purpose.... An experiment will be often equivalent to a publication. 29

MGED Ontology Chromosome =Def An abstraction used for annotation Chromosome =Def A biological sequence that can be placed on an array 30

31 OBI The Ontology for Biomedical Investigations with thanks to Trish Whetzel and Richard Scheuermann

32 Purpose of OBI To provide a resource for the unambiguous description of the components of biomedical investigations such as the design, protocols and instrumentation, material, data and types of analysis and statistical tools applied to the data  NOT designed to model biology

Hypothesis That it is possible to create ontology resources of genuine utility by drawing on logical and philosophical principles e.g. pertaining to consistency of definitions, avoidance of use-mention confusions. 33

34 OBI Collaborating Communities Crop sciences Generation Challenge Programme (GCP), Environmental genomics MGED RSBI Group, Genomic Standards Consortium (GSC), HUPO Proteomics Standards Initiative (PSI), psidev.sourceforge.net Immunology Database and Analysis Portal, Immune Epitope Database and Analysis Resource (IEDB), International Society for Analytical Cytology, Metabolomics Standards Initiative (MSI), Neurogenetics, Biomedical Informatics Research Network (BIRN), Nutrigenomics MGED RSBI Group, Polymorphism Toxicogenomics MGED RSBI Group, Transcriptomics MGED Ontology Group

OBI – Tools and Documentation  Open source, standards compliant and version management Ontology Web Language (OWL) using Protégé editor OBI.owl files are available from the OBI SVN Repository

The Problem of Clinical Investigations Regulatory bodies such as the FDA need to assess the evidentiary value of enormous volumes of data collected e.g. in trials on specific drug formulations For this, they need to impose standardization of terminologies used to express these data, e.g. as developed by the Clinical Data Interchange Standards Consortium (CDISC) 36

37

Clinical Investigations terminologies

“Study Design” Descriptive research –Case study – description of one or more patients –Developmental research – description of pattern of change over time –Qualitative research – gathering data through interview or observation Exploratory research –Secondary analysis – exploring new relationships in old data –Historical research – reconstructing the past through an assessment of archives or other records Experimental research –Randomized clinical trial –Meta-analysis – statistically combining findings from several different studies to obtain a summary analysis

“Population” Recruited population –Randomized population –Eligible population –Screened population –Premature termination population Excluded population –Excluded post-randomization population –Not-eligible-population Analyzed population –Study arm population –Crossover population –Subgroup population –Intent-to-treat population - based on randomization

Overview of OCI

Meta-analysis (CDISC) Quality assurance (CDISC) Quality control (CDISC) Baseline assessment (CDISC) Validation (CDISC) Coding (MUSC) Permuted block randomization (MUSC) Secondary-study-protocol (RCT) Intervention-step (RCT) Blinding-method (RCT) Study design Development plan (CDISC) Standard operating procedures (CDISC) Statistical analysis plan (CDISC)

Negative findings (MUSC) Positive findings (MUSC) Primary-outcome (RCT) Secondary-outcome (RCT)

46 EXPO The Ontology of Experiments L. Soldatova, R. King Department of Computer Science The University of Wales, Aberystwyth

47 EXPO: Experiment Ontology

48 EXPO: Experiment Ontology

49 EXPO: Experiment Ontology

50 experimental actions part_of experimental design subject of experiment part_of experimental design

51 Role of Philosophy of Science EXPO: Experiment Ontology

Towards Empirical Philosophy of Science rational statistical models of induction case-based / domain-based reasoning falsifiabilism Humeanism vs. laws logical, relative frequency, Bayesian, objective (chance) and epistemic theories of probability These generate different ontologies of scientific evidence – which one is correct? 52

Environment Ontology + Phenotypic Quality Ontology + Ontology for Personalized and Community Medicine ‘Racial’ Phenotypes: Social, Phylogenetic, Essentialistic... 53

54 Ontology for Personalized and Community Medicine to support studies of differential effects on health 1. of environmental qualities of different neighborhoods and 2. of different community behavior phenotypes