Tim Clark Harvard Medical School & Massachusetts General Hospital RPI Tetherless World Constellation May 3, 2011 Copyright 2010 Massachusetts General.

Slides:



Advertisements
Similar presentations
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Advertisements

NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
RDB2RDF: Incorporating Domain Semantics in Structured Data Satya S. Sahoo Kno.e.sis CenterKno.e.sis Center, Computer Science and Engineering Department,
The journal as index and incentive for data publication Myles Axton Editor, Nature Genetics Cambridge Oct 23 rd 2011.
Data Landscapes neuinfo.org Anita Bandrowski, Ph. D. University of California, San Diego.
Global Alignment and Collaboration Jo
Tim Clark Harvard Medical School & Massachusetts General Hospital October 17, 2011 Copyright 2011 Massachusetts General Hospital. All rights reserved.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
© SIOC sections Copyright 2008 Digital Enterprise Research Institute. © SWAN sections Copyright 2008 Massachusetts General Hospital. All rights reserved.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Tim Clark Harvard Medical School & Massachusetts General Hospital September 14, 2011 Copyright 2011 Massachusetts General Hospital. All rights reserved.
Tim Clark In San Diego (Feb 2011, Beyond the PDF) ‘I challenge you to capture your talk in nanopublications’ Thank you!
Why, in the future, all sciences will be computer sciences Barry Smith.
TEXT MINING IN BIOMEDICAL RESEARCH QI LI 03/28/14.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Progress in Open-World, Integrative, Web-based Collaborative Research Platforms Peter Fox and the DCO-DS* Team Tetherless World Constellation.
Paul Groth VU University Amsterdam Convergence Meeting: Semantic Interoperability for Clinical Research & Patient.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Bioinformatics and medicine: Are we meeting the challenge?
THEME 1: Improving the Experimentation and Discovery Process Unprecedented complexity of scientific enterprise Is science stymied by the human bottleneck?
Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,
The Information Environment for Neuroscientists David R Newman
Resource Curation and Automated Resource Discovery.
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
Using ontologies to make sense of unstructured medical data Nigam Shah, MBBS, PhD
An integrative approach to drug repositioning: a use case for semantic web technologies Paul Rigor Institute for Genomics and Bioinformatics Donald Bren.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
The Cell Migration Consortium 5 min; 8hr PD 8
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
Ferran Sanz – GRIB (IMIM-UPF) Bioinformatics: How it can support the Family of International Classifications? Ferran Sanz Research Programme on Biomedical.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Proposed Research Problem Solving Environment for T. cruzi Intuitive querying of multiple sets of heterogeneous databases Formulate scientific workflows.
The Neuroscience information framework A User’s Guide.
Mapping to Ontologies Nigam Shah
The BioCADDIE / FORCE11 Data Citation Pilot © 2015 FORCE11.orgFORCE11.org Tim Clark, Ph.D. Harvard Medical School & Massachusetts General Hospital Maryann.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
A Distributed Framework for Computation on the Results of Large Scale NLP Christophe Roeder, William.
Describing Bioinformatic Metadata at EBI James Malone
Visual Knowledge ® Software Inc. Visual Knowledge BioCAD Case Study Parallels to Other Domains VK Semantic Web Server.
Tetherless World Constellation Open Government Data Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information.
Measure ANYTHING in the –omics age…. BIGNORANCE Driven Research.
TDM in the Life Sciences Application to Drug Repositioning *
Harnessing the Semantic Web to Answer Scientific Questions:
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
HCLS Scientific Discourse C-SHALS 2009
Building a community for genome and proteome annotation
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Data challenges in the pharmaceutical industry
Sponsored by the University of Southampton
Imaging AD Progression Amyloid Imaging Agents.
Structuring Biomedical Papers as RDF Micropublications
HCLS Scientific Discourse Progress Report
Scientific Discourse Task Tim Clark Massachusetts General Hospital & Harvard Medical School W3C HCLS MIT April 30, 2009.
Alexandre Passant1, Paolo Ciccarese2, 3, John G
Paolo Ciccarese, PhD Mass General Hospital / Harvard Medical School
The Linked Data Cloud Source: Chris Bizer. Linking Open Drug Data Susie Stephens, Principal Research Scientist, Eli Lilly.
Lesson 3 Bioinformatics Laboratory
Collaborative RO1 with NCBO
Service-enabling Biomedical Research Enterprise
Harnessing the Semantic Web to Answer Scientific Questions:
Presentation transcript:

Tim Clark Harvard Medical School & Massachusetts General Hospital RPI Tetherless World Constellation May 3, 2011 Copyright 2010 Massachusetts General Hospital. All rights reserved.

 Biomedical web data integration challenges  Requirements to cure complex disorders  Catch-22 for semantic data in medicine  Web 3.0 and semantic metadata  Injecting semantics into the existing ecosystem  Integrating ontologies, documents & data  Annotation Ontology & Annotation Framework  Hypothesis management (vs. KM)

 Alzheimer Disease  Huntington’s Disease  Nicotine Addiction  Schizophrenia  Bipolar Disorder  Alcohol addiction  Autism  Parkinson’s Disease  ALS  Neuropathic Pain  Major Depressive Disorder

 Yearly mortality (U.S.) = 642,00 people  Yearly costs (U.S.) =$676 B / 4.7% GDP  Prevalence = 5.3 M + 76 M M = 95.7 M people

create hypothesis design experiment run experimentcollect data interpret data share interpretations synthesize knowledge

MCI progressorsnon progressors PET imaging of PIB (radiolabelled compound binds amyloid beta A4 protein) MRI imaging of brain structure showing loss of hippocampal volume Brain Nov;133(Pt 11): = 218 subjects +

Alzheimer Disease Parkinson’s Disease Schizophrenia Autism Bipolar Disorder Drug Addiction Huntington’s Disease ALS Depression

dopaminergic pathway α-synuclein, β-amlyoid α-synuclein, Tau chr 16p11.2 CNV CRF, glutaminergic system, dopamine, amygdala … Alzheimer Disease Parkinson’s Disease Schizophrenia Autism Bipolar Disorder Drug Addiction Huntington’s Disease ALS Depression SIRT2

1.We want to organize all the known facts in neurobiology so we can mash them up. 2.There are no “facts” in neurobiology, except uninteresting ones. 3. All we have, are assertions supported by evidence, of varying quality.

Printing PressWeb

We scientists do not attend professional meetings to present our findings ex cathedra, but in order to argue. John Polanyi, FRS, Nobel Laureate University of Manchester

 Social Web (Web 2.0, read/write)  Shared annotation with controlled terminology systems (Sem Web) +

 Information sharing within communities or tasks via Social Web (Web 2.0), wikis and forums  Information “permeability” across pharma R&D projects / domains / pipeline stages via shared metadata (semantic annotation)  Web 3.0 improves cross-domain Signal to Noise, institutional memory & data “findability”

Genes Proteins Biological Processes Chemical Compounds Antibodies Cells Brain anatomy …

 Annotation Ontology (AO) is a domain- independent Web ontology.  Links document fragments to ontology terms.  Metadata separate from annotated documents.  SWAN AF manages document annotation.  Interfaces to textmining svcs & supports curation.  Collaborating with  NCBO, UCSD, Elsevier, USC, Manchester, EMBL, Colorado, EBI, etc…

Text Shared metadata

2) Automatic annotation Dr. Paolo Ciccarese – Oct 8, 2010

 Semantics on documents (SESL)  Vocabulary standards & terminology development  Document & data management  Collaboratories & web communities  Hypothesis management (SWAN)  Nanopublications (OpenPHACTS)

 Model the thinking behind your research  Database it, web-ify it, RDF-ize it, share it  Link the Models / Hypotheses to  Claims / Interpretations  Evidence (publications, experiments, data)  Supporting and contradictory claims from others  Evidence for these other claims  Web 3.0: share, compare and discuss  Manage knowledge while creating it  Can be public, private, or semi-private

Dr. Paolo Ciccarese – Oct 8, 2010

Cognitive Deficits (S) Cognitive Deficits (S) BACE1 (O) BACE1 (O) Relate to (p) Relate to (p) provenanc e context With thanks to Barend Mons and Paul Groth… Mons / Groth model of a nanopublication

swande:Claim Intramembranous Aβ behaves as chaperones of other membrane proteins rdf:type dct:title G1 pav:authoredBy Vincent Marchesi foaf:name foaf:Person rdf:type pav: foaf: G2

swande:Claim Intramembranous Aβ behaves as chaperones of other membrane proteins rdf:type dct:title G1 pav:authoredBy G2 pav:curatedBy G4 Gwen Wong foaf:name foaf:Person rdf:type

swande:Claim Intramembranous Aβ behaves as chaperones of other membrane proteins rdf:type dct:title G1 pav:contributedBy swanrel:referencesAsSupportiveEvidence G5 G6

G8 rdf:type Event of type GO "chaperone binding" rdfs:label rdf:type rdfs:label “Beta amyloid” rdfs:label “Membrane protein” rdfs:label “Plasma membrane” With many thanks to Nigam Shah, Stanford University

Hyque triples G8 pav:contributedBy Nigam Shah foaf:name foaf:Person rdf:type G9

swande:Claim Intramembranous Aβ behaves as chaperones of other membrane proteins rdf:type dct:title G1 Hyque triples G8 swanrel:derivedFrom

 Target / pathway hypotheses will be linked to:  Pathway & target relation to disease,  Target selection criteria,  Validation assays and criteria,  Experiment (assay) provenance,  Experimental data and computations,  Scientist remarks, findings and discussion.  Start as a relatively simple model and extend

 Hypotheses of therapeutic action for compounds and scaffolds will be linked to  Hypothesis / results for individual assays,  Experiment (assay) provenance,  Experimental data,  Group annotation,  Internal databases etc.  Start as a relatively simple model and extend

Information ecosystem

 Research reproducibility  Linking data to documents at time of publications  Citation of reagents, instruments, code, protocols  Bibliographies and citation networks  Bibliographic records and citations are metadata  Personal annotations  Selective sharing and virtual communities  Database annotation  Biomedical ontology database curation projects

 What is NASA ADS?  Web database comprising over 8 million astronomy and physics papers  Full-text for over 880K articles, including all major astronomy journals  NASA ADS semantic annotation requirements  Astronomical objects by catalog ID  Specific telescope, type of telescope, wavelength  Investigators  Grant funding sources

 Curing complex medical disorders goes hand in hand with next-gen biomedical communications  Web 3.0 provides the technology framework  Semantic annotation, hypothesis management, nanopubs: tools for next-gen biomed comms.  Requires / enables international collaborations of biomedical researchers and informaticians.  Open enterprise model with semantic metadata.

 People  Paolo Ciccarese (Harvard)  Maryann Martone (UCSD)  Anita DeWaard & Tony Scerri (Elsevier)  Karen Verspoor & Larry Hunter (Colorado)  Adam West & Ernst Dow (Eli Lilly)  Carole Goble (Manchester)  Nigam Shah (Stanford / NCBO)  Paul Groth (VU Amsterdam)  Funding: Elsevier, NIH, Eli Lilly, & EMD Serono

Whereas King Ptolemy, living forever, the Manifest God whose excellence is fine, son of King Ptolemy and Queen Arsinoe, the Father- loving Gods, is wont to do many favours for the temples of Egypt and for all those who are subject to his kingship, he being a god… English translation by R.S. Simpson