Mark A. Musen, M.D., Ph.D. Stanford University

Slides:



Advertisements
Similar presentations
16/11/ IRS-II: A Framework and Infrastructure for Semantic Web Services Motta, Domingue, Cabral, Gaspari Presenter: Emilia Cimpian.
Advertisements

The National Center for Biomedical Ontology Online Knowledge Resources for the Industrial Age Mark A. Musen Stanford University
George A. Komatsoulis, Ph.D. National Center for Biotechnology Information National Library of Medicine National Institutes of Health U.S. Department of.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition.
Adaptive Book: A Platform for teaching, learning and student modeling Ananda Gunawardena School of Computer Science Carnegie Mellon University.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Data Curation and Management activities within the UCT Computational Biology Group Dr Nicky Mulder.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
Andrew S. Budarevsky Adaptive Application Data Management Overview.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Adaptive Book: Teaching and Learning Environment for Programming Education Ananda Gunawardena & Victor Adamchik.
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Harokopio University of Athens – Department of Informatics and Telematics HAROKOPIOUNIVERSITY A Distributed Architecture for Building Federated Digital.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
Web 2.0 IS530 Fall 2009 Dr. Dania Bilal. Web 2.0 Is the Web that is being transformed into a computing platform for delivering web applications to end.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
Linking Ontologies and ISO/IEC in the CEDAR Metadata Repository Martin J. O’Connor Technical Lead, CEDAR Project Stanford University.
1 CASE Computer Aided Software Engineering. 2 What is CASE ? A good workshop for any craftsperson has three primary characteristics 1.A collection of.
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
Towards a unified MOD resource: An Overview
Eclipse Vorto Alexander Edelmann.
Making Metadata More Comprehensive and More Searchable with CEDAR
Jennie Larkin, PhD Senior Advisor
4.01 How Web Pages Work.
EOSC Services for Scientists
The National Center for Biomedical Ontology
Stanford University, Stanford, CA, USA
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
IS301 – Software Engineering V:
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Modern Systems Analysis and Design Third Edition
Modern Systems Analysis and Design Third Edition
Using ArrayExpress.
knowledge organization for a food secure world
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Web Services CO5027.
Online Communication and Marketing (Day 2, )
X-DOMES: Cross Domain Observational Metadata for Environmental Sensing
Chapter 4 Automated Tools for Systems Development
RichAnnotator: Annotating rich (XML-like) documents
Knowledge Based Workflow Building Architecture
Modern Systems Analysis and Design Third Edition
Browser Engine How it works…..
Health Ingenuity Exchange - HingX
An ecosystem of contributions
2. An overview of SDMX (What is SDMX? Part I)
BlackBoard 5 A Definitive e-Learning Software Platform Ozgur Balsoy,
Modern Systems Analysis and Design Third Edition
An EUDAT-based FAIR Data Approach for Data Interoperability
Requirement Analysis using
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
HP Quality Center 10.0 The Test Plan Module
The National Center for Biomedical Ontology
Chapter 5 Architectural Design.
Web APIs In computer programming, an application programming interface (API) is a set of subroutine definitions, protocols, and tools for building application.
Web services for online assessment in mathematics
Building Ontologies with Protégé-2000
W3C WoT Standardization
Modern Systems Analysis and Design Third Edition
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Toward an Ontology-Driven Architectural Framework for B2B E. Kajan, L
SDMX IT Tools SDMX Registry
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Mark A. Musen, M.D., Ph.D. Stanford University musen@Stanford.EDU Helping Researchers to Make Their Data More Accessible and Reusable: The Center for Expanded Data Annotation and Retrieval Mark A. Musen, M.D., Ph.D. Stanford University musen@Stanford.EDU

The OMICs people—whose actual data are much simpler—have focused on the metadata directly

How can we make sense of microarray data? What was the substrate of the experiment? What array platform was used? What were the experimental conditions? DNA Microarray

BioSharing lists more than 100 guidelines like MIAME for reporting experimental data 85 Guidelines

The Good News: Minimal information checklists, such as MIAME, are being advanced from all sectors of the biomedical community The Bad News: Investigators view requests for even “minimal” information as burdensome; there is still the pervasive problem of “What’s in it for me?”

CEDAR Partners Stanford University School of Medicine University of Oxford e-Science Centre Yale University School of Medicine Northrup Grumman Corporation Stanford University Libraries

HIPC centers submit data directly to the NIAID ImmPort data repository using detailed, experiment- specific templates Example entity–relationship diagram for describing metadata for annotating multiplex bead array assays (e.g., Luminex)

A Metadata Ecosystem HIPC investigators perform experiments in human immunology HIPC Standards Working Group creates metadata templates to annotate experimental data in a uniform manner ImmPort stores HIPC data (and metadata) in its public repository CEDAR will ease Template creation and management The use of templates to author metadata for ImmPort Analysis of existing metadata to inform the authoring of new metadata

The CEDAR Approach to Metadata

Some key features of CEDAR All semantic components—template elements, templates, ontologies, and value sets—are managed as first-class entities User interfaces and drop-down menus are not hardcoded, but are generated on the fly from CEDAR’s semantic content All software components have well defined APIs, facilitating reuse of software by a variety of clients CEDAR generates all metadata in JSON-LD, a widely adopted Web standard that can be translated into other representations

The CEDAR Approach to Metadata

The CEDAR Approach to Metadata Highlight the metadata repository in the third panel.

The CEDAR Ecosystem Renders guidelines for experimental metadata in a form that computers can reason about them Allows developers to create metadata templates, and to link those templates to standard ontologies Allows investigators to fill out those templates and to upload data to repositories May offer the ability for investigators to create metadata that are More comprehensive More complete More standardized Should make the corresponding data sets more “FAIR”

http://metadatacenter.org