Biomedical and healthCAre Data Discovery Index Ecosystem NIH Core Team Ron Margolis (Lead) Ian Fore (Science Officer) Dawei Lin & Alison Yao (Program Officers)

Slides:



Advertisements
Similar presentations
CHARMCATS: Harmonisation demands for source metadata and output management CESSDA Expert Seminar: Towards the CESSDA- ERIC common Metadata Model and DDI3.
Advertisements

Facebook for scientists Titus Schleyer et al. 1 of 38 Digital Vita : Leveraging Personal Information Management Practices to Facilitate Research Collaborations.
MedOANet Final Conference Reflections on themes Dr Paul Ayris President of LIBER Chair, LERU Community of Chief Information Officers (League of European.
Creating a UK Account Tamela Harper, MHA Biomedical Intelligence Reporting Officer Center for Clinical and Translational Science Biomedical Informatics.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
California Digital Library Applications in the Real World: The Counting California Experience with the DDI Patricia Cruse Ilona Einowski Juri Stratford.
Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,
Evidence-Based Information Retrieval in Bioinformatics
Introducing Symposia : “ The digital repository that thinks like a librarian”
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Data Ingest Automation GHRC Status and Plans Helen Conover GHRC DAAC Operations Manager Presented at ESIP Summer Meeting 2015.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
Good practice in Research Data Management Module 6: Tools, training and support.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
Addressing the Metadata Bottleneck* *By Developing and Evaluating an Online Tool to Support Non-specialists to Evaluate Dublin Core Metadata Records Michael.
Sept 19,  Provides a common set of terminology and definitions  A framework for describing resources and processes  Enables computer based interoperability.
DDICC Overview: biomedical and healthCAre Data Discovery Index Ecosystem Lucila Ohno-Machado, MD, PhD UCSD Biomedical Informatics NIH BD2K Joint Kick-off.
Big Data to Knowledge (BD2K) Jennie Larkin, Ph.D. NIH RDA P5 March 10,2015.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
Study Discovery in Support of the Data Without Boundaries Initiative, the NIH Data Documentation Index and Infonomics Jay Greenfield Booz Allen Hamilton.
SEAD Virtual Archive :: A Thin Layer for Scientific Discovery and Long-Term Preservation Inna Kouper April #dlbbspring2013.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Data Integration and Management A PDB Perspective.
© Ex Libris Ltd. All Rights Reserved. From Library Systems to Information SystemsMetaLib Jenny Walker ICOLC 2001.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Organising social science data – computer science perspectives Simon Jones Computing Science and Mathematics University of Stirling, Stirling, Scotland,
M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control.
Collaborative Portal for the IOOS Super-Regional Modeling Testbed Sara Graves, Manil Maskey, Ken Keiser Information Technology & Systems Center University.
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
Breakout Session 2.2: A sustainable GEO Information System of Systems Chair: Lorenzo Bigagli Rapporteur: Greg Yetman.
ILYA ZASLAVSKY RAQUEL CALDERON CHRIS CONDIT JEFFREY GRETHE AMARNATH GUPTA BURAK OZYURT THOMAS WHITENACK DAVID VALENTINE ALICE GILIARINI AARON GONG University.
RDA End to End RDA Global Tested, Hardened, Integrated Council TAB OAB Sec Tech Transfer Outreach Mtgs Publication Testing & Eval RDA Coord Groups Third.
GEODE – Sharing Occupational Data Through The Grid Dr. Paul Lambert, Dr. Vernon Gayle, Prof. Ken Prandy, Prof. Richard Sinnott, Prof. Ken Turner, Koon.
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
Sharing Research Data with: OC Data Portal: ocdp.lib.uci.edu UC Irvine Dash: dash.lib.uci.edu Dan Tsang, Data Librarian Julia Gelfand, Applied Sciences.
ESO and the CMR Life Cycle Process Winter ESIP, Jan 2015 ESDIS Standards Office (ESO) Yonsook Enloe Allan Doyle Helen Conover.
Working with Data at its Source: Partnering with Researchers to Share Their Data for Archiving and Discovery Ron Nakao – Stanford University Libraries.
Participant Portal APIs - Pilot Tomasz KAMINSKI, DIGIT B.5.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
METADATA ORGANISATION ESDS APPROACHES AND RESOURCES …………………………………………
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
The NIH Data Commons: A Cloud-based Training Environment Philip E. Bourne, Ph.D. FACMI Associate Director for Data Science National Institutes of Health.
Enhancements to Galaxy for delivering on NIH Commons
Jennie Larkin, PhD Senior Advisor
RDS / AAF / ANDS / NeCTAR / AARNET Data Lifecycle framework
Using NCBO Web services
Metadata mapping in bioCADDIE: challenging cases
Designing a better future: Active, actionable DMPs
An Overview of Data-PASS Shared Catalog
DDI to DATS ICPSR Metadata on the DataMed Portal
Steering Group Member, Link Digital
Grid Portal Services IeSE (the Integrated e-Science Environment)
Toward FAIR Semantic Resources
User Needs in Biomedical Data Discovery
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
First teleconference/web session Dec 11, 2015
The MRC Research Data Gateway
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Developing Institutional Data Repositories
Presentation transcript:

biomedical and healthCAre Data Discovery Index Ecosystem NIH Core Team Ron Margolis (Lead) Ian Fore (Science Officer) Dawei Lin & Alison Yao (Program Officers) Jennie Larkin (ADDS office liaison) RDA BOF on Data Search 3/1/16 NIH grant 1U24 AI to the University of California, San Diego

Aims – “Pubmed” for Data 1. Help users find shared data 2. Build a prototype data discovery index 3. Evaluate requirements for next phase FAIR: Findability, Accessibility, Interoperability, Reusability  White Paper – finalized 3/8/2015 (can be found at biocaddie.org)

3 Organizing framework and portal for data dashed lines: mapping of metadata, standards, links to aggregators aggregators: various indices whose metadata are or can be mapped into Commons metadata Data Digital objects The Concepts of DDI From Lucila Ohno Machado

Open Community Participation Identifiers Metadata Identifiers Metadata Working GroupsPilot Projects/RFAs to the CommunitySupplements

Data Indexing Pipeline 1. Configuration file developed by curator 2. Extraction of metadata/data from data resource or dataset via ingestion module  Cache information for further processing 3. Process metadata/data via sequential set of processing modules  e.g. ID conversion, keyword extraction, data normalization 4. Mapping of metadata/data to metadata model(s) 5. Export to target endpoint(s)via export modules 6. Search via ElasticSearch APIs From Jeff Grethe

datamed.biocaddie.org (v0.5)

Acknowledgements Lucila Ohno Machado (PI) The bioCADDIE team NIH Staff The bioCADDIE ESP Working Group experts Collaborators

A DDI Example for PDB { "dataItem": { "ID": "4IAQ", "title": "Crystal structure of the chimeric protein of 5-HT1B-BRIL in complex with dihydroergotamine (PSI Community Target)", "description": "5-hydroxytryptamine receptor 1B", "keywords": [ "Signaling Protein”, "GPCR Dock" ], "dataTypes": [ "dataItem", "citation", "materialEntity", "organism", "identifiers" ] }, "materialEntity": [ { "name": "TYROSINE", "role": "chemical component", "formula": "C9 H11 N O3", "weight": " ", "type": "L-peptide linking" }, Identifier Metadata …….