1Data Structures | Data Elements Finding Disease Data: The Autism Example Finding the Needle in the Haystack February 26, 2013 Greg Farber, Ph.D. Director.

Slides:



Advertisements
Similar presentations
An Overview of the Integration of the UCSF Dept. of Radiology Teaching File with MIRC Wyatt M. Tellis University of California San Francisco Departments.
Advertisements

Data Architecture at CIA Dave Roberts Chief Technical Officer Application Services, CIO CIA
The WINSS School Improvement Planning Tool: An Overview.
A Framework for Communicating, Collaborating, and Coordinating A Framework for Communicating, Collaborating, and Coordinating Michael Huerta Co-Chair,
EDRN’s Validation Study Information Management System Developed for EDRN by the DMCC Cancer Biomarkers Group Division of Cancer Prevention Jet Propulsion.
Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
Newborn Screening Translational Research Network Goals of the Hunter Kelly Newborn Screening Program Identify, develop and test the most promising technologies.
OIRA / IT November 2014 Instructor Course Evaluation (ICE) Instructors’ Manual.
1http://ndar.nih.gov NDAR Data Dictionary | Data Structure Mapping NDAR Data Dictionary Data Structure Definition: Creating a Mapping File Create a mapping.
Surfing the Data Standards: Colorado’s Path 2012 MIS Conference – San Diego Daniel Domagala, Colorado Department of Education David Butter, Deloitte Consulting.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
Introduction to Software Architecture. What is Software Architecture?  It is the body of methods and techniques that help us to manage the complexities.
Chapter 10: Analyzing Systems Using Data Dictionaries Instructor: Paul K Chen.
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
1 Welcome & Overview 2 nd Annual Workshop “What are National Security Threats?” Kathleen D. Morrison Co-Director, JTAC Professor of Anthropology Director,
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Building Data-intensive Pipelines Ravi K Madduri Argonne National Lab University of Chicago.
Musical therapy R. Lakshmiprabha.  A clinical treatment that utilizes brain function, adaptation, sensory systems, audition, music elements and personal.
Community Business Intelligence Project Full Roll-Out Implementation Kick-Off HSPs with Validated Vendors December 2013.
South African Education Portal
NIH RePORT: report.nih.gov | RePORTER: projectreporter.nih.gov NIH ExPORTER Data NIH OFFICE OF EXTRAMURAL RESEARCH The Health Datapalooza.
Copyright © IBM Corp., All rights reserved; made available under the EPL v1.0 | March 20, 2008 | Short Talk Standards based systems management: An.
2015 Joint Congress on Medical Imaging and Radiation Science Clinical Integration of Students with Learning Disabilities Alice Havel, Ph.D. Susie Wileman,
1 Matthew J. McAuliffe, Ph.D., Chief, Biomedical Imaging Research Services Section (BIRSS) CIT Ramona Hicks, Ph.D., Program Director, Repair and Plasticity.
Accelerate Biomedical Discovery
NEDSS National Electronic Disease Surveillance System ( National Base System( NBS))
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Michael F. Huerta, Ph.D. Associate Director for Program Development National Library of Medicine, NIH BD2K CDE Webinar – September 8, 2015 Common Data.
Computing Fundamentals Module Lesson 19 — Using Technology to Solve Problems Computer Literacy BASICS.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Interoperability Framework Overview Health Information Technology (HIT) Standards Committee June 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting.
Orphanet Europe State of the Art of Database and Services Polish activity Orphanet Europe State of the Art of Database and Services Polish.
© Internet 2012 Internet2 and Global Collaboration APAN 33 Chiang Mai 14 February 2012 Stephen Wolff Internet2.
Belinda Seto, Ph.D. Acting Deputy Director for Extramural Research National Institutes of Health Human Subjects Research Enhancements Awards Renaissance.
Developer TECH REFRESH 15 Junho 2015 #pttechrefres h Understand your end-users and your app with Application Insights.
NIH NeuroBioBank: A Platform for Postmortem Brain Research Michelle Freund, PhD AFSP Forum February 28, 2014.
National Center for Research Resources NATIONAL INSTITUTES OF HEALTH T r a n s l a t I n g r e s e a r c h f r o m b a s i c d i s c o v e r y t o i m.
The NDAR Model for Publishing Findings in the Life Sciences Dan Hall Manager, National Database for Autism Research, NIMH.
By: Jessica Watkins. “Open Source software is software which can be used, modified and improved by anyone and can be redistributed freely.” Freely, in.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Health Management Information Systems Unit 3 Electronic Health Records Component 6/Unit31 Health IT Workforce Curriculum Version 1.0/Fall 2010.
Issues in developing NDAR autism ontology Samson Tu Stanford Center for Biomedical Informatics Research conference call.
Global Rare Diseases Patient Registry Data Repository GRDR
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
The WINSS School Improvement Planning Tool: An Overview.
The Uniform Resource Layer Anita Bandrowski Neuroscience Information Framework.
1 SWE Introduction to Software Engineering Lecture 14 – System Modeling.
“Beyond bibliographic management” Amir Information Systems Jack Ben Haim RefWorks.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
The NINDS Common Data Elements Project February 20, 2014 Wendy R. Galpern, MD, PhD NINDS / NIH American Society for Experimental NeuroTherapeutics | 16.
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Institute of Neurological Disorders and Stroke U.S. DEPARTMENT OF HEALTH.
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
Early Childhood Program Accountability: Cross Walks Between Strengthening Families, Head Start Performance Standards and NAEYC Accreditation standards.
February 20, 2014 ASENT Annual Meeting, Bethesda, MD Petra Kaufmann MD MSc Director, Office of Clinical Research National Institute of Neurological Disorders.
Monash.edu Research data ecosystem David Groenewegen Director, Research, University Library.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Enhancements to Galaxy for delivering on NIH Commons
Solutions to Clinical Data Visualization and Analysis
Making Data from NIAAA Funded Grants
Instructor Course Evaluation (ICE)
Common Data Elements: Data Standards for Clinical Data Collection
Alice Havel, Ph.D. Susie Wileman, M.Ed.
A free open-source solution for electronic medical records
Presentation transcript:

1Data Structures | Data Elements Finding Disease Data: The Autism Example Finding the Needle in the Haystack February 26, 2013 Greg Farber, Ph.D. Director Office of Technology Development and Coordination National Institute of Mental Health National Institutes of Health

2Data Structures | Data Elements Following the Presentation by Mike Huerta, how are we making autism data A) Discoverable B) Useful to Others C) Citable D) Linked to the Literature Talk Overview

3Data Structures | Data Elements Joint initiative supported by NIMH, NICHD, NINDS, and NIEHS Federal data repository Contains data from human subjects related to autism (and control subjects) Data are available to the research community through a not too difficult application process Summary data are available to everyone with a browser Begun in late 2006, and first data was received in 2008 The data types include demographic data, clinical assessments, imaging data, and –omic data Currently has data available from over 37,000 subjects 150TB of imaging and –omic data is stored in the cloud NDAR Overview

4Data Structures | Data Elements The NDAR data dictionary is one of the key building blocks for this repository. It provides a flexible and extensible framework for data definition by the research community instruments, freely available to anyone 35,000+ unique data elements and growing A research community platform for defining the complex language characterizing autism research ̶ Clinical ̶ Genomics/Proteomics ̶ Imaging Modalities Accommodates any data type and data structure Extended and enhanced by the ASD research community Curated by NDAR Allows investigators to quickly perform quality control tests of their data without submitting data anywhere. Data Dictionary

5Data Structures | Data Elements Data Dictionary Data Definition (200+)

6Data Structures | Data Elements Many ICs at NIH have initiated data dictionary/controlled vocabulary efforts, and a few are even mandating use of those data dictionaries for all awardees. NINDS (CDEs for neuroscience clinical research and form builder) PhenX (standards and measures related to complex diseases) NIH Toolbox (an integrated set of tools for measuring cognitive, emotional, motor, and sensory function) A summary of CDE efforts (both NIH and other) is available at ( NIH Data Dictionary Efforts

7Data Structures | Data Elements The NDAR GUID software allows any researcher to generate a unique identifier using some information from a birth certificate. If the same information is entered in different laboratories, the same GUID will be generated. This strategy allows NDAR to aggregate data on the same subject collected in multiple laboratories without holding any of the personally identifiable information about that subject. The GUID is now being used in other research communities and can be made available to you. We have created a video to help with informed consent issues. Voous Voous Global Unique Identifier – the Other Building Block

8Data Structures | Data Elements There are several other data repositories with data from human subjects related to autism. NDAR has a deep federation with those repositories to allow queries and data downloads from multiple repositories simultaneously. Data Federation

9Data Structures | Data Elements

10Data Structures | Data Elements

11Data Structures | Data Elements

12Data Structures | Data Elements An Example of Data Associated with a Particular Laboratory

13Data Structures | Data Elements An Example of Data Associated with a Particular Paper

14Data Structures | Data Elements NDAR, making autism data: A) Discoverable – federation, useful queries, XML web services B) Useful to Others – data access, data QC, data analysis pipelines (soon), C) Citable – data from labs, data from papers D) Linked to the Literature – data link in PubMed Summary