Data! Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.

Slides:



Advertisements
Similar presentations
DIScovery SciEnce through Computational Thinking (DISSECT) Enrico Pontelli.
Advertisements

Private Sector Perspectives on Federal Financial Systems Modernization and Shared Services.
Broader Impacts: Meaningful Links between Research and Societal Benefits October 23, 2014 Martin Storksdieck I Center for Research on Lifelong STEM Learning.
EDUCATIONAL CURRICULUM IN TRANSLATIONAL RESEARCH Panel Session Goals:  To discuss how CTSA training programs currently prepare clinical and translational.
Overview of Mentored K Awards Shawna V. Hudson, PhD Assistant Professor of Family Medicine and Community Health UMDNJ-RWJMS The Cancer Institute of New.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
Bioinformatics Training for Dental Researchers Lynn Johnson, Ph.D. University of Michigan.
Data the NIH What is Happening & What is Coming A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes.
George A. Komatsoulis, Ph.D. National Center for Biotechnology Information National Library of Medicine National Institutes of Health U.S. Department of.
Summarizing Community-Based Participatory Research: Background and Context for the Review Lucille Webb, MEd Eugenia Eng, DrPH Alice Ammerman, DrPH Meera.
Health IT Standards Committee Federal Health IT Strategic Plan December 10, 2014 Seth Pazinski Director, Office of Planning, Evaluation, and.
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
The NIH Roadmap for Medical Research
An Introduction to the Open Science Data Cloud Heidi Alvarez Florida International University Robert L. Grossman University of Chicago Open Cloud Consortium.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
Bill Newhouse Program Lead National Initiative for Cybersecurity Education Cybersecurity R&D Coordination National Institute of Standards and Technology.
Johns Hopkins Technology Transfer 1 Translational Biomedical Research: Moving Discovery from Academic Centers to the Community Translational Biomedical.
A National Resource Working in the Public Interest © 2006 The MITRE Corporation. All rights reserved. KM at MITRE Jean Tatalias KM TEM, December 2007.
HRSA’s Oral Health Goals and the Role of MCH Stephen R. Smith Senior Advisor to the Administrator Health Resources and Services Administration.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Overview: FY12 Strategic Communications Plan Meredith Fisher Director, Administration and Communication.
Partnerships and Broadening Participation Dr. Nathaniel G. Pitts Director, Office of Integrative Activities May 18, 2004 Center.
Managing Data: The Long View FORCE15 – 12 January 2015 Amy Friedlander, Ph.D.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
Big Data to Knowledge (BD2K) Jennie Larkin, Ph.D. NIH RDA P5 March 10,2015.
Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.
NIH Big Data to Knowledge (BD2K) March 4, 2014 Peter Lyster National Institute of General Medical Sciences (NIGMS) NIH.
NIH Activities Related to Big Data Jerry Sheehan Assistant Director for Policy Development National Library of Medicine Board on Research Data and Information.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
Funding your Dreams Cathy Manduca Director, Science Education Resource Center Iowa State University, 2005.
Richard Nakamura, Ph.D. October 2014 CSR Goals and Philosophy.
David Carr The Wellcome Trust Data management and sharing: the Wellcome Trust’s approach Economic & Social Data Service conference.
Congress created the NSF in 1950 as an independent federal agency. Budget ~$7.0 billion (2012) Funding for basic research.
The Swiss Grid Initiative Context and Initiation Work by CSCS Peter Kunszt, CSCS.
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Clinical and Translational Science Institute / CTSI at the University of California, San Francisco UCSF DataShare Making Research Data Available to All.
Midwest Big Data Hub Edward Seidel Director, NCSA Founder Prof. of Physics, Prof of Astronomy On behalf of the Midwest Big Data Hub 1 Brian Athey Sarah.
32 Digital Academia The Future
Challenges of Coping with Funding and Data Management in a Changing World Rick Lyons Director Infectious Disease Research Center.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
Why Write A Grant? Elaine M. Hylek, MD, MPH Professor of Medicine Associate Director, Education and Training Division BU CTSI Section of General Internal.
University of Kentucky Center for Clinical and Translational Science (CCTS) November 2015 Stephen W. Wyatt, DMD, MPH Senior Associate Director Center for.
NIH and the Clinical Research Enterprise Third Annual Medical Research Summit March 6, 2003 Mary S. McCabe National Institute of Health.
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
Federal Funder open data and literature requirements January 15, 2016 RAWG Meeting.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
NITRD Networking and ITRD IT R&D CIC computing, info and comm HPCC and communication HPC high-performance computing George O. Strawn NITRD co-chair and.
Leadership Guide for Strategic Information Management Leadership Guide for Strategic Information Management for State DOTs NCHRP Project Information.
Data NIH Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health Big Data Symposium, Lincoln,
The Vision for the NIH Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes of Health Bio-IT World, Boston April.
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
A Crucial Moment for Scientific Cooperation ESOF-2014 June 25, 2014.
The NIH Data Commons: A Cloud-based Training Environment Philip E. Bourne, Ph.D. FACMI Associate Director for Data Science National Institutes of Health.
NIH – A Vision Through 2020 Philip E. Bourne, PhD, FACMI Associate Director for Data Science
Jennie Larkin, PhD Senior Advisor
South Big Data Innovation Hub
NLM: Meeting Challenges & Seizing Opportunities in & with Big Data
Jarek Nabrzyski Director, Center for Research Computing
Computer Science Department, University of Missouri, Columbia
Proposal Development Services
EOSCpilot Skills Landscape & Framework
Johns Hopkins Medicine Innovation 2023 Strategic Plan
Bird of Feather Session
Johns Hopkins Medicine Innovation 2023 Strategic Plan
What is FASEB? A federation of 30 societies
Presentation transcript:

Data! Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health

Some Context: NIH Data Science History 6/12 2/14 3/14 Findings: Sharing data & software through catalogs Support methods and applications development Need more training Need campus-wide IT strategy Hire CSIO Continued support throughout the lifecycle

My Bias  Still a scientist  A funder who still thinks like a PI  Not yet attuned to the federal system  Big supporter of OA via PLOS and others

Data – A Few Observations …  We talk about the promise of big data, but we don’t even know the value of little data (aka could “Big Data” be the new “AI”)  Good data is expensive in terms of time and money  Looking at data retroactively is really expensive  Good data begats trust; trust begats community; community is God  The way we support scientific data currently is not sustainable  There is no workable business model currently for scientific data

Data – A Few NIH Observations … 1.We have little idea how much we spend on data – estimated over $1bn per year 2.We have even less idea how much we should be spending  Point 2 is part of a culture clash between the more observational history of biomedicine and the new analytical approach to discovery

ADDS Mission Statement To foster an ecosystem that enables biomedical research to be conducted as a digital enterprise that enhances health, lengthens life and reduces illness and disability

What Problems Are We Trying to Solve? Possible Solutions  Sustainability – 50% business model  Efficiency – sharing best practices in longitudinal clinical studies  Collaboration - identification of collaborators at the point of data collection not publication  Reproducibility – data accessible with publication  Integration – phenotype homogenization  Accessibility – clinical trials registration  Quality – sharing CDEs across institutes  Training – keeping trainees in the ecosystem

The Data Ecosystem Community Policy Infrastructure Sustainable business model Collaboration Training

Raw Materials to Seed the Ecosystem  NIH mandate & support  ADDS team of 8 people  Intramural participation of over 100 team members across ICs  Funding through BD2K: –~$30M in FY14 –~$80M in FY15 –....

Example Communities –NIH 20/27 ICs –Agencies NSF DOE DARPA NIST –Government OSTP HHS HDI ONC CDC FDA –Private sector Phrma Google Amazon –Organizations PCORI RDA, ELIXIR CCC CATS FASEB, ISCB Biophysical Society Sloan Foundation Moore Foundation

Example Policies –Clinical data harmonization –Data citation –Machine readable data sharing plans on all grants –New review models, audiences etc. Open review Micro funding Standing data committees to explore best practices Crowd sourcing

Example Infrastructure: The Commons Data The Long Tail Core Facilities/HS Centers Clinical /Patient The Why: Data Sharing Plans The Commons Government The How: Data Discovery Index Sustainable Storage Quality Scientific Discovery Usability Security/ Privacy The End Game: Knowledge NIH Awardees Private Sector Metrics/ Standards Rest of Academia Software Standards Index BD2K Centers Cloud, Research Objects, Business Models

What Does the Commons Enable?  Dropbox like storage  The opportunity to apply quality metrics  Bring compute to the data  A place to collaborate  A place to discover

[Adapted from George Komatsoulis] One Possible Commons Business Model HPC, Institution …

Pilots Around A Virtuous Cycle Expect a Funding Call

Training & Diversity  Training & Diversity Goals: –Develop a sufficient cadre of diverse researchers skilled in the science of Big Data –Elevate general competencies in data usage and analysis across the biomedical research workforce –Combat the Google bus  How: –Traditional training grants –Work with IC’s on a needs assessment –Standards for course descriptions with EU –Work with institutions on raising awareness –Partner with minority institutions –Virtual/physical training center(s)?

What Can Open Access Publishers Do?  Work with NIH on supporting data citation  Experiment with the idea of micropublication  Other?

NIH … Turning Discovery Into Health