Data NIH Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health Big Data Symposium, Lincoln, Nebraska November 7, 2014
Where We Have Come From
Where We Are Today
Where We Are Going? Evidence: –Google car –3D printers –Waze –Robotics From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies by Erik Brynjolfsson & Andrew McAfee
Challenges Source Michael Bell
Mission Statement To foster an ecosystem that enables biomedical* research to be conducted as a digital enterprise that enhances health, lengthens life and reduces illness and disability * Includes biological, biomedical, behavioral, social, environmental, and clinical studies that relate to understanding health and disease.
Elements of The Ecosystem Community Policy Infrastructure Sustainability Collaboration Training
Elements of The Ecosystem Community Policy Infrastructure Sustainability Collaboration Training Virtuous Researc h Cycle
The Virtuous Cycle September 3, 2014 Workshop
Policies – Now & Forthcoming Data Sharing –Genomic data sharing announced –Data sharing plans on all research awards –Data sharing plan enforcement Machine readable plan Repository requirements to include grant numbers
Policies - Forthcoming Data Citation –Goal: legitimize data as a form of scholarship –Process: Machine readable standard for data citation (done) Endorsement of data citation for inclusion in NIH bib sketch, grants, reports, etc. Example formats for human readable data citations Slowly work into NLM/NCBI workflow
Infrastructure - The Commons
What Is The Commons A concept that will be stood up in conjunction with the $32m associated with BD2K A collection of physical compute and storage resources – public/private/hybrid clouds, NPC, institutional etc. Agreements – Commons compliance An agile environment that will be evaluated as small steps are made
Community – BD2K Awards
Community: BD2K Awards Governance November 3 Kick-off PI Meeting –Emphasize the need to build the ecosystem –Emphasize sharing from day 1 –Incentivize to work in the Commons
Community Short Term Interactions NSF Workshops and Dear Colleague letter Workshop with NOAA on public – private partnerships ELIXIR Workshops - Standards Workshop Inspiring the Game Developer Community to Engage in and Enhance Biomedical Research, Dec 2014 Sustainability of Data Resources 2015
1)Build a digital framework for data science training: NIH Data Science Workforce Development Center 2)Develop short-tem training opportunities: Courses, educational resources, etc. 3)Develop the discipline of biomedical data science and support cross-training Community: Training Data Science Training Goals Goals expanded from recommendations in the June 2012 DIWG and Aug 2013 Training workshop reports.
Grants - Forthcoming Grant Mechanisms –Support for communities e.g. GA4GH –Matchmaking Biomedical researchers & –Computer scientists –Statisticians –Data scientists from other fields e.g. astronomers, earth scientists –Gamers
Heads Up on What is Coming in FY15 Re Funding Funding calls for using the Commons Funding call for a standards framework development Funding calls for software development Funding calls to stimulate interactions between communities (diversity, rotations, library) Funding calls for high risk, high return projects Your ideas here…..
NIH … Turning Discovery Into Health