BD2K @ NIH – A Vision Through 2020 Philip E. Bourne, PhD, FACMI Associate Director for Data Science philip.bourne@nih.gov.

Slides:



Advertisements
Similar presentations
Archived File The file below has been archived for historical reference purposes only. The content and links are no longer maintained and may be outdated.
Advertisements

UC BRAID: Co-creating and evaluating performance in a regional laboratory for conducting translational science UC BRAID Executive Committee: Steven Dubinett.
Data the NIH What is Happening & What is Coming A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes.
George A. Komatsoulis, Ph.D. National Center for Biotechnology Information National Library of Medicine National Institutes of Health U.S. Department of.
Western Regional Biomedical Collaboratory Creating a culture for collaboration.
THE JOINED UP WORLD OF E-RESEARCH Professor Neil McLean National Technical Standards Adviser to the Department of Education Science and Training (DEST)
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
Data! Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
3 June 2010National Academies - BRDI1 Research Data and Information: Recent Developments and Continuing NIH Interests Jerry Sheehan Assistant Director.
Big Data to Knowledge (BD2K) Jennie Larkin, Ph.D. NIH RDA P5 March 10,2015.
Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.
NIH Big Data to Knowledge (BD2K) March 4, 2014 Peter Lyster National Institute of General Medical Sciences (NIGMS) NIH.
NIH Activities Related to Big Data Jerry Sheehan Assistant Director for Policy Development National Library of Medicine Board on Research Data and Information.
1 The Federal Shared Youth Vision Partnership A Federal Partnership between the Corporation for National community Service;
ADBC: Background, Broader Impacts and Opportunity Anne Maglia Program Director, Division of Biological Infrastructure National Science Foundation
1 The Federal Shared Youth Vision Partnership A Federal Partnership between the United States Departments of Education, Health.
1 Judy Hewitt, PhD On Detail to Office of Extramural Research National Institutes of Health May 18, 2015 Center for Scientific Review Advisory Council.
Midwest Big Data Hub Letters of Intent for NSF Edward Seidel Director, NCSA Founder Prof. of Physics, Prof of Astronomy On behalf of the Midwest.
1 Investing in America’s Future The National Science Foundation Strategic Plan for FY OPP Advisory Committee 10/26/06.
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
FROM PRINCIPLE TO PRACTICE: Implementing the Principles for Digital Development Perspectives and Recommendations from the Practitioner Community.
Data NIH Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health Big Data Symposium, Lincoln,
Center for Nursing Informatics Connie White Delaney, PhD, RN, FAAN, FACMI Dean and Professor Co-Director of the Center for Nursing Informatics September.
For Information only From distributor to Intelligent Funder.
Peer Review and Grant Mechanisms at NIH What is Changing? May 2016 Richard Nakamura, Ph.D., Director Center for Scientific Review.
Critical Program Movement: Integration of STD Prevention with Other Programs Kevin Fenton, MD, PhD, FFPH Director National Center for HIV/AIDS, Viral Hepatitis,
David M. Murray, Ph.D. Associate Director for Prevention Director, Office of Disease Prevention Multilevel Intervention Research Methodology September.
Examining Federal Expert Networking and the Economies of Scale: Moving the “HHS Profiles” Pilot Towards “Experts.gov” James King, Jessica N. Berrellez,
The NIH Data Commons: A Cloud-based Training Environment Philip E. Bourne, Ph.D. FACMI Associate Director for Data Science National Institutes of Health.
NASA Model-Based Systems Engineering Pathfinder 2016 Summary and Path Forward Karen J. Weiland, Ph.D. Jon Holladay, NASA Systems Engineering Technical.
A Funder's Perspective on Sustainability of Digital Data Repositories
Central Oregon Research Coalition
Data Management Program Introduction
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
NSF INCLUDES “NSF should implement a bold new initiative, focused on broadening participation of underrepresented groups in STEM, similar in concept.
American Evaluation Association
Jennie Larkin, PhD Senior Advisor
Structure The organizational structure is designed to accelerate a new workforce development and student success strategy: integrating makerspaces into.
Auditing of Trustworthy Data Repositories – Speakers
GISELA & CHAIN Workshop Digital Cultural Heritage Network
NLM: Meeting Challenges & Seizing Opportunities in & with Big Data
Electronic Case Reporting Update
Responding to Times of Challenge ATMCH Meeting March 5, 2006 Jeffrey G
Policy & Advocacy Platform April 24, 2017
Using metrics to change the narrative
Betsy Wilson Environmental Update October 29, 2007
Long Term Impacts of Research Capacity Building in Global Health
Summit 2017 Breakout Group 2: Data Management (DM)
Care Act – Strategic Partner Engagement
NSF INCLUDES – DESIGN AND DEVELOPMENT LAUNCH PILOTS
Who’s on Today’s Call Patty O’Connor Jenn Goodwin Daniel Paré
Preprints and Other Interim Research Products NIH perspectives
Workshop on Cyberinfrastructure National Science Foundation
Background to The Conference
Maximizing the value and the impact of health research in Europe
National HIT Resource Center
Clinical and Translational Science Awards Program
A Funders Perspective Maria Uhle Co-Chair, Belmont Forum Directorates for Geosciences, US National Science Foundation.
Joint NHDS & CRKN Workshop on Documentary Heritage
Quality in Evaluation: the international development experience
Gpsc Resource team modalities
Rachel Sturke, PhD Deputy Director and Senior Scientist
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Privacy in Nationwide Health IT
Summary for the use of UN Habitat staff
MODULE 11: Creating a TSMO Program Plan
Health Impact Assessment in NSW
Presentation transcript:

BD2K @ NIH – A Vision Through 2020 Philip E. Bourne, PhD, FACMI Associate Director for Data Science philip.bourne@nih.gov

Yes these are uncertain times, but … First and foremost you should see this meeting as a celebration of the hard work of the past two years Yes these are uncertain times, but … There is a commitment to the BD2K program through 2020

BD2K cannot be viewed in isolation, but rather as part of a broader view of data science @ NIH … Particularly as funding is increasingly from the IC’s

A View Which Includes: A vibrant research program of: Fundamental developments in data science Application of those fundamental developments Flagship projects to which developments are applied: PMI, Brain, Moonshot, ECHO A sustainable data ecosystem Commons and the FAIR Principles adoption Cross-cutting activities Increased workforce training A changing governance model

A Strategic Response can be Modeled on Three Axes: Research Resources Outcomes

A Strategic Response Research Resources Outcomes Fundamental Machine learning Data mining Indexing Predictive modeling … Applied Sustainability, governance, economics of data Privacy and security Effective use of clouds … Research Resources Outcomes

A Strategic Response Research Resources Outcomes Fundamental Machine learning Data mining Indexing Predictive modeling … Applied Sustainability, governance, economics of data Privacy and security Effective use of clouds … Research Resources Standards Commons APIs Reference data sets Workflows Access & Authentication Workforce Outcomes

A Strategic Response Research Resources Outcomes Fundamental Machine learning Data mining Indexing Predictive modeling … Applied Sustainability, governance, economics of data Privacy and security Effective use of clouds … Research Evaluated pilots FAIR data Trained workforce Best practices Policies Effective use of clouds On-ramps for all IC’s Resources Standards Commons APIs Reference data sets Workflows Access & Authentication Workforce Outcomes

A View Which Includes: A vibrant research program of: Fundamental developments in data science Application of those fundamental developments Flagship projects to which developments are applied: PMI, Brain, Moonshot, ECHO A sustainable data ecosystem Commons and the FAIR Principles adoption Cross-cutting activities Increased workforce training A changing governance model

The Current Situation NIH Funded Data Dark Data Cost Total data from NIH-funded research currently estimated at 650 PB* 20 PB of that is in NCBI/NLM (3%) and it is expected to grow by 10 PB this year Dark Data Only 12% of data described in published papers is in recognized archives – 88% is dark data^ Cost 2007-2014: NIH spent ~$1.2Bn extramurally on maintaining data archives $1.25bn per year to capture all data. After a significant effort at reduction, intramurally data is spread across > 60 data centers; imagine the extramural situation. * In 2012 Library of Congress was 3 PB ^ http://www.ncbi.nlm.nih.gov/pubmed/26207759

The Commons - Status Commons and FAIR principles* adopted across NIH Development and public release of a prototype Data Discovery Index DataMed Feb. v 1.0 Nov v 1.5 Cloud credits being issued for work in the Commons FOA’s for Commons Framework being issued Commons pilots under way * https://www.ncbi.nlm.nih.gov/pubmed/26978244

Sustainability – Sample Other Activities Request for Information: Metrics to Assess Value of Biomedical Digital Repositories (NOT-OD-16-133) To be discussed at Sustainability Session, Wed 1pm RFA to support community based standards work was released in the fall for May 2017 award, session today 1pm Funding opportunity announcement: (BD2K) Enhancing the Efficiency and Effectiveness of Digital Curation for Biomedical Big Data (RFA-LM-17-001) Applications due Dec 15

Sustainability – Looking Forward International collaboration on business models for sustainable data repositories Sustainable Business Models for Data Repositories (OECD Global Science Forum) Future of Life Sciences and Biomedical Databases (International Human Science Frontiers Program) NIH long-term data repository support Federal interagency Workshop on Measuring the Impact of Data Repositories, 2017 Recommend mechanism(s), review criteria, implementation plan

Example Cross-cutting Activities International partnerships Count everything – Secure count query framework California centers regional meetings GA4GH – Beacon project

A View Which Includes: A vibrant research program of: Fundamental developments in data science Application of those fundamental developments Flagship projects to which developments are applied: PMI, Brain, Moonshot, ECHO A sustainable data ecosystem Commons and the FAIR Principles adoption Cross-cutting activities Increased workforce training A changing governance model

NLM Working Group Report Patti Brennan – New NLM director http://acd.od.nih.gov/reports/Report-NLM-06112015-ACD.pdf Recommendation – NLM should become the programmatic epicenter for data science at NIH … Patti Brennan – New NLM director

What We Hope to See in 2020 New innovations bought about by large and complex data Evidence of translation i.e. real application at the point of care Broad Commons adoption leading to Improved sharing, reuse and hence cost effectiveness and reproducibility A balance between what is spent on data vs what is gained from that data Policies that are supportive of the above

… for your hard work and to the NIH staff from the ADDS office and from across the IC’s who have toiled to make BD2K a success