The NSF Cyberinfrastructure for the 21 st Century Program CIF21 Rob Pennington Program Director Office of Cyberinfrastructure National Science Foundation.

Slides:



Advertisements
Similar presentations
21 st Century Science and Education for Global Economic Competition William Y.B. Chang Director, NSF Beijing Office NATIONAL SCIENCE FOUNDATION.
Advertisements

Will The 21st Century Need a Library ? Cyberinfrastructure and its Implications.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
U.S. Department of Energy’s Office of Science Basic Energy Sciences Advisory Committee Dr. Daniel A. Hitchcock October 21, 2003
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
ACCI TASK FORCES Update CASC September 22, Task Force Introduction Timeline months or less from June 2009 Led by NSF Advisory Committee on.
Research CU Boulder Cyberinfrastructure & Data management Thomas Hauser Director Research Computing CU-Boulder
The Changing Research Data Paradigm One agency’s response Changes to Implementation of NSF’s Data Sharing Policy NOAA’s second annual Environmental Data.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Science of Science and Innovation Policy (SciSIP) Presentation to: SBE Advisory Committee By: Dr. Kaye Husbands Fealing National Science Foundation November.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration.
Oceans Observations Environmental Obs Satellites Earth System Modeling Cyberinfrastructure in an Era of Observation and Simulation EarthScopeWater Eva.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
1 CASC September Meeting Planning for CIF21 New Computational Infrastructure: CDS&E Software HPC Gabrielle Allen, Eduardo Misawa, Manish Parashar Irene.
Data, Data Everywhere…. September 8, 2011 The Coalition for Academic Scientific Computation José-Marie Griffiths, PhD Vice President for Academic Affairs.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Unidata Policy Committee Meeting Bernard M. Grant, Assistant Program Coordinator for the Atmospheric and Geospace Sciences Division May 2012 NSF.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Infrastructure as a Service Kees Neggers, NSF IRNC Kickoff 13 July 2010.
Cyberinfrastructure for the 21st Century (CIF21): Data MRI and STCI
Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen – Louisiana State University Rob.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Transformation of Research and Education in the 21 st Century Edward Seidel Director, Office of Cyberinfrastructure National Science Foundation
Campus Cyberinfrastructure – Network Infrastructure and Engineering (CC-NIE) Kevin Thompson NSF Office of CyberInfrastructure April 25, 2012.
Cyberinfrastructure Planning at NSF Deborah L. Crawford Acting Director, Office of Cyberinfrastructure HPC Acquisition Models September 9, 2005.
1 Investing in America’s Future The National Science Foundation Strategic Plan for FY Advisory Committee for Cyberinfrastructure 10/31/06 Craig.
Cyberinfrastructure A Status Report Deborah Crawford, Ph.D. Interim Director, Office of Cyberinfrastructure National Science Foundation.
Dr. Fran Berman, RPI Feedback from BRDI Sponsor Forum 11/11 January 29, 2012 Fran Berman.
DOE 2000, March 8, 1999 The IT 2 Initiative and NSF Stephen Elbert program director NSF/CISE/ACIR/PACI.
CyberInfrastructure workshop CSG May Ann Arbor, Michigan.
© Internet 2012 Internet2 and Global Collaboration APAN 33 Chiang Mai 14 February 2012 Stephen Wolff Internet2.
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
HPC Centres and Strategies for Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen –
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Funding: Staffing for Research Computing What staffing models does your institution use for research computing? How does your institution pay for the staffing.
1 Investing in America’s Future The National Science Foundation Strategic Plan for FY OPP Advisory Committee 10/26/06.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Forging the eXtremeDigital (XD) Program Barry I. Schneider Program Director, Office of CyberInfrastructure January 20, 2011.
Internet2 Strategic Directions October Fundamental Questions  What does higher education (and the rest of the world) require from the Internet.
Cultural Heritage in Tomorrow ’s Knowledge Society Cultural Heritage in Tomorrow ’s Knowledge Society Claude Poliart Project Officer Cultural Heritage.
1 Kostas Glinos European Commission - DG INFSO Head of Unit, Géant and e-Infrastructures "The views expressed in this presentation are those of the author.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
NSF Draft Strategic Plan for Data, Data Analysis, and Visualization Chris Greer Program Director National Science Foundation.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
1 NCSA 2015 Strategic Planning Process April 21, 2010 José L. Muñoz (Acting) Director, OCI (thanks to Blatecky, Parashar and Pennington) 1.
1 Cyberinfrastructure for the 21 st Century (CIF21) NSF Data Strategy and EarthCube 9 th e-Infrastructure Concertation Meeting Sept 23, 2011 Rob Pennington.
National e-Infrastructure Vision
Campus Cyberinfrastructure
Cognitus: A Science Case for HPC in the Nordic Region
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Presentation transcript:

The NSF Cyberinfrastructure for the 21 st Century Program CIF21 Rob Pennington Program Director Office of Cyberinfrastructure National Science Foundation 1

The Shift Towards a “Sea of Data” Implications  All science is becoming data-dominated  Experiment, computation, theory  Fourth paradigm  Classes of data  Collections, observations, experiments, simulations  Software  Publications  Totally new methodologies  Algorithms, mathematics, culture  Data become the medium for  Multidisciplinarity, communication, publication…science 2 Fundamental questions become focused around data: How to remove boundaries? How to incentivize sharing? How do we attribute credit for this new publication form? How are data peer reviewed? What is a publication in the modern data-rich world?

Scientific Data Challenges 3 Bytes per day Genomics LHC TeraGrid, Blue Waters Square Kilometer Array Genomics LHC Climate, Environment LSST Exa Bytes Peta Bytes Tera Bytes Giga Bytes Climate, Environment Volume Useful Lifetime Distribution Data Access Many smaller datasets… DataNet

4 Software Analytic Tools Compute, Modeling Communities Expertise, research Networks Sea of Data CIF21 Science, innovation, discovery, economic competitiveness Grand Challenges EarthCube, Understanding the Phenome, Clean Energy, Climate prediction, Social networking, Complex networks, Health records, cybersecurity, Matter-by-design, disaster recovery, etc Multi-disciplinary & multi-scale integration CIF21 and Transforming Research

Discovery Collaboration Education NSF CIF21 Major Areas Organizations Universities, schools Government labs, agencies Research and Medical Centers Libraries, Museums Virtual Organizations Communities Expertise Research and Scholarship Education Learning and Workforce Development Interoperability and operations Cyberscience Networking Campus, national, international networks Research and experimental networks End-to-end throughput Cybersecurity Computational Resources Supercomputers Clouds, Grids, Clusters Visualization Compute services Data Centers Data Databases, Data repositories Collections and Libraries Data Access; storage, navigation management, mining tools, curation, privacy Scientific Instruments Large Facilities, MREFCs,,telescopes Colliders, shake Tables Sensor Arrays - Ocean, environment, weather, buildings, climate. etc Software Applications, middleware Software development and support Cybersecurity: access, authorization, authentication Advanced Computational Infrastructure Data Infrastructure Program

Broad Principles to Lead CIF21  Builds national infrastructure for S&E  Leverages common methods, approaches, and applications – focus on interoperability  Catalyzes other CI investments across NSF  Provides focus and is a vehicle for coordinating efforts and programs  Based upon a shared governance model involving all parts of NSF  Managed as a coherent program by OCI  Spiral development methodology 6

Evolution of CIF21 and NSF Data Programs 7 ACCI Task Force NSB DataNet Awards Community Input NSF CIF21 Data Programs On-going input Science & Engineering Research + Cyberinfrastructure

Data Related Context  National Science and Technology Council (NSTC)  comments-access-federally-funded-scientific-research- results comments-access-federally-funded-scientific-research- results  Networking and Information Technology Research and Development (NITRD)   National Science Board Data Policies Task Force   Advisory Committee for Cyberinfrastructure (ACCI)  8

NSTC RFIs for Public Comment - Context  Two Requests for Information (RFIs) – Nov 2011  Public Access to Digital Data Resulting from Federally Funded Scientific Research Preservation, Discovery and Access Standards for Interoperability, Re-Use and Re-Purposing  RFI for Scholarly Publications  information-public-access-digital-data-and-scientific- publications  Comment period closed on 12 Jan 2012  Digital Data: 118 responses  Scholarly Publications: 377 responses  Individual and institutional responses 9

NSB Data Policy Task Force - Context  Dec 2011: NSB Recommendations   #1: Provide leadership … in the development and implementation of digital research data policies...  #2: … require grantees to make both the data and the methods and techniques used in the creation and analysis of the data accessible … Data should be shared using persistent electronic identifiers …  #3: Continue to expand the support of computational and data- enabled science and engineering …  #4: Convene a panel.. to explore and develop a range of viable long-term business models…  #5: Further the expansion of sustainable data management, including preservation and curation of pre-existing and newly generated long-lived data … 10

NSF Advisory Committee for Cyberinfrastructure (ACCI) Task Force - Context Grand Challenges Campus Bridging Data and Viz Cyberlearning HPC HIGH P ERFORMANCE COMPUTING Software  Grand Challenges, HPC, Data/Viz, Software, Campus Bridging, Cyberlearning  More than 25 workshops and Birds of a Feather sessions and more than 1300 people involved  Final reports: orces/ 11

ACCI Data Task Force Recommendations  Recognize data infrastructure and services as essential research assets fundamental to today’s science and as long-term investments in national prosperity  Create new citation models in which data and software tool providers are credited with their data contributions  Develop and publish realistic cost models to underpin institutional/national business plans for research repositories/data services  Identify and share best-practices for the critical areas of data management 12

CIF21 and Data Enabled Science  Provide critical tools and services for data mining, integration, analysis, modeling and visualization.  Overcome barriers to scaling, synthesis, and interoperability to promote effective use of large scale, shared data resources.  Strategic investments that concentrate tools, resources and expertise in support of compelling grand challenge science questions. 13

Data Infrastructure: A Multi-tiered and Multi-Disciplinary Landscape 14 Observational Communities Modeling and Simulation Communities Population, Climate, Environment Communities Data Content Data Storage Data-enabled Science DataNet supported

CIF21: Data-Enabled Science  Data-intensive Science Program (knowledge)  Intensive disciplinary efforts, multi-disciplinary discovery and innovation  Data Analysis and Tools Program (information)  Data mining, manipulation, modeling, visualization, decision-making systems  Data Services Program (data)  Provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over a decades-long timeline 15 Dumped On by Data: Scientists Say a Deluge Is Drowning Research

Data Curation  Sustainable, community-based networks for management of critical scientific data resources in a life-cycle context.  Overcome challenges of culture change, policy development and implementation, sustainable operations, quality and usability control.  Strategic awards that address heterogeneity in formats, complexity, semantics of data collections that are valued by science communities of significant breadth.  Operate as a network of data services that promote interoperability, multidisciplinarity, and scalability. 16

Data Storage  National storage infrastructure for scientific data  Accommodate scale and heterogeneity through robust, open, and broadly accepted standards  Business model implemented with governmental, academic, non profit, and commercial stakeholders  Make strategic investments that:  Leverage existing resources in XSEDE, commercial clouds, federal data centers  Meet growing capacity needs at optimum cost  Provide coordinating and integrative functions for integrity, access control, availability, persistence  Catalyze a national data infrastructure 17

Cross Cutting Challenges  Balancing Research into Next Generation infrastructure with operation & maintenance of current capacity  Sustainability through technical design, development of business models, and integration with the research cycle  Integration  Vertical – Linking low-level bit storage infrastructure to data collections, and to applications  Horizontal– Achieving connectivity and interoperability between activi ties that vary in scale, disciplinarity, and funding source 18

Summary  CIF21 is focused on effective ways to approach and respond to the challenges  Critical concepts and goals  Realistic and innovative  Spiral process with strong, on-going feedback  Structure for longevity  Scalable open inclusive governance  Long term business models  International collaborations and programs 19