1 Cyberinfrastructure for the 21 st Century (CIF21) NSF Data Strategy and EarthCube 9 th e-Infrastructure Concertation Meeting Sept 23, 2011 Rob Pennington.

Slides:



Advertisements
Similar presentations
Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.
Advertisements

Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
ACCI TASK FORCES Update CASC September 22, Task Force Introduction Timeline months or less from June 2009 Led by NSF Advisory Committee on.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
The NSF Cyberinfrastructure for the 21 st Century Program CIF21 Rob Pennington Program Director Office of Cyberinfrastructure National Science Foundation.
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
The Vision, Process, and Requirements for Creating EarthCube Presentation at Second EarthCube WebEx Aug 22, 2011.
Oceans Observations Environmental Obs Satellites Earth System Modeling Cyberinfrastructure in an Era of Observation and Simulation EarthScopeWater Eva.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Unidata Policy Committee Meeting Bernard M. Grant, Assistant Program Coordinator for the Atmospheric and Geospace Sciences Division May 2012 NSF.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Cyberinfrastructure for the 21st Century (CIF21): Data MRI and STCI
EarthCube Vision An alternative approach to respond to daunting science and CI challenges An alternative approach to respond to daunting science and CI.
Imagine a World…. With easy, unlimited access to scientific data from any field Where you can easily plot data of interest and display it any way you want.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Transformation of Research and Education in the 21 st Century Edward Seidel Director, Office of Cyberinfrastructure National Science Foundation
National Science Foundation Experimental Program to Stimulate Competitive Research (NSF EPSCoR) May 24, 2012 National Academies 1.
1 Investing in America’s Future The National Science Foundation Strategic Plan for FY Advisory Committee for Cyberinfrastructure 10/31/06 Craig.
DOE 2000, March 8, 1999 The IT 2 Initiative and NSF Stephen Elbert program director NSF/CISE/ACIR/PACI.
Perspectives on Cyberinfrastructure Daniel E. Atkins Professor, University of Michigan School of Information & Dept. of EECS October 2002.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
HPC Centres and Strategies for Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen –
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
1 Investing in America’s Future The National Science Foundation Strategic Plan for FY OPP Advisory Committee 10/26/06.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
A 10 YEAR OUTLOOK A REPORT BY THE NSF ADVISORY COMMITTEE FOR ENVIRONMENTAL RESEARCH & EDUCATION SPONSORED BY THE NATIONAL SCIENCE FOUNDATION SEPTEMBER.
CYBERINFRASTRUCTURE FRAMEWORK FOR 21st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
The LTER Network Planning Grant Barbara Benson NTL-LTER.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Forging the eXtremeDigital (XD) Program Barry I. Schneider Program Director, Office of CyberInfrastructure January 20, 2011.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
PAA on Scientific Data and Information Roberta Balstad Chair, PAA Panel.
1 NCSA 2015 Strategic Planning Process April 21, 2010 José L. Muñoz (Acting) Director, OCI (thanks to Blatecky, Parashar and Pennington) 1.
Priorities for International Development of e-Infrastructure and Data Management in Global Change Research Presentation by Robert Gurney, University of.
Joslynn Lee – Data Science Educator
Summit 2017 Breakout Group 2: Data Management (DM)
Capacity Building Enhance the coordination of efforts to strengthen individual, institutional and infrastructure capacities, particularly in developing.
Campus Cyberinfrastructure
NSF INCLUDES – DESIGN AND DEVELOPMENT LAUNCH PILOTS
Cognitus: A Science Case for HPC in the Nordic Region
VCU Strategic Plan 2025: Fall Town Halls
Workshop on Cyberinfrastructure National Science Foundation
State of XSEDE: XSEDE14 John Towns PI and Project Director, XSEDE
NSF Advisory Committee for Environmental Research and Education (AC ERE) Purpose: To provide advice and recommendations concerning support of the NSF’s.
Overview of working draft v. 29 January 2018
The Curry School of Education October 3, 2018
A Funders Perspective Maria Uhle Co-Chair, Belmont Forum Directorates for Geosciences, US National Science Foundation.
Three Uses for a Technology Roadmap
Briefing to ARL Membership
White House Office of Science and Technology Policy
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Wrap-Up – NSF Site Visit 8 February 2010
Brokering as a Core Element of EarthCube’s Cyberinfrastructure
MODULE 11: Creating a TSMO Program Plan
Urban Infrastructure: Analysis and Modeling for Their Optimal Management and Operation NSF Workshop NSF Award #: Nada Marie Anid, Ph.D. Professor.
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

1 Cyberinfrastructure for the 21 st Century (CIF21) NSF Data Strategy and EarthCube 9 th e-Infrastructure Concertation Meeting Sept 23, 2011 Rob Pennington Office of Cyberinfrastructure (OCI) National Science Foundation 1

Framing the Challenge: Science and Society Transformed by Data  Modern science  Data- and compute- intensive  Integrative, multiscale  Multi-disciplinary Collaborations for Complexity  Individuals, groups, teams, communities  Sea of Data  Age of Observation  Distributed, central repositories, sensor- driven, diverse, etc 2

Advisory Committee for Cyberinfrastructure Task Force Reports Grand Challenges Campus Bridging Data and Viz Cyberlearning HPC HIGH P ERFORMANCE COMPUTING Software  More than 25 workshops and Birds of a Feather sessions and more than 1300 people involved  Final recommendations presented to the NSF Advisory Committee on Cyberinfrastructure (ACCI) Dec 2010  Final reports on-line at: s/ 3

Broad Principles to Lead CIF21  Builds national infrastructure for S&E  Leverages common methods, approaches, and applications – focus on interoperability  Catalyzes other CI investments across NSF  Provides focus and is a vehicle for coordinating efforts and programs  Acts as a “research multiplier” across NSF  Based upon a shared governance model involving every directorate and office  Managed as a coherent program 4

New Computational Infrastructure  Computational and Data-enabled resources  HPC, Clouds, Clusters, Data Centers  Computation capabilities  Modeling, simulation, visualization  Long-term software for science and engineering  Sustained software development and support  Discipline-specific activities  Services, tools, compute, simulation environments that serve specific research efforts and communities 5

Scientific Software Elements: Small groups, individuals Scientific Software Integration: Research Communities Scientific Software Innovation Institutes: Large Multidisciplinary Groups Multi-year Creating Scalable Software Development Environments  Create a software ecosystem that scales from individual or small groups of software innovators to large hubs of software excellence Focus on innovation Focus on sustainability 6

Evolution of Cyberinfrastructure for the 21 st Century (CIF21) and Data 7 ACCI Data Task Force National Science Board (NSB) DataNet Program Community Input NSF CIF21 Data Programs On-going input Science & Engineering Research + Cyberinfrastructure

Data Task Force Recommendations Infrastructure: Recognize data infrastructure and services (including visualization) as essential long term research assets fundamental to today’s science Economic sustainability: Develop realistic cost models to underpin institutional/national business plans for research repositories/data services Culture Change: Emphasize expectations for data sharing; support the establishment of new citation models in which data and software tool providers and developers are recognized and credited with their contributions Data Management Guidelines : Identify and share best-practices for the critical areas of data management Ethics and IP : Train researchers in privacy-preserving data access 8

Scientific Data Challenges 9 Bytes per day Genomics LHC XSEDE, HPC Square Kilometer Array Genomics LHC Climate, Environment LSST Exa Bytes Peta Bytes Tera Bytes Giga Bytes Climate, Environment Volume Useful Lifetime Distribution Data Access Many smaller datasets… DataNet

DataNet: A Multi-tiered and Multi- Disciplinary Landscape 10 Genomics Communities Modeling and Simulation Communities Population, Climate, Environment Communities Data Curation Data Storage Data-enabled Science DataNet supported

Data-Enabled Science  Data Services Program (data)  Provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over a decades-long timeline  Data Analysis and Tools Program (information)  Data mining, manipulation, modeling, visualization, decision-making systems  Data-intensive Science Program (knowledge)  Intensive disciplinary efforts  Simulation, modeling  Multi-disciplinary S&E 11

Cross Cutting Challenges  Balancing research into next generations of infrastructure with operation & maintenance of current capacity.  Stimulate innovation and manage transitions  Sustainable, long term programs  Technical design, development of sustainability models, and integration with the research cycle.  Integration  Vertical – Linking low-level bit storage infrastructure to data collections, and finally to applications  Horizontal– Achieving connectivity and interoperability between activities that vary in scale, disciplinarity, and funding source. 12

 Life cycle perspective covering the use of the data  Research, development, implementation, operations, sustainability, close-out  Incorporate project management methodologies  Work breakdown structure, risk management, change control, schedule, milestones, deliverables  Standardized process:  Evaluate science merit, conceptual design  Develop initial project plans, design and reporting metrics  Finalize baseline, prototype, evaluate  Implementation & operations  Operational evaluations and continuance DataNet Program Management 13

WHAT IS EARTHCUBE?

A Call to Action Transitions and Tipping Points in Complex Environmental Systems, NSF AC for Environmental Research and Education, 2009 Earth Science and Applications from Space: National Imperatives for the Next Decade and Beyond, 2007 High-Performance Computing Requirements for the Computational Solid Earth Sciences, 2005

Goal of EarthCube To transform the conduct of research in geosciences by supporting community-based cyberinfrastructure to integrate data and information for knowledge management across the Geosciences.

What Needs To Be Done?  Integrate data, tools and communities through cyberinfrastructure  Establish a governance mechanism that is inclusive and adopted by the community  Utilize current and emerging technologies to create transparent infrastructure for the geosciences community

Modes of Support Convergence to a Unifying Architecture

EarthCube: Assumptions  The geosciences community is ready to take on the EarthCube challenge  Community will start self-organizing prior to EarthCube activities, such as the Nov 1-4 Charrette  Current and emerging technology will help achieve the convergence envisioned for EarthCube  A broad range of expertise and resources must be engaged to shape EarthCube

Framework Should  Create infrastructure of integrated and interactive services  transcend fields and accelerate discovery of a complex, multi-scale Earth System  Create an interoperable digital access infrastructure  Provide a network that is open, extensible and sustainable  Include Observations, Simulations, Collaborations, and Sharing of information  Facilitate data and metadata transfer from the field into data systems and applications  Integrate research and education  Build a knowledgeable and broadly engaged workforce

Jun 2011 Jul-Sept 2011 Nov Nov/11-Apr/12 May 2012 DCL Released Two WebEx events Charrette Proposed Framework Approaches Developed through EAGERs Sandpit/ IdeasLab to determine 18 mo. prototype award(s)

22 Questions?