Data- and Compute-Driven Transformation of Modern Science Update on the NSF Cyberinfrastructure Vision People, Sustainability, Innovation, Integration.

Slides:



Advertisements
Similar presentations
21 st Century Science and Education for Global Economic Competition William Y.B. Chang Director, NSF Beijing Office NATIONAL SCIENCE FOUNDATION.
Advertisements

Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
U.S. Department of Energy’s Office of Science Basic Energy Sciences Advisory Committee Dr. Daniel A. Hitchcock October 21, 2003
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
ACCI TASK FORCES Update CASC September 22, Task Force Introduction Timeline months or less from June 2009 Led by NSF Advisory Committee on.
Funding Opportunities at NSF Jane Silverthorne International Arabidopsis Consortium Workshop January 15, 2011.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration.
NSF EPSCoR and the Role of Cyberinfrastructure Dr. Jennifer M. Schopf National Science Foundation EPSCoR Office October 6, 2010.
Stimulating and Supporting (Sustained) Collaborations NSF Workshop on Effective Engagement and Collaboration of US CISE - China Researchers Peter Arzberger.
Social and behavioral scientists building cyberinfrastructure David W. Lightfoot Assistant Director, National Science Foundation Social, Behavior & Economic.
NSF Research Day University of Vermont - June 6, 2008 Directorate for Geosciences Margaret Cavanaugh Deputy Assistant Director.
Office of Science Office of Biological and Environmental Research J Michael Kuperberg, Ph.D. Dan Stover, Ph.D. Terrestrial Ecosystem Science AmeriFlux.
Cyberinfrastructure: Initiatives at the US National Science Foundation Stephen Nash Program Director, Operations Research U.S. National Science Foundation.
1 CASC September Meeting Planning for CIF21 New Computational Infrastructure: CDS&E Software HPC Gabrielle Allen, Eduardo Misawa, Manish Parashar Irene.
Data- and Compute-Driven Transformation of Modern Science Edward Seidel Assistant Director, Mathematical and Physical Sciences, NSF (Director, Office of.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Edward Seidel Acting Assistant Director Directorate for Mathematical & Physical Sciences Mathematical and Physical Sciences Advisory Committee 1 April.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
Unidata Policy Committee Meeting Bernard M. Grant, Assistant Program Coordinator for the Atmospheric and Geospace Sciences Division May 2012 NSF.
CyberInfrastructure and GIS at the National Science Foundation Dr. Jennifer M. Schopf Office of CyberInfrastructure National Science Foundation April 16,
Sept 29-30, 2005 Cambridge, MA 1 Grand Challenges Workshop for Computer Systems Software Brett D. Fleisch Program Director National Science Foundation.
The FY 2009 Budget Thomas N. Cooley, NSF Council of Colleges of Arts and Sciences March 13, 2008.
Edward Seidel, Assistant Director Directorate for Mathematical and Physical Sciences.
Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen – Louisiana State University Rob.
NSF ACCI (Advisory Committee for CyberInfrastructure) Taskforce Update - CASC Meeting 23 March 2010 Craig Stewart – Executive Director,
Biomedical Science and Engineering Funding Opportunities at NSF Semahat Demir Program Director Biomedical Engineering Program National Science Foundation.
Transformation of Research and Education in the 21 st Century Edward Seidel Director, Office of Cyberinfrastructure National Science Foundation
Changing Science and Engineering: the impact of HPC Sept 23, 2009 Edward Seidel Assistant Director, Mathematical and Physical Sciences, NSF (Director,
Directorate for Social, Behavioral, and Economic Sciences Amber L. Story Deputy Division Director Directorate for Social, Behavioral, and Economic Sciences.
Campus Cyberinfrastructure – Network Infrastructure and Engineering (CC-NIE) Kevin Thompson NSF Office of CyberInfrastructure April 25, 2012.
Cyberinfrastructure Planning at NSF Deborah L. Crawford Acting Director, Office of Cyberinfrastructure HPC Acquisition Models September 9, 2005.
NSF-funded Research Collaborations with SubSaharan Africa Presented at a Workshop on “Enhancing Research and Education Network Connectivity to and within.
National Science Foundation Experimental Program to Stimulate Competitive Research (NSF EPSCoR) May 24, 2012 National Academies 1.
Cyberinfrastructure A Status Report Deborah Crawford, Ph.D. Interim Director, Office of Cyberinfrastructure National Science Foundation.
Data-Model Assimilation: Collaboration, Integration, & Transformation GLOBAL CARBON CYCLE LAND-USE & LAND-COVER CHANGE HUMAN CONTRIBUTIONS & RESPONSES/DECISION.
Judith E. Skog Biological Sciences Directorate Emerging Frontiers Division H. Richard Lane Geological Sciences Directorate Earth Systems Science.
National Ecological Observatory Network
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
Office of Science Office of Biological and Environmental Research DOE Workshop on Community Modeling and Long-term Predictions of the Integrated Water.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Soil and Water Conservation Modeling: MODELING SUMMIT SUMMARY COMMENTS Dennis Ojima Natural Resource Ecology Laboratory COLORADO STATE UNIVERSITY 31 MARCH.
HPC Centres and Strategies for Advancing Computational Science in Academic Institutions Organisers: Dan Katz – University of Chicago Gabrielle Allen –
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Biocomplexity Teacher Workshop May 31 – June 2, 2008 University of Puerto Rico.
Midwest Big Data Hub Letters of Intent for NSF Edward Seidel Director, NCSA Founder Prof. of Physics, Prof of Astronomy On behalf of the Midwest.
National Strategic Computing Initiative
O C I October 31, 2006Office of CyberInfrastructure Implementing the Strategic Vision for Digital Data NSF Data Group ACCI Meeting October 31, 2006.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
1 Cyber-Enabled Discovery and Innovation Michael Foster May 11, 2007.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Understanding Collaboration Using Social Network Analysis Diana Rhoten Office of Cyberinfrastructure National Science Foundation.
Forging the eXtremeDigital (XD) Program Barry I. Schneider Program Director, Office of CyberInfrastructure January 20, 2011.
NSF Organization National Science Board Director & Deputy Director Computer & Information Sci & Eng Engineering Geo- Sciences Mathematical & Physical Sciences.
NASA Earth Exchange (NEX) A collaborative supercomputing environment for global change science Earth Science Division/NASA Advanced Supercomputing (NAS)
NSF INCLUDES Inclusion Across the Nation of Learners of Underrepresented Discoverers in Engineering and Science AISL PI Meeting, March 1, 2016 Sylvia M.
Data Infrastructure Building Blocks (DIBBS) NSF Solicitation Webinar -- March 3, 2016 Amy Walton, Program Director Advanced Cyberinfrastructure.
CyberGIS Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
1 NCSA 2015 Strategic Planning Process April 21, 2010 José L. Muñoz (Acting) Director, OCI (thanks to Blatecky, Parashar and Pennington) 1.
Engineering (Richard D. Braatz and Umberto Ravaioli)
Matthew Hawkins Head, NSF Large Facilities Office
Unidata Policy Committee Meeting
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

Data- and Compute-Driven Transformation of Modern Science Update on the NSF Cyberinfrastructure Vision People, Sustainability, Innovation, Integration Edward Seidel Acting Assistant Director, Mathematical and Physical Sciences, NSF (Director, Office of Cyberinfrastructure) 1

2 Profound Transformation of Science Gravitational Physics  Galileo, Newton usher in birth of modern science: c  Problem: single “particle” (apple) in gravitational field (General 2 body- problem already too hard)  Methods  Data: notebooks (Kbytes)  Theory: driven by data  Computation: calculus by hand (1 Flop/s)  Collaboration  1 brilliant scientist, 1-2 student

3 3 3D Collision Science Result Year: 1998 Team size ~ 15 Data produced ~ 50Gbytes 3D Collision Science Result Year: 1998 Team size ~ 15 Data produced ~ 50Gbytes Profound Transformation of Science Collision of Two Black Holes Science Result The “Pair of Pants” Year: 1994 Team size ~ 10 Data produced ~ 50Mbytes Impact of HPC taking root Science Result The “Pair of Pants” Year: 1994 Team size ~ 10 Data produced ~ 50Mbytes Impact of HPC taking root  Science Result  The “Pair of Pants”  Year: 1972  Team size  1 person (S. Hawking)  Computation  Flop/s  Data produced  ~ Kbytes (text, hand- drawn sketch)  400 years later…same!

4 Now: Complexity of Universe LHC, Gamma-ray bursts!  Gamma-ray bursts! GR now soluble: complex problems in relativistic astro can now be attacked All energy emitted in lifetime of sun bursts out in a few seconds: what are they?! Colliding BH-NS? SN? GR, hydrodynamics, nuclear physics, radiation transport, neutrinos, magnetic fields: globally distributed collab! Scalable algorithms, complex AMR codes, viz, PFlops*week, PB output!  LHC: What is the nature of mass? Higgs particle? ~10K scientists, 33+ countries, 25PB data, distributed! Planetary lab for scientific discovery! Remote Instrument

5 Grand Challenge Communities Combine it All... Where is it going to go? 5 Same CI useful for black holes, hurricanes

6 Grand Challenge Communities  Complex problems require many disciplines, all scales of collaborations, advanced CI  Individuals, groups, teams, communities  Multiscale Collaborations: Beyond teams  Grand Challenge Communities assemble dynamically  Emergency forecasting: flu, hurricane, tornado...  Gamma-ray bursts, supernovae,  They can only work by sharing data  Place requirements on  CI: software, networks, collaborative environments, data, sharing, computing, etc  Scientific culture, reproducibility, access, university structures New social networking technologies will be needed for collaborations at this scale. Allen, Schnetter, et al 6

NSF Vision and National CI Blueprint 7 Track 1 Track 2 CampusCampusCampusCampusCampusCampus CampusCampus CampusCampusCampusCampusCampusCampusCampusCampus DataNetDataNet DataNetDataNet SoftwareSoftware NetsNets DataNetDataNet DataNetDataNet DataNetDataNet Learning & Work Force Needs & Opportunities Virtual Organizations for Distributed Communities High Performance Computing Data & Visualization/ Interaction Education Crisis: I need all of this to start to solve my problem!

What is Needed? 8 NSF-wide CI Framework for 21 st Century Science & Engineering

CF21: Cyberinfrastructure Framework for 21 st Century Science & Engineering  High-end computation, data, visualization for transformative science; sustainability, extensibility  Facilities/centers as hubs of innovation  MREFCs and collaborations including large-scale NSF collaborative facilities, international partners  Software, tools, science applications, and VOs critical to science, integrally connected to hardware  Campuses fundamentally linked; grids, clouds, loosely coupled campus services, policy to support  People. Comprehensive approach workforce development for 21st century science and engineering 9 Comprehensive, balanced, integrated, national high performance CI; Dear Colleague Letter released December, 2009 by all units

10 ACCI Task Forces Campus Bridging: Craig Stewart, IU (BIO) Computing: Thomas Zacharia, ORNL/UTK (DOE) Grand Challenge Communities/VOs: Tinsley Oden, Austin (ENG) Education & Workforce: Alex Ramirez, CEOSE Software: David Keyes, Columbia/KAUS T (MPS) Data & Viz: Shenda Baker, Harvey Mudd (MPS); Tony Hey, (CISE)  Timelines: months  Advising NSF  Workshop(s)  Recommendations  Input to NSF informs CF21 programs, 2012 CI Vision Plan 10

Preliminary Task Force (TF) Results  Computing TF Workshop Interim Report  Rec: Address sustainability, people, innovation Developing CF21-oriented HPC program  Software TF Interim Report  Rec: Address sustainability, create long term, multi- directorate, multi-level software program Developing CF21-oriented integrated program  GCC/VO TF Interim Report  Rec: Address sustainability, OCI to nurture computational science across NSF units Concept paper coming to NSF PITAC: “inadequate structures within the Federal government and the academy today do not effectively support computational science” 11

Roadmap and Timelines 12 DataNetDataNet DataNetDataNet Track Task Force Reports and Workshops NSF CF21 Strategic Plan Integration Stronger interagency interaction New science activities enabled DataNetDataNetDataNetDataNetDataNetDataNet National Petascale Facility CF21Computing program; hubs of innovation CF21 Software People, VOs Better campus integration Major facilities CI planning

OCI Special Role in CF21  Driver for integrative CI activity via CF21  Working with all units, community Develop vision and implementation plan OCI budget ¼ NSF CI; other units critical!  Catalyst for coordinated, linked investments  CI in all forms: campus, centers, MREFC Leadership in R&D for prototypes, pilots, best practices Looking for coherence, re-use of CI  Science applications enabled by CI  People: supporting next generation of CI researchers  Steward for NSF-wide computational science  Working with all NSF units to provide sustainable home 13

2009 PetaApps, CDI, CI-Reuse 70% OCI ARRA: Innovations in software, apps, people  PetaApps: OCI led, NSF-wide  Partners: MPS, CISE, ENG, GEO and SBE  2009: $16M from OCI, matched for total of $35M!  Total: 42 awards, ~200 proposals, $60M  Equivalent to entire Track-2 award (including O&M)  CDI: CISE led, NSF-wide  OCI a “Big 4” contributor in FY09! ( CISE, ENG, OCI, MPS…), $63M total  OCI contributed to 22 awards, more than $10M  CI Re-Use: Internal OCI-led NSF program  OCI venture fund of $4M to catalyze  CISE, GEO, OPP, BIO and MPS  13 awards, > $20M investments catalyzed by OCI 14

MREFC Projects: NEON, and Cyber-GIS 15

James Collins, Assistant Director Biological Sciences Directorate, NSF Office of Management and Budget Briefing October 5, 2009 National Ecological Observatory Network New horizons for large-scale biology

How does the effect of climate change on biosphere processes vary along regional and continental gradients? What is the effect of the biosphere on regional climate? How will land use change affect the dispersion of invasive species through a region and across the continent? How do large scale physical processes produce regional to continental ecological responses? In theory all life is interconnected…. What is NEON? NEON is an integrated sensing system to detect, understand, and forecast the consequences of climate and landuse change and the effects of invasive species on the biosphere of the U.S. at the regional and continental scales. Enables research to address ….

Cyberinfrastructure Decision Support Education Research Distributed Centralized Operational and Support Systems (OSS) Data Services Data Management Airborne Remote Sensing Archive Data Products Raw Data Acquisition Data Process Management Portals Future Sources In situ Sensors Satellite Remote Sensing Biological Monitoring and Measurements

Landuse Analysis Package  On land use, land cover and land management -- drivers of change  Across multiple spatial scales (local to the continental) for the entire NEON realm  Across multiple temporal scales (days to decades to centuries) to help understand legacy effects of prior land use on ecosystem function and performance  For use by ecological modelers and forecasters to extend models to a continental scale The NEON Land Use Analysis Package (LUAP) provides information: (ISEP, NOD) Goal to “… collate existing data … on past and current land use practices as well as economic and social data that are useful for prediction of future land use processes”

Landuse Analysis Package  NEON must scale from site to region to continent  Remote sensing, aircraft borne, satellite.  Spectral and LiDAR data converted into 3D biogeochemical fingerprints of earth surface including vegetation and human structures.  GIS critical to convert sensor data to spatial data  USGS will provide satellite data from MODIS, Landsat, etc.  NEON will ingest other spatial data from and convert them into spatial data using GIS 20

New Approaches with CF21 21

Emerging CF21 Concepts  CF21 HPC program  Sustainability, hubs of innovation + experimental  Looking to develop new program in FY10  CF21 Software Institutes and Innovators  Transform innovation into sustainable software  Significant multiscale, long-term program Connected institutes, teams, investigators Integrated into CF21 framework w/Directorates 22 Hierarchical structures that link innovation and sustainability, integrate with national and campus activities

Concept for NSF-wide Fellowships for Transformative Computational Science  Goal: People! Build innovative researchers in computational science by supporting outstanding postdocs  Emphasize central role of cyberscience in all sciences (physical, biological, geological, mathematical, social, behavioral, economic, computer, information and data)  Support cyberscience research and education: CI- based, cross disciplinary boundaries Use CI to make revolutionary advances in their disciplines Research and develop CI that enables innovative computational practices 23

Summary  Science is being revolutionized through CI  Compute, data, networking advance suddenly 9-12 orders of magnitude after 4 centuries  All forms of CI—including GIS—needed for science  NSF responsive: developing much more comprehensive, integrated CF21 initiative  All units involved; OCI, CISE play important roles  Community deeply engaged in planning  Activities ramp up in FY11-12 and beyond  People, sustainability, innovation, integration  Longer term programs, better linked, hubs of innovation  Support computational scientists who develop and/or use advanced CI 24