Scalable Systems Software for Terascale Computer Centers www.scidac.org/ScalableSystems Problem Resource Management Computer centers use incompatible, ad hoc set of systems tools Present tools are not designed to scale to multi-Teraflop systems Accounting & user mgmt Solution Collectively (with industry) define standard interfaces between systems components for interoperability Create scalable, standardized management tools for efficiently running our large computing centers System Monitoring System Build & Configure Impact Revolutionize the way system software is designed and used. Job management
Participating Organizations ORNL ANL LBNL PNNL SNL LANL Ames NCSA PSC SDSC IBM HP / Compaq SGI Scyld Intel Unlimited Scale Main Web Site www.scidac.org/ScalableSystems
The Architecture
Modular Architecture Framework
Project Working Groups Main project notebook http://www.scidac.org/ScalableSystems Project subgroups “chapters” http://www.scidac.org/ScalableSystems/chapters.html Meeting notes and overall design ideas Build and Configure Building and configuring systems and information services Resource Management Resource management and accounting Process Management Process management, job accounting, checkpoint-restart and monitoring Integration Validation and integration of all components