Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting September 11-12, 2003 Washington D.C.

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Pharos Uniprint 8.3.
NGAS – The Next Generation Archive System Jens Knudstrup NGAS The Next Generation Archive System.
Chapter 20 Oracle Secure Backup.
Accounting Manager Taking resource usage into your own hands Scott Jackson
Accounting Manager Taking resource usage into your own hands Scott Jackson Pacific Northwest National Laboratory
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
IWay Service Manager 6.1 Product Update Scott Hathaway iWay Software Copyright 2010, Information Builders. Slide 1.
Module 10: Troubleshooting AD DS, DNS, and Replication Issues.
Bookshelf.EXE - BX A dynamic version of Bookshelf –Automatic submission of algorithm implementations, data and benchmarks into database Distributed computing.
Environmental Council of States Network Authentication and Authorization Services The Shared Security Component February 28, 2005.
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
6/4/2015Page 1 Enterprise Service Bus (ESB) B. Ramamurthy.
Security SIG: Introduction to Tripwire Chris Harwood John Ives.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 10: Server Administration.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Tripwire Enterprise Server – Getting Started Doreen Meyer and Vincent Fox UC Davis, Information and Education Technology June 6, 2006.
Understanding and Managing WebSphere V5
Enterprise Reporting with Reporting Services SQL Server 2005 Donald Farmer Group Program Manager Microsoft Corporation.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Remote Desktop Services Remote Desktop Connection Remote Desktop Protocol Remote Assistance Remote Server Administration T0ols.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
TeraPaths: A QoS Collaborative Data Sharing Infrastructure for Petascale Computing Research Bruce Gibbard & Dantong Yu High-Performance Network Research.
Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Aug 26-27, 2004 Argonne, IL.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 5-6, 2003.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Jan 25-26, 2005 Washington D.C.
第十四章 J2EE 入门 Introduction What is J2EE ?
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
Resource Management Working Group SSS Quarterly Meeting November 28, 2001 Dallas, Tx.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Crystal Ball Panel ORNL Heterogeneous Distributed Computing Research Al Geist ORNL March 6, 2003 SOS 7.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting January 15-16, 2004 Argonne, IL.
Hands-On Microsoft Windows Server Implementing Microsoft Internet Information Services Microsoft Internet Information Services (IIS) –Software included.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting May 10-11, 2005 Argonne, IL.
7. CBM collaboration meetingXDAQ evaluation - J.Adamczewski1.
Secure Systems Research Group - FAU SW Development methodology using patterns and model checking 8/13/2009 Maha B Abbey PhD Candidate.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
National Center for Supercomputing ApplicationsNational Computational Science Grid Packaging Technology Technical Talk University of Wisconsin Condor/GPT.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
TOPIC 7.0 LINUX SERVICES AND CONFIGURATION. ROOT USER Root user is called “super user” because it has power far beyond those of mortal user. As root,
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Process Manager Specification Rusty Lusk 1/15/04.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
GridWay Overview John-Paul Robinson University of Alabama at Birmingham SURAgrid All-Hands Meeting Washington, D.C. March 15, 2007.
Allocation Management Solutions for High Performance Computing Scott M. Jackson Workshop on Scheduling and Resource Management for Parallel and Distributed.
Linux Systems Administration
ArcGIS for Server Security: Advanced
Architecture Review 10/11/2004
Fundamental of Databases
Jean-Philippe Baud, IT-GD, CERN November 2007
April Webinar: Advanced Configuration of Order Forms in Workflow
Leigh Grundhoefer Indiana University
Wide Area Workload Management Work Package DATAGRID project
Condor-G: An Update.
Presentation transcript:

Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting September 11-12, 2003 Washington D.C.

Resource Management and Accounting Working Group Working group scope Progress over last quarter Next steps Topics for group consideration

Working Group Scope The Resource Management Working Group is involved in the areas of resource management, scheduling and accounting. This working group will focus on the following software components: Queue Manager Scheduler Accounting and Allocation Manager Meta Scheduler Other critical resource management components are being developed in the Process Management and Monitoring Working Group: Process Manager Cluster Monitor

Resource Management Component Architecture Queue Manager Allocation Manager Node Monitor Meta Scheduler Local Scheduler Node Manager Process Manager Security System Information Service Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Infrastructure Services Event Manager

Resource Management Prototype Demonstration Queue Manager Allocation Manager Node Monitor Local Scheduler Process Manager Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Job Submission Client 1 Submit-Job 3 Query-Node 6 Exec-Process 4 Create-Reservation 2 Query-Job 5 Run-Job 8 Delete-Job 0 Service-Lookup 7 Query-Job 9 Withdraw-Allocation This demo runs a simple end-to-end test with a job being submitted running past it’s wallclock limit

General Progress SSSRMAP v2.0 interface specification has been implemented and tested by most RMWG components [all except Silver Meta-Scheduler] –Includes both HTTP Wire Protocol and XML Message Format –Includes implementation of SSS Job Object definition v2.0 Implemented default SSSRMAP v2.0 security authentication for most components [all except Silver Meta-Scheduler] –HMAC-SHA1 digital signature with shared secret key –Canonicalizes the XML System testing nearly complete for SSSRMAP v2 (on xtorc)

General Progress Created Node Object Specification version 1.0 –Differentiates between configured, available and utilized node properties Proposed set of response/status codes drafted –Success, warning and error response codes –Allows for component-specific and application-specific error- codes –Supports multiple levels of specificity Portability testing is underway (for initial release components) –Cobbed together a bunch of machines to test on

Portability Testing Progress

General Progress Re-release of v1.0 Initial SSS Resource Management Suite –OpenPBS-SSS sss_pbs_svr –Maui Scheduler –QBank sss_qbank_svr Webpage for RMWG recreated –Relevant documentation –Software downloads (tarballs and rpms) –Mailing lists –Links –Bug tracking

Scheduler Progress Implemented SSSRMAP v2 Wire Protocol (all clients, Resource Manager and Allocation Manager interfaces) Implemented SSSRMAP v2 Message Format (75% of clients, job object for Resource Manager and Allocation Manager) Implemented SSSRMAP authentication via shared secret keys Added 64-bit support for HMAC algorithm

Scheduler Progress Added limited support for SSSRMAP error codes Improved multi-taskgroup support Added man pages and command-line usage documentation Added resource manager, allocation manager and grid scheduler diagnostics to assist in configuration and troubleshooting

Queue Manager Progress Settled on a name: "Bamboo" Implemented SSSRMAP v2 wire protocol (including authentication) Support for SSSRMAP v2 XML support for single step jobs Data storage implemented via flat files or ODBC compliant database. Many improvements in build system, run-time configuration, and logging.

Accounting and Allocation Manager Progress QBank –Portability testing has begun Linux, HP-UX, AIX and IRIX completed Gold –Implemented SSSRMAP v2.0 encryption Using 3DES & session key generated from shared secret key Uses compression algorithm –Tested and verified SSSRMAP v2.0 authentication –Support added for Role Based Access Control (fine- grained command authorization)

Accounting and Allocation Manager Progress Gold –Object interface in web-based GUI has been implemented (gives you powerful low-level access to allocation manager objects) –Reimplemented the Gold client in Perl to overcome latency issues inherent in java startup overhead –Created a suite of full-featured Perl command-line clients Manages Users, Projects, Machines, Allocations (deposits, withdrawals, refunds, transfers, balance,…), Reservations, Quotations, Jobs, Usage, Transactions –Installed Gold on PNNL 11.8TF Linux cluster in transparent mode to test coherency and stability –Slow progress on open source front Blanket approval letter came out June 24th (CC ) Decision not to commercialize Public domain vs. open source (copyright issues)

Meta-Scheduler Progress Documentation for installation, configuration, and troubleshooting

Future Work User Oriented Problem Response System Complete portability testing for initial release components –(at least Linux, AIX, +other_UNIX) Release alpha versions of new components –(Bamboo, Silver, Gold) Begin portability testing for new components Create per-component interface specification documents (binding to SSSRMAP) Complete Design Specification documents for new components

Future Work Local Scheduler Complete integration of SSSRMAP v2 for queue and node objects Support full suite of allocation manager interface calls Full support for error codes Enhance dynamic job support with queue/task manager

Future Work Local Scheduler Support multi-source resource management interface Continued progress in resource limit enforcement and tracking Full resource limit enforcement and tracking configuration Integrate with checkpoint/restart capability (when available)

Future Work Queue manager Finish prologue/epilogue support once exit codes are available from process manager Interface with Node Monitor (probably after initial release) IO staging (may need API from process manager) Full multi step job support Package code for distribution. Add support for optional site job submission verification script.

Future Work Accounting and Allocation manager User and Admin interface for Gold Web-based GUI will be developed Integration with Directory Service Open source gold (BSD license) SSL over web gui and password authentication Production testing of Gold on 11.8TF Linux cluster (side-by-side with QBank)

Future Work Meta Scheduler Implement SSSRMAP v2 Wire Protocol and Message Format Add allocation manager interface support Add threaded support for cluster (local) scheduler interface

Issues requiring inter-group discussion Response Codes SC03 User Oriented Problem Response System Need process exit codes from process manager Cluster Monitor Open source