Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving.

Slides:



Advertisements
Similar presentations
Copyright Hub Software Engineering Ltd 2010All rights reserved Hub Document Exchange Product Overview Secure Transmission for Transaction-based Documents.
Advertisements

INFSO-RI Enabling Grids for E-sciencE Agreement Service for Resource Reservation and Allocation: Overview Tiziana Ferrari, Elisabetta.
Accounting Manager Taking resource usage into your own hands Scott Jackson Pacific Northwest National Laboratory
CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Application architectures
Batch VIP — A backend system of video processing VIEW Technologies The Chinese University of Hong Kong.
Application architectures
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
CVSQL 2 The Design. System Overview System Components CVSQL Server –Three network interfaces –Modular data source provider framework –Decoupled SQL parsing.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
RUP Fundamentals - Instructor Notes
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
Rational Unified Process Fundamentals Module 4: Disciplines II.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Aug 26-27, 2004 Argonne, IL.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 5-6, 2003.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Jan 25-26, 2005 Washington D.C.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Process Management Working Group Process Management “Meatball” Dallas November 28, 2001.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Resource Management Working Group SSS Quarterly Meeting November 28, 2001 Dallas, Tx.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
SSS Test Results Scalability, Durability, Anomalies Todd Kordenbrock Technology Consultant Scalable Computing Division Sandia is a multiprogram.
CVSQL 2 The Revenge of the SQL. The present Read-only access to CVS repository logs Language is a subset of SQL XML interface for returning results Built-in.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting January 15-16, 2004 Argonne, IL.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting September 11-12, 2003 Washington D.C.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting May 10-11, 2005 Argonne, IL.
Scalable Systems Software for Terascale Computer Centers Coordinator: Al Geist Participating Organizations ORNL ANL LBNL.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
CSC480 Software Engineering Lecture 10 September 25, 2002.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Presented by: Tony Rimovsky TeraGrid Account Management Tony Rimovsky, Area Director for Network Operations and Security
A View from the Top Al Geist June Houston TX.
Objective What is RFT ? How does it work Architecture of RFT RFT and OGSA Issues Demo Questions.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Service Proforma Middleware Workshop. Notes Please complete as much of this proforma as possible – it will help make the workshop more informative & productive.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
Hyperion Artifact Life Cycle Management Agenda  Overview  Demo  Tips & Tricks  Takeaways  Queries.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
TRIUMF HLA Development High Level Applications Perform tasks of accelerator and beam control at control- room level, directly interfacing with operators.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
DDM Central Catalogs and Central Database Pedro Salgado.
Application architectures Advisor : Dr. Moneer Al_Mekhlafi By : Ahmed AbdAllah Al_Homaidi.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Architecture Review 10/11/2004
LCGAA nightlies infrastructure
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Wide Area Workload Management Work Package DATAGRID project
A Scripting Server for Domain Automation Tasks
Final Review 27th March Final Review 27th March 2019.
Presentation transcript:

Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving larger group

Working Group Scope The Resource Management Working Group encompasses the areas of resource management, scheduling and accounting. This working group will focus on the following software components: Job Manager(/Queue Manager) Scheduler Allocation Manager (and accounting) Meta Scheduler

Proposed Component Architecture Job/Queue Manager Allocation Manager Collector Meta Scheduler Node Manager Process Manager Security System Information Service Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Config and Infrastructure

Proposed Component Architecture Scheduler PBS server PBS Mom Queue Manager Process Manager Collector Node Monitor Job Manager Job ManagementNode Management b a

Component Interaction Diagram Job submitted to Queue Manager User Interface Node Manager Meta Scheduler Job Manager Allocation Manager SchedulerProcess Manager

Component Interaction Trace Job submitted to Queue Manager 1.A user submits a job to the Queue Manager 2.The Queue Manager does a sanity balance check with the Bank 3.The Queue Manager notifies the Scheduler that a new job has arrived 4.The Scheduler queries node and job status until job can run 5.A bank reservation is made with the Allocation Manager 6.The Scheduler requests the Queue Manager to run the job 7.The Queue Manager passes job control to the Process Manager 8.The Process Manager notifies Queue Manager of job completion 9.The Queue Manager notifies Scheduler of job completion 10.A bank withdrawal is made with the Allocation Manager 11.The user is notified of job completion

General Progress Creation of XML marshaller/unmarshaller Establishment of CVS repository Prototype demonstration: Scheduler makes a deposit to allocation manager using XML interface

Scheduler Progress Creation of SSS Resource Manager interface (RMType SSS – half-open sockets) Creation of SSS Allocation Manager interface Creation of allocation manager and resource manager objects for management of arbitrary attributes Integration of XML marshaller/unmarshaller Maui enhancements to link with C++ libs (Xerxes) Additional regression tests

Meta Scheduler Progress Added support for data-staging interface Added support for network proximity optimization Initial support for checkpoint/restart –Checkpoint aware statistics –Checkpoint aware preemption optimizations Sqsub client created allowing PBS-style jobs to be submitted and metascheduled Initial work on translation library (PBS->silver & silver->RS2) Stability enhancements

Job Manager Progress Initial job manager specification defined Interacted with process manager working group and drafted specification proposals for task manager and node manager and how they will interact with RMWG components Initial study on PBS to determine viability of dissection possibilities and functionality enhancements

Allocation Manager Progress Draft requirements document underway XML schema version 0.3 reworked to have explicit request & response elements From scratch allocation manager being used as prototype to test XML interface Implemented create, query, modify and delete for user, account and membership objects (interacting with database over JDBC)

Allocation Manager Progress (contd) Stubbed in dummy withdrawal and successfully demo’d XML interface with scheduler (validating against schema) Logging, config files, error handling General purpose dcecp-like client allows output formatting by utilizing metadata from queries

Current Issues Job Manager/Queue Manager as separate or unified components How to split up PBS (if at all) and at what levels (if any) to refit with XML interface Working with Software Engineering Working Group to decide on test framework

Next Work All components under CVS Establish initial resource management interface specifications for release Scheduler demos by next face-to-face: –Scheduler to process manager (over XML) –Scheduler to node manager (over XML) –Scheduler to job manager (over XML) –Drive an end-to-end checkpoint request –Scheduler talks to registry and discovery service

Next Work Job manager/queue manager milestones –Submission client submits job to queue manager and queue manager reports status to user client –Scheduler implements query to obtain job info from queue manager –Scheduler starts a job (requires implementation of task manager interface) – also cancel job –No prolog, epilog initially. Batch only. Simple single- step jobs. Supports polling mode only. No data-staging.

Next Work Allocation manager –Completion of XML schema for remaining objects/services –Review of requirements (SDSC, NCSA …) –Complete (1 st draft of) initial requirements –Implement machine class, allocations, reservations, withdrawals, transaction register, simple charging algorithm

Issues requiring inter-group coordination Need to solidify SSS-wide standards for packaging, revision control, documentation, problem tracking, online project schedule… and establish mechanisms and places to home them.