Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002.

Slides:



Advertisements
Similar presentations
Tom Sugden EPCC OGSA-DAI Future Directions OGSA-DAI User's Forum GridWorld 2006, Washington DC 14 September 2006.
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Remote Visualisation System (RVS) By: Anil Chandra.
Accounting Manager Taking resource usage into your own hands Scott Jackson Pacific Northwest National Laboratory
CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University.
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.
Chapter 9: Moving to Design
System Center 2012 R2 Windows Azure Pack Service Management Automation 101.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Software Configuration Management
Institute of Computer and Communication Network Engineering OFC/NFOEC, 6-10 March 2011, Los Angeles, CA Lessons Learned From Implementing a Path Computation.
Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Aug 26-27, 2004 Argonne, IL.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 5-6, 2003.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Jan 25-26, 2005 Washington D.C.
Process Management Working Group Process Management “Meatball” Dallas November 28, 2001.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Resource Management Working Group SSS Quarterly Meeting November 28, 2001 Dallas, Tx.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
SSS Test Results Scalability, Durability, Anomalies Todd Kordenbrock Technology Consultant Scalable Computing Division Sandia is a multiprogram.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting January 15-16, 2004 Argonne, IL.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting September 11-12, 2003 Washington D.C.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting May 10-11, 2005 Argonne, IL.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Scalable Systems Software for Terascale Computer Centers Coordinator: Al Geist Participating Organizations ORNL ANL LBNL.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
Service Proforma Middleware Workshop. Notes Please complete as much of this proforma as possible – it will help make the workshop more informative & productive.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
Process Manager Specification Rusty Lusk 1/15/04.
CATI Pitié-Salpêtrière CATI: A national platform for advanced Neuroimaging In Alzheimer’s Disease Standardized MRI and PET acquisitions Across a wide network.
IPS Infrastructure Technological Overview of Work Done.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
© SCRIBE SOFTWARE CORPORATION 2008 Tips and Tricks for Working with Scribe Insight Trace Files.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Architecture Review 10/11/2004
LCGAA nightlies infrastructure
Integration of Network Services Interface version 2 with the JUNOS Space SDK
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002

Resource Management and Accounting Working Group Working Group Scope and Components Progress over last quarter Current issues being worked Next steps Discussions involving larger group

Working Group Scope The Resource Management Working Group is involved in the areas of resource management, scheduling and accounting. This working group will focus on the following software components: Queue Manager Scheduler Allocation Manager (and accounting) Meta Scheduler Other critical resource management components are being developed in the Process Management and Monitoring Working Group: Process Manager Node Monitor

Proposed Component Architecture Queue Manager Allocation Manager Node Monitor Meta Scheduler Local Scheduler Node Manager Process Manager Security System Information Service Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Infrastructure Services

Resource Management Prototype Demonstration Queue Manager Allocation Manager Node Monitor Local Scheduler Process Manager Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Job Submission Client 1 Submit-Job 3 Query-Node 6 Exec-Process 4 Create-Reservation 2 Query-Job 5 Run-Job 8 Delete-Job 0 Service-Lookup 7 Query-Job 9 Withdraw-Allocation This demo runs a simple end-to-end test with a job being submitted running past it’s wallclock limit

General Progress Prototype components (Queue Manager and Allocation Manager) advanced to stage of responding to basic requests over XML protocol Existing components (Maui, PBS) partially modified to communicate to SSS components over XML We can run a job now completely in SSS protocol! Initial Requirements documents for Allocation Manager & Queue Manager drafted Began initial draft of Scalable Systems Software Resource Management and Accounting Protocol (SSSRMAP)

Scheduler Progress Developed own XML parser/builder Converted to internal use of XML (job checkpointing etc.) Logically separated Node Monitor & Queue Manager Iface Implemented and tested XML interface to Allocation Mgr to create reservations and make allocation withdrawals Implemented and tested XML interface to Queue Manager to query, start and cancel jobs Implemented and tested XML interface to Node Monitor to query nodes Modified scheduler clients to allow SSS-0.1 socket protocol interface modify checkjob output to display machine readable AVP data Progress on log-based job (resourceXduration and node-mapping) GUI

Meta Scheduler Progress Call Dave in AM or get from Brett

Queue Manager Progress Initial Queue-Manager server and clients supporting: job submission, job query, job deletion and job startup Queue manager and clients use XML over basic protocol Queue Manager supports challenge protocol for communications with the Process Manager Submission client submits job to queue manager and queue manager reports status to user client Test interaction with scheduler to return job information, start a job and cancel a job Job startup is supported via create-process commands with the process manager

Allocation Manager Progress Completed first draft of initial requirements Reviewed requirements/design of other existing project management software Implemented audit log Preservation of historical state (distinct from audit log – allows statement creation and time travel) Support for operators and conjunctions in queries Reworked class structure and schema to support dynamic extensibility of objects and attributes Implemented cached metadata dictionary (for dynamic web-GUIs and generic proxy handling of objects) Lot’s of work on the protocol

Current Issues How best to provide XML interface for PBS Working with Software Engineering Working Group to decide on test framework Seeking to clarify interaction with node manager Determining which component best suited to handle arbitrary batch-specific node features

Next Work Release initial resource management interface specification Incorporate security in RMA components All components under CVS Testing framework installed and first tests created for each component

Next Work Local Scheduler Test interaction with checkpoint/restart mechanisms when interfaces ready Lot’s of testing and write-up of new capabilities Certification of milestones (20% of bullet items ready to be checked off) Security integration Progress on graphical interfaces

Next Work Queue manager Documentation and packing for easy site configuration (nearly done) Implementation of a backside database connection to provide job queue persistence across restarts of the Queue manager Full challenge protocol support in clients and server QM Support for more advanced jobs and job prologue/epilogue, stdout/stderr handling.

Next Work Allocation manager Focus on getting QBank ready for bundling with SSS (security, use key, improved installation procedure) Focus effort on open source of new Allocation Manager (gold) Implement simple pricing engine Develop XML schema for external pricing Implementation of functional allocation, reservation mechanisms Security integration (gold)

Issues requiring inter-group discussion Framing mechanism Security protocol Need to solidify SSS-wide standards for packaging, testing, revision control, documentation standards, problem tracking, etc.