Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 5-6, 2003.

Slides:



Advertisements
Similar presentations
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Advertisements

Accounting Manager Taking resource usage into your own hands Scott Jackson
Accounting Manager Taking resource usage into your own hands Scott Jackson Pacific Northwest National Laboratory
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
Presented by: Priti Lohani
LUNARC, Lund UniversityLSCS 2002 Transparent access to finite element applications using grid and web technology J. Lindemann P.A. Wernberg and G. Sandberg.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
6/4/2015Page 1 Enterprise Service Bus (ESB) B. Ramamurthy.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Components and Architecture CS 543 – Data Warehousing.
Security SIG: Introduction to Tripwire Chris Harwood John Ives.
Chapter 8: Network Operating Systems and Windows Server 2003-Based Networking Network+ Guide to Networks Third Edition.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
Presented by: Alvaro Llanos E.  Motivation and Overview  Frangipani Architecture overview  Similar DFS  PETAL: Distributed virtual disks ◦ Overview.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Resource Management and Accounting Working Group Working Group Scope and Components Progress made Current issues being worked Next steps Discussions involving.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Aug 26-27, 2004 Argonne, IL.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Towards a Javascript CoG Kit Gregor von Laszewski Fugang Wang Marlon Pierce Gerald Guo
1 Designing a Data Exchange - Best Practices Data Exchange Scenarios –Sender vs. Receiver-initiated exchanges –Node Design Best Practices: –Handling Large.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 13-14, 2002.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting Jan 25-26, 2005 Washington D.C.
Process Management Working Group Process Management “Meatball” Dallas November 28, 2001.
Resource Management Working Group SSS Quarterly Meeting November 28, 2001 Dallas, Tx.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Crystal Ball Panel ORNL Heterogeneous Distributed Computing Research Al Geist ORNL March 6, 2003 SOS 7.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting January 15-16, 2004 Argonne, IL.
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Five Windows Server 2008 Remote Desktop Services,
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting September 11-12, 2003 Washington D.C.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting May 10-11, 2005 Argonne, IL.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Scalable Systems Software for Terascale Computer Centers Coordinator: Al Geist Participating Organizations ORNL ANL LBNL.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
ATLAS Grid Requirements A First Draft Rich Baker Brookhaven National Laboratory.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
Service Proforma Middleware Workshop. Notes Please complete as much of this proforma as possible – it will help make the workshop more informative & productive.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Process Manager Specification Rusty Lusk 1/15/04.
AFS/OSD Project R.Belloni, L.Giammarino, A.Maslennikov, G.Palumbo, H.Reuter, R.Toebbicke.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
MGRID Architecture Andy Adamson Center for Information Technology Integration University of Michigan, USA.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
Installing VERITAS Cluster Server. Topic 1: Using the VERITAS Product Installer After completing this topic, you will be able to install VCS using the.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Allocation Management Solutions for High Performance Computing Scott M. Jackson Workshop on Scheduling and Resource Management for Parallel and Distributed.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
ArcGIS for Server Security: Advanced
OpenPBS – Distributed Workload Management System
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting June 5-6, 2003

Resource Management and Accounting Working Group Working group scope Progress over last quarter Next steps Topics for group consideration

Working Group Scope The Resource Management Working Group is involved in the areas of resource management, scheduling and accounting. This working group will focus on the following software components: Queue Manager Scheduler Accounting and Allocation Manager Meta Scheduler Other critical resource management components are being developed in the Process Management and Monitoring Working Group: Process Manager Cluster Monitor

Proposed Component Architecture Queue Manager Allocation Manager Node Monitor Meta Scheduler Local Scheduler Node Manager Process Manager Security System Information Service Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Infrastructure Services Event Manager

Resource Management Prototype Demonstration Queue Manager Allocation Manager Node Monitor Local Scheduler Process Manager Discovery Service Color Key Working Group Resource Management and Accounting Execution Management and Monitoring Node Configuration and Infrastructure Job Submission Client 1 Submit-Job 3 Query-Node 6 Exec-Process 4 Create-Reservation 2 Query-Job 5 Run-Job 8 Delete-Job 0 Service-Lookup 7 Query-Job 9 Withdraw-Allocation This demo runs a simple end-to-end test with a job being submitted running past it’s wallclock limit

General Progress SSS front-end created for QBank Ready for re-release of v1.0 Initial SSS Resource Management Suite –OpenPBS-SSS sss_xml front-end –Maui Scheduler –QBank sss_xml front-end Created Job Object Specification version 2.0 –Takes into account all stages of a job’s lifecycle –Support for job steps, preferences, request choices, charging, meta- scheduling, dynamic jobs, multi-task jobs, awareness policy –Distinguishes between requested, utilized and dedicated properties

General Progress Completed version 2.0 of the SSSRMAP resource management interface specification –Includes specifications for authentication and encryption –Has been implemented by Gold Accounting and Allocation Manager (as a proof of design) Beginning to see adoption of SSSRMAP specification –Commitment from SLURM (LLNL) to write interface to SSSRMAP –Commitment from Cluxterworx (Linux Networx) to write interface to SSSRMAP –Interest from bproc-based scheduler (Clemson University) to interface to queue manager via SSSRMAP –CLUBMask resource manager (Penn State) to interface with scheduler via SSSRMAP –Interest from CERN for a data manager interface to scheduler using SSSRMAP

Scheduler Progress Implemented XML client-server interface (40% of clients now using SSSRMAP) New interfaces to support generic resource loads (paging space, I/o, processor load, etc) for resource limit enforcement and tracking Documentation on resource limit enforcement and tracking Added support for multi-task group jobs Support for dynamic reservations (growing and shrinking to support MPI dynamic jobs)

Scheduler Progress Security -- support for a user specified keyfile containing the security token Performance -- continued efforts in memory- footprint reduction Fault tolerance – implemented a fallback server Ease of use -- Initial web-GUI developed (communicates directly with Maui server)

Queue Manager Progress Updated service directory and event manager interfaces Implemented caching of service directory lookups and prioritizing the wire protocol types returned for fault tolerance and performance. Beginning implementation of SSSRMAP v2 wire protocol and xml specification.

Accounting and Allocation Manager Progress Gold –Added support for 95% of functionality from QBank –Allocation design enhancements allocations shareable by users, projects and machines (also supports exclusions) Special wildcard types (ANY, NONE, MEMBER, DEFINED) Enhanced support for activation and expiration times (& active state) –Support added for Deposits Use of deposit shares for non-interactive deposit defaults –Support added for Hierarchical accounts (projects) Affected withdrawals, deposits, reservations, balance checks, etc. Support for recursive trickle up withdrawals and trickle down deposits

Accounting and Allocation Manager Progress Gold –Support added for Refunds –Implemented Guaranteed Quotes –Implemented Transfers –Support added for debit vs. credit allocations –Support for operations (aggregate functions) on returned query fields (sort, sum, max, unique, count, group by, etc) –Negation of options –Association metadata added to aid in GUI object navigation –Enhanced support for transaction logging, journaling, undo, redo –Implemented more flexible charging algorithm

Accounting and Allocation Manager Progress Gold –Implemented SSSRMAP version 2.0 –Implemented SSS Job Object version 2.0 –Infrastructure added for Role-Based Access Control –Support added for method overriding and method scope resolution –Progress on open source front (Gold and sss_xml front-ends) obtained approval from PNNL IP to apply a BSD open source license Sent letter to Fred requesting DOE approval to assert copyright –Created Accounting and Allocation Manager Binding document describing use of SSSRMAP protocol –Beginning effort to develop Web-based GUI (JSP) –Implemented SSSRMAP v2 authentication –Almost completed implementing SSSRMAP v2 encryption

Meta-Scheduler Progress Added basic data scheduling! (tested with Globus) Created interface for data-cache scheduling Fault tolerance improvements –job queue is persistent –Will recover from network failure, system failure, loss of checkpoint files Major documentation in all areas

Future Work Implement v2 SSS Resource Management and Accounting interface specification (all components) Implement v2 Job Object Specification Implement default SSSRMAP v2 security authentication and encryption for all components Release v1.0 Initial SSS Resource Management Suite and improve download and documentation webpages Release Portability enhancements (AIX, Tru64, possibly Cray) Create per-component interface specification documents (binding to SSSRMAP) Draft Design Specification documents

Future Work Local Scheduler Test interaction with checkpoint/restart mechanisms when interfaces ready Continued work on resource limit enforcement and tracking quality of service support for completion time guarantees Implement SSSRMAP v2.0 Security integration (authentication and encryption) Support for maleable jobs (pre-execution) Abstracting resource manager interfaces to accept multiple sources of input data and control Enable simulation to live submission translator

Future Work Queue manager Implement persistence via database (replacing flat files) Add Epilogue/Prologue support and job submission verification script Interface with Node Monitor Full PBS qsub compatibility Implement full input/output handling (need to define PM interfaces, if any) Implement SSSRMAP v2.0 (including security)

Future Work Accounting and Allocation manager Implement SSSRMAP v2 encryption (and test authentication) Implement Role-Based Access Control (fine- grained command authorization) Integration with Directory Service Open source gold (BSD license) Progress on Web-based JSP GUI

Future Work Meta Scheduler Continued effort in allocation management, credential management, data management, Enablement of grid level prioritization and fairness policies

Issues requiring inter-group discussion