Download presentation
Presentation is loading. Please wait.
Published byKristopher Tucker Modified over 8 years ago
1
OSG Area Coordinator’s Report: Workload Management Maxim Potekhin BNL 631-344-3621 potekhin@bnl.gov May 8 th, 2008
2
2 Overview Workload Management Accomplishments Since Last Report Code changes to glexec-enabled Panda Pilot committed to SVN and tested on both BNL and Fermilab sites Understood the issues of the environment set-up when using glexec, which includes both the OS environment variables and the dynamic change of the working directory Code enhancements made to the Panda Pilot to accommodate specifics of MPI clusters, with test done at Purdue and NERSC Finalized the Panda Pilot Factory EGEE interoperability: had consultations and met in person with EGEE/WLCG personnel regarding the status of LCMAPS/LCAS deployment, which is a pre-requisite to using glexec-enabled pilot jobs in setuid mode (note: in OSG, we are using the GUMS plugin, whose setup has been largely understood prior to that). Developed testing plan for EGEE sites, pending additional development of the LCMAPS network API.
3
3 Overview Workload Management Current Initiatives: user support With Panda Pilot code adapted to MPI running mode and initial testing done at Purdue and NERSC, will shortly contact the CHARMM team to coordinate pre-production validation Security Continue to work out configuration and other issues related to glexec integration. Will work to expand testing to WLCG/EGEE sites increasing robustness of the Panda job aggregation and submission service Working on Panda Server code review, refactoring, versioning and improvements to installation and configuration procedures With a new set of hardware available at RACF as a dedicated test platform, we are working towards a comprehensive stress test of Panda service – work in progress
4
4 Overview Workload Management Issues / Concerns Current priorities in the OSG Workload Management effort continue to be scalability and security EGEE interoperability re: glexec? (Need to keep up the effort and cooperation with EGEE) While Panda already features a comprehensive set of monitoring tools, we need to move towards more user- friendly, efficient Panda interface for VO’s and individual users to increase the OSG’s ability to engage new organizations and researchers
5
5 WMS in WBS WBS Task InformationIn ChargeFinish DateComment 4.1.2.1 Deliver phase 1 improvements into OSG 1.0Wenaus12/07/07 4.1.9. Support security effort in facility (including GUMS)Wenaus, Potekhin09/30/08Well under way 4.2.1 Support OSG VOs in building, deploying and operating Workload Management Systems (WMS) that are based on just-in time job scheduling and the integration of tools used by these WMS in to the VDT Potekhin09/30/08MPI integration work under way 4.2.1.1 Deliver phase 1 into OSG 1.0Potekhin, Chiu17/03/08?Testing in production to commence 4.2.1.2 Deliver phase 2 into OSG 1.2Potekhin, Caballero06/07/08In progress 4.2.2. Manage the allocation of compute and storage resources allocated to the OSG-ET by OSG sites and/or external resource providers. Potekhin09/30/08Planning phase 4.2.3 Operate and support the hardware upon which the WMS service for OSG VO is instantiated. Ernst09/30/08In progress 4.2.4. Operate and support the WMS service for the OSG VOPotekhin, Caballero09/30/08In progress 4.3.3.1 Job submission, execution and managementGreen09/30/08
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.