Download presentation
Presentation is loading. Please wait.
Published byEdmund Bell Modified over 9 years ago
1
Automated Grid Monitoring for LHCb Experiment through HammerCloud Bradley Dice Valentina Mancinelli
2
Project Overview Use HammerCloud to… Test LHCb data storage access Ensure that new releases of user analysis programs function successfully Why? Temporarily disable sites with unreliable storage Prioritize bug-fixing by most common problems Keep the science moving!
3
Work falls into three categories: Front EndBack EndGrid Tests
4
Front End (User Interface) Shows list of current and past tests and offers management tools Progress: Added data visualizations to categorize errors and the sites they affect (right) Cleaned menu structures Made job colors more easily understandable
5
Back End (Test Manager) Interfaces between Ganga (to submit grid jobs) and Django (to display data) Progress: HammerCloud sites automatically update to match the WLCG topology Ganga jobs report back detailed information for analysis The backend produces plots showing jobs by status: complete, running, schedule, or failed (right)
6
Grid Tests (Getting Results) Detecting and classifying data access failure is the key purpose of HammerCloud Progress: A postprocessor has to detect whether files were accessed locally or pulled from another site (failover) Failover detection is presently difficult. Current collaboration with the developers of Ganga will help resolve this challenge.
7
Future Steps Retrieve more job information (metrics on CPU time, etc.) Provide grid site status information to RSS (Resource Status System) Create data visualizations requested by LHCb Document code in Twiki for future developers
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.