Download presentation
Presentation is loading. Please wait.
Published byHayden Fowler Modified over 11 years ago
1
The Grid Job Monitoring Service Luděk Matyska et al. CESNET, z.s.p.o. Prague Czech Republic
2
Motivation n Job tracking –Too complex environment –Responsibility delegation –Independent decision by components –Security issues (only delegated contact) n Parallel and multipart jobs –Too many sub-tasks –View aggregation
3
Job Movement
4
The Logging and Bookkeeping Service n Collects events associated with job life, e.g. –Job submitted –Resource found –Job started on a CE (Computing Element) –Job finished its computation n Stores them in bookkeeping and logging databases n Provides the job state to end users
5
Job Life Cycle
6
LB service architecture n Two APIs –logging API –server API n Local logger service n The database servers
7
Architecture Schema
8
ArchitectureComments n Message format: –ULM based (NetLogger) –Semantic rules prescribed n Local logger service –locallogger daemon –interlogger daemon –local persistency (local disk file) n Data transfer to database servers –Bookkeeping server: persistent during the job life time –Logging server: eternally persistent
9
Logging API n Simple n Just one function dg_log_event() n Always stores date/time, event producer, jobID n Authenticated
10
Server API n State computed on-demand n Three core functions: –List of users jobs –Job status for a given job –List of events related to a given job n Authenticated
11
Job Identification n GRID-wide (global) identifier n Used to identify the appropriate bookkeeping server –Currently wired in –In the future probably via Information service n URL-like syntax: https://hostname:port/unique_string?... n unique_string to distinguish individual jobs n Bookkeeping server speaks https protocol
12
Security Considerations n Authentication –Both for logging and database queries –Certificate based (user and/or host/service) –User associated with jobID on first authenticated event n Secure channels n Storage (database) access
13
R-GMA Integration n Work in progress n The goals: –To lower database load –To provide notification service –To allow better integration with other information services
14
R-GMAFirst Extension
15
LB Service Extensions n User defined attributes –To store additional information associated with a job –To retrieve job collections n Synchronous API n Job checkpointing (at the application level) –Information stored in Bookkeeping server
16
Job Partitionning n Group ID –Job collections –Hierarchical n Aggregate queries
17
Conclusion n LB service provides –Job tracking –Persistent event storage –Job state provision n Future work –(R-)GMA integration –Authorization –Collective operations
18
Thank you for your interest
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.