Current status of gLite

Slides:



Advertisements
Similar presentations
INFSO-RI Enabling Grids for E-sciencE EGEE and gLite Slides by: Erwin Laure EGEE Deputy Middleware Manager.
Advertisements

21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware Claudio Grandi (INFN – Bologna) Workshop Commissione.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
INFSO-RI Enabling Grids for E-sciencE LCG-2 and gLite Architecture and components Author E.Slabospitskaya.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
GLite, the next generation middleware for Grid computing Oxana Smirnova (Lund/CERN) Nordic Grid Neighborhood Meeting Linköping, October 20, 2004 Uses material.
INFSO-RI Enabling Grids for E-sciencE gLite: Short Summary Anar Manafov, GSI Material on EGEE 3 rd Conference April 18-22, 2005.
INFSO-RI Enabling Grids for E-sciencE Comparison of LCG-2 and gLite Author E.Slabospitskaya Location IHEP.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
INFSO-RI Enabling Grids for E-sciencE Status and Plans of gLite Middleware Erwin Laure 4 th ARDA Workshop 7-8 March 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Overview Roberto Barbera Univ. of.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
Enabling Grids for E-sciencE gLite for ATLAS Production Simone Campana, CERN/INFN ATLAS production meeting May 2, 2005.
INFSO-RI Enabling Grids for E-sciencE gLite Middleware Status Frédéric Hemmer, JRA1 Manager, CERN On behalf of JRA1.
The CMS Top 5 Issues/Concerns wrt. WLCG services WLCG-MB April 3, 2007 Matthias Kasemann CERN/DESY.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Segundo Taller Latino Americano de Computación GRID – Primer Taller Latino Americano de EELA – Primer Tutorial Latino Americano de EELA
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
13th EELA Tutorial, La Antigua, 18-19, October E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA
Bob Jones – Project Architecture - 1 March n° 1 Project Architecture, Middleware and Delivery Schedule Bob Jones Technical Coordinator, WP12, CERN.
INFSO-RI Enabling Grids for E-sciencE gLite Overview Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Overveiw of the gLite middleware Yaodong Cheng
ATLAS DDM Developing a Data Management System for the ATLAS Experiment September 20, 2005 Miguel Branco
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
EGEE Data Management Services
Resource access in the EGEE project Massimo Sgaravatto INFN Padova
Jean-Philippe Baud, IT-GD, CERN November 2007
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win: Porting of gLite middleware to Windows platform
Erwin Laure EGEE-I Deputy Middleware Manager
Grid2Win Porting of gLite middleware to Windows XP platform
gLite Basic APIs Christos Filippidis
Grid Computing: Running your Jobs around the World
OGF PGI – EDGI Security Use Case and Requirements
StoRM: a SRM solution for disk based storage systems
gLite Grid Services Salma Saber
GDB 8th March 2006 Flavia Donno IT/GD, CERN
Comparison of LCG-2 and gLite v1.0
gLite Middleware Status
Slides contributed by EGEE Team
Introduction to Data Management in EGI
Grid2Win: Porting of gLite middleware to Windows XP platform
Introduction to Grid Technology
Grid Services Ouafa Bentaleb CERIST, Algeria
Short update on the latest gLite status
Gri2Win: Porting gLite to run under Windows XP Platform
LCG middleware and LHC experiments ARDA project
Data Management cluster summary
R-GMA (Relational Grid Monitoring Architecture) for monitoring applications “s” gLite and LCG.
From Prototype to Production Grid
The GENIUS portal and the GILDA t-Infrastructure
Grid Introduction and gLite Overview
Overview of gLite Middleware
Based on material by Sergio Andreozzi INFN-CNAF
INFNGRID Workshop – Bari, Italy, October 2004
gLite The EGEE Middleware Distribution
Presentation transcript:

Current status of gLite Erwin Laure, CERN 3rd EGEE Conference, Athens, Greece April 18-22, 2005

Outline Contents of gLite release 1.0 Major differences to LCG-2 Components, architecture, and service interplay Major differences to LCG-2 Major open issues Future plans 3rd EGEE Conference

gLite Services for Release 1 Grid Access Service API Access Services Job Provenance Job Management Services Computing Element Workload Management Package Manager Metadata Catalog Data Services Storage Element Data Management File & Replica Catalog Authorization Security Services Authentication Auditing Information & Monitoring Information & Monitoring Services Application Monitoring Site Proxy Accounting JRA3 UK CERN IT/CZ Focus on key services Released on April 5th 2005 3rd EGEE Conference

gLite Services for Release 1 Software stack and origin (simplified) Computing Element Gatekeeper, WSS (Globus) Condor-C (Condor) CE Monitor (EGEE) Local batch system (PBS, LSF, Condor) Workload Management WMS (EDG) Logging and bookkeeping (EDG) Storage Element File Transfer/Placement (EGEE) glite-I/O (AliEn) GridFTP (Globus) SRM: Castor (CERN), dCache (FNAL, DESY), other SRMs Catalog File and Replica Catalog (EGEE) Metadata Catalog (EGEE) Information and Monitoring R-GMA (EDG) Security VOMS (DataTAG, EDG) GSI (Globus) Authentication and authorization for C and Java based (web) services (EDG) 3rd EGEE Conference

WMS Interaction Overview 3rd EGEE Conference

CE Interaction Overview Collaboration of JRA1 (INFN, Univ. of Chicago, Univ. of Wisconsin-Madison), and JRA3 Submit job Grid Gatekeeper LCAS LCMAPS WSS CEMon Notifications Launch Condor-C Blahpd Condor-C CE Should evolve into a VO scheduler Local batch system LSF PBS/ Torque Condor 3rd EGEE Conference

DM Interaction Overview Storage Element WSDL VOMS Storage API Get credential File I/O gLite I/O SRM gridFTP File namespace and Metadata mgmt File and Replica Catalog StorageIndex Fireman Database Store credential File replication File Transfer and Placement Service FTS FPS Transfer Agent Database Proxy renewal Replica Location MyProxy WMS 3rd EGEE Conference

WMS Started with LCG-2 Workload Management System Support partitioned jobs and jobs with dependencies Task Queue Persistent queue for submitted jobs Information Supermarket Read only information system cache Updated by Information systems (CE in push mode) Notification of available resources (CE in pull mode) Combination of both Push and pull mode Interface to Data Management EDG-RLS StorageIndex DLI Condor-C Reliable job submission between the WM and the CE CE moving towards a VO based scheduler 3rd EGEE Conference

WMS Cont’d Major problems Failure rate ~12% (retrycount = 0), otherwise 100% success Several reasons being investigated (e.g. race conditions) Shallow re-submission (i.e. retry of submission, not execution) might help Matchmaking is being blocked sometimes Fix provided for Release 1.1 (end of April) Condor as backend not yet working Not yet final architecture of CE: One Schedd per local user id Need setuid services and head node monitoring (Globus+JRA3) Not a lot of experience tuning the CE Monitor Need some examples 3rd EGEE Conference

Data Management Services Mostly new developments based on (and re-using some) AliEn services Storage Element Storage Resource Manager rely on existing implementations POSIX-I/O gLite-I/O Access protocols gsiftp, https, rfio, … Catalogs File Catalog Replica Catalog File Authorization Service Metadata Catalog File Transfer Data Scheduler planned for Release 2 File Transfer Service gLite FTS and glite-url-copy File Placement Service gLite FPS Tech preview in release 1.0 Modifications for LCG SCs in Release 1.1 gLite FiReMan Catalog (MySQL and Oracle) gLite Metadata Catalog 3rd EGEE Conference

Data Management Cont’d Addressing shortcomings of LCG-2 data management RLS performance A non distributed catalog Although only single catalog supported in release 1 Lack of consistent grid-storage interfaces Unreliable data transfer layer Fireman Catalog Hierarchical Name Space Bulk Operations ACLs Web Services Interface POOL Interface Performance/scalability gLite I/O Support of ACL’s Support of Fireman catalog in addition to RLS File Transfer Service Did not exist on LCG-2 Differences to LCG-2 explained in http://egee-jra1-dm.web.cern.ch/egee-jra1-dm/lcg-utils-glite.htm 3rd EGEE Conference

Data Management Cont’d Major problems (Catalog) No ROOT interface available yet Will be done in next weeks Performance tuning Problems with security configuration Major problems (glite-I/O) Sensitive to SRM failures Error codes not always explanatory Interfaced to FireMan and RLS/RMC catalog Interface to AliEn FC should be available in Release 1.1 Not yet interfaced to ROOT 3rd EGEE Conference

Information and Monitoring Services R-GMA (Relational Grid Monitoring Architecture) Implements GGF GMA standard Development started in EDG, deployed on the production infrastructure for accounting Producer Service Registry Consumer API Mediator Schema application Publish Tuples Send Query Receive Tuples Register Locate Query Tuples SQL “CREATE TABLE” SQL “INSERT” SQL “SELECT” 3rd EGEE Conference

Information and Monitoring Services Cont’d R-GMA Producer, Consumer, Registry and Schema services with supporting tools Registry replication Simpler API – matching the next (WS) release Provides smooth transition between old API and WS coping with life on the Grid: poorly configured networks, firewalls, MySQL corruptions etc Generic Service Discovery API Major problems Difficult to deploy Performance problems observed by LCG-2 Differences between lcg 2.4.0 and gLite essentially schema changes for service discovery and security turned on in gLite 3rd EGEE Conference

VOMS Derived from DataTag and EDG Used for VO mgmt RFC compliance VOMS certificates understood by WMS and Data Mgmt (as of release 1.1) RFC compliance Major problems Incompatibility with previous VOMS versions Due to RFC compliance Limited deployment options (only single VO) Due to lack of communication among the involved parties Fixed in Release 1.1 3rd EGEE Conference

UI Clients for WMS, LB, Catalog+gLiteI/O, R-GMA, and VOMS Fully re-locatable Can install with un-privileged accounts 3rd EGEE Conference

Future Plans Defect fixing has highest priority Monthly release cycles Done in the release 1 branch Monthly release cycles Defect fixes New functionality according to planning with SA1 and NA4 Quick fixes Patches that cannot wait for the monthly cycle Release 1.1 due end of April FTS/FPS Metadata catalog VOMS with multi-VO configuration Defect fixes (WMS matchmaking hanging problem, DM security configuration…) Planning document for Release 2 distributed https://edms.cern.ch/document/573493/4 3rd EGEE Conference

Future Plans - WMS Move CE to final architecture WS interface to WMS One scheduler per VO (can be provided by VO) Head-node monitor and fork/set-uid service WS interface to WMS With better support for bulk job-submission Support for pilot jobs Integration of network information (JRA4) Closer integration with Data Mgmt Common job and data transfer DAGs Data matchmaking for ranking “shallow” job-resubmission CE history used for ranking Use information from R-GMA in the information supermarket 3rd EGEE Conference

Future Plans - DM Security in DM chain Delegation Work out security model (local vs. Grid) Support SRMs with native ACL support VOMS roles for ACL’s Distributed/Partitioned Catalogs Need to define model Integration of network information (JRA4) Explore xrootd Harmonize metadata interface (ARDA, PTF) Data Scheduler (equivalent to WMS for data transfer requests) 3rd EGEE Conference

Future Plans – I&M R-GMA WS enabled and re-factored Should help with performance problems observed on LCG-2 (partial results are provided more quickly) R-GMA Schema replication Remove single point of failure Usage of service discovery for WMS and data mgmt. R-GMA using VOMS based authentication 3rd EGEE Conference

Future Plans - Others Common configuration, instrumentation, and logging framework for gLite Services More information in L&B More error information Basic job statistics (depending on information available from the resource) Status of data transfer requests Accounting Agreement service E.g. for advance reservation of networks (jointly with JRA4) Package mgr GAS 3rd EGEE Conference

http://www.glite.org glite-announce@cern.ch http://cern.ch/egee-jra1 More information http://www.glite.org glite-announce@cern.ch http://cern.ch/egee-jra1 3rd EGEE Conference