Installation and evaluation of the Globus toolkit WP 1 INFN-GRID Workload management WP 1 DATAGRID WP 2.1 INFN-GRID Massimo Sgaravatto INFN Padova.

Slides:



Advertisements
Similar presentations
WP1 Grid Workload Management Massimo Sgaravatto INFN Padova
Advertisements

INFN & Globus activities Massimo Sgaravatto INFN Padova.
Grid Workload Management (WP 1) Report to INFN-GRID TB Massimo Sgaravatto INFN Padova.
WP 1 (Globus) Status Report Massimo Sgaravatto INFN Padova for the INFN Globus group
Work Package 1 Installation and Evaluation of the Globus Toolkit Massimo Sgaravatto INFN Padova.
Deployment Team. Deployment –Central Management Team Takes care of the deployment of the release, certificates the sites and manages the grid services.
Evaluation of the Globus Toolkit: Status Roberto Cucchi – INFN Cnaf Antonia Ghiselli – INFN Cnaf Giuseppe Lo Biondo – INFN Milano Francesco Prelz – INFN.
A Computation Management Agent for Multi-Institutional Grids
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
CMS HLT production using Grid tools Flavia Donno (INFN Pisa) Claudio Grandi (INFN Bologna) Ivano Lippi (INFN Padova) Francesco Prelz (INFN Milano) Andrea.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Status of Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
INFN-GRID Globus evaluation Massimo Sgaravatto INFN Padova for the INFN Globus group
Report on the INFN-GRID Globus evaluation Massimo Sgaravatto INFN Padova for the INFN Globus group
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Status of Globus activities within INFN (update) Massimo Sgaravatto INFN Padova for the INFN Globus group
First ideas for a Resource Management Architecture for Productions Massimo Sgaravatto INFN Padova.
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
INFN-GRID Globus evaluation (WP 1) Massimo Sgaravatto INFN Padova for the INFN Globus group
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
GRID The GRID distribution toolkit at INFN Flavia Donno (INFN Pisa) Andrea Sciaba` (INFN Pisa) Zhen Xie (INFN Pisa) presented by Massimo Sgaravatto (INFN.
DATAGRID ConferenceTestbed0 - resources in Italy Luciano Gaido 1 DATAGRID WP6 Testbed0 resources in Italy Amsterdam March,
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
11 December 2000 Paolo Capiluppi - DataGrid Testbed Workshop CMS Applications Requirements DataGrid Testbed Workshop Milano, 11 December 2000 Paolo Capiluppi,
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
A monitoring tool for a GRID operation center Sergio Andreozzi (INFN CNAF), Sergio Fantinel (INFN Padova), David Rebatto (INFN Milano), Gennaro Tortone.
Grid Workload Management Massimo Sgaravatto INFN Padova.
The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.
DataGrid Workshop Oxford, July 2-5 INFN Testbed status report Luciano Gaido 1 DataGrid Workshop INFN Testbed status report L. Gaido Oxford July,
Report from USA Massimo Sgaravatto INFN Padova. Introduction Workload management system for productions Monte Carlo productions, data reconstructions.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Zhen Xie, INFN-Pisa, on DataGrid WP6 meeting1 Globus Installation Toolkit Zhen Xie On behalf of grid-release team INFN-Pisa.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Proposal for a IS schema Massimo Sgaravatto INFN Padova.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Report on the INFN-GRID Globus evaluation Massimo Sgaravatto INFN Padova for the INFN Globus group
GRID The GRID distribution toolkit at INFN Flavia Donno (INFN Pisa) Andrea Sciaba` (INFN Pisa) Zhen Xie (INFN Pisa) presented by Massimo Sgaravatto (INFN.
Condor on WAN D. Bortolotti - INFN Bologna T. Ferrari - INFN Cnaf A.Ghiselli - INFN Cnaf P.Mazzanti - INFN Bologna F. Prelz - INFN Milano F.Semeria - INFN.
6 march Building the INFN Grid Proposal outline a.ghiselli,l.luminari,m.sgaravatto,c.vistoli INFN Grid meeting, milano.
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
4/9/ 2000 I Datagrid Workshop- Marseille C.Vistoli Wide Area Workload Management Work Package DATAGRID project Parallel session report Cristina Vistoli.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
INFN “Grid Information Service” evaluation Giuseppe Lo Biondo - INFN Sez. Di Milano Giulietta Vita Finzi - INFN CNAF Padova June
WP1 Status and plans Francesco Prelz, Massimo Sgaravatto 4 th EDG Project Conference Paris, March 6 th, 2002.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
First evaluation of the Globus GRAM service Massimo Sgaravatto INFN Padova.
Workload Management Workpackage
Report on GLUE activities 5th EU-DataGRID Conference
Wide Area Workload Management Work Package DATAGRID project
I Datagrid Workshop- Marseille C.Vistoli
GRID Workload Management System for CMS fall production
Presentation transcript:

Installation and evaluation of the Globus toolkit WP 1 INFN-GRID Workload management WP 1 DATAGRID WP 2.1 INFN-GRID Massimo Sgaravatto INFN Padova

Globus To make productions and analyses on a Grid some basic services (security, information service, resource management, …) must be implemented as first step Globus identified as possible Grid framework providing these services … but it has been developed mainly for traditional computing, different from computing in HEP High throughput vs. High performance PC farms vs Supercomputers Distributed data intensive computing Need to assess what can be used for HEP environment (PC clusters, distributed PB of data, …) WP1 Installation and Evaluation of the Globus Toolkit of the INFN- GRID Project Goal: evaluation of the Globus toolkit Which services can be useful ? What is necessary to integrate/modify ? What is missing ? Duration: 6 months Results of this first evaluation used to plan future activities

Globus: Tasks Globus deployment Reduce complexity and manpower for Globus installation and maintenance Security To access GRID resources mechanisms for user authentication needed Evaluation of GSI service (based on certificates) Information Service To discover the GRID resources (CPU, storage, network, …) mechanisms to publish them must be defined Analysis of GIS service to publish information using a uniform and standard interface Resource Management Necessary a uniform interface to submit jobs on GRID resources Uniform standard interface to different resource management systems Uniform standard language for task management Assessment of Globus GRAM service for resource allocation and process management

Globus: Tasks Data Access and Migration High performance and reliable tools needed to manage data (data transfers, wide area replica between regional centers, …) Assessment of Globus tools for data management (GASS, Globus ftp) Fault Monitoring Faults in a GRID environment must be promptly detected and recovery mechanisms must be implemented Evaluation of HBM service for fault detection Execution Environment Management Code migration (moving the application where the job will actually be executed) as a possible implementation strategy Evaluation of GEM service to support code migration

Globus: Deliverables & Milestones Deliverables Tools, documentation and operational procedures for Globus deployment (6 Months) Final report on suitability of the Globus toolkit as basic Grid infrastructure (6 Months) Milestones Basic deployment Grid infrastructure for the INFN GRID (6 months)

Globus: Personnel Contribution (FTE) Bari M. DAmato (0.2), XXX (0.5) Bologna F. Semeria (0.2) Catania R. Barbera(0.1), E. Cangiano (0.1), C. Rocca (0.1) CNAF L. Fonti (0.2), F. Giacomini (0.1), A. Italiano(0.1), G. Vita Finzi (0.4) Lecce M. Cafaro (0.1), L. Depaolis (0.1) LNL M. Biasotto (0.1) Milano G. Lo Biondo (0.1), F. Prelz (0.1) Padova M. Sgaravatto (0.2) Pisa G. Bagliesi (0.3), A. Controzzi (0.5), F. Donno (0.3), A. Sciaba` (0.3), Z. Xie (0.3) Roma 1 D. Anzellotti (0.1), M. De Rossi (0.1), E. Majorana (0.1), C. Palomba (0.1), D. Rossetti (0.1), A. Spanu (0.1), E. Valente (0.1) Torino C. Anglano (0.1), A. Forte (0.1), L. Gaido (0.1) Total: 5.4 FTE

Globus: First Activities and Results Globus deployment Globus installed on ~ 30 machines in 11 different sites Bari, Bologna, Catania, CNAF, Ferrara, LNL, Milano, Padova, Napoli, Pisa, Torino First tools to quasi-automatically install and deploy Globus Bug fixes and INFN customizations included Released for DataGrid Interest from Globus team on these activities Information Service INFN MDS server (for Globus and installations, that use a centralized model) Definition and on-going implementation of test architecture for Grid Information Service (for Globus installations, that consider a distributed model) Web interface for browsing Preliminary activities on integrating the default schema with other info

Dc=bo, Dc=infn, dc=it,o=grid Bologna GIIS INFN CMS GIIS GIIS Dc=pd,Dc=infn, dc=it,o=grid Exp=cms, o=grid Top Level INFN GIIS Dc=infn,dc=it, o=grid Padova INFN GIS Architecture (test phase)

Globus: First Activities and Results Security Globus architecture for authentication evaluated Reliable, but some improvements needed … Use of INFN Certification Authority to issue Globus certificates: ok Resource Management (activities strictly related with workload management WP) Job submission tests on remote GRID resources considering different underlying resource management systems (Condor, LSF, PBS) Evaluation of Globus Resource Specification Language as uniform language to specify resources Cooperation with Grid Information Service Information on characteristics and status of local resources must be published

Workload Management in the DataGrid project (WP 1) Goal: define and implement a suitable architecture for distributed scheduling and resource management in a GRID environment Large heterogeneous environment Large numbers (thousands) of independent users Many challenging issues : Optimizing the choice of execution location based on the availability of data, computation and network resources Optimal co-allocation and advance reservation of CPU, data, network Uniform interface to possible different local resource management systems Priorities, policies on resource usage Reliability Fault tolerance Scalability … INFN responsibility in DataGrid

DataGrid Workload Mgmt: Effort breakdown (mm) FundedUnfunded INFN DATAMAT1080 CESnet PPARC

Workload Management in the INFN-GRID project (WP 2.1) Integration, adaptation and deployment of middleware developed within the DataGrid project GRID software must enable physicists to run their jobs using all the available GRID resources in a transparent way HEP applications classified in 3 different classes, with incremental level of complexity Workload management system for Monte Carlo productions Goal: throughput maximization Implementation strategy: code migration (moving the application where the processing will be performed) Workload management system for data reconstruction and production analysis Goal: throughput maximization Implementation strategy: code migration + data migration (moving the data where the processing will be performed, and collecting the outputs in a central repository) Workload management system for individual physics analysis Chaotic processing Goal: latency minimization Implementation strategy: code migration + data migration + remote data access (accessing data remotely) for client/server applications (dynamical decision)

Workload Mgmt: Personnel Contribution (FTE) Bologna F. Semeria (0.5) Catania F. Barbanera (0.2), S. Cavalieri (0.4), E. Commis (0.4), L. Lo Bello (0.4), O. Mirabella (0.4), S. Monforte (0.4), V. Sassone (0.2), L. Vita (0.4) CNAF P. Ciancarini (0.5), T. Ferrari (0.2), L. Fonti (0.3), F. Giacomini (1), A. Ghiselli (0.3), C. Vistoli (0.5) Ferrara M. Gambetti (0.2), A. Gianoli (0.2), E. Luppi (0.1) Lecce G. Aloisio (0.2), M. Cafaro (0.4), S. Campeggio (0.2), L. Depaolis (0.1). E. Fasanelli (0.2), XXX (1) Milano F. Prelz (0.3) Padova S. Orlando (0.3), M. Sgaravatto (0.4) Roma1 G. Mirabelli (0.3) Torino C. Anglano (0.3), S. Donatelli (0.3), L. Gaido (0.3), A. Welbrouck (0.3), YYY (0.9), ZZZ (1) Total: 13.1 FTE INFN participation to Workload Mgmt in DataGrid included

Workload Mgmt: First Activities and Results CMS-HLT use case analyzed in terms of GRID requirements and GRID tools availability Discussions with Globus team and Condor team Good and productive collaborations already in place Definition of a possible high throughput workload management system architecture Use of Globus and Condor mechanisms But major developments needed On going activities in putting together the various building blocks Deep tests on GRAM functionalities already done and Globus-Condor interface already in place

High throughput workload management system architecture (simplified design) Globus GRAM CONDOR Globus GRAM LSF Globus GRAM PBS globusrun Site1 Site2Site3 condor_submit (Globus Universe) Condor-G Master Grid Information Service (GIS) Submit jobs (using Class-Ads) Resource Discovery Information on characteristics and status of local resources Local Resource Management Systems Globus GRAM as uniform interface to different local resource management systems Condor-G able to provide reliability Use of Condor tools for job monitoring, logging, … Master chooses in which Globus resources the jobs must be submitted Farms

Other info INFN-GRID WP 1 (Globus) web site: INFN-GRID/Globus