EGEE is a project funded by the European Union under contract IST-2003-508833 Issues from current Experience SA1 Feedback to JRA1 A. Pacheco PIC Barcelona.

Slides:



Advertisements
Similar presentations
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Advertisements

1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Oxford Jan 2005 RAL Computing 1 RAL Computing Implementing the computing model: SAM and the Grid Nick West.
Chapter 1 Introduction to Databases
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Andrew McNab - Manchester HEP - 22 April 2002 UK Rollout and Support Plan Aim of this talk is to the answer question “As a site admin, what are the steps.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
1 Deployment of an LCG Infrastructure in Australia How-To Setup the LCG Grid Middleware – A beginner's perspective Marco La Rosa
Linux Operations and Administration
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Release Process Maria Alandes Pradillo.
EMI INFSO-RI EMI Quality Assurance Processes (PS ) Alberto Aimar (CERN) CERN IT-GT-SL Section Leader EMI SA2 QA Activity Leader.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
EGEE is a project funded by the European Union under contract IST Build Infrastructure & Release Procedures Integration.
Deployment Issues David Kelsey GridPP13, Durham 5 Jul 2005
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
Introduction to the Adapter Server Rob Mace June, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PPS All sites Meeting: Introduction & Agenda.
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
EGEE is a project funded by the European Union under contract IST User support in EGEE Alistair Mills Torsten Antoni EGEE-3 Conference 20 April.
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
OSG Tier 3 support Marco Mambelli - OSG Tier 3 Dan Fraser - OSG Tier 3 liaison Tanya Levshina - OSG.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
HEPiX FNAL ‘02 25 th Oct 2002 Alan Silverman HEPiX Large Cluster SIG Report Alan Silverman 25 th October 2002 HEPiX 2002, FNAL.
EGEE is a project funded by the European Union under contract IST Tools survey status, first experiences with the prototype Diana Bosio EGEE.
EGEE is a project funded by the European Union under contract IST EGEE Services Ian Bird SA1 Manager Cork Meeting, April
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
EGEE is a project funded by the European Union under contract IST Support in EGEE Ron Trompert SARA NEROC Meeting, 28 October
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
Rutherford Appleton Lab, UK VOBox Considerations from GridPP. GridPP DTeam Meeting. Wed Sep 13 th 2005.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
LCG LCG-1 Deployment and usage experience Lev Shamardin SINP MSU, Moscow
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
High Availability Technologies for Tier2 Services June 16 th 2006 Tim Bell CERN IT/FIO/TSI.
EGEE is a project funded by the European Union under contract IST GLite Integration Infrastructure Integration Team JRA1.
Linux Operations and Administration
Nanbor Wang, Balamurali Ananthan Tech-X Corporation Gerald Gieraltowski, Edward May, Alexandre Vaniachine Argonne National Laboratory 2. ARCHITECTURE GSIMF:
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
EGEE is a project funded by the European Union under contract IST Package Manager Predrag Buncic JRA1 ARDA 21/10/04
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
Kati Lassila-Perini EGEE User Support Workshop Outline: – CMS collaboration – User Support clients – User Support task definition – passive support:
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
J. Templon Nikhef Amsterdam Physics Data Processing Group Storage federations Why & How : site perspective Jeff Templon Pre-GBD october CDN vision.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
II EGEE conference Den Haag November, ROC-CIC status in Italy
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
Jean-Philippe Baud, IT-GD, CERN November 2007
Regional Operations Centres Core infrastructure Centres
Operations Status Report
NA4/medical imaging. Medical Data Manager Installation
LCG Operations Workshop, e-IRG Workshop
Leigh Grundhoefer Indiana University
INFNGRID Workshop – Bari, Italy, October 2004
gLite The EGEE Middleware Distribution
Installation/Configuration
Presentation transcript:

EGEE is a project funded by the European Union under contract IST Issues from current Experience SA1 Feedback to JRA1 A. Pacheco PIC Barcelona LCG Workshop 2-4 Nov

LCG Workshop, 2-4 Nov Motivation Goal of this talk: to start the discussions on the grid middleware components. Today, some experience has been collected thanks to LCG2. LCG has reached this year the experience to operate with the number of sites expected at the start of LHC and half of the number of nodes expected in the Tier-0. However, the start-up of the EGEE project and its ROC- based structure bring new challenges.

LCG Workshop, 2-4 Nov Trends observed in SA1 In EGEE SA1 we are observing a change in the profile of the site administrators with respect to LCG. SA1 experience is that new sites use untrained or recently recruited people to deploy their initial grid middleware services. Some of them have never installed a Linux cluster before. Site administrators expect a complete solution pack with documentation, recipes and point-and-click installation kits. They do not expect to edit, copy, cut and paste configuration data nor search for hidden log files. The effort expected to install and run the grid is not persistent. Once installed they expect to run unless something is broken or upgrade is needed. No baby-sitting.

LCG Workshop, 2-4 Nov Typical scenario for new sites One or more Linux administrators with the mandate to install an EGEE grid site by tomorrow. Specially to appear in the GOC site map! One PC with and web access... One printer (for the documentation,...)... One web address A number 1..n of PC boxes in a rack, shelf, table or even in the floor....

LCG Workshop, 2-4 Nov What do they require or expect... A complete set of documentation in electronic format (introduction, installation, operations, user and reference manuals) for all components. A complete set of products with the installation kits associated. A secure web interface to apply, register and maintain site information, service information and user profiles. An script to install the components in the appropiate order with or without automatic installation tools. An script to configure the components for the most common scenarios. An script to verify the installation and configuration of the components.

LCG Workshop, 2-4 Nov What do they require or expect (2) A complete management toolkit with an interface to operate the components and services. A standard environment for easy log file examination, debugging, understanding of error codes and problem tracing. An automatic tool to update the configuration files and to upgrade to the latest version of the components. Access to a professional support channel to get solutions in case of need. Access to a technical or operation channel to keep informed of new releases, operational issues, etc...

LCG Workshop, 2-4 Nov Preliminary conclusions Middleware delivery and requirements must be adapted to the new scenarios in EGEE. Clients are not longer computer experts located in HEP institutes. In the next transparencies a list of issues concerning current deployment of middleware are shown. They issues have been discussed at ROC Management level. This requirement list is incomplete. A more detailed list can be obtained from the JRA1-SA1 meetings.

LCG Workshop, 2-4 Nov Issues Need for easy installation of middleware software  Current experience is that the released middleware is very difficult to install and integrate in preproduction testbeds.  Dependencies and conflicts with other packages should be clearly identified and eliminated.  Use of tools like apt4rpm to convert an rpm repository into an apt, yum or metadata repository should be encouraged. Need for easy configuration and verification scripts  Some components such as VO Support and RGMA are known to be particularly difficult to configure.  A complete verification procedure prior to the request for certification should be provided.  Configuration files should be concentrated in one directory or in a single file.  Configuration management should be centralized using lcfgng/quattor tools.

LCG Workshop, 2-4 Nov Issues (2) Need for easy upgrade and site maintenance  Continuous effort may be scarce. Once installed, automate as much as possible. Need for management tools  In order to operate and configure correctly, the services should provide a secure interface to allow remote query and management of their operational state Use of coherent naming conventions and procedures  Important for fast debugging and inter-component problem tracing Need for an improved computer element component  Difficulties for the middleware to express job requirements to the job scheduler

LCG Workshop, 2-4 Nov Issues (3) Need for improvements in the storage element  No tools to manage the request queue nor redirect the requests.  Any reconfiguration or maintenance becomes a loss of service requests.  No possibility in gridftp storage elements to request asynchronous copy from tape to HSM disk cache (pre-staging) before really copying the files. Useful for data challenges. Solved with SRM GET. Need for a single support channel for middleware  Clarify the support interfaces GGUS, Savannah, LCG-ROLLOUT. Need for a realistic plan for deploy gLite  Experiments are depending on new middleware but deployment plan is difficult with 80 sites and 7000 computers widely located.

LCG Workshop, 2-4 Nov Conclusions The key of the EGEE project is the spread of the grid technology worldwide. Configuration and management areas should be improved. A state machine for operational supervision of components and their interdependencies in a local as well as a global perspective is needed. HEP is very experienced in DAQ systems... The discussions of this LCG workshop will be very important to prioritize or refocus the work to be done.