Pierre Girard Réunion CMS

Slides:



Advertisements
Similar presentations
Mardi 30 mars 2010 Lavoisier : a way to integrate heteregeneous monitoring systems. Cyril LOrphelin IN2P3/CNRS Computing Centre, Lyon, France.
Advertisements

1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
A module to customize CREAM jobs according to site policies Tsukuba, KEK, 21 December 2010 Sylvain Reynaud JWGEN :
The Community Authorisation Service – CAS Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
Workload Management Massimo Sgaravatto INFN Padova.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
INFSO-RI Enabling Grids for E-sciencE XACML and G-PBox update MWSG 14-15/09/2005 Presenter: Vincenzo Ciaschini.
G RID M IDDLEWARE AND S ECURITY Suchandra Thapa Computation Institute University of Chicago.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
11/30/2007 Overview of operations at CC-IN2P3 Exploitation team Reported by Philippe Olivero.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
13 May 2004EB/TB Middleware meeting Use of R-GMA in BOSS for CMS Peter Hobson & Henry Nebrensky Brunel University, UK Some slides stolen from various talks.
INFSO-RI Enabling Grids for E-sciencE 1 Downtime Process Author : Osman AIDEL Hélène Cordier.
Mine Altunay July 30, 2007 Security and Privacy in OSG.
Light weight Disk Pool Manager experience and future plans Jean-Philippe Baud, IT-GD, CERN September 2005.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
USATLAS deployment We currently use VOMS Role based authorization in production within USATLAS. In the VO we have defined 4 groups/roles that satisfy our.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
The CMS Top 5 Issues/Concerns wrt. WLCG services WLCG-MB April 3, 2007 Matthias Kasemann CERN/DESY.
The new FTS – proposal FTS status. EMI INFSO-RI /05/ FTS /05/ /05/ Bugs fixed – Support an SE publishing more than.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
2011/11/03 Partial downtimes management Pierre Girard WLCG T1 Service Coordination Meeting.
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
StoRM + Lustre Proposal YAN Tian On behalf of Distributed Computing Group
Operation team at Ccin2p3 Suzanne Poulat –
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
CERN LCG1 to LCG2 Transition Markus Schulz LCG Workshop March 2004.
2007/05/22 Integration of virtualization software Pierre Girard ATLAS 3T1 Meeting
OSG VO Security Policies and Requirements Mine Altunay OSG Security Team July 2007.
Architectural Framework Presentation Vincenzo Ciaschini CNAF 15/5/06.
Job Priorities and Resource sharing in CMS A. Sciabà ECGI meeting on job priorities 15 May 2006.
Vendredi 27 avril 2007 Management of ATLAS CC-IN2P3 Specificities, issues and advice.
CE design report Luigi Zangrando
EGEE-II INFSO-RI Enabling Grids for E-sciencE Simone Campana (CERN) Job Priorities: status.
LCG A few slides for the discussion on VOMS Kors Bos, NIKHEF, Amsterdam GDB Oct.4, 2006.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
CMS-specific services and activities at CC-IN2P3 Farida Fassi October 23th.
EGRID Project: Experience Report Implementation of a GRID Infrastructure for the Analysis of Economic and Financial data.
HTCondor Accounting Update
Service Availability Monitoring
Workload Management Workpackage
Job monitoring and accounting data visualization
The EDG Testbed Deployment Details
Classic Storage Element
Open Science Grid Progress and Status
VOs and ARC Florido Paganelli, Lund University
Key Activities. MND sections
GDB 8th March 2006 Flavia Donno IT/GD, CERN
lcg-infosites documentation (v2.1, LCG2.3.1) 10/03/05
Sergio Fantinel, INFN LNL/PD
Lavoisier : a way to integrate heteregeneous monitoring systems.
Grid services for CMS at CC-IN2P3
CREAM-CE/HTCondor site
CC IN2P3 - T1 for CMS: CSA07: production and transfer
Grid Deployment Board meeting, 8 November 2006, CERN
Short update on the latest gLite status
Scalability Tests With CMS, Boss and R-GMA
Glexec/SCAS Pilot: IN2P3-CC status
Artem Trunov and EKP team EPK – Uni Karlsruhe
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
Pierre Girard ATLAS Visit
lundi 25 février 2019 FTS configuration
Grid Management Challenge - M. Jouvin
Information System (BDII)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

Pierre Girard (pierre.girard@in2p3.fr) Réunion CMS 2007-07-11 18/07/2018 2007/07/11 T1 & T2 at CCIN2P3 Pierre Girard (pierre.girard@in2p3.fr) Réunion CMS 2007-07-11

Administrative and Technical issues CMS use case 18/07/2018 Content Objectives Administrative and Technical issues CMS use case Current CMS’ requirements Last changes made at IN2P3-CC Next steps Expected changes Open questions Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

Deployment of both a T1 and a T2 over the same computing centre 18/07/2018 Objectives Deployment of both a T1 and a T2 over the same computing centre Sharing the same computing farm and using the same LRMS Being able to manage separetely the production of each grid site Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

Administrative and Technical issues 18/07/2018 Administrative and Technical issues Administrative matters A MoU by grid site Publishing separate accounting for T1 and T2 Requiring to declare a second site in the GOC DB Different VO activities to manage through BQS Jobs management policies must implement both commitments of T1 and commitments of T2 Technical issues we had to solve How to publish accounting for 2 sites while using same farm How to implement different site policies while using same farm Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case CMS’ requirements 18/07/2018 CMS use case CMS’ requirements T1 site policy T1 job slots = (CMS’ job slots x #CPUT1) / (#CPUT1 + #CPUT2) VOMS Role « lcgadmin » VOMS Role « production » Regular users T2 site policy T2 job slots = (CMS’ job slots x #CPUT2) / (#CPUT1 + #CPUT2) Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(1) 18/07/2018 CMS use case Last changes at IN2P3-CC(1) Taking benefit of the last downtime We revisited our CEs mapping strategy By prohibiting account overlapping between local sites By splitting the grid accounts into 2 subsets We put in production a new site Site-BDII CE (Atlas, Cms) No SE for now, but T1’s SRM SE is declared as close SE of T2’s CE We extended our accounting system to take into account multiple (logical) sites logical sites are mutually exclusive subsets of CEs Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(2) 18/07/2018 CMS use case Last changes at IN2P3-CC(2) State before last downtime T1 Site BDII CE01 CE02 CE03 CMS Mapping policy Site policy AFS rw access BQS priorities Jobs slot max Role production Role lcgadmin All others cms050 cmsgrid cms[001-049] Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(3) 18/07/2018 CMS use case Last changes at IN2P3-CC(3) State after downtime T1 Site BDII T2 Site BDII CE01 CE02 CE03 CE04 CE05 Mapping policy Mapping policy production lcgadmin All others cms050 cmsgrid cms[001-024] production lcgadmin All others cms049 cmsgrid cms[025-048] Site T1 policy Site T2 policy Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(4) 18/07/2018 CMS use case Last changes at IN2P3-CC(4) Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(5) 18/07/2018 CMS use case Last changes at IN2P3-CC(5) Accounting is published for both T1 and T2 http://www3.egee.cesga.es/gridsite/accounting/CESGA/egee_view.php Accounting RGMA T1 Site BDII CE4 CE3 CE2 CE1 CC/T1 CC/T2 BQS Anastasie WN Computing MonBox 5 Sites T2 Site BDII CE5 Filtering from CE Hostname Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(6) 18/07/2018 CMS use case Last changes at IN2P3-CC(6) Cherry on top of the cake We are now publishing several clusters by CE Each queue is linked to one cluster according to its type (short, medium, long) Each cluster defines the max amount of memory by job for the related queues Should solve the problem of jobs submitted on the wrong queue because of a requirement on RAMSize only Classical BQS error :« Memory size exceeded …» Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

CMS use case Last changes at IN2P3-CC(6) 18/07/2018 CMS use case Last changes at IN2P3-CC(6) Cherry on top of the cake We are now publishing several clusters by CE Each queue is linked to one cluster according to its type (short, medium, long) Each cluster defines the max amount of memory by job for the related queues Should solve the problem of jobs submitted on the wrong queue because of a requirement on RAMSize only Classical BQS error :« Memory size exceeded …» {ccali22}~(0)>lcg-info --list-ce --vo cms --attrs Memory --query CE="cclcgceli05*" - CE: cclcgceli05.in2p3.fr:2119/jobmanager-bqs-cms_long - Memory 2048 - CE: cclcgceli05.in2p3.fr:2119/jobmanager-bqs-medium - Memory 1024 - CE: cclcgceli05.in2p3.fr:2119/jobmanager-bqs-short - Memory 512 Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

18/07/2018 Next steps Sites policies must be adapted to meet both T1 and T2 commitments Update the current prioritization script to apply the quotas with the new mapping policy Sites publications must reflect the difference between T1 and T2 Ongoing work New BQS information provider should integrate the production CEs during summer Must enforce that each site is well used for what it is made For accounting concerns, it is important to use the T1 this summer Number of accounts by pool will be increased cms[001-100] Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

18/07/2018 Expected changes CMS should define another roles/groups combination for T2 than the one for T1 For example: /cms/reconstruction/Role=production (T1) /cms/simulation/Role=production (T2) /cms/analysis/Role=production (T2) In order to clearly separate T1 and T2 VOMS information publication is needed to definitely identify what/whom a queue is for Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11

Open questions Close SE issue Classic SE issues CMS is using CloseSE to choose the CE T1 and T2 are sharing ccsrm SE for now This strategy doesn’t work to choose either CCIN2P3 T1 site, or CCIN2P3 T2 site How do you proceed otherwise ? Classic SE issues cclcgseli02 is used to access SPS through gridftp This solution is not scalable Classic SE are about to definitely disappear (end of summer) Is there any plan B ? Pierre Girard / CMS / T1 & T2 at CCIN2P3 2007/07/11