Status of the Production and Nagios news ALICE TF Meeting 29/07/2010.

Slides:



Advertisements
Similar presentations
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Advertisements

Calculating Partial Benefits Problems and Solutions.
CREAM: Update on the ALICE experiences WLCG GDB Meeting Patricia Méndez Lorenzo (IT/GS) CERN, 11th March 2009.
15/07/2010Swiss WLCG Operations Meeting Summary of the last GridKA Cloud Meeting (07 July 2010) Marc Goulette (University of Geneva)
Please open your laptops, log in to the MyMathLab course web site, and open Daily Quiz 18. You will have 10 minutes for today’s quiz. The second problem.
ALICE Operations short summary and directions in 2012 Grid Deployment Board March 21, 2011.
ALICE Operations short summary and directions in 2012 WLCG workshop May 19-20, 2012.
March 27, IndiaCMS Meeting, Delhi1 T2_IN_TIFR of all-of-us, for all-of-us, by some-of-us Tier-2 Status Report.
On Sunday, it was 59° at 10:00. From 8:00 until 10:00 it had cooled off by 4 degrees. What was the temperature at 8:00?
AMOD Report Doug Benjamin Duke University. Hourly Jobs Running during last week 140 K Blue – MC simulation Yellow Data processing Red – user Analysis.
Large scale data flow in local and GRID environment V.Kolosov, I.Korolko, S.Makarychev ITEP Moscow.
Patricia Méndez Lorenzo (IT/GS) ALICE Offline Week (18th March 2009)
WLCG ‘Weekly’ Service Report ~~~ WLCG Management Board, 22 th July 2008.
Computing Infrastructure Status. LHCb Computing Status LHCb LHCC mini-review, February The LHCb Computing Model: a reminder m Simulation is using.
Status of PDC’07 Latchezar Betev TF meeting – April 5, 2007.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
GGUS summary ( 4 weeks ) VOUserTeamAlarmTotal ALICE ATLAS CMS LHCb Totals 1.
LCG Plans for Chrsitmas Shutdown John Gordon, STFC-RAL GDB December 10 th, 2008.
ACTIVE VOICE. TO BE PastPresentFuture Regular Action Finished Action Simple /Indefinite Perfect Yesterday, 3 days ago, last winter, in 1917 Today Tomorrow,
Status of the production and news about Nagios ALICE TF Meeting 22/07/2010.
WLCG Service Report ~~~ WLCG Management Board, 1 st September
CCRC’08 Weekly Update Jamie Shiers ~~~ LCG MB, 1 st April 2008.
1 PRAGUE site report. 2 Overview Supported HEP experiments and staff Hardware on Prague farms Statistics about running LHC experiment’s DC Experience.
CSCS Status Peter Kunszt Manager Swiss Grid Initiative CHIPP, 21 April, 2006.
Update on replica management
WLCG Service Report ~~~ WLCG Management Board, 9 th August
The LHCb Italian Tier-2 Domenico Galli, Bologna INFN CSN1 Roma,
 Status of the ALICE Grid Patricia Méndez Lorenzo (IT)ALICE OFFLINE WEEK, CERN 18 October 2010.
CERN – Alice Offline – Thu, 27 Mar 2008 – Marco MEONI - 1 Status of RAW data production (III) ALICE-LCG Task Force weekly.
CREAM: ALICE Experience WLCG GDB Meeting, CERN 11th November 2009 Stefano Bagnasco (INFN-Torino), Jean-Michel Barbet (Subatech), Latchezar Betev (ALICE),
Experiment Operations: ALICE Report WLCG GDB Meeting, CERN 14th October 2009 Patricia Méndez Lorenzo, IT/GS-EIS.
1 WLCG-GDB Meeting. CERN, 12 May 2010 Patricia Méndez Lorenzo (CERN, IT-ES)
May Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
13 October 2004GDB - NIKHEF M. Lokajicek1 Operational Issues in Prague Data Challenge Experience.
WLCG Service Report ~~~ WLCG Management Board, 7 th September 2010 Updated 8 th September
Status of the Production ALICE TF MEETING 11/02/2010.
Production Activities and Results by ALICE Patricia Méndez Lorenzo (on behalf of the ALICE Collaboration) Service Challenge Technical Meeting CERN, 15.
WLCG Service Report ~~~ WLCG Management Board, 31 st March 2009.
WLCG Service Report ~~~ WLCG Management Board, 18 th September
WLCG Service Report ~~~ WLCG Management Board, 23 rd November
OPERATIONS REPORT JUNE – SEPTEMBER 2015 Stefan Roiser CERN.
WLCG ‘Weekly’ Service Report ~~~ WLCG Management Board, 5 th August 2008.
SL5 Site Status GDB, September 2009 John Gordon. LCG SL5 Site Status ASGC T1 - will be finished before mid September. Actually the OS migration process.
Christmas running post- mortem (Part III) ALICE TF Meeting 15/01/09.
8 August 2006MB Report on Status and Progress of SC4 activities 1 MB (Snapshot) Report on Status and Progress of SC4 activities A weekly report is gathered.
Data transfers and storage Kilian Schwarz GSI. GSI – current storage capacities vobox LCG RB/CE GSI batchfarm: ALICE cluster (67 nodes/480 cores for batch.
ALICE Computing Model A pictorial guide. ALICE Computing Model External T1 CERN T0 During pp run i (7 months): P2: data taking T0: first reconstruction.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
Current status WMS and CREAM CE deployment Patricia Mendez Lorenzo ALICE TF Meeting (CERN, 02/04/09)
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
GRID interoperability and operation challenges under real load for the ALICE experiment F. Carminati, L. Betev, P. Saiz, F. Furano, P. Méndez Lorenzo,
CERN IT Department CH-1211 Genève 23 Switzerland t CHEP 2009, Monday 26rd March 2009 (Prague) Patricia Méndez Lorenzo on behalf of the IT/GS-EIS.
CREAM CE: upgrades in the system  Migration of the ALICE production queue in the CREAM CE: DONE  From pps-cream-fzk.gridka.de:8443/cream-pbs-pps to.
Status of GSDC, KISTI Sang-Un Ahn, for the GSDC Tier-1 Team
Dominique Boutigny December 12, 2006 CC-IN2P3 a Tier-1 for W-LCG 1 st Chinese – French Workshop on LHC Physics and associated Grid Computing IHEP - Beijing.
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE7029 ATLAS CMS LHCb Totals
UK Status and Plans Catalin Condurache – STFC RAL ALICE Tier-1/Tier-2 Workshop University of Torino, February 2015.
The ALICE Christmas Production L. Betev, S. Lemaitre, M. Litmaath, P. Mendez, E. Roche WLCG LCG Meeting 14th January 2009.
Status of the SL5 migration ALICE TF Meeting
ALICE Workload Model – WMS and CREAM
LCG Service Challenge: Planning and Milestones
Status of the Production
Philippe Charpentier CERN – LHCb On behalf of the LHCb Computing Group
WLCG Management Board, 16th July 2013
Simulation use cases for T2 in ALICE
ALICE – FAIR Offline Meeting KVI (Groningen), 3-4 May 2010
Status of MC production on the grid
Universita’ di Torino and INFN – Torino
The LHCb Computing Data Challenge DC06
Presentation transcript:

Status of the Production and Nagios news ALICE TF Meeting 29/07/2010

Status of the production Since yesterday (28/07/2010) ALICE is running out of MC production – Raw data reconstruction: Currently running at CERN (LHC10e). Decrease of the activity during the week – Analysis trains: Ongoing – User analysis: Ongoing – MC production: Finished for the moment. No new MC requirements on pipe

Job profile this week Decrease due to the stop of the MC production

Job profile per users Production clearly dominated by the MC jobs this week As usual, important user analysis activity also this week

Raw data transfers and production Low raw data transfer activity this week: 1.3TB of raw data transferred. (Compatible with the raw data taking regime this week) Around 25TB of raw data recorded in

Status of the sites T1 sites – CNAF: The site has been running a very low number of Alice jobs since more than a week. A GPFS migration caused this problem Still today the number of jobs is low although the operation is finished # jobs should increase in the next hours – RAL ALICE is running over the number of assigned resources Site proposed to put a cap on the number of Alice jobs at This is about 25% of the farm, and is around 10 times Alice's current fairshare allocation, (Alice's current usage is about 65%)This is necessary as the recent high volumes on Alice work caused CMS to run a high priority workload elsewhere.

Status of the sites T2 sites – Subatech will be down starting tomorrow Friday at 16:00 GMT+2 until Monday in the morning. Electrical maintenance In addition some French sites had cooling problems already solved – Grenoble: External network will be down on Saturday, July 31st from 5:30 am till 6:00pm. – Poznan: SE failed during the week, already solved – IPNL: CREAM1.6 migration completed – Torino: CREAM1.6 migration completed – Madrid: SE failing today. Migration activities ongoing. The CREAM system already migrared to CREAM1.6 – Trujillo: Out of production since a long time, in addition SE failing – LBL: SE failing today – Small activities at some Russian sites (new host certificates of the voboxes)

Pending issues Issue reported last week: – Large amount of zombies or extremely long jobs running at the sites (over 46h) Declared as pathological jobs which should be killed Sites were encouraged to whether kill those jobs or decrease the CPU limit time of the ALICE queues to 24h – No news after this during this week

Quattor recipe for the CREAM-CE migration Thanks to Jerome for this instructions – Available at: – ent&view=article&id=46&Itemid=103

Status of Nagios SAM will switched off in September ALL VOBOXES MUST BE PINGABLE AND ACCESIBLE FROM samnag014.cern.ch