Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/i t DBES A. Abramyan, S. Bagansco, S. Banerjee, L. Betev, F. Carminati,

Slides:



Advertisements
Similar presentations
DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
Advertisements

During the last three years, ALICE has used AliEn continuously. All the activities needed by the experiment (Monte Carlo productions, raw data registration,
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
ALICE Operations short summary and directions in 2012 Grid Deployment Board March 21, 2011.
ALICE Operations short summary and directions in 2012 WLCG workshop May 19-20, 2012.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
ALICE DATA ACCESS MODEL Outline ALICE data access model - PtP Network Workshop 2  ALICE data model  Some figures.
AliEn Tutorial MODEL th May, May 2009 Installation of the AliEn software AliEn and the GRID Authentication File Catalogue.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
ALICE data access WLCG data WG revival 4 October 2013.
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
1 LCG-France sites contribution to the LHC activities in 2007 A.Tsaregorodtsev, CPPM, Marseille 14 January 2008, LCG-France Direction.
Status of PDC’07 and user analysis issues (from admin point of view) L. Betev August 28, 2007.
Overview of ALICE monitoring Catalin Cirstoiu, Pablo Saiz, Latchezar Betev 23/03/2007 System Analysis Working Group.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 WLCG-GDB Meeting. CERN, 12 May 2010 Patricia Méndez Lorenzo (CERN, IT-ES)
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES A. Abramyan, S. Bagansco, S. Banerjee, L. Betev, F. Carminati,
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT DPM / LFC and FTS news Ricardo Rocha ( on behalf of the IT/GT/DMS.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES GGUS Ticket review T1 Service Coordination Meeting 2010/10/28.
Database authentication in CORAL and COOL Database authentication in CORAL and COOL Giacomo Govi Giacomo Govi CERN IT/PSS CERN IT/PSS On behalf of the.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
AliEn central services Costin Grigoras. Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps.
+ AliEn site services and monitoring Miguel Martinez Pedreira.
ANALYSIS TOOLS FOR THE LHC EXPERIMENTS Dietrich Liko / CERN IT.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
CERN IT Department CH-1211 Genève 23 Switzerland t ALICE XROOTD news New xrootd bundle release Fixes and caveats A few nice-to-know-better.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.
Patricia Méndez Lorenzo (CERN, IT/GS-EIS) ċ. Introduction  Welcome to the first ALICE T1/T2 tutorial  Delivered for site admins and regional experts.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES L. Betev, A. Grigoras, C. Grigoras, P. Saiz, S. Schreiner AliEn.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz The future of AliEn.
Status of AliEn2 Services ALICE offline week Latchezar Betev Geneva, June 01, 2005.
+ AliEn status report Miguel Martinez Pedreira. + Touching the APIs Bug found, not sending site info from ROOT to central side was causing the sites to.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The AliEn File Catalogue Jamboree on Evolution of WLCG Data &
GRID interoperability and operation challenges under real load for the ALICE experiment F. Carminati, L. Betev, P. Saiz, F. Furano, P. Méndez Lorenzo,
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
Gestion des jobs grille CMS and Alice Artem Trunov CMS and Alice support.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Care and feeding of the alice grid Torino, Jan 15-16, 2009.
Creating a simplified global unique file catalogue Miguel Martinez Pedreira Pablo Saiz.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES A. Abramyan, S. Bagnasco, L. Betev, D. Goyal, A. Grigoras, C.
The ALICE Analysis -- News from the battlefield Federico Carminati for the ALICE Computing Project CHEP 2010 – Taiwan.
Federating Data in the ALICE Experiment
Jean-Philippe Baud, IT-GD, CERN November 2007
ALICE and LCG Stefano Bagnasco I.N.F.N. Torino
UML diagrams for the AliEn job execution part and PackMan service
Practical: The Information Systems
Torrent-based software distribution
CREAM Status and Plans Massimo Sgaravatto – INFN Padova
ALICE FAIR Meeting KVI, 2010 Kilian Schwarz GSI.
INFN-GRID Workshop Bari, October, 26, 2004
ALICE Physics Data Challenge 3
ALICE Computing : 2012 operation & future plans
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
MC data production, reconstruction and analysis - lessons from PDC’04
Short update on the latest gLite status
Simulation use cases for T2 in ALICE
ALICE – FAIR Offline Meeting KVI (Groningen), 3-4 May 2010
LCG middleware and LHC experiments ARDA project
Presentation transcript:

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES A. Abramyan, S. Bagansco, S. Banerjee, L. Betev, F. Carminati, D. Goyal, A. Grigoras, C. Grigoras, M. Litmaath, N. Manukyan, M. Martinez, A. Montiel, J. Porter, P. Saiz, S. Sankar, S. Schreiner, J. Zhu ALICE Environment on the GRID

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 221 May 2012 Pablo Saiz AliEn –File Catalogue –TaskQueue –Data access AliEn in ALICE New development Conclusions Outline

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 321 May 2012 Pablo Saiz All components to create a GRID File Catalogue –UNIX-like file system –Mapping to physical files –Metadata information –SE discovery Transfer Model –With different plugins TaskQueue –Job Agent & pull model –Automatic installation of software packages –Simulation, reconstruction, analysis... Developed by ALICE –Used by several communities AliEn

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 421 May 2012 Pablo Saiz AliEn File Catalogue Global Unique name space –Mapping from LFN to PFN UNIX-like file system interface Powerful metadata catalogue Automatic SE selection Integrated quota system Multiple storage protocols: xrootd, torrent, srm, file Collections of files Physical file archival Roles and users

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 521 May 2012 Pablo Saiz Hierarchical structure Entries can be retrieved by LFN or GUID Flexible table structure, allowing for easy scalability ALICE: 350M entries (270GB DB size) 1-JAN JAN FEB AUG-2008 … / /alice /alice/user/p/psaiz /alice/simulation/2006 … Index LFN->GUID LFN Catalogue Index GUID->PFN GUID Catalogue

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 621 May 2012 Pablo Saiz Data access Mostly xrootd protocol Jobs only download configuration files Data files are accessed remotely –The closest replica to the job, local replica first JobAgent uploads N replicas of the output Schedules data transfers only involve the raw data –Done via xrd3cp Use of torrent for AliEn and software package distribution

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 721 May 2012 Pablo Saiz SE selection Client automatically directed to the best SE to store/retrieve files –Find the closest working SE of a given QoS Built-in retrial mechanism Client Authen File Catalogue SERank Optimizer I’m in ‘New York’ Give me SEs! Try: LBL, LLNL, CNAF (auth. envelope)

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 821 May 2012 Pablo Saiz Job Execution Central TaskQueue with all jobs –Priorities & quotas on user/role level

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 921 May 2012 Pablo Saiz xrootd Job execution Job Manager JOB TASKQUEUE Job Broker CE MonALISA xrootd Site A JOB MonALISA xrootd Site B MonALISA Site C File catalogue LFN GUID Meta data JOB CE JA CREAMCE

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1021 May 2012 Pablo Saiz Middleware interfaces AliEn AliEn user interface VTDEDG LCG /GLITE CONDOR ARC/ NORDUGRID Nice! I STILL do not have to worry about ever changing GRID environment… LCG /EMI

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1121 May 2012 Pablo Saiz ALICE sites More than 80 centres

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1221 May 2012 Pablo Saiz ALICE Results

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1321 May 2012 Pablo Saiz Distribution of successful jobs

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1421 May 2012 Pablo Saiz Evolution - Intelligent Design Several major AliEn iterations –Starting in 2001 –Scale testing –Evaluating different approaches And learning from the exercises Keeping (almost) the same CLI –Even if backward incompatible under the hood

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1521 May 2012 Pablo Saiz Human grid Chile, Trust model USA, JA memory Armenia, XML model PackMan India, File deletion South Korea, Quota system Germany, ORACLE Italy, CREAMCE Scotland, VO to VO Switzerland, Main dev. China, Trust Model

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1621 May 2012 Pablo Saiz Plenty of new improvements –Catalogue simplification –Extreme Job Brokering –Removal of PackMan –New JDL fields –Proxy renewal –Job Memory checkup –… And baseline for new development What’s new

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1721 May 2012 Pablo Saiz Extreme Brokering Postpone splitting of job until last moment Decide data to be analyzed based on current location of JA & files not analyzed yet Can define Max/Min number of files to be analyzed –Even if the files are not local Less subjobs: –Easier merging

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1821 May 2012 Pablo Saiz Plenty of areas to contribute File popularity Interactive jobs Correlate Monitoring data Multi core jobagents Catalogue crawler Error classification Distributed brokering Trust model …

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1921 May 2012 Pablo Saiz Brokering alternatives Multicore jobagent –One agent per core (overkill) or –One agent per machine (needs development) Combining jobs with similar input –And dispatch them together Distribute (part of) brokering to the vobox –Sent multiple jobs to Vobox Let the vobox distribute them among the JA –Reduce load on central services –Possibility to reuse files from previous jobs

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2021 May 2012 Pablo Saiz Extreme Brokering Postpone splitting of job until last moment Decide data to be analyzed based on current location of JA & files not analyzed yet Can define Max/Min number of files to be analyzed –Even if the files are not local Less subjobs: –Easier merging

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2121 May 2012 Pablo Saiz Current situation Works nicely if one replica per file Job Manager JOB A bit more complex with 3 SE and 2 replicas Job Manager JOB And a lot more with 50 SE and 3 replicas Job Manager JOB

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2221 May 2012 Pablo Saiz Example Site ASite BSite C File 1 File 2 File 3 File 4 File 5 Current schema Submit 4 jobs: File1 File 4 File2File3File 5 Broker per file Submit 3 empty subjobs File1, 2,4,5 When a job starts, analyze as much as possible File 3 If nothing left, just exit

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2321 May 2012 Pablo Saiz How to test new versions… Build system: –Multiple platforms –Integration & basic functionality tests No API/access from ROOT tests  –Similar to the AliROOT, ROOT build systems –Running the whole system on a single machine –

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2421 May 2012 Pablo Saiz Summary AliEn v2.20 ready for deployment –With plenty of new features and bug fixes Minimize upgrade downtime –Create testing setup with several sites, and with all the SE –More effort on testing (also from site admins) Deploy Test V0 with ALICE sites And say goodbye to v2-19 in two months

CERN IT Department CH-1211 Geneva 23 Switzerland t ES 2521 May 2012 Pablo Saiz ALICE has been using the GRID since 2001 AliEn provides an interface to the GRID –Access to multiple resources –Transparent to the user AliEn Components –File & metadata catalogue –TaskQueue –Transfer model Can be used by other communities Plenty of areas for research Summary