Interactive Job Monitor: CafMon kill CafMon tail CafMon dir CafMon log CafMon top CafMon ps LcgCAF: CDF submission portal to LCG resources Francesco Delli.

Slides:



Advertisements
Similar presentations
EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
Advertisements

Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
SEE-GRID-SCI Hands-On Session: Workload Management System (WMS) Installation and Configuration Dusan Vudragovic Institute of Physics.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
Sep Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
22/04/2005Donatella Lucchesi1 CDFII Computing Status OUTLINE:  New CDF-Italy computing group organization  Usage status at FNAL and CNAF  Towards GRID:
A tool to enable CMS Distributed Analysis
JIM Deployment for the CDF Experiment M. Burgon-Lyon 1, A. Baranowski 2, V. Bartsch 3,S. Belforte 4, G. Garzoglio 2, R. Herber 2, R. Illingworth 2, R.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
LcgCAF:CDF submission portal to LCG Federica Fanzago for CDF-Italian Computing Group Gabriele Compostella, Francesco Delli Paoli, Donatella Lucchesi, Daniel.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
CDF Grid at KISTI 정민호, 조기현 *, 김현우, 김동희 1, 양유철 1, 서준석 1, 공대정 1, 김지은 1, 장성현 1, 칸 아딜 1, 김수봉 2, 이재승 2, 이영장 2, 문창성 2, 정지은 2, 유인태 3, 임 규빈 3, 주경광 4, 김현수 5, 오영도.
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
The EDGeS project receives Community research funding 1 SG-DG Bridges Zoltán Farkas, MTA SZTAKI.
Grid infrastructure analysis with a simple flow model Andrey Demichev, Alexander Kryukov, Lev Shamardin, Grigory Shpiz Scobeltsyn Institute of Nuclear.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Movement of MC Data using SAM & SRM Interface Manoj K. Jha (INFN- Bologna) Gabriele Compostella, Antonio Cuomo, Donatella Lucchesi,, Simone Pagan (INFN.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
22 nd September 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
Evolution of a High Performance Computing and Monitoring system onto the GRID for High Energy Experiments T.L. Hsieh, S. Hou, P.K. Teng Academia Sinica,
Condor Week 2004 The use of Condor at the CDF Analysis Farm Presented by Sfiligoi Igor on behalf of the CAF group.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
May Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
Outline: Status: Report after one month of Plans for the future (Preparing Summer -Fall 2003) (CNAF): Update A. Sidoti, INFN Pisa and.
Workload Management System Jason Shih WLCG T2 Asia Workshop Dec 2, 2006: TIFR.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
Proxy management mechanism and gLExec integration with the PanDA pilot Status and perspectives.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE 3 rd conference - Athens – 20/04/2005 CREAM JDL vs JSDL Massimo Sgaravatto INFN - Padova.
User Interface UI TP: UI User Interface installation & configuration.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Open Science Grid Consortium Storage on Open Science Grid Placing, Using and Retrieving Data on OSG Resources Abhishek Singh Rana OSG Users Meeting July.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
CDF Monte Carlo Production on LCG GRID via LcgCAF Authors: Gabriele Compostella Donatella Lucchesi Simone Pagan Griso Igor SFiligoi 3 rd IEEE International.
IFAE Apr CDF Computing Experience - Gabriele Compostella1 IFAE Apr CDF Computing Experience Gabriele Compostella, University.
FESR Trinacria Grid Virtual Laboratory Practical using WMProxy advanced job submission Emidio Giorgio INFN Catania.
Gri2Win: Porting gLite to run under Windows XP Platform
Grid Computing: Running your Jobs around the World
LcgCAF:CDF submission portal to LCG
Brief overview on GridICE and Ticketing System
Introduction to Grid Technology
Ruslan Fomkin and Tore Risch Uppsala DataBase Laboratory
Grid2Win: Porting of gLite middleware to Windows XP platform
Short update on the latest gLite status
Gri2Win: Porting gLite to run under Windows XP Platform
Interoperability & Standards
#01 Client/Server Computing
Initial job submission and monitoring efforts with JClarens
#01 Client/Server Computing
The LHCb Computing Data Challenge DC06
Presentation transcript:

Interactive Job Monitor: CafMon kill CafMon tail CafMon dir CafMon log CafMon top CafMon ps LcgCAF: CDF submission portal to LCG resources Francesco Delli Paoli, Donatella Lucchesi, Daniel Jeans, Subir Sarkar and Igor Sfiligoi CDF-UI Submitter accepts incoming job submission requests and creates JDL. It retrieves user tarball and make it available to the WN. It submits the job in asynchronous way CDF-UI delegates jobs to the gLite WMS and hosts several CDF specific services … General architecture Job Submission and Execution CDF user sends the request of jobs submission to CDF-UI. CAF clients and Kerberos V ticked are needed WMS accepts jobs, and submits them to computing elements (CE) selected by CDF Computing Elements and Worker Node (WN) where jobs run Output is temporary stored on CDF Storage Elements job output Output permanently stored to FNAL tape storage Mailer sends a mail to the user as soon as the job finish Monitor (data_collector) provides job information to user job Resource Broker sends the job to LCG site chosen using the Match Making procedure based on the “ Estimated wait time” WMS keeps track of job information … When the job reach the worker node CafExe job Wrapper is executed. It takes care of user job. It retrieves the tarball From CDF-UI via HTTP and prepares environment for user job. Then it forks the job as required by user. When the job finishes it packs the working directory into output tarball. Job Monitor CDF code distribution HTTP Server CDF Software LCG site: CNAF LCG site: Lyon Proxy cache /home/cdfsoft  Parrot translates local I/O in HTTP requests  Possible to dynamically create a virtual file in user space system: mount /home/cdfsoft location on each WN when system call is translated  For big sites need to cache HTTP near WN to make Parrot fast and scalable CDF-UI Information System Job information WN information WMS keeps tracks of job information The job wrapper on WN collects information like load, dir, ps, top Performances SamplesEvents on tape   with 1 Recovery B s ->D s  D s ->  ~92%~100% B s ->D s  D s ->K*K ~98%~100% B s ->D s  D s ->K s K ~92%~100% CDF B Monte Carlo production #  segments succeeded # segments submitted  GRID failures: site miss-configuration, temporary authentication problems, WMS overload LcgCAF failures: Parrot and Squid cache stacked Useful Links LcgCAF Home Page - CAF Home page - SAM Home page - INFN-GRID Home page – GRID-IT CNAF Home Page - Squid Web Proxy Cache - Parrot Home page - LcgCAF LcgCAF is a complete rewrite of the CDF Analysis Farm (CAF) software developed by H. Shih-Chieh, E. Lipeles, M. Neubauer and F. Wuerthwein. It is constituted by several services running on CDF-UI which is like the farm head node for CDF. Based on the LCG/gLite middleware it allows the direct submission to the gLite Workload Management System (WMS). Introduction Future Developments … CDF-SE Job output Currently the output of the job, i.e. simulated data, is copied to FNAL tapes using a CDF procedure. Future: An interface between SRM and SAM (CDF catalogue) is under development. This will allow: - File declaration into the catalogue directly from LcgCAF - Automatic transfer of files from LCG sites to FNAL tape - Possibility of data processing on LCG sites Retry mechanism: necessary to achieve ~100% efficiency. It exploits the WMS retry for GRID failures and the LcgCAF “home-made” retry based on logs parsing