INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo.

Slides:



Advertisements
Similar presentations
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Advertisements

Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
K.Harrison CERN, 23rd October 2002 HOW TO COMMISSION A NEW CENTRE FOR LHCb PRODUCTION - Overview of LHCb distributed production system - Configuration.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Cluster currently consists of: 1 Dell PowerEdge Ghz Dual, quad core Xeons (8 cores) and 16G of RAM Original GRIDVM - SL4 VM-Ware host 1 Dell PowerEdge.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
BaBar Grid Computing Eleonora Luppi INFN and University of Ferrara - Italy.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Geant4 Acceptance Suite for Key Observables CHEP06, T.I.F.R. Mumbai, February 2006 J. Apostolakis, I. MacLaren, J. Apostolakis, I. MacLaren, P. Mendez.
INFSO-RI Enabling Grids for E-sciencE Geant4 Physics Validation: Use of the GRID Resources Patricia Mendez Lorenzo CERN (IT-GD)
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
8th November 2002Tim Adye1 BaBar Grid Tim Adye Particle Physics Department Rutherford Appleton Laboratory PP Grid Team Coseners House 8 th November 2002.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Belle MC Production on Grid 2 nd Open Meeting of the SuperKEKB Collaboration Soft/Comp session 17 March, 2009 Hideyuki Nakazawa National Central University.
IST E-infrastructure shared between Europe and Latin America High Energy Physics Applications in EELA Raquel Pezoa Universidad.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Increased productivity for merging Grid applications:
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
A Short Course on Geant4 Simulation Toolkit How to learn more?
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
3 rd EGEE Conference Athens 18th-22nd April EGEE is a project funded by the European Union under contract IST Geant4 Production in the LCG.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
…building the next IT revolution From Web to Grid…
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
FRANEC and BaSTI grid integration Massimo Sponza INAF - Osservatorio Astronomico di Trieste.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
INFSO-RI Enabling Grids for E-sciencE CRAB: a tool for CMS distributed analysis in grid environment Federica Fanzago INFN PADOVA.
M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite configuration (plans) Robert Harakaly.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Enabling Grids for E-sciencE Experience Supporting the Integration of LHC Experiments Computing Systems with the LCG Middleware Simone.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
ATLAS Distributed Analysis DISTRIBUTED ANALYSIS JOBS WITH THE ATLAS PRODUCTION SYSTEM S. González D. Liko
Experiences on Grid production for Geant4 EGEE User Forum, CERN, 1st March 2006 P. Mendez Lorenzo, A. Ribon CERN CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
Geant4 Training 2003 A Short Course on Geant4 Simulation Toolkit How to learn more? The full set of lecture notes of this Geant4.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
GDB Meeting CERN 09/11/05 EGEE is a project funded by the European Union under contract IST A new LCG VO for GEANT4 Patricia Méndez Lorenzo.
Enabling Grids for E-sciencE LRMN ThIS on the Grid Sorina CAMARASU.
Eleonora Luppi INFN and University of Ferrara - Italy
European Organization for Nuclear Research
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
Simulation use cases for T2 in ALICE
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo CERN (IT-GD) / CNAF 1st IEEE International Conference on e-Science and Grid Computing Dec, 5-8 Melbourne (Australia)

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Outlook Scope of the talk Geant4 Experiences UNOSAT Experiences Summary This talk assumes you are familiar with the LCG/EGEE infrastructure and architecture

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Scope of this talk  LCG/EGEE is providing a Grid infrastructure to high energy physics experiments at CERN  HOWEVER...  An important number of other communities are getting in contact with us to use the Grid  LCG/EGEE foreseen the support to this new groups  The question is: How to begin and what to do to get involved  The scope of this talk is to show you how the gridifications of new projects are performed

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo First Example: Geant4  What is Geant4 (GEometry ANd Tracking)?  Generic Toolkit for Monte Carlo simulation of particle interactions with the matter (i.e. detectors)  Application domains:  High-Energy Physics: ATLAS, CMS and LHCb (LHC), BaBar (SLAC), etc  Space Radiation: ESA  Medical Physics: Proton and brachy therapies, etc  Object-Oriented (C++) project, modular and extensible. Significant improved with respect its predecessor, Geant3, not only from the software structure, but mainly for the physics coverage  Electromagnetic physics of Geant4 and even more Hadronic physics are complex fields. It is fundamental to test their models covering the widest possible range of particles, materials and energies

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Requirements  Geant4 works with those physics models the experiments will have to face to  Electromagnetic and Hadronic physics are fundamental features to be properly simulated by Geant4. However they are extremely CPU demanding  Number of events and energy depending:  1 event of 1GeV ~ 0.03 sec (2.4 GHz)  1 event of 300 Gev ~ 9-10 sec Geant4 wants to use the LCG environment to validate the software they provide to their users twice per year  Two large productions per year  Goal during the software validation: Comparison some shower observables between the two different Geant4 versions and check statistical significant changes  Small productions (some few thousands of jobs) during the whole year

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Parameters and Strategy  In order to test the software they provide, Geant4 has to run over a large range of parameters:  8 different particles  23 Beam energies  5 physics list (physics models)  7 simplified detectors  Make all the combinations; these are the independent jobs to be run in each validation of the Geant4 software  Geant4 Procedure  Already two productions have been validated in LCG  Comparison of two different software versions  Generation of samples for each version  First time separately  Second time both versions together in each job  Analysis procedure outside Grid  Third production going on at this moment

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Production in LCG  Stages: 1. Software installation:  Installation of the Geant4 packages (with all the required external additional packages: PI, AIDA, etc)  Software provided via a tar file  Installation through jobs using specific LCG tools  Fundamental request for the sites: Shared area between WNs and perfectly definition of the software installation region 2. Events production:  Jobs sent by bunches of about 1000 jobs (remember all the possible parameter combinations) defined by each physics list  5000 events in each job were produced 3. Analysis:  Statistical tests to perform the comparison between the two G4 versions

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Production in LCG  General Characteristics:  VO:  1 st Production: dteam (6 certificates, one as dteamsgm)  2 nd Production: alice (2certificates, one as alicesgm)  3 rd Production: First time as Geant4!  The group is following all the necessary streps to become VO  Sites and middleware operating system:  1 st Production: RedHat7.3  2 nd and 3 rd Productions: Scientific Linux  Resources:  1 st Production: Own RB+BDII+UI: lxb2006 at CERN  2 nd and 3 rd Productions: lxplus resources and 2 BDII  All output:  1 st Production: About 30 GB stored at CERN (lxn1183)  2 nd Production: Comparable quantity stored at CERN (lxn1180)  3 rd Production: Retrieving the outputs to a delivered afs area at CERN

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Generation of a general framework consisting of 3 major tools:  Tool for general and automatic job submission  Tool for events generation in all those sites where the software has been installed  Tool for data analysis (not needed during the 2 nd Production)  First Part: Tool for job submission  Copy and registry of the Geant4 package  A file containing the TURL is created and is passed to the WN  Follow up of candidates able to admit Geant4 jobs  Selection of long queues only  Automatic built of the.jdl files for each long queue  Built taking as base those proposed by the user joining the name of the queue where to submit the job  Software Installation tools are used to perform the installation  Submission of these files to each queue

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Software Installation tool (Tool submitted in the first step to all sites to install the software)  First step:  The tar file is copied from the SE at CERN to the WN  It is untar and copied to the VO_DTEAM_SW_DIR area  Second Step: Software Installation tool  Some Geant4 tests are performed to validate the installation  If succeeded a tag is published in the Information System  Results:  The software installation was tried in 63 sites  1 st Production: 28 sites  2 nd Production: 35 sites  3 rd Production: Transition phase: configuring the G4 sites  Main Problems:  Sites were having submission problems  Sites did not have defined the VO_ _SW_DIR area or did not have shared area among WNs

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Second Step: Tool for the ProductionStrategy:  Only long queues will be used to run the production  All outputs (hbook files) will be stored at CERNMethodology:  Geant4 provides their own code to perform the events production  Python Script for each type of particle, energy, physics list and calorimeter is created by the framework from one template provided by Geant4  Generation of one jdl per job containing the code provided by Geant4 (the same for all jobs) + that script generated by the framework and changing for each job  Submission of all jdl files to all sites containing the Geant4 installation

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Results (First and Second Production):  A hbook file containing 5000 event is created in the case the production succeeded  The name of the file is created by the framework containing the type of particles, the energy, the physics list and the calorimeter within the name (important to perform later the comparison)  The hbook file is copied and registered to a disk at CERN  During the 2nd production a tar file containing different files should have been created in the case the job succeeded. This file was retrieved to the afs area delivered for this aim and copied and registered to the grid  Around 4508 jobs (two physics list for both Geant4 versions) were run in lest than 2 weeks in 28 sites with a efficiency of about 87%

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  This framework covered the Geant4 requirements for its first production  It’s not useful for larger production  Difficult to deal with the output and visualize the results  A new complete tool has been developed for large production  Flexible enough to be used for any VO and any user application  Most of the improvements mostly relative to the outputs handle Documentation: “LCG2 User Guide” bin/index.cgi?var=eis/docs Download:

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  The new framework consists mainly of two tools:  Tool to perform the automatic job submission  Tool to retrieve and handle the corresponding output 1. Automatic job submission Overview:  Given an user’s jdl this tool performs the following actions:  It lists all sites able to run the jdl provided by the user  It creates automatically a jdl file based on that provided by the user  It submits the just created jdl containing the user application(s)  Moreover it creates a subdirectory (defined by the user) containing a list of the sites where the jobs have been submitted, the corresponding jdls and the jobs IDs

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  Additional Features:  The user can define the queues where the jobs are submitted. These queues are checked to see whether it fixes the job requirements  Requested LFN files can be included. The corresponding TURLs are searched and included in a file passed in the InputSandbox to the WN 2. Retrieve and handle of the outputs  The 2 nd tool checks the status of the jobs from the job IDs included in the directory given by the user  It provides the following output: The job run in ramses.dcic.ups.es:2119/jobmanager-torque-dteam is in status: Scheduled The job run in grid01.phy.ncu.edu.tw:2119/jobmanager-torque-dteam is in status: running The job run in scaic10.scai.frauhofer.de:2119/jobmanager-torque-dteam is in status: over The user is queried to retrieve the output to the destination he has previously decided

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework Additional Features: It is possible to visualize the outputs on the web A html report is provided showing the files decided by the user

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo New Geant4 Production: DIANE Results obtained for another community: ITU

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Second Example: UNOSAT  Satellite imagery based web mapping service  Objectives  Easy access to quality geoinformation service  Organize the demand for geoinformation  Ensure cost-effective and timely products  Core Services  Humanitarian Mapping  Image Processing

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Second Example: UNOSAT Data suppliers UNOSAT Central Unit USER WWW Ground station    

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Relief Projects of UNOSAT  Case Study: Indian Ocean Tsunami Relief and Development  29 th Dec 2004: First Map distributed online to field users  14 th Jan 2005: Imagery Bank online:  100 Tsunami-related maps (pre and post)  670 raw satellite images  January: 200,000 tsunami maps downloaded in total  UNOSAT has a huge amount of data to stored  CERN has provided a good amount of space for this aim  From Summer 2005 the collaboration with GRID began  Running and storing data in LCG/EGEE can certainly assist UNOSAT in their purposes

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo UNOSAT and LCG/EGEE  In summer 2005 we have provided a whole structure at CERN for UNOSAT  UNOSAT Virtual Organization (VO)  3.5TB in CASTOR  Computing Elements, Resource Brokers  Collaboration with ARDA group  AFS area of 5GB  We have run some UNOSAT tests (images compression) inside the GRID environment (quite successful)  The framework developed for Geant4 has been adapted for UNOSAT needs

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo ARDA Support LFC ARDA APP Oracle DB CASTOR Metadata (x,y,z) LFN PFN SRM

Enabling Grids for E-sciencE INFSO-RI Melbourne 5th-8 th December Patricia Méndez Lorenzo Summary  Just one message  We are involving already different communities inside the GRID  Huge applications field for GRID  We have created different frameworks to gridify in a short time the new projects  Thanks to ARDA developers we have covered many needs of each community  One of the EGGE purposes (involved different communities inside the GRID) is already a reality