Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2.

Slides:



Advertisements
Similar presentations
NGS computation services: APIs and.
Advertisements

Maria Grazia Pia Simulation in a Distributed Computing Environment Simulation in a Distributed Computing Environment S. Guatelli 1, A. Mantero 1, P. Mendez.
28 April, 2005ISGC 2005, Taiwan The Efficient Handling of BLAST Applications on the GRID Hurng-Chun Lee 1 and Jakub Moscicki 2 1 Academia Sinica Computing.
Maria Grazia Pia Geant4 LowE Workshop 30-31/5/2002 ow Energy e.m. Workshop CERN, May 2002.
Maria Grazia Pia, INFN Genova 1 Part V The lesson learned Summary and conclusions.
Monte Carlo simulation for radiotherapy in a distributed computing environment S. Chauvie 2,3, S. Guatelli 2, A. Mantero 2, J. Moscicki 1, M.G. Pia 2 CERN.
Other GEANT4 capabilities Event biasing Parameterisation (fast simulation) Persistency Parallelisation and integration in a distributed computing environment.
DIANE Project Seminar on Innovative Detectors, Siena Oct 2002 Distributed Computing in Physics Parallel Geant4 Simulation in Medical.
Parallel Programming on the SGI Origin2000 With thanks to Moshe Goldberg, TCC and Igor Zacharov SGI Taub Computer Center Technion Mar 2005 Anne Weill-Zrahia.
S. Guatelli, M.G Pia, INFN Genova S. Guatelli ( CERN, INFN Genova ) CERN, 13 November 2002 Users Workshop Where to put analysis in Geant4 Applications.
Susanna GuatelliGeant4 Workshop 2004 Use of Analysis Tools Geant4 Workshop 2004, Catania Susanna Guatelli, INFN Genova.
Maria Grazia Pia, INFN Genova CERN, 26 July 2004 Background of the Project.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
DIANE Overview Germán Carrera, Alfredo Solano (CNB/CSIC) EMBRACE COURSE Monday 19th of February to Friday 23th. CNB-CSIC Madrid.
Physics data management tools: computational evolutions and benchmarks Mincheol Han 1, Chan-Hyeung Kim 1, Lorenzo Moneta 2, Maria Grazia Pia 3, Hee Seo.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Data Analysis using Java Mobile Agents Mark Dönszelmann, Information, Process and Technology Group, IT, CERN ATLAS Software Workshop Analysis Tools Meeting,
Geant4 Acceptance Suite for Key Observables CHEP06, T.I.F.R. Mumbai, February 2006 J. Apostolakis, I. MacLaren, J. Apostolakis, I. MacLaren, P. Mendez.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
INFSO-RI Enabling Grids for E-sciencE Geant4 Physics Validation: Use of the GRID Resources Patricia Mendez Lorenzo CERN (IT-GD)
DIANE Project CHEP 03 DIANE Distributed Analysis Environment for semi- interactive simulation and analysis in Physics Jakub T. Moscicki,
Maria Grazia Pia, INFN Genova Test & Analysis Project aka “statistical testing” Maria Grazia Pia, INFN Genova on behalf of the T&A team
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
REMSIM Radiation Exposure and Mission Strategies for Interplanetary Manned Missions Susanna Guatelli, 9 th March 2004, Genova, Italy
A General Purpose Brachytherapy Software Simulation + Analysis (isodose calculation) 2/10/2002 Geant4 Workshop CERN Susanna Guatelli Univ. and INFN Genova.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
1 Large-Scale Profile-HMM on the Grid Laurent Falquet Swiss Institute of Bioinformatics CH-1015 Lausanne, Switzerland Borrowed from Heinz Stockinger June.
Technological Transfer from HEP to Medical Physics How precise Brachytherapy MonteCarlo simulations can be applied in Clinics Reality Problem: How to achieve.
S. Guatelli, A. Mantero, J. Moscicki, M. G. Pia Geant4 medical simulations in a distributed computing environment 4th Workshop on Geant4 Bio-medical Developments.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
Benchmarks of medical dosimetry simulation on the grid S. Chauvie 1, A. Lechner 4, P. Mendez Lorenzo 5, J. Moscicki 5, M.G. Pia 6 G.A.P. Cirrone 2, G.
Avian Flu Data Challenge Hsin-Yen Chen ASGC 29 Aug APAN24.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
Geant4 Training 2006 Short Course Katsuya Amako (KEK) Gabriele Cosmo (CERN) Susanna Guatelli (INFN Genova) Aatos Heikkinen (Helsinki Institute of Physics)
GRID activities in Wuppertal D0RACE Workshop Fermilab 02/14/2002 Christian Schmitt Wuppertal University Taking advantage of GRID software now.
Doug Benjamin Duke University. 2 ESD/AOD, D 1 PD, D 2 PD - POOL based D 3 PD - flat ntuple Contents defined by physics group(s) - made in official production.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Parallelization Geant4 simulation is an embarrassingly parallel computational problem – each event can possibly be treated independently 1.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
Geant4 Training 2004 Short Course Katsuya Amako (KEK) Gabriele Cosmo (CERN) Giuseppe Daquino (CERN) Susanna Guatelli (INFN Genova) Aatos Heikkinen (Helsinki.
NGS computation services: APIs and.
1 IDGF International Desktop Grid Federation ASSESSING THE PERFORMANCE OF DESKTOP GRID APPLICATIONS A. Afanasiev, N. Khrapov, and M. Posypkin DEGISCO is.
Maria Grazia Pia, INFN Genova - G4 WG Coord. Meeting, 13/11/2001 ow Energy Electromagnetic Physics ow Energy Electromagnetic Physics New physics features.
Experiences on Grid production for Geant4 EGEE User Forum, CERN, 1st March 2006 P. Mendez Lorenzo, A. Ribon CERN CERN.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
Enabling Grids for E-sciencE LRMN ThIS on the Grid Sorina CAMARASU.
CNAF - 24 September 2004 EGEE SA-1 SPACI Activity Italo Epicoco.
F. Foppiano3, S. Guatelli2, J. Moscicki1, M.G. Pia2 CERN1 INFN Genova2
Design rationale and status of the org.glite.overlay component
GWE Core Grid Wizard Enterprise (
INFN-GRID Workshop Bari, October, 26, 2004
Simulation in a Distributed Computing Environment
Ruslan Fomkin and Tore Risch Uppsala DataBase Laboratory
CompChem VO: User experience using MPI
Simulation use cases for T2 in ALICE
F. Foppiano, S. Guatelli, B. Mascialino, M. G. Pia, M. Piergentili
Other GEANT4 capabilities
Simulation in a Distributed Computing Environment
Short Course Siena, 5-6 October 2006
The Hadrontherapy Geant4 advanced example
Short Course IEEE NSS/MIC 2003 Katsuya Amako (KEK) Makoto Asai (SLAC)
Gridifying the LHCb Monte Carlo production system
Simulation in a Distributed Computing Environment
Distributed Simulation with Geant4
The Geant4 Hadrontherapy Advanced Example
Presentation transcript:

Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2. CERN, Geneva, Switzerland Geant th user conference and collaboration workshop November 3-10, 2005, Bordeaux, France

Susanna Guatelli Vision Problem How to obtain a quick response from a Geant4 simulation  Case 1: quick response in few minutes i.e. dosimetry, study the efficiency of detectors  Case 2: reasonable time for response from G4 simulations requiring high statistics i.e. medical, space science, high energy physics applications, tests of Geant4 physics models Solution Parallelisation  On dedicated pc clusters  On the GRID Study a general approach, independent from the specific Geant4 application

Susanna Guatelli Quick response Transparent configuration in sequential or parallel mode Transparent access to the GRID through an intermediate software layer Parallelisation Access to distributed computing resources

Susanna Guatelli G4 IMRT application Execution time of 10 9 events ~ 9 days and half Goal: quick response ~ few hours Strategy Study the performance of two Geant4 applications as typical examples: –Geant4 Brachytherapy application –Geant4 IMRT application Parallelisation through DIANE Performance tests –On a single CPU –On clusters –On the GRID Quantitative analysis of the results G4 Brachytherapy application Execution time of 20 M events ~ 5 hours Goal: quick response ~ few minutes Sequential mode on a Pentium IV, 3 GHz

Susanna Guatelli Outline Diane overview How to dianize a G4 application Results of performance tests Conclusions

Susanna Guatelli DIANE Overview DIANE R&D Project started in 2001 in CERN/IT with very limited resources collaboration with Geant 4 groups at CERN, INFN, ESA succesful prototypes running on LSF and EDG Parallel cluster processing – make fine tuning and customisation easy – transparently using GRID technology – application independent Developed by J. Moscicki, CERN

Susanna Guatelli Practical Example Example: simulation with analysis  The job is divided into tasks  The tasks are executed on worker components  Each task produces a file with histograms  Job result = sum of histograms produced by tasks Master-worker model  Client starts a job  Workers perform tasks and produce histograms  Master integrates the results

Susanna Guatelli Running in a distributed environment Not affecting the original code of the application same code – standalone and distributed case is the same code Good separation of the subsystems – the application does not need to know that it runs in a distributed environment – the distributed framework (DIANE) does not need to care about what actions an application performs internally The application developer is shielded from the complexity of underlying technology via DIANE

Susanna Guatelli How to dianize a G4 application Look at the Geant4 extended example: ExDIANE in the parallel directory Completely transparent to the user: same G4 code Documentation at specific for Geant4 applications –Installing and compiling DIANE –Compiling and running a Geant4 application through DIANE

Susanna Guatelli Test results Study the performance of the execution of the dianized G4Brachy: –Test on a single CPU –Test on a dedicated farm (60 CPUs) –Test on a farm, shared with other users (LSF, CERN) –Test on the GRID Tools and libraries: Simulation toolkit: Geant4 7.0.p01 Analysis tools: AIDA and PI DIANE: DIANE CLHEP: G4EMLOW2.3

Susanna Guatelli Results – G4Brachy: 1 CPU with respect to the number of events Test on a single dedicated CPU (Intel ®, Pentium IV, 3.00 GHz) Ratio = Execution time with respect to the number of events of the job The overhead of DIANE is negligible in high statistics jobs

Susanna Guatelli Dedicated farm : 30 identical biprocessors (Pentium IV, 3 GHz) Thanks to Hurng-Chun Lee (Academia Sinica Grid Computing Center, Taiwan) Thanks to Regional Operation Centre (ROC) Team, Taiwan Results – G4Brachy: farm

Susanna Guatelli Comment The job ends when all the tasks are executed on the workers If the job is split into a higher number of tasks, there is a higher chance that the workers finish the tasks at the same moment Example of a good job balancingExample of a job that can be improved from a performance point of view Worker number Time (seconds) Worker number Time (seconds)

Susanna Guatelli Ratio = Comparison Preliminary! Results – G4Brachy: farm (3) Test on LSF cluster of CERN: case of farm shared with other users The load of the cluster changes quickly in time The conditions of the test are not reproducible Execution in parallel mode on 5 workers of LSF DIANE used as intermediate layer

Susanna Guatelli Results – G4Brachy: GRID (1) The load of the GRID changes quickly in time The conditions of the test are not reproducible G4Brachy executed on the GRID on nodes located in Spain, Russia, Italy, Germany, Switzerland Execution on the GRID through DIANE, 20 M events,180 tasks, 30 workers Execution on the GRID, without DIANE Without DIANE: - 2 jobs not successfully executed due to set-up problems of the workers Through DIANE: - All the tasks are executed successfully on 22 workers - Not all the workers are initialized and used: on-going investigation Worker number Time (seconds) Worker number Time (seconds)

Susanna Guatelli How the GRID load changes Execution time of G4Brachy in two different conditions of the GRID DIANE used as intermediate layer Worker number Time (seconds) Worker number Time (seconds) 20 M events, 60 workers initialized, 360 tasks

Susanna Guatelli Conclusions General approach to obtain quick response from Geant4 simulations Advantage of using DIANE as intermediate layer in a dedicated farm or GRID –Transparency –Good separation of the subsystems –Good management of the CPU resources DIANE is very advantageous as an intermediate layer to the GRID from a performance point of view A quantitative analysis of the performance results is in progress Submission of this work for publication in IEEE Trans. Nucl. Sci. Acknowledgments to: M. Lamanna (CERN), Hurng-Chun Lee (ASGC, Taiwan), L. Moneta (CERN), A. Pfeiffer (CERN) Thanks to the GRID team of CERN and the Regional Operation Centre Team of Taiwan