Analysis Trains - Reloaded

Slides:

Advertisements

Similar presentations

Workload Management meeting 07/10/2004 Federica Fanzago INFN Padova Grape for analysis M.Corvo, F.Fanzago, N.Smirnov INFN Padova.

Advertisements

The LEGO Train Framework

– Unfortunately, this problems is not yet fully under control – No enough information from monitoring that would allow us to correlate poor performing.

D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.

Trains status&tests M. Gheata. Train types run centrally FILTERING – Default trains for p-p and Pb-Pb, data and MC (4) Special configuration need to be.

QA train tests M. Gheata. Known problems QA tasks create too many histograms – Pushing resident memory limit above 3GB – Train gets kicked out by some.

Staging to CAF + User groups + fairshare Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE Offline week,

Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.

Example Gridification via command-line Application Developer Training Day IV. Miklos Kozlovszky Ankara, 25. October, 2007.

An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.

Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.

Technical Workshops | Esri International User Conference San Diego, California Creating Geoprocessing Services Kevin Hibma, Scott Murray July 25, 2012.

Analysis infrastructure/framework A collection of questions, observations, suggestions concerning analysis infrastructure and framework Compiled by Marco.

Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.

Semi-Automatic patch upgrade kit

PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.

Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.

Analysis trains – Status & experience from operation Mihaela Gheata.

Working with AliEn Kilian Schwarz ALICE Group Meeting April

5/2/  Online  Offline 5/2/20072  Online  Raw data : within the DAQ monitoring framework  Reconstructed data : with the HLT monitoring framework.

NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN

ALICE analysis framework References for Analysis Tools used to the ALICE simulated data.

CERN – Alice Offline – Thu, 20 Mar 2008 – Marco MEONI - 1 Status of Cosmic Reconstruction Offline weekly meeting.

PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.

Technical Validation The Technical Validation is a testing framework of the AUGER offline to monitor the code development process. It is not a validation.

PWG-CF Jan Fiete Grosse-Oetringhaus Analysis Session Offline Week March 2012.

Separate distribution of the analysis code (and more) P. Hristov 19/03/2014.

A. Gheata, ALICE offline week March 09 Status of the analysis framework.

The Development Process Compilation. Compilation - Dr. Craig A. Struble 2 Programming Process Problem Solving Phase We will spend significant time on.

AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.

Gustavo Conesa ALICE offline week Gamma and Jet correlations analysis framework Short description, Status, HOW TO use and TO DO list 1/9.

1 Offline Week, October 28 th 2009 PWG3-Muon: Analysis Status From ESD to AOD:  inclusion of MC branch in the AOD  standard AOD creation for PDC09 files.

Data processing Offline review Feb 2, Productions, tools and results Three basic types of processing RAW MC Trains/AODs I will go through these.

M. Gheata ALICE offline week, October Current train wagons GroupAOD producersWork on ESD input Work on AOD input PWG PWG31 (vertexing)2 (+

Analysis Trains Costin Grigoras Jan Fiete Grosse-Oetringhaus ALICE Offline Week,

PWG3 analysis (barrel)

Physics selection: online changes & QA M Floris, JF Grosse-Oetringhaus Weekly offline meeting 30/01/

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES L. Betev, A. Grigoras, C. Grigoras, P. Saiz, S. Schreiner AliEn.

Analysis train M.Gheata ALICE offline week, 17 March '09.

M. Gheata ALICE offline week, 24 June  A new analysis train macro was designed for production  /ANALYSIS/macros/AnalysisTrainNew.C /ANALYSIS/macros/AnalysisTrainNew.C.

D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.

Good user practices + Dynamic staging to a CAF cluster Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CUF,

Analysis framework plans A.Gheata Offline week 13 July 2011.

InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.

AAF tips and tricks Arsen Hayrapetyan Yerevan Physics Institute, Armenia.

ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.

The ALICE Analysis -- News from the battlefield Federico Carminati for the ALICE Computing Project CHEP 2010 – Taiwan.

Jan Fiete Grosse-Oetringhaus

Data Formats and Impact on Federated Access

Installation of the ALICE Software

(on behalf of the POOL team)

Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław

Analysis trains – Status & experience from operation

PWG2 Analysis status Adam Kisiel, CERN for the PWG2 group.

Analysis tools in ALICE

Running a job on the grid is easier than you think!

Status of the Analysis Task Force

Work report Xianghu Zhao Nov 11, 2014.

Status of the CERN Analysis Facility

ALICE analysis preservation

Patricia Méndez Lorenzo ALICE Offline Week CERN, 13th July 2007

fields of possible improvement

Grid Application Support Group Case study Schrodinger equations on the Grid Status report 16. January, Created by Akos Balasko

Experience in ALICE – Analysis Framework and Train

Dirk Düllmann CERN Openlab storage workshop 17th March 2003

Analysis framework - status

Performance optimizations for distributed analysis in ALICE

CIS16 Application Programming with Visual Basic

Job Application Monitoring (JAM)

Offline framework for conditions data

Presentation transcript:

Analysis Trains - Reloaded Andrei Gheata Costin Grigoras Jan Fiete Grosse-Oetringhaus

Idea Presented in the offline meeting in June and the offline week that followed Manage trains using MonALISA Users register wagons Train operators compose trains Automatic testing per wagon Train file generation Submission managed by ML (existing LPM infrastructure) Jan Fiete Grosse-Oetringhaus

Configuration & Testing Train Configuration New class AliAnalysisTaskCfg Contains description of wagons (add task macro, libraries, dependencies) Reads/writes to a text file format (used to read train configuration from ML) Testing Uses alientest04 machine Downloads AliEn packages (ROOT, AliRoot) Copies a part of the input data set local Runs tests per wagon Uses syswatch to extract mem/cpu information Tests also "base line" task which is empty Phys Sel Centr Sel User A User B User C Jan Fiete Grosse-Oetringhaus

Workflow User 1. adds wagons LPM MonALISA Train operator AliEn config 2. composes train 4. recompose after test test results 6. runs train Test machine train files 3. generates test files + executes test 5. generates train jdl + scripts Jan Fiete Grosse-Oetringhaus

Screenshot Handler configuration Wagon configuration Data configuration Testing and running status Jan Fiete Grosse-Oetringhaus

Handler Jan Fiete Grosse-Oetringhaus

Wagon Jan Fiete Grosse-Oetringhaus

Dataset Jan Fiete Grosse-Oetringhaus

Run Jan Fiete Grosse-Oetringhaus

Syswatch Jan Fiete Grosse-Oetringhaus

Demo… Enough theory, let's do some clicking… http://alimonitor.cern.ch/trains Jan Fiete Grosse-Oetringhaus

Some More Details Train runs with an analysis tag All code + "AddTask" macro has to be in the tag (no par file!) Output stored in the input data directory (like AOD, QA trains). E.g.: /alice/data/2010/LHC10h/000137366/ESDs/pass2/PWG4/CorrelationTrain/7_20111117_1350 Current infrastructure only allows per-run merging Jan Fiete Grosse-Oetringhaus

Open Issues ROOT AOD analysis Fix in TGridJDL was required. Is in v5-30-00-patches, but not yet deployed in the Grid. Needed for train operation AOD analysis Found huge leak even in empty analysis (20kb/event) Jan Fiete Grosse-Oetringhaus

Old Slides Jan Fiete Grosse-Oetringhaus

Idea Setting up and operating analysis trains is a lot of work Specific settings for each wagon Wagons have bugs, leaks etc.  Automatic configuration needed  Automatic testing needed (on a subset of the same data the train will run on) We have collected some ideas which we want to give a try using at the beginning the PWG4 train Jan Fiete Grosse-Oetringhaus

High Level Description Train runs on analysis tag (no modifications allowed) User registers task Train operator triggers train test Test results are fed back to Monalisa where the user & operator can see them Operator starts train with tasks that succeeded and have no (significant) leaks These steps are operated from MonaLisa Jan Fiete Grosse-Oetringhaus

Some Technical Details Container that contains task configuration (already shown by Andrei) Currently identified configuration items Location of AddTask macro + parameters Required libraries Tasks that have to run before Train testing Tasks are tested one by one On subset of data on which the train will run CPU/Real time, memory extracted w.r.t baseline Baseline from test with just PhysSel + Centrality Train macro generation By analysis framework using the wagons selected by the operator Macros for testing (wagon by wagon) Macros for full train (all wagons) Overall train submission Using the already existing ML submission framework (including merging jobs) Phys Sel Centr Sel User A User B User C Jan Fiete Grosse-Oetringhaus