Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.

Slides:



Advertisements
Similar presentations
5/2/  Online  Offline 5/2/20072  Online  Raw data : within the DAQ monitoring framework  Reconstructed data : with the HLT monitoring framework.
Advertisements

CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
The LEGO Train Framework
– Unfortunately, this problems is not yet fully under control – No enough information from monitoring that would allow us to correlate poor performing.
Hands-On Microsoft Windows Server 2003 Administration Chapter 6 Managing Printers, Publishing, Auditing, and Desk Resources.
Backup and Recovery Part 1.
Trains status&tests M. Gheata. Train types run centrally FILTERING – Default trains for p-p and Pb-Pb, data and MC (4) Special configuration need to be.
Staging to CAF + User groups + fairshare Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE Offline week,
Active Security Infrastructure Stuart Kenny Trinity College Dublin.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
Unit tests, Integration tests Physics tests Andrea Dotti, Gunter Folger, Pere Mato CERN – PH/SFT Geant4 workshop 2012.
The ALICE Analysis Framework A.Gheata for ALICE Offline Collaboration 11/3/2008 ACAT'081A.Gheata – ALICE Analysis Framework.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
RISICO on the GRID architecture First implementation Mirko D'Andrea, Stefano Dal Pra.
Experience with analysis of TPC data Marian Ivanov.
Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
Analysis infrastructure/framework A collection of questions, observations, suggestions concerning analysis infrastructure and framework Compiled by Marco.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
Analysis trains – Status & experience from operation Mihaela Gheata.
5/2/  Online  Offline 5/2/20072  Online  Raw data : within the DAQ monitoring framework  Reconstructed data : with the HLT monitoring framework.
ALICE analysis framework References for Analysis Tools used to the ALICE simulated data.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
HOW A COMPUTER PROCESSES DATA. What is hardware? Hardware is the electric, electronic and mechanical equipment that makes up a computer What is software?
(1) Introduction to Continuous Integration Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University of.
(1) Introduction to Continuous Integration Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University of.
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE.
AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.
Gustavo Conesa ALICE offline week Gamma and Jet correlations analysis framework Short description, Status, HOW TO use and TO DO list 1/9.
1 Offline Week, October 28 th 2009 PWG3-Muon: Analysis Status From ESD to AOD:  inclusion of MC branch in the AOD  standard AOD creation for PDC09 files.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
Data processing Offline review Feb 2, Productions, tools and results Three basic types of processing RAW MC Trains/AODs I will go through these.
M. Gheata ALICE offline week, October Current train wagons GroupAOD producersWork on ESD input Work on AOD input PWG PWG31 (vertexing)2 (+
Analysis Trains Costin Grigoras Jan Fiete Grosse-Oetringhaus ALICE Offline Week,
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
PWG3 analysis (barrel)
Analysis experience at GSIAF Marian Ivanov. HEP data analysis ● Typical HEP data analysis (physic analysis, calibration, alignment) and any statistical.
Analysis train M.Gheata ALICE offline week, 17 March '09.
M. Gheata ALICE offline week, 24 June  A new analysis train macro was designed for production  /ANALYSIS/macros/AnalysisTrainNew.C /ANALYSIS/macros/AnalysisTrainNew.C.
Alien and GSI Marian Ivanov. Outlook GSI experience Alien experience Proposals for further improvement.
Analysis efficiency Andrei Gheata ALICE offline week 03 October 2012.
Analysis framework plans A.Gheata Offline week 13 July 2011.
Some topics for discussion 31/03/2016 P. Hristov 1.
AliRoot survey: Calibration P.Hristov 11/06/2013.
AAF tips and tricks Arsen Hayrapetyan Yerevan Physics Institute, Armenia.
LEGO train limits Offline week, 31/03/2016 LB 1. Incident of 22/03/2016 A train operator has submitted 107 LEGO trains in one go – These resulted in more.
V4-19-Release P. Hristov 11/10/ Not ready (27/09/10) #73618 Problems in the minimum bias PbPb MC production at 2.76 TeV #72642 EMCAL: Modifications.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
Jan Fiete Grosse-Oetringhaus
Analysis trains – Status & experience from operation
Analysis tools in ALICE
Running a job on the grid is easier than you think!
Developments of the PWG3 muon analysis code
Status of the CERN Analysis Facility
ALICE analysis preservation
Experience in ALICE – Analysis Framework and Train
Analysis Trains - Reloaded
Analysis framework - status
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010

Parameter containers Allow saving configuration AND/OR post-processed data using the output container formalism Already introduced last time, now few bugs fixed in the implementation Simple procedure allowing saving this kind of data together with the final merged outputs Define an output slot in the task corresponding to this data In the AddTask macro, create a container of type kParamContainer and connect to this slot mgr->CreateContainer(“cMyTaskConfig”, TList::Class(), mgr->CreateContainer(“cMyTaskConfig”, TList::Class(), AliAnalysisManager::kParamContainer, Form(“%s:ConfigData”, myOutputFile)); In LocalInit() use PostData() for the configuration data. In Terminate() one can also PostData() for any post-processed data (like normalization or fitting) Please start using this very important new feature ! Please start using this very important new feature ! Allows keeping the configuration with the results Allows keeping the configuration with the results We will also do it for the physics selection task

Alien plugin (1) Most delevopment done in this area The plugin became the preferred way of doing grid analysis… Merging via generated JDL script plugin->SetMergeViaJDL() Files for merging (JDL, merge macro, executable and validation script) are generated and copied in alien space. Submitted in “terminate” mode only The Terminate for the tasks get executed at the end of the merging jobs (!) Currently the merging is done per master job – to be extended

Alien plugin (2) plugin->SetOverwriteMode() Will trigger overwriting input data collections AND existing output files. Make sure not to have this if you need to resume a failed merging of outputs plugin->SetFastReadOption() Try to reduce default xrootd timeouts to deal with the fact that currently alien does not prioritize copies which are closer or more accessible Use on your own risk – you will be warned whenever you use this feature.

Alien plugin (3) Improved feedback from failing steps of running via the plugin Verbosity of copy/submit commands Check if file copying works, printing out alien_CLOSE_SE Better indications of what to do in case of failure Automatic check if the generated dataset is empty “Micro-control” of plugin components Enable/disable file copying in AliEn Submission, merging (ON/OFF) to be implemented

Alien plugin (4) SetDefaultOutputs() enabled as default Do not specify your outputs by hand anymore Will automatically archive all outputs in root_archive.zip – SetOutputArchive produces a Fatal if default outputs are set Using specific SE’s via: or via: plugin->SetPreferredSE() not allowed used now by default Use: SetNumberOfReplicas(max=4) stdout and stderr kept only for the jobs that fail for some reason

Plans Unified plugin submission both in GRID and PROOF- based analysis facilities Extend plugin “micro” controls to allow automatic generation and update for production jobs Extended list of cuts accommodating user cuts besides the standard physics selection

Analysis tender Supplies accessing and applying calibration improvements w.r.t pass1 to ESD Tasks benefit of this change Event selection can be used AOD’s can be streamed out

Tender in production Used for pass1 filtering

Analysis trains PWG1 QA trains running now automatically Policy: A new QA train version every new AN tag Memory was an issue – now under control (see next) No QA flag for the run yet as outcome of this procedure ! Event selected filtering trains starting Producing standard AOD’s (UA1 jets included) + muon AOD’s + Vertexing deltas Mostly starting from pass2 data, but can run also on pass1 using the tender Please test your analysis on this data ! PWGn physics wagons should validate their configuration used in the central analysis train for running on data !

Trains

Task buzzer Investigating some huge leaks of O(100K/event) Testing tcmalloc from Google performance tools In the hope of fixing fragmentation problems The hope was all that’s left at the end… Found guilty tasks creating in-memory ntuples and added a couple of blockers in Savannah Some more strict policy for the task output will probably need to be enforced Very time consuming process cannot continue like that A new automatic tool is running in grid – the task buzzer… What it does: memory profiling for single tasks

Buzzer in action Run after every deployed AN tag Will start buzzing when detecting leaks…

Documentation Analysis framework FAQ ion_of_the_day ion_of_the_day An attempt to collect useful information in one place Will hopefully grow bigger