AliEn Tutorial Panos Christakoglou University of Athens - CERN

Slides:



Advertisements
Similar presentations
May 25, ALICE Offline Tutorial Markus Oldenburg – CERN May 25, 2007 – University of São Paulo.
Advertisements

Proof/Proof-on-Demand at Atlas T3 Shuwei Ye (BNL) Aug 20, 2014.
Workload Management meeting 07/10/2004 Federica Fanzago INFN Padova Grape for analysis M.Corvo, F.Fanzago, N.Smirnov INFN Padova.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Introduction to UNIX/Linux Exercises Dan Stanzione.
AliEn Tutorial MODEL th May, May 2009 Installation of the AliEn software AliEn and the GRID Authentication File Catalogue.
Enabling Grids for E-sciencE gLite training at Sinaia '06 Victor Penso Kilian Schwarz GSI Darmstadt Germany.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
ALICE Offline Tutorial Using the AliEn Grid Client GSI, 4 th Mar
BaBar WEB job submission with Globus authentication and AFS access T. Adye, R. Barlow, A. Forti, A. McNab, S. Salih, D. H. Smith on behalf of the BaBar.
AliEn uses bbFTP for the file transfers. Every FTD runs a server, and all the others FTD can connect and authenticate to it using certificates. bbFTP implements.
WORK ON CLUSTER HYBRILIT E. Aleksandrov 1, D. Belyakov 1, M. Matveev 1, M. Vala 1,2 1 Joint Institute for nuclear research, LIT, Russia 2 Institute for.
Chapter 3.  Help you understand different types of servers commonly found on a network including: ◦ File Server ◦ Application Server ◦ Mail Server ◦
Alice off-line meeting Alberto Colla Cern, October 3, 2005 AliEn How-To Alice off-line meeting Cern, October 3, 2005 Alberto Colla (Alice off-line Calibration.
The gLite API – PART I Giuseppe LA ROCCA INFN Catania ACGRID-II School 2-14 November 2009 Kuala Lumpur - Malaysia.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
Compiled Matlab on Condor: a recipe 30 th October 2007 Clare Giacomantonio.
110/10/06 - AliEn AliEn Tutorial Solutions Panos Christakoglou University of Athens - CERN.
Nadia LAJILI User Interface User Interface 4 Février 2002.
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Andrey Meeting 7 October 2003 General scheme: jobs are planned to go where data are and to less loaded clusters SUNY.
GRID. Register Fill the form. Your IP (Juanjo) signature is needed and the one from the.
ROOT-CORE Team 1 PROOF xrootd Fons Rademakers Maarten Ballantjin Marek Biskup Derek Feichtinger (ARDA) Gerri Ganis Guenter Kickinger Andreas Peters (ARDA)
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.
Nurcan Ozturk University of Texas at Arlington US ATLAS Transparent Distributed Facility Workshop University of North Carolina - March 4, 2008 A Distributed.
Working with AliEn Kilian Schwarz ALICE Group Meeting April
ALICE analysis framework References for Analysis Tools used to the ALICE simulated data.
EGEE is a project funded by the European Union under contract IST “Interfacing to the gLite Prototype” Andrew Maier / CERN LCG-SC2, 13 August.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE.
AliEn2 and GSI batch farm/disks/tape Current status Kilian Schwarz.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES L. Betev, A. Grigoras, C. Grigoras, P. Saiz, S. Schreiner AliEn.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
ALICE Computing Status1 ALICE Computing Status Are we ready? What about our choices? Workshop on LHC Computing 26 October Ren é Brun CERN Several slides.
+ AliEn status report Miguel Martinez Pedreira. + Touching the APIs Bug found, not sending site info from ROOT to central side was causing the sites to.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The AliEn File Catalogue Jamboree on Evolution of WLCG Data &
AliEn Tutorial ALICE workshop Sibiu 20 th August, 2008 Pablo Saiz.
Wouter Verkerke, NIKHEF 1 Using ‘stoomboot’ for NIKHEF-ATLAS batch computing What is ‘stoomboot’ – Hardware –16 machines, each 2x quad-core Pentium = 128.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
AAF tips and tricks Arsen Hayrapetyan Yerevan Physics Institute, Armenia.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
HYDRA Framework. Setup of software environment Setup of software environment Using the documentation Using the documentation How to compile a program.
SEE-GRID-SCI WRF-ARW model: Grid usage The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures.
Advanced Computing Facility Introduction
Integrating ArcSight with Enterprise Ticketing Systems
Welcome to Indiana University Clusters
Integrating ArcSight with Enterprise Ticketing Systems
Installation of the ALICE Software
Open OnDemand: Open Source General Purpose HPC Portal
Basic aliensh S. Bagnasco, INFN Torino CNAF Nov 27-28, 2007.
MCproduction on the grid
ATLAS Use and Experience of FTS
UML diagrams for the AliEn job execution part and PackMan service
Running a job on the grid is easier than you think!
Running a job on the grid is easier than you think!
Status of the Analysis Task Force
Status of the CERN Analysis Facility
INFN-GRID Workshop Bari, October, 26, 2004
MC data production, reconstruction and analysis - lessons from PDC’04
Managing Software.
Compiling and Job Submission
Michael P. McCumber Task Force Meeting April 3, 2006
gLite Job Management Christos Theodosiou
Job Application Monitoring (JAM)
GENIUS Grid portal Hands on
Presentation transcript:

AliEn Tutorial Panos Christakoglou University of Athens - CERN

OUTLINE (1) Installation of the AliEn software. Authentication. General description of the shell: Basic commands. Basic functionalities. News on production. Working with the file catalog: Copying files from/to the catalog. File catalog structure. Querying the file catalog.

OUTLINE (2) Flow of the overall analysis procedure. Local analysis. Creation of tag files. Local analysis using the Event Tag System. Interactive analysis with AliEn stored files. Batch analysis: Flow of the procedure. Description of the files needed. Description of the jdl fields. Practical examples.

Installation of AliEn software Getting the installer

Installation of AliEn software Selecting the version

Installation of AliEn software Reuse of software

Installation of AliEn software Workspace directory

Installation of AliEn software Platform

Installation of AliEn software Supported platforms

Installation of AliEn software Installation directory

Installation of AliEn software Installation directory

Installation of AliEn software Overwrite files

Installation of AliEn software Selecting packages

Installation of AliEn software Progress bar

Installation of AliEn software Try it out Download the alien installer from http://alien.cern.ch. Make the file executable. Run the installer. Select v2.11. Platform should be i686. Select installation directory. Select the following packages: Client gShell ROOT

Password that you selected when making the request Authentication Getting a grid proxy Password that you selected when making the request for the certificate

alien-token-init <username> Authentication Getting an alien proxy alien-token-init <username>

Authentication Try it out Get a valid grid proxy. Get a valid alien token. Check the information of your proxy by typing: grid-proxy-info Check the information of your token by typing: alien-token-info

General description of the shell Accessing the shell

General description of the shell Basic commands (1)

General description of the shell Basic commands (2)

General description of the shell Whereis command

General description of the shell Viewing files (1)

General description of the shell Viewing files (2)

General description of the shell Try it out Check your user name by typing whoami. List the contents of your home directory. Check the working directory. Create the following directory structure: $HOME/bin $HOME/Tutorial/XML/jdl $HOME/Tutorial/XML/par $HOME/Tutorial/XML/output $HOME/Tutorial/XML/selectors $HOME/Tutorial/XML/macros Get the information of the file: /alice/cern.ch/user/p/pchrist/Tutorial/XML/selectors/esdPt.C

News on production File location p+p min bias @ 14TeV --- NEvents = 3579700: /alice/cern.ch/user/a/aliprod/prod2006_2/output_pp/ All RUNS have been tested (100-111 & 200-246). Merged tag files have been produced at the RUN level for all RunIds. p+p min bias @ 900GeV --- NEvents = 190400: /alice/sim/2006/pp_900GeV/ All RUNS have been tested (600-601). Muon events --- NEvents = 61150: /alice/sim/2006/muon/ All RUNS have been tested (500-502).

Working with the file catalog Copying from/to the catalog

File catalog structure Working with the file catalog The path name will be: for ‘real’ data: /data/<Year>/<AcceleratorPeriod>/<RunNumber>/ for simulated data: /sim/<Year>/<ProductionType>/<RunNumber>/ Subdirectories will be called: Raw/ cond/ reco/<PassX>/ESD/ reco/<PassX>/AOD/ … File names will look like this: <xxxx>.AliESD.root For further information see http://indico.cern.ch/conferenceDisplay.py?confId=3280 http://cern.ch/Oldenburg/MetaData/MetaData.doc MARKUS OLDENBURG AN INTERNAL NOTE IS ON THE WAY

Querying the file catalog (1) Working with the file catalog $ find -x pp /alice/cern.ch/user/p/pchrista/production/pp/PDC06/* AliESDs.root > pp.xml Redirect the output to the xml collection.

Working with the file catalog Querying the file catalog (2) $ find -x pp /alice/data/2008/LHC08a/*/reco/Pass3/* AliESDs.root Run:collision_system=”pp” and Run:stop<"2008-03-20 10:20:33" and Run:start>"2008-03-19" > pp.xml

Working with the file catalog Try it out (1) Create the following directory structure locally: $HOME/AliEn/PDC06/001 $HOME/AliEn/PDC06/002 $HOME/AliEn/Tags $HOME/AliEn/Local $HOME/AliEn/Interactive $HOME/AliEn/Batch Copy the following files to your local $HOME/AliEn/Local: /alice/cern.ch/user/p/pchrist/Tutorial/LOCAL/ESD.par /alice/cern.ch/user/p/pchrist/Tutorial/LOCAL/esdPt.h /alice/cern.ch/user/p/pchrist/Tutorial/LOCAL/esdPt.C /alice/cern.ch/user/p/pchrist/Tutorial/LOCAL/demoLocal.C

Working with the file catalog Try it out (2) Copy the following files to your local $HOME/AliEn/Tags: /alice/cern.ch/user/p/pchrist/Tutorial/TAGS/ESD.par /alice/cern.ch/user/p/pchrist/Tutorial/TAGS/CreateTags.C Copy the following files to $HOME/AliEn/PDC06/001 and $HOME/AliEn/PDC06/002: /alice/cern.ch/user/a/aliprod/prod2006_2/output_pp/100/002/AliESDs.root /alice/cern.ch/user/a/aliprod/prod2006_2/output_pp/100/004/AliESDs.root Query the f.c. and get all the tag files (*.tag.root) under: /alice/cern.ch/user/a/aliprod/prod2006_2/output_pp/100/* Get the output on your terminal. Count the files using the shell. Redirect the results to the tag100.xml collection. Repeat the previous exercise limiting the number of output files to 10 (find -l 10 ...) and copy the xml to your local $HOME/AliEn/Interactive.

Flow of the analysis procedure Query the file catalog Tag xml collection Selection criteria Query the Event Tag System User's macro Create an esdTree chain Extract an esd xml collection PROOF AliEn Local Process chain with selector

Local analysis Event Tag System

Local analysis Create tag files Setup par archive Load the needed libraries AliTagCreator *t = new AliTagCreator(); t->SetStorage(0); t->ReadLocalCollection(“/home/pchrist/PDC06/pp14TeV/”); t->MergeTags();

Local analysis with tags Setup par archive Load the needed libraries AliEventTagCuts *EvCuts = new AliEventTagCuts(); EvCuts->SetMultiplicityRange(0,1500); AliTagAnalysis *TagAna = new AliTagAnalysis(); TagAna->ChainLocalTags(“.”); analysischain = TagAna->QueryTags(EvCuts); const char *selectorfile = "esdPt.C"; analysischain->Process(selectorfile);

Local analysis Try it out Open your local $HOME/AliEn/Tags/CreateTags.C file and modify it accordingly: Change the line where you define where you have the locally stored ESDs. Run it to create the tag files. Delete the single tag files and stay with just the merged one. Go to your local $HOME/AliEn/Local directory and open the demoLocal.C file. Change the line where you define the location of the tag files and run the macro with root. Impose some selection criteria and rerun the example.

Interactive analysis with tags Setup par archive Load the needed libraries TGrid::Connect("alien://pcapiserv01.cern.ch:10000","pchrist") AliEventTagCuts *EvCuts = new AliEventTagCuts(); EvCuts->SetMultiplicityRange(0,1500); TAlienCollection* coll = TalienCollection::Open("tag100.xml"); TGridResult* TagResult = coll->GetGridResult(""); AliTagAnalysis *TagAna = new AliTagAnalysis(); TagAna->ChainGridTags(TagResult); analysischain = TagAna->QueryTags(EvCuts); const char *selectorfile = "esdPt.C"; analysischain->Process(selectorfile);

Interactive analysis Try it out Copy the following files to your local $HOME/AliEn/Interactive: /alice/cern.ch/user/p/pchrist/Tutorial/INTERACTIVE/ESD.par /alice/cern.ch/user/p/pchrist/Tutorial/INTERACTIVE/esdPt.h /alice/cern.ch/user/p/pchrist/Tutorial/INTERACTIVE/esdPt.C /alice/cern.ch/user/p/pchrist/Tutorial/INTERACTIVE/demoInteractive.C Go to your local $HOME/AliEn/Interactive directory and open the demoInteractive.C file. Change the line where you define the tag collection and put the name of the file you created by querying the f.c (it should be tag100.xml). Run the macro. Impose some selection criteria and rerun the example.

Batch analysis Analysis flow <collection> <event>...guid=”asdf1”...lfn=”/alice/cern.ch/.../file1.root” <event>...guid=”asdf2”...lfn=”/alice/cern.ch/.../file2.root” <event>...guid=”asdf3”...lfn=”/alice/cern.ch/.../file3.root” </collection> F.C. query JDL::InputDataCollection Job Optimiser Splitting <collection1> <event> lfn=”/alice/.../file1.root” </collection1> <event> lfn=”/alice/.../file2.root” <event> lfn=”/alice/.../file3.root” JDL::InputData JOB ANGENT ROOT XROOTD MSS TAlienCollection SITE A ROOT execution SITE B SITE C

Analysis flow with the tags Batch analysis Analysis flow with the tags Optimiser Splitting GUID-LFN EventList Global Chain C Event list gets lost during splitting InputData in new JDL Job agents C2 C1 C3 LFN-GUID EventList Cn AND C

Batch analysis Files needed Executable Par file Macro Selectors xml collection jdl

Executable Batch analysis #!/bin/bash echo =========================== echo $PATH echo $ROOTSYS echo $LD_LIBRARY_PATH echo ========================== root -b -x Analysis.C; IT SHOULD BE STORED UNDER $HOME/bin IN THE FILE CATALOG!!!

Macro Batch analysis Setup the par file – compile and load the libESD.so (or any necessary library that is needed for the analysis). Get the xml collection. Convert the collection to a list of files. Process the chain with the selector.

Batch analysis JDL fields (1) Executable: Compulsory field where we give the lfn of the executable that should be stored in /bin or $V0/bin or $HOME/bin. Arguments: They will be passed to the executable. Packages: Type packages in the shell to see what kind of packages are installed. InputFile: The files that will be transported to the node where the job will run. InputData: It will require that the job will be executed in a site close to the files specified here. InputDataList: The filename in which the Job Agent will write the InputData list. InputDataListFormat: The format of the InputData list.

Batch analysis JDL fields (2) OutputFile: The files that will be registered in the catalog once the job finishes. OutputArchive: What files will be archived in a zip file. Validationcommand: Specifies the script to be used as a validation script. Email: Receive a mail when the job finishes, TTL: The maximum run time of your job. Split: Split the jobs in several sub jobs.

Batch analysis Try it out (1) Copy the following files to $HOME/AliEn/Batch: /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/ESD.par /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/esdPt.h /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/esdPt.C /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/demoBatch.C /alice/cern.ch/user/p/pchrist/bin/batch.sh /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/pp10.xml /alice/cern.ch/user/p/pchrist/Tutorial/BATCH/analysis.jdl Modify the Email, InputFile, InputDataCollection and OutputDir fields of your analysis.jdl.

Batch analysis Try it out (2) Copy the local Analysis.sh to your AliEn $HOME/bin. Copy the local analysis.jdl to your AliEn $HOME/Tutorial/XML/jdl. Copy the local pp10.xml to your AliEn $HOME/Tutorial/XML/xml. Copy the local ESD.par to your AliEn $HOME/Tutorial/XML/par. Copy the local esdPt.* to your AliEn $HOME/Tutorial/XML/selectors. Copy the local demoBatch.C to your AliEn $HOME/Tutorial/XML/macros. Go to your AliEn $HOME/Tutorial/XML/jdl and submit the job by typing: “submit analysis.jdl”.

Batch analysis Try it out (3) Check your job priority by typing: “queue priority jobs $username”. Display the jdl of your job by typing: “ps -jdl $jobid”. Trace the status of your job by typing: “ps -trace $jobid”. When the job 's status turns to RUNNING you can get the stdout and stderr of the job by typing: “spy $jobid stdout”. “spy $jobid stderr”.

References (1) Registration – Certificates: AliEn: Gshell: http://alien.cern.ch/twiki/bin/view/Alice/UserRegistration AliEn: http://alien.cern.ch Gshell: http://alien.cern.ch/twiki/bin/view/AliEn/GAPI User's guide: http://project-arda-dev.web.cern.ch/project-arda- dev/alice/apiservice/AA-UserGuide-0.0m.pdf

References (2) Previous tutorials: File catalog structure – Queries: http://project-arda-dev.web.cern.ch/project-arda- dev/alice/apiservice/Tutorial-Exercises.pdf http://project-arda-dev.web.cern.ch/project-arda- dev/alice/apiservice/Tutorial-Exercises-Solutions.pdf http://project-arda-dev.web.cern.ch/project-arda- dev/alice/apiservice/Tutorial2-Exercises.pdf http://project-arda-dev.web.cern.ch/project-arda- dev/alice/apiservice/Tutorial2-Exercises-Solutions.pdf File catalog structure – Queries: http://pcaliweb02.cern.ch/Offline/Analysis/RunEventTagSystem/RunTags .html#Run/File%20metadata

References (3) Event Tag System: Creation of tag files: http://pcaliweb02.cern.ch/Offline/Analysis/RunEventTagSystem/EventTa gs.html#Event%20tag%20system ALICE internal note. Creation of tag files: http://pcaliweb02.cern.ch/Offline/Analysis/RunEventTagSystem/EventTa gsCreation.html#Create%20tags%20howto Analysis using the Event Tag System: http://pcaliweb02.cern.ch/Offline/Analysis/RunEventTagSystem/EventTa gsAnalysis.html#Analysis%20with%20tags

Reference (4) File level metadata: http://cern.ch/Oldenburg/MetaData/MetaData.doc

BACKUP

Problems with gcc versions Installation Problems with gcc versions AliEn software comes with a precompiled library with gcc 3.2.3 which is also shipped with AliEn. If you are using a different version of gcc (check it with gcc –v) then do the following: cd $ALIEN/api/src ./recompile.gapi cd $ALIEN/api/lib Copy all the libgapiUIXX files to the libgapiUI.so.XX