Introduction to PanDA Client Tools – pathena, prun and others

Slides:



Advertisements
Similar presentations
Grid Wizard Enterprise Basic Tutorial Using Web Control Panel.
Advertisements

Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Submitting jobs to the grid Argonne Jamboree January 2010 R. Yoshida Esteban Fullana.
ATLAS Software Kaushik De University of Texas At Arlington based on a tutorial by P. Calafiura (LBNL) LHC Computing Workshop, Ankara May 2, 2008.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
GANGA PANDA Dietrich Liko. Motivation Access to OSG resources by GANGA Collaboration with US colleagues Possibly an alternative way of submitting jobs.
Advanced File Processing
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Nurcan Ozturk University of Texas at Arlington Grid User Training for Local Community TUBITAK ULAKBIM, Ankara, Turkey April 5 - 9, 2010 Overview of ATLAS.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
How to Install and Use the DQ2 User Tools US ATLAS Tier2 workshop at IU June 20, Bloomington, IN Marco Mambelli University of Chicago.
David Adams ATLAS AJDL: Analysis Job Description Language David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Advanced File Processing. 2 Objectives Use the pipe operator to redirect the output of one command to another command Use the grep command to search for.
Chapter Five Advanced File Processing. 2 Objectives Use the pipe operator to redirect the output of one command to another command Use the grep command.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Submitting jobs to the grid Argonne Jamboree January 2010 R. Yoshida (revised March 2010) Esteban Fullana.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
Tour Overview Introduction Collage Basics Collage Basics (Templates and Tools) Computer Configuration Bookmark Collage Getting Started Tour Collage Terminology.
October 8, 2002P. Nilsson, SPD General Meeting1 Paul Nilsson, SPD General Meeting, Oct. 8, 2002 New tools and software updates Test beam analysis Software.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Chapter Five Advanced File Processing. 2 Lesson A Selecting, Manipulating, and Formatting Information.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Nurcan Ozturk University of Texas at Arlington US ATLAS Transparent Distributed Facility Workshop University of North Carolina - March 4, 2008 A Distributed.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
TAGS in the Analysis Model Jack Cranshaw, Argonne National Lab September 10, 2009.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
LHCbDirac and Core Software. LHCbDirac and Core SW Core Software workshop, PhC2 Running Gaudi Applications on the Grid m Application deployment o CVMFS.
ELSSISuite Services QIZHI ZHANG Argonne National Laboratory on behalf of the TAG developers group ATLAS Software and Computing Week, 4~8 April, 2011.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
Future of Distributed Production in US Facilities Kaushik De Univ. of Texas at Arlington US ATLAS Distributed Facility Workshop, Santa Cruz November 13,
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Starting Analysis with Athena (Esteban Fullana Torregrosa) Rik Yoshida High Energy Physics Division Argonne National Laboratory.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
J. Shank DOSAR Workshop LSU 2 April 2009 DOSAR Workshop VII 2 April ATLAS Grid Activities Preparing for Data Analysis Jim Shank.
Panda Monitoring, Job Information, Performance Collection Kaushik De (UT Arlington), Torre Wenaus (BNL) OSG All Hands Consortium Meeting March 3, 2008.
9/21/04 James Gallagher Server Installation and Testing: Hands-on ● Install the CGI server with the HDF and FreeForm handlers ● Link data so the server.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
ATLAS TAG Services Jack Cranshaw with support from Thomas Doherty, Julius Hrivnac, Marcin Nowak.
Development Environment
L’analisi in LHCb Angelo Carbone INFN Bologna
Practical: The Information Systems
IW2D migration to HTCondor
Tango Administrative Tools
lcg-infosites documentation (v2.1, LCG2.3.1) 10/03/05
A full demonstration based on a “real” analysis scenario
Status of Storm+Lustre and Multi-VO Support
Vandy Berten Luc Goossens Alvin Tan
Guide To UNIX Using Linux Third Edition
Exploring the Power of EPDM Tasks - Working with and Developing Tasks in EPDM By: Marc Young XLM Solutions
EGEE Middleware: gLite Information Systems (IS)
Introduction to Athena
DIBBs Brown Dog BDFiddle
Presentation transcript:

Introduction to PanDA Client Tools – pathena, prun and others Nurcan Ozturk University of Texas at Arlington First ATLAS-South Caucasus Software / Computing Workshop & Tutorial 25-29 October 2010, Tbilisi, Georgia

PanDA Client Tools PanDA = Production and Distributed Analysis System for ATLAS PanDA client consists of five tools to submit or manage analysis jobs on PanDA DA on PanDA page: https://twiki.cern.ch/twiki/bin/viewauth/Atlas/DAonPanda pathena How to submit Athena jobs prun How to submit general jobs (ROOT, python, sh, exe, …) psequencer How to perform sequential jobs/operations (e.g. submit job + download output) pbook Bookkeeping (browsing, retry, kill) of analysis jobs puserinfo Access control on PanDA analysis queues Nurcan Ozturk

What is pathena? To submit Athena jobs to PanDA A simple command line tool, but contains advanced capabilities for more complex needs A consistent user interface to Athena When you run Athena with: $ athena jobOptions.py all you need to do is: $ pathena jobOptions.py --inDS inputDatasetName --outDS outputDatasetName a dataset which contains the input files a dataset which will contain the output files Nurcan Ozturk

Launching a pathena job $ pathena jobOptions.pythia.py --outDS user.nurcan.pythiaEventGeneration INFO : extracting run configuration INFO : ConfigExtractor > No Input INFO : ConfigExtractor > Output=STREAM1 pythia.pool.root INFO : ConfigExtractor > RndmStream PYTHIA INFO : ConfigExtractor > RndmStream PYTHIA_INIT INFO : archiving source files INFO : archiving InstallArea INFO : checking symbolic links INFO : uploading source/jobO files INFO : trying to get the latest version number for DBRelease=LATEST INFO : use ddo.000001.Atlas.Ideal.DBRelease.v120901:DBRelease-12.9.1.tar.gz INFO : query files in ddo.000001.Atlas.Ideal.DBRelease.v120901 INFO : submit to ANALY_SLAC =================== JobsetID : 14390 JobID : 14391 Status : 0 > build PandaID=1132989222 > run PandaID=1132989223 Recreates the job’s environment at the grid site Runs the job option file at the grid site Nurcan Ozturk

Job Cycle Nurcan Ozturk

What is prun? To submit general jobs to PanDA: ROOT (ARA- AthenaRootAccess), Python, shell script, exe, … ATLAS analysis has two stages Run Athena on AOD/ESD to produce DPD  pathena Run ROOT, Python, shell scripts, etc. to produce final plots  prun How to run prun: Example in the twiki page: https://twiki.cern.ch/twiki/bin/viewauth/Atlas/SouthCaucasusComputingTutorial $ prun --outDS user.nurcan.pruntest --exec HelloWorld.py output dataset name name of the python script Nurcan Ozturk

Launching a prun job $ prun --outDS user.nurcan.pruntest --exec HelloWorld.py INFO : gathering files under /afs/cern.ch/user/n/nozturk/scratch0/16.0.1/run INFO : upload source files INFO : submit to ANALY_SARA =================== JobsetID : 14388 JobID : 14389 Status : 0 > build PandaID=1132981745 > run PandaID=1132981750 Recreates the job’s environment at the grid site Runs the job option file at the grid site Nurcan Ozturk

What is pbook? Bookkeeping of PanDA jobs: Browsing Retry Kill Makes a local sqlite3 repository to keep personal job information: IMAP like sync-diff mechanism Not scanning the global PanDA repository, thus quick response Dual user interface Command-line Graphical Nurcan Ozturk

Monitoring a PanDA job (1/2) Go to PanDA monitor at http://panda.cern.ch and enter your Panda jobID on the left panel. Find yourself here Nurcan Ozturk

Monitoring a PanDA job (2/2) pathena run job finished. output is dataset container (now default in pathena/prun) build job Nurcan Ozturk

More options with pathena $ pathena -h Usage: pathena [options] <jobOption1.py> [<jobOption2.py> [...]] 'pathena --help' prints a summary of the options HowTo is available at https://twiki.cern.ch/twiki/bin/view/Atlas/PandaAthena Options: -h, --help show this help message and exit --version Displays version --split=SPLIT Number of sub-jobs to which a job is split --nFilesPerJob=NFILESPERJOB Number of files on which each sub-job runs --nEventsPerJob=NEVENTSPERJOB Number of events on which each sub-job runs --nEventsPerFile=NEVENTSPERFILE Number of events per file --nGBPerJob=NGBPERJOB Instantiate one sub job per NGBPERJOB GB of input files. --nGBPerJob=MAX sets the size to the default maximum value --site=SITE Site name where jobs are sent (default:AUTO) Many options available Nurcan Ozturk

More options with prun $ prun -h Many options available Usage: prun [options] HowTo is available at https://twiki.cern.ch/twiki/bin/view/Atlas/PandaRun Options: -h, --help show this help message and exit --version Displays version --inDS=INDS Name of an input dataset or dataset container --goodRunListXML=GOODRUNLISTXML Good Run List XML which will be converted to datasets by AMI --goodRunListDataType=GOODRUNDATATYPE specify data type when converting Good Run List XML to datasets, e.g, AOD (default) --goodRunListProdStep=GOODRUNPRODSTEP specify production step when converting Good Run List to datasets, e.g, merge (default) --goodRunListDS=GOODRUNLISTDS A comma-separated list of pattern strings. Datasets which are converted from Good Run List XML will be used when they match with one of the pattern strings. Many options available Nurcan Ozturk

More Information Documentation about PanDA tools together with analysis examples and FAQ (Frequently Asked Questions): https://twiki.cern.ch/twiki/bin/viewauth/Atlas/DAonPanda pathena examples: how to run production transformations, TAG selection, on good run lists, event picking, etc. prun examples: how to run CINT macro, C++ ROOT, python job, pyROOT script, skim RAW/AOD/ESD data, merge ROOT files, etc. Your tutorial page: https://twiki.cern.ch/twiki/bin/viewauth/Atlas/SouthCaucasusComputingTutorial Regular Offline Software Tutorial page: https://twiki.cern.ch/twiki/bin/viewauth/Atlas/RegularComputingTutorial How to get support if you need help: hn-atlas-dist-analysis-help@cern.ch https://groups.cern.ch/group/hn-atlas-dist-analysis-help/default.aspx Nurcan Ozturk