Presentation is loading. Please wait.

Presentation is loading. Please wait.

Panda-based Software Installation

Similar presentations


Presentation on theme: "Panda-based Software Installation"— Presentation transcript:

1 Panda-based Software Installation
Tadashi Maeno (BNL)

2 Installation using Panda (1/3)
server ProdSys job site B pull https Installation job job pilot https submit site A $OSG_APP install pilot Operator Worker Nodes

3 Installation using Panda (2/3)
Someone submits installation jobs to Panda through usual HTTP I/F The same I/F is used for production/analysis as well Authentication Scheduling (priority, retry, …) Pilots retrieve jobs Each pilot knows which type of jobs it should retrieve Production pilots run ATLAS TRF Installation pilots run Installation TRF

4 Installation using Panda (3/3)
Installation TRF Downloads pacman-latest.tgz from Setup pacman Scans destination dir to find setup.sh for Athena runtime Installs Atlas releases and/or Production caches when setup.sh is missing Runs Kit-Validation Advantages Automatization Scalability Panda infrastructure like monitoring

5 How to submit jobs import userinterface.Client as Client
from taskbuffer.JobSpec import JobSpec jobList = [] for site in [‘SLACXRD’,’AGLT2’]: job = JobSpec() job.transformation = ‘…/installAtlasSW‘ job.computingSite = site job.jobParameters="-s p slc3+gcc -c AtlasProduction_12_0_6_3_i686_slc3_gcc323_opt, AtlasProduction_12_0_6_4_i686_slc3_gcc323_opt“ jobList.append(job) Client.submitJobs(jobList) List of sites TRF site Release + Package + Caches submit

6 Remaining Issues The installation pilot needs write-permission on $OSG_APP “Normal” pilots are mapped to usatlas1 because schedulers are running with the production role Special scheduler running with software role to map pilots to usatlas2? gLExec? Integration with schedconfig DB schedconfig contains what releases are available at each site An intelligent client is possible Get a list of sites where a release is missing Submit a bunch of jobs to install the release Update schedconfig when installation is succeeded Who has responsibility on operations?

7 Test at SLAC Tried 13.0.25 as it is unused for production
Required modifications to SLAC Outbound HTTP connection BU : to download pacman CNAF : to download KV cache Gave temporary write-permission on $OSG_APP/ to Nurcan’s DN Submitted an job from BNL Job= Installation succeeded and KV passed log

8 Conclusions Release installation using Panda is ready
Tested to install to SLAC successfully A few issues Permission of installation pilot Operator Integration with schedconfig


Download ppt "Panda-based Software Installation"

Similar presentations


Ads by Google