ORNL is managed by UT-Battelle for the US Department of Energy Process Management Adam Simpson OLCF User Support.

Slides:



Advertisements
Similar presentations
 Introduction  The Windows 7 login script I inherited  Tools  Flow Chart  Requirements  Auto Login  Auto Shutdown  Unix Timestamps  Design 
Advertisements

Aleph 16 Demo Reports in Aleph 16. Key Points Understand the difference for Reports in Aleph 14 and Aleph 16 Locate New Reports in Aleph 16 Understand.
Practice Fusion Step by Step: Create a new patient Open a new chart
Basic Unix system administration
Using the Argo Cluster Paul Sexton CS 566 February 6, 2006.
Job Submission Using PBSPro and Globus Job Commands.
Tips: * Just delete the page you don’t like. * Save the page you want only before submit to us.
CSCI 330 T HE UNIX S YSTEM Shell Job Control. T ODAY ’ S CLASS Unix is multi-user, multi-process OS Shell features to control jobs Unix utilities to manage.
Software Tools Using PBS. Software tools Portland compilers pgf77 pgf90 pghpf pgcc pgCC Portland debugger GNU compilers g77 gcc Intel ifort icc.
Using Clusters -User Perspective. Pre-cluster scenario So many different computers: prithvi, apah, tejas, vayu, akash, agni, aatish, falaq, narad, qasid.
An overview of Torque/Moab queuing. Topics ARC topology Authentication Architecture of the queuing system Workflow Job Scripts Some queuing strategies.
Running Jobs on Jacquard An overview of interactive and batch computing, with comparsions to Seaborg David Turner NUG Meeting 3 Oct 2005.
OSCAR Jeremy Enos OSCAR Annual Meeting January 10-11, 2002 Workload Management.
PBSpro Advanced Information Systems & Technology Advanced Campus Services Prepared by Chao “Bill” Xie, PhD student Computer Science Fall 2005.
High Performance Computing
Sun Grid Engine Grid Computing Assignment – Fall 2005 James Ruff Senior Department of Mathematics and Computer Science Western Carolina University.
Using the BYU Supercomputers. Resources Basic Usage After your account is activated: – ssh ssh.fsl.byu.edu You will be logged in to an interactive node.
Quick Tutorial on MPICH for NIC-Cluster CS 387 Class Notes.
ORNL is managed by UT-Battelle for the US Department of Energy Data Management User Guide Suzanne Parete-Koon Oak Ridge Leadership Computing Facility.
Critical Flags, Variables, and Other Important ALCF Minutiae Jini Ramprakash Technical Support Specialist Argonne Leadership Computing Facility.
Introduction to UNIX/Linux Exercises Dan Stanzione.
Yi Shen Mar 18 th, Backup workshop Stanford Exploration Project.
System-level and RESA Administrators Functions. Topics Manually creating new student account Manually creating new teacher account Importing data Viewing.
Electronic Visualization Laboratory, University of Illinois at Chicago MPI on Argo-new Venkatram Vishwanath Electronic Visualization.
Sun Grid Engine. Grids Grids are collections of resources made available to customers. Compute grids make cycles available to customers from an access.
Bigben Pittsburgh Supercomputing Center J. Ray Scott
Day 8 Exporting Displays Cronjobs Mount. Chapter 5 Chapter 5 talks about X windows. –You should read the chapter. –However, you do not need to pay particular.
Introduction to Using SLURM on Discover Chongxun (Doris) Pan September 24, 2013.
Using the BYU Supercomputers. Resources Basic Usage After your account is activated: – ssh You will be logged in to an interactive.
1 Periodic Processes and the cron Daemon The cron daemon is where all timed events are initiated. The cron system is serviced by the cron daemon. What.
Guide to Linux Installation and Administration, 2e1 Chapter 11 Using Advanced Administration Techniques.
Virtual mpirun Jason Hale Engineering 692 Project Presentation Fall 2007.
Dynamic Batch Processing March 19 10:00 AM David Steger.
Software Tools Using PBS. Software tools Portland compilers pgf77 pgf90 pghpf pgcc pgCC Portland debugger GNU compilers g77 gcc Intel ifort icc.
Running Parallel Jobs Cray XE6 Workshop February 7, 2011 David Turner NERSC User Services Group.
Sys Admin Course Job Scheduling Fourie Joubert. Sys Admin Course Very often, you may want a job to run once at a certain time, or to run regularly at.
Running the Operational Codes for the Brahmaputra Tom Hopson.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
ORNL is managed by UT-Battelle for the US Department of Energy Using Darshan on Titan Verónica G. Vergara Larrea OLCF Users Conference Call August 26,
Portable Batch System – Definition and 3 Primary Roles Definition: PBS is a distributed workload management system. It handles the management and monitoring.
Modules, Compiling WRF, and Running on CHPC Clusters Adam Varble WRF Users Meeting 10/26/15.
Basic UNIX system administration CS 2204 Class meeting 14 *Notes by Doug Bowman and other members of the CS faculty at Virginia Tech. Copyright
Running the Operational Codes for the Brahmaputra Tom Hopson.
Chapter 10 Periodic Processes Unix System Administration.
ORNL is managed by UT-Battelle for the US Department of Energy Spark On Demand Deploying on Rhea Dale Stansberry John Harney Advanced Data and Workflows.
Grid Computing: An Overview and Tutorial Kenny Daily BIT Presentation 22/09/2016.
Hackinars in Bioinformatics
Specialized Computing Cluster An Introduction
Welcome to Indiana University Clusters
PARADOX Cluster job management
Unix Scripts and PBS on BioU
Welcome to Indiana University Clusters
How to use the HPCC to do stuff
Open Source Server Side Scripting Permissions & Users
Chapter 9 Periodic Processes
Chapter 11 – Processes and Services
Import Cron and Windows Task Scheduler definitions
System Administration
Practice #0: Introduction
Postdoctoral researcher Department of Environmental Sciences, LSU
HTCondor Command Line Monitoring Tool
Like tron and not kron, or chron
Compiling and Job Submission
Weekly Webinar Welcome to the Wednesday webinar!
Version A.01 H3064S Module 14 Slides
Sun Grid Engine.
R-Points for Extra Credit
Queueing System Peter Wad Sackett.
Quick Tutorial on MPICH for NIC-Cluster
Presentation transcript:

ORNL is managed by UT-Battelle for the US Department of Energy Process Management Adam Simpson OLCF User Support

2 Intellectual Property Training August 2014 Process Management Daemon Process –Perpetually running program –Run even when user is not logged in Scheduled process –Process that runs at scheduled time intervals Run process every x hours Run process at xx:xx ever day

3 Intellectual Property Training August 2014 Cron Don’t use Cron at the OLCF –Jobs subject to be deleted without notice –Cron untility subject to be disabled –Crontab not persistent between OS upgrades –Not robust w.r.t system downtimes –Only available on a small set of resources

4 Intellectual Property Training August 2014 Daemons: Chained Jobs Recursively submitted PBS jobs –n th +1 job depends on completion of n th job Recommended option for OLCF resources size 0 Job –Does not count against allocation –Runs on service node Create dependency with qsub flag -W depend=afterok:{jobid} =afternotok:{jobid} =afterany:{jobid}

5 Intellectual Property Training August 2014 Daemons: daemon.py import time from datetime import datetime while True: print str(datetime.now()) time.sleep(30)

6 Intellectual Property Training August 2014 Daemons: launcher.pbs #PBS -l walltime=24:00:00 #PBS -l nodes=0 #PBS -A PRJ123 qsub -W depend=afterok:$PBS_JOBID launcher.pbs python -u $HOME/daemon.py >> $HOME/daemon.out 2>&1

7 Intellectual Property Training August 2014 Daemons: running Start $ qsub launcher.pbs Monitor $ showq –u $USER active jobs {jobid} Running blocked jobs {jobid#2} Hold Stop $ qdel {jobid} {jobid#2}

8 Intellectual Property Training August 2014 Scheduled Jobs Instead of Cron use qsub qsub -a [[[[CC]YY]MM]DD]hhmm[.SS] [CC] – first digits of year [YY] – second digits of year [MM] – month [DD] – day hhmm – hour minute [.SS] - seconds

9 Intellectual Property Training August 2014 Scheduled Jobs: sched.py from datetime import datetime print str(datetime.now())

10 Intellectual Property Training August 2014 Scheduled jobs: launcher.pbs #PBS -l walltime=00:05:00 #PBS -l nodes=0 #PBS -A PRJ123 qsub -a 1030 launcher.pbs python $HOME/sched.py >> $HOME/sched.out 2>&1

11 Intellectual Property Training August 2014 Scheduled jobs: launcher.pbs #PBS -l walltime=00:05:00 #PBS -l nodes=0 #PBS -A PRJ123 DT=$(date -d “+12 hours” +%H%M) qsub -a $DT launcher.pbs python $HOME/sched.py >> $HOME/sched.out 2>&1