Condor Project Computer Sciences Department University of Wisconsin-Madison Condor Job Router.

Slides:



Advertisements
Similar presentations
Jaime Frey Computer Sciences Department University of Wisconsin-Madison OGF 19 Condor Software Forum Routing.
Advertisements

Community Grids Lab1 CICC Project Meeting VOTable Developed VotableToSpreadsheet Service which accepts VOTable file location as an input, converts to Excel.
Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
Greg Thain Computer Sciences Department University of Wisconsin-Madison Condor Parallel Universe.
Greg Quinn Computer Sciences Department University of Wisconsin-Madison Condor on Windows.
Condor Project Computer Sciences Department University of Wisconsin-Madison Stork An Introduction Condor Week 2006 Milan.
Heterogeneous Grid Design and Implementation Thesis Presentation By Jeffrey Wells State University New York Institute of Technology May 7, 2008 CSC 599.
Post ASP 2012: Protein Docking at the IU School of Medicine.
High Throughput Computing with Condor at Notre Dame Douglas Thain 30 April 2009.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machine Universe in.
Zach Miller Condor Project Computer Sciences Department University of Wisconsin-Madison Flexible Data Placement Mechanisms in Condor.
Minerva Infrastructure Meeting – October 04, 2011.
Condor Project Computer Sciences Department University of Wisconsin-Madison What’s new in Condor? What’s coming up? Condor Week 2009.
Introduction to information theory
Condor Project Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
HTPC - High Throughput Parallel Computing (on the OSG) Dan Fraser, UChicago OSG Production Coordinator Horst Severini, OU (Greg Thain, Uwisc) OU Supercomputing.
High Throughput Parallel Computing (HTPC) Dan Fraser, UChicago Greg Thain, Uwisc.
Workflow Management in Condor Gökay Gökçay. DAGMan Meta-Scheduler The Directed Acyclic Graph Manager (DAGMan) is a meta-scheduler for Condor jobs. DAGMan.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
Condor Tugba Taskaya-Temizel 6 March What is Condor Technology? Condor is a high-throughput distributed batch computing system that provides facilities.
Condor Project Computer Sciences Department University of Wisconsin-Madison Advanced Condor mechanisms CERN Feb
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week.
High Throughput Parallel Computing (HTPC) Dan Fraser, UChicago Greg Thain, UWisc Condor Week April 13, 2010.
Parallel Optimization Tools for High Performance Design of Integrated Circuits WISCAD VLSI Design Automation Lab Azadeh Davoodi.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
Condor Project Computer Sciences Department University of Wisconsin-Madison A Scientist’s Introduction.
Condor Project Computer Sciences Department University of Wisconsin-Madison Condor-G Operations.
Grid job submission using HTCondor Andrew Lahiff.
Grid Compute Resources and Job Management. 2 Local Resource Managers (LRM)‏ Compute resources have a local resource manager (LRM) that controls:  Who.
Dealing with real resources Wednesday Afternoon, 3:00 pm Derek Weitzel OSG Campus Grids University of Nebraska.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor RoadMap.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Quill / Quill++ Tutorial.
Condor-G A Quick Introduction Alan De Smet Condor Project University of Wisconsin - Madison.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Dan Bradley University of Wisconsin-Madison Condor and DISUN Teams Condor Administrator’s How-to.
Job Management DIRAC Project. Overview  DIRAC JDL  DIRAC Commands  Tutorial Exercises  What do you have learned? KEK 10/2012DIRAC Tutorial.
Condor Project Computer Sciences Department University of Wisconsin-Madison Grids and Condor Barcelona,
Derek Wright Computer Sciences Department University of Wisconsin-Madison Condor and MPI Paradyn/Condor.
Peter F. Couvares Computer Sciences Department University of Wisconsin-Madison Condor DAGMan: Managing Job.
Condor Project Computer Sciences Department University of Wisconsin-Madison Condor and DAGMan Barcelona,
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison What’s New in Condor-G.
Matthew Farrellee Computer Sciences Department University of Wisconsin-Madison Condor and Web Services.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor NT Condor ported.
Condor Project Computer Sciences Department University of Wisconsin-Madison Running Interpreted Jobs.
HTCondor-CE for USATLAS Bob Ball AGLT2/University of Michigan OSG AHM March, 2015 Bob Ball AGLT2/University of Michigan OSG AHM March, 2015.
Condor Project Computer Sciences Department University of Wisconsin-Madison Using New Features in Condor 7.2.
HTCondor’s Grid Universe Jaime Frey Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison.
1 Getting Started with OSG Connect ~ an Interactive Tutorial ~ Emelie Harstad, Mats Rynge, Lincoln Bryant, Suchandra Thapa, Balamurugan Desinghu, David.
Gabi Kliot Computer Sciences Department Technion – Israel Institute of Technology Adding High Availability to Condor Central Manager Adding High Availability.
An Introduction to Using
1 Lecture 8 Shell Programming – Control Constructs COP 3353 Introduction to UNIX.
Job submission overview Marco Mambelli – August OSG Summer Workshop TTU - Lubbock, TX THE UNIVERSITY OF CHICAGO.
Condor Project Computer Sciences Department University of Wisconsin-Madison Condor-G: Condor and Grid Computing.
Parag Mhashilkar (Fermi National Accelerator Laboratory)
Greg Thain Computer Sciences Department University of Wisconsin-Madison HTPC on the OSG.
Primer for Site Debugging
Mardi Gras Distributed Applications Conference Baton Rouge, LA
Adding High Availability to Condor Central Manager Tutorial
Job Management with DATA
Condor: Job Management
HTCondor Command Line Monitoring Tool
Globus Job Management. Globus Job Management Globus Job Management A: GRAM B: Globus Job Commands C: Laboratory: globusrun.
The Condor JobRouter.
gLite Job Management Christos Theodosiou
PU. Setting up parallel universe in your pool and when (not
Presentation transcript:

Condor Project Computer Sciences Department University of Wisconsin-Madison Condor Job Router

2 Dan, Condor Week 2008 The Job Router A Flexible Job Transformer › Acts upon jobs in queue › Policy controls when:  (jobs currently routed to site X) < max  (idle jobs routed to site X) < max  (rate of recent failure at site X) < max › And how to:  Change attribute values (e.g. Universe)  Insert new attributes (e.g. GridResource)  Other arbitrary actions in hooks

3 Dan, Condor Week 2008 Example: sending excess vanilla jobs to a grid site Universe = “vanilla” Executable = “sim” Arguments = “seed=345” Output = “stdout.345” Error = “stderr.345” ShouldTransferFiles = True WhenToTransferOutput = “ON_EXIT” Universe = “grid” GridType = “gt2” GridResource = \ “cmsgrid01.hep.wisc.edu/jobmanager-condor” Executable = “sim” Arguments = “seed=345” Output = “stdout” Error = “stderr” ShouldTransferFiles = True WhenToTransferOutput = “ON_EXIT” JobRouter Routing Table: Site 1 … Site 2 … final status routed (grid) joboriginal (vanilla) job

4 Dan, Condor Week 2008 Using the Job Router › Job Router is a daemon › Disabled by default › To use…  Modify the condor_config.local file  Run condor_reconfig and condor_on  Submit jobs that want the Job Router

# Enable the Job Router DAEMON_LIST = $(DAEMON_LIST) JOB_ROUTER # These settings become the default settings for all routes JOB_ROUTER_DEFAULTS = \ [ \ requirements=target.WantJobRouter is True; \ MaxIdleJobs = 50; \ MaxJobs = 200; \ /* now modify routed job attributes */ \ delete_WantJobRouter = true; \ set_x509userproxy = "/home/jfrey/epikh.proxy"; \ ] # Now we define each of the routes to send jobs on JOB_ROUTER_ENTRIES = \ [ GridResource = "cream cream/services/CREAM2 condor stress3.chtc.wisc.edu"; \ name = "CHTC"; ] # How often the job router should check for jobs JOB_ROUTER_POLLING_PERIOD = 10 Config File Settings 5

Sample Job › You can use any vanilla job, but here’s a simple example: #!/bin/sh /bin/date echo echo whoami /usr/bin/whoami echo echo hostname /bin/hostname sleep 300 6

Submit File › Note the added line universe = vanilla executable = job.sh output = out.$(cluster).$(process) error = err.$(cluster).$(process) log = job.log should_transfer_files = YES when_to_transfer_output = ON_EXIT +WantJobRouter = time() - EnteredCurrentStatus > 60 queue 50 7