Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.

Slides:



Advertisements
Similar presentations
Jaime Frey Computer Sciences Department University of Wisconsin-Madison OGF 19 Condor Software Forum Routing.
Advertisements

Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
1 Concepts of Condor and Condor-G Guy Warner. 2 Harvesting CPU time Teaching labs. + Researchers Often-idle processors!! Analyses constrained by CPU time!
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
More HTCondor 2014 OSG User School, Monday, Lecture 2 Greg Thain University of Wisconsin-Madison.
Condor and GridShell How to Execute 1 Million Jobs on the Teragrid Jeffrey P. Gardner - PSC Edward Walker - TACC Miron Livney - U. Wisconsin Todd Tannenbaum.
Matchmaking in the Condor System Rajesh Raman Computer Sciences Department University of Wisconsin-Madison
Dr. David Wallom Use of Condor in our Campus Grid and the University September 2004.
Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.
Condor Project Computer Sciences Department University of Wisconsin-Madison A Scientist’s Introduction.
Distributed Computing Overviews. Agenda What is distributed computing Why distributed computing Common Architecture Best Practice Case study –Condor –Hadoop.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Introduction to Condor DMD/DFS J.Knudstrup December 2005.
Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor Derek Wright Computer Sciences Department.
COMP3019 Coursework: Introduction to M-grid Steve Crouch School of Electronics and Computer Science.
An Introduction to High-Throughput Computing Rob Quick OSG Operations Officer Indiana University Content Contributed by the University of Wisconsin Condor.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Harnessing the Capacity of Computational.
April Open Science Grid Building a Campus Grid Mats Rynge – Renaissance Computing Institute University of North Carolina, Chapel.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
April Open Science Grid Campus Condor Pools Mats Rynge – Renaissance Computing Institute University of North Carolina, Chapel Hill.
An Introduction to High-Throughput Computing Rob Quick OSG Operations Officer Indiana University Some Content Contributed by the University of Wisconsin.
An Introduction to High-Throughput Computing Monday morning, 9:15am Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Large, Fast, and Out of Control: Tuning Condor for Film Production Jason A. Stowe Software Engineer Lead - Condor CORE Feature Animation.
Condor Tugba Taskaya-Temizel 6 March What is Condor Technology? Condor is a high-throughput distributed batch computing system that provides facilities.
Installing and Managing a Large Condor Pool Derek Wright Computer Sciences Department University of Wisconsin-Madison
1 HawkEye A Monitoring and Management Tool for Distributed Systems Todd Tannenbaum Department of Computer Sciences University of.
Grid Computing I CONDOR.
GridShell + Condor How to Execute 1 Million Jobs on the Teragrid Jeffrey P. Gardner Edward Walker Miron Livney Todd Tannenbaum The Condor Development Team.
Intermediate Condor Rob Quick Open Science Grid HTC - Indiana University.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
Hunter of Idle Workstations Miron Livny Marvin Solomon University of Wisconsin-Madison URL:
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
Alain Roy Computer Sciences Department University of Wisconsin-Madison ClassAds: Present and Future.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Review of Condor,SGE,LSF,PBS
Dan Bradley University of Wisconsin-Madison Condor and DISUN Teams Condor Administrator’s How-to.
Working with Condor. Links: Condor’s homepage:  Condor manual (for the version currently.
Derek Wright Computer Sciences Department University of Wisconsin-Madison Condor and MPI Paradyn/Condor.
Proposal for a IS schema Massimo Sgaravatto INFN Padova.
Pilot Factory using Schedd Glidein Barnett Chiu BNL
Nick LeRoy Computer Sciences Department University of Wisconsin-Madison Hawkeye.
Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week.
An Introduction to High-Throughput Computing With Condor Tuesday morning, 9am Zach Miller University of Wisconsin-Madison.
Scheduling & Resource Management in Distributed Systems Rajesh Rajamani, May 2001.
Weekly Work Dates:2010 8/20~8/25 Subject:Condor C.Y Hsieh.
Nicholas Coleman Computer Sciences Department University of Wisconsin-Madison Distributed Policy Management.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
An Introduction to High-Throughput Computing Monday morning, 9:15am Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor NT Condor ported.
Condor on Dedicated Clusters Peter Couvares and Derek Wright Computer Sciences Department University of Wisconsin-Madison
UCS D OSG Summer School 2011 Overlay systems OSG Summer School An introduction to Overlay systems Also known as Pilot systems by Igor Sfiligoi University.
Profiling Applications to Choose the Right Computing Infrastructure plus Batch Management with HTCondor Kyle Gross – Operations Support.
Intermediate Condor Monday morning, 10:45am Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Condor Week May 2012No user requirements1 Condor Week 2012 An argument for moving the requirements out of user hands - The CMS experience presented.
Quick Architecture Overview INFN HTCondor Workshop Oct 2016
Condor Introduction and Architecture for Vanilla Jobs CERN Feb
Monitoring HTCondor with Ganglia
Building Grids with Condor
A Distributed Policy Scenario
Accounting, Group Quotas, and User Priorities
Job Matching, Handling, and Other HTCondor Features
Basic Grid Projects – Condor (Part I)
Genre1: Condor Grid: CSECCR
HTCondor Training Florentia Protopsalti IT-CM-IS 1/16/2019.
Condor Administration in the Open Science Grid
Condor-G Making Condor Grid Enabled
GLOW A Campus Grid within OSG
Presentation transcript:

Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International Summer School on Grid Computing 2007

2 …And matches them Condor Takes Computers… Dedicated Clusters Desktop Computers …And jobs I need a Mac! I need a Linux box with 2GB RAM! Matchmaker

3 Matchmaking › Matchmaking is fundamental to Condor › Matchmaking is two-way  Job describes what it requires: I need Linux && 2 GB of RAM  Machine describes what it requires: I will only run jobs from the Physics department › Matchmaking allows preferences  I need Linux, and I prefer machines with more memory but will run on any machine you provide me

4 Why Two-way Matching? › Condor conceptually divides people into three groups:  Job submitters  Machine owners  Pool (cluster) administrator › All three of these groups have preferences } May or may not be the same people

5 Machine owner preferences › I prefer jobs from the physics group › I will only run jobs between 8pm and 4am › I will only run certain types of jobs › Jobs can be preempted if something better comes along (or not) ‏ › Remember BNL’s “START” expression?  It expressed policies like this  People use complex policies to control resource usage—Condor supports them

6 System Admin Prefs › When can jobs preempt other jobs? › Which users have higher priority?

7 ClassAds › ClassAds state facts  My job’s executable is analysis.exe  My machine’s load average is 5.6 › ClassAds state preferences  I require a computer with Linux

8 Example: MyType = "Job" TargetType = "Machine" ClusterId = 1377 Owner = "roy“ Cmd = “analysis.exe“ Requirements = (Arch == "INTEL") && (OpSys == "LINUX") && (Disk >= DiskUsage) && ((Memory * 1024)>=ImageSize)‏ … ClassAds ClassAds are: – semi-structured – user-extensible – schema-free – Attribute = Expression String Number Boolean

9 Schema-free ClassAds › Condor imposes some schema  Owner is a string, ClusterID is a number… › But users can extend it however they like, for jobs or machines  AnalysisJobType = “simulation”  HasJava_1_4 = TRUE  ShoeLength = 7 › Matchmaking can use these attributes  Requirements = OpSys == "LINUX" && HasJava_1_4 == TRUE

10 Submitting jobs › Users submit jobs from a computer  Jobs described as ClassAds  Each submission computer has a queue  Queues are not centralized  Submission computer watches over queue  Can have multiple submission computers  Submission handled by condor_schedd Condor_schedd Queue

11 Advertising computers › Machine owners describe computers  Configuration file extends ClassAd  ClassAd has dynamic features Load Average Free Memory …  ClassAds are sent to Matchmaker Matchmaker (Collector) ‏ ClassAd Type = “Machine” Requirements = “…”

12 Matchmaking › Negotiator collects list of computers › Negotiator contacts each schedd  What jobs do you have to run? › Negotiator compares each job to each computer  Evaluate requirements of job & machine  Evaluate in context of both ClassAds  If both evaluate to true, there is a match › Upon match, schedd contacts execution computer

13 Matchmaking Service Job queue service Information service Matchmaking diagram condor_schedd Queue Matchmaker Collector Negotiator 1 2 3

14 Manages Remote Job Manages Machine Running a Job condor_schedd Queue Matchmaker condor_collector condor_negotiator condor_startd condor_submit Manages Local Job condor_shadow condor_starter Job