Basic Grid Projects – Condor (Part I)

Slides:



Advertisements
Similar presentations
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
Advertisements

Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
1 Concepts of Condor and Condor-G Guy Warner. 2 Harvesting CPU time Teaching labs. + Researchers Often-idle processors!! Analyses constrained by CPU time!
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
Condor and GridShell How to Execute 1 Million Jobs on the Teragrid Jeffrey P. Gardner - PSC Edward Walker - TACC Miron Livney - U. Wisconsin Todd Tannenbaum.
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
CONDOR CISC 879 Parallel Computation Spring 2003 Preethi Natarajan.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Condor and the Grid D. Thain, T. Tannenbaum, M. Livny Christopher M. Moretti 23 February 2007.
Workload Management Massimo Sgaravatto INFN Padova.
Design and Evaluation of a Resource Selection Framework for Grid Applications University of Chicago.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Condor Overview Bill Hoagland. Condor Workload management system for compute-intensive jobs Harnesses collection of dedicated or non-dedicated hardware.
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Distributed Computing Overviews. Agenda What is distributed computing Why distributed computing Common Architecture Best Practice Case study –Condor –Hadoop.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Derek Wright Computer Sciences Department, UW-Madison Lawrence Berkeley National Labs (LBNL)
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor Derek Wright Computer Sciences Department.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Harnessing the Capacity of Computational.
Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
National Alliance for Medical Image Computing Grid Computing with BatchMake Julien Jomier Kitware Inc.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
An Introduction to High-Throughput Computing Monday morning, 9:15am Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Installing and Managing a Large Condor Pool Derek Wright Computer Sciences Department University of Wisconsin-Madison
Ashish Patro MinJae Hwang Thanumalayan S. Thawan Kooburat.
Grid Computing I CONDOR.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Hunter of Idle Workstations Miron Livny Marvin Solomon University of Wisconsin-Madison URL:
Rochester Institute of Technology Job Submission Andrew Pangborn & Myles Maxfield 10/19/2015Service Oriented Cyberinfrastructure Lab,
Grid Workload Management Massimo Sgaravatto INFN Padova.
Condor Week 2005Optimizing Workflows on the Grid1 Optimizing workflow execution on the Grid Gaurang Mehta - Based on “Optimizing.
The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
Derek Wright Computer Sciences Department University of Wisconsin-Madison MPI Scheduling in Condor: An.
1 Condor BirdBath SOAP Interface to Condor Charaka Goonatilake Department of Computer Science University College London
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Review of Condor,SGE,LSF,PBS
Dan Bradley University of Wisconsin-Madison Condor and DISUN Teams Condor Administrator’s How-to.
A Personal Cloud Controller Yuan Luo School of Informatics and Computing, Indiana University Bloomington, USA PRAGMA 26 Workshop.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Derek Wright Computer Sciences Department University of Wisconsin-Madison Condor and MPI Paradyn/Condor.
Derek Wright Computer Sciences Department University of Wisconsin-Madison New Ways to Fetch Work The new hook infrastructure in Condor.
Pilot Factory using Schedd Glidein Barnett Chiu BNL
Ian D. Alderman Computer Sciences Department University of Wisconsin-Madison Condor Week 2008 End-to-end.
An Introduction to High-Throughput Computing With Condor Tuesday morning, 9am Zach Miller University of Wisconsin-Madison.
Scheduling & Resource Management in Distributed Systems Rajesh Rajamani, May 2001.
Condor on WAN D. Bortolotti - INFN Bologna T. Ferrari - INFN Cnaf A.Ghiselli - INFN Cnaf P.Mazzanti - INFN Bologna F. Prelz - INFN Milano F.Semeria - INFN.
Nicholas Coleman Computer Sciences Department University of Wisconsin-Madison Distributed Policy Management.
Condor Services for the Global Grid: Interoperability between OGSA and Condor Clovis Chapman 1, Paul Wilson 2, Todd Tannenbaum 3, Matthew Farrellee 3,
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
UCS D OSG Summer School 2011 Overlay systems OSG Summer School An introduction to Overlay systems Also known as Pilot systems by Igor Sfiligoi University.
Workload Management Workpackage
Condor A New PACI Partner Opportunity Miron Livny
Quick Architecture Overview INFN HTCondor Workshop Oct 2016
Dynamic Deployment of VO Specific Condor Scheduler using GT4
Operating a glideinWMS frontend by Igor Sfiligoi (UCSD)
Monitoring HTCondor with Ganglia
A Distributed Policy Scenario
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
Accounting, Group Quotas, and User Priorities
HTCondor Training Florentia Protopsalti IT-CM-IS 1/16/2019.
Condor: Firewall Mirroring
Condor-G Making Condor Grid Enabled
GLOW A Campus Grid within OSG
Presentation transcript:

Basic Grid Projects – Condor (Part I) Sathish Vadhiyar Sources/Credits: Condor Project web pages

Condor Motivation Most of the cycles (70%) of workstation pools are underutilized High throughput computing – Large amounts of processing capacity over long periods of time In contrast to High Performance Computing Support system with distributed ownerships Owners specify access policies

Condor Features Specialized workload management system Provides a job queueing mechanism, scheduling policy, resource monitoring, and resource management Can effectively harness wasted CPU power from otherwise idle desktop workstations Can checkpoint and migrate a job to a different machine

Condor Architecture – daemons / processes master Startd Represents a machine to the Condor pool Implement owner’s access control policies Starts, stops, suspends jobs Runs on executing machines Starter Spawned by startd for a job Coordinates with the job

Condor Architecture - daemons Schedd Represents jobs to the condor pool Maintain persistent queues of user’s requests Runs on submit machines Shadow Similar to starter functionality, but runs on submit machine Job specific Manager Collector Collects machine and resource information from all other daemons Answers queries Negotiator Retrieves information from collector Does match making

Job Submission Steps

Idle capacity utilization When owner returns, Condor checkpoints and migrates jobs

classads Language used in Condor For describing jobs, workstations, and other resources Mapping from attribute names to expressions Used by condor central manager to decide on job scheduling

Steps Step 1 – entities express their characteristics through classads Also expresses constraints for other entities to follow Also specifies ranks (preferences) to break ties in case there are more than one entities following the constraints. Step 2 – matchmaker matches different classads Step 3 – match maker notifies matched entities Step 4 – matched entities establish allocation

ClassAds

More on ClassAds Resource owners and customers can dynamically define own models – suitable for distributed setting Matching and claiming as 2 distinct operations 5 components of matchmaking protocol classAd specification advertising protocol matchmaking algorithm matchmaking protocol claiming protocol Constraints, i.e. queries, may be expressed as attributes of classAd classAd definition – mapping from attribute names to expressions

Examples

Examples