Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

Generic MPI Job Submission by the P-GRADE Grid Portal Zoltán Farkas MTA SZTAKI.
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Tutorial on Distributed High Performance Computing 14:30 – 19:00 (2:30 pm – 7:00 pm) Wednesday November 17, 2010 Jornadas Chilenas de Computación 2010.
6a.1 Globus Toolkit Execution Management. Data Management Security Common Runtime Execution Management Information Services Web Services Components Non-WS.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Overview of TeraGrid Resources and Usage Selim Kalayci Florida International University 07/14/2009 Note: Slides are compiled from various TeraGrid Documentations.
Grid Computing, B. Wilkinson, 20046d.1 Schedulers and Resource Brokers.
6d.1 Schedulers and Resource Brokers ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
Grid Computing, B. Wilkinson, 20046d.1 Schedulers and Resource Brokers.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
Vladimir Litvin, Harvey Newman Caltech CMS Scott Koranda, Bruce Loftis, John Towns NCSA Miron Livny, Peter Couvares, Todd Tannenbaum, Jamie Frey Wisconsin.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
A Web 2.0 Portal for Teragrid Fugang Wang Gregor von Laszewski May 2009.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Condor Tugba Taskaya-Temizel 6 March What is Condor Technology? Condor is a high-throughput distributed batch computing system that provides facilities.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Grid Computing I CONDOR.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Condor Birdbath Web Service interface to Condor
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
Part 6: (Local) Condor A: What is Condor? B: Using (Local) Condor C: Laboratory: Condor.
A Geographically-Distributed, Assignment-Structured Undergraduate Grid Computing Course Mark A. Holliday, Barry Wilkinson, Jeffrey House, Samir Daoud,
Rochester Institute of Technology Job Submission Andrew Pangborn & Myles Maxfield 10/19/2015Service Oriented Cyberinfrastructure Lab,
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Experiment Management with Microsoft Project Gregor von Laszewski Leor E. Dilmanian Acknowledgement: NSF NMI, CMMI, DDDAS
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
Grid Compute Resources and Job Management. 2 Local Resource Managers (LRM)‏ Compute resources have a local resource manager (LRM) that controls:  Who.
Report from USA Massimo Sgaravatto INFN Padova. Introduction Workload management system for productions Monte Carlo productions, data reconstructions.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Review of Condor,SGE,LSF,PBS
Grid Compute Resources and Job Management. 2 How do we access the grid ?  Command line with tools that you'll use  Specialised applications Ex: Write.
Job Submission with Globus, Condor, and Condor-G Selim Kalayci Florida International University 07/21/2009 Note: Slides are compiled from various TeraGrid.
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
Rochester Institute of Technology 1 Job Submission Andrew Pangborn & Myles Maxfield 01/19/09Service Oriented Cyberinfrastructure Lab,
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
The Gateway Computational Web Portal Marlon Pierce Indiana University March 15, 2002.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
CSF. © Platform Computing Inc CSF – Community Scheduler Framework Not a Platform product Contributed enhancement to The Globus Toolkit Standards.
HTCondor’s Grid Universe Jaime Frey Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison.
Workload Management Workpackage
Duncan MacMichael & Galen Deal CSS 534 – Autumn 2016
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Condor: Job Management
Standard Portlet Architecture Supports Reusable Components
Wide Area Workload Management Work Package DATAGRID project
OGCE Architecture: Portlets and Services for Science Gateways
Condor-G Making Condor Grid Enabled
Condor-G: An Update.
Presentation transcript:

Job Submission Condor, Globus, Java CoG Kit Young Suk Moon

Workflow The computerized facilitation or automation of a business process, in whole or part. W i T = ( G r T, G s T, Q u T, W m T ) – W i = Workflow instantiation – G r = Grid resources – G s = Grid services – Q u = Quality expectations from the user – W m = Workflow model. von Laszewski, G. (2006). Java CoG Kit Workflow Concepts. Journal of Grid Computing.

Condor High throughput computing system – Job management – Scheduling policy – Priority scheme – Resource monitoring / management Checkpointing – Fault tolerance – Migration 3http://grid.rit.edu

Condor Kernel architecture Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor Job submission in a condor pool Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor Gateway flocking Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor-G Interacting with GRAM Disadvantage – the agent does not know which resource is available in the foreign system Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor – Gliding In Step 1 – submit Condor servers as jobs Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor – Gliding In Step 2 – submitted servers form an ad-hoc Condor pool with the matchmaker Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor – Gliding In Step 3 – use the Condor pool as normal Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), http://grid.rit.edu

Condor - Problem Solvers Master-Worker – Master process assigns jobs to worker processes – Suitable for a large problem with no dependency DAGMan (The Directed Acyclic Graph Manager) – For multiple jobs with dependancy Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), A Directed Acyclic Graph 11

Condor Universes Define execution environments – Standard (default) : checkpointing & remote system calls – Vanila : run programs which cannot be re-linked – PVM : run Parallel Virtual Machine programs – MPI : run MPICH programs – Globus : use a Globus system – Java : run on JVM – Scheduler : executes jobs without waiting to be matched to resources. Used for DAGMan 12http://grid.rit.edu

Condor Example submit description file – submit.hello Executable = hello Universe = standard Output = hello.out Log = hello.log Queue To submit the job – > condor_submit submit.hello 13http://grid.rit.edu

Globus GRAM Grid Resource Allocation and Management Program execution requirements – State – User executables – Staging of input and output – Streaming output – Control – Scheduler – Monitoring 14http://grid.rit.edu

Globus GRAM State transition diagram for GRAM jobs Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4. 15http://grid.rit.edu

Globus GRAM Client APIs – Four portTypes: Managed Job Factory, Managed Job, Delegation Factory, Delegation Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 operations interfaces resources 16http://grid.rit.edu

Globus GRAM GT4 Container – Hosts Web Services Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 17http://grid.rit.edu

Globus GRAM GRAM implementation structure Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 18http://grid.rit.edu

Globus GRAM Job submission % globusrun-ws -submit -job-command /bin/program arg Submitting job…Done Job ID: uuid:c51fe35a-4fa3-11d9-9cfc Termination time: 12/17/ :47 GMT Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done. Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 19http://grid.rit.edu

Globus GRAM Job submission with job description files % globusrun-ws -submit -job-description-file touch.xml -factory-epr-file gram.epr #touch.xml /bin/program arg #gram.epr PBS Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 20http://grid.rit.edu

Java CoG Kit Java Commodity Grid Toolkit Maps Grid functionality into a commodity framework (Java) Extends the use of Globus Purpose – To use Grid services more efficiently by using / developing Java classes 21http://grid.rit.edu

Java CoG Kit Workflow management layer – It supports multiple Grid middleware Grid Middleware + CoG Kit providers GT2, SSH, GT4 WS-RF, WebDAV Grid Middleware + CoG Kit providers GT2, SSH, GT4 WS-RF, WebDAV Job submission file transfer & authentication Workflow Abstractions, Queue/Set Abstractions Job submission file transfer & authentication Workflow Abstractions, Queue/Set Abstractions Task & workflow management Karajan (GridAnt, ant) Task & workflow management Karajan (GridAnt, ant) Workflow monitoring Workflow Portlet & visualizer/Editor Workflow monitoring Workflow Portlet & visualizer/Editor Application Nano Materials, Bio-Informatics, etc. Application Nano Materials, Bio-Informatics, etc. 22http://grid.rit.edu

Java CoG Kit - Job Submission (GUI) 23http://grid.rit.edu

Java CoG Kit – Command Line Job execution – globusrun File transfer – globus-url-copy GASS (Global Access to Secondary Storage) – globus-gass-server – globus-gass-server-shutdown 24http://grid.rit.edu

Java CoG Kit - Shell

References Thain, D., Tannenbaum, T., & Livny, M. (2005). Distributed computing in practice: the Condor experience. Concurrency - Practice and Experience, 17(2-4), Foster, I. (2005). A Globus Primer: Describing Globus Toolkit 4 von Laszewski, G. (2006). Java CoG Kit Workflow Concepts. Journal of Grid Computing. von Laszewski, G., Foster, I., Gawor, J., & Lane, P. (2001). A Java Commodity Grid Kit. Concurrency and Computation: Practice and Experience, 13(8-9), http://grid.rit.edu