Download presentation
Presentation is loading. Please wait.
Published byPaul Casey Modified over 9 years ago
1
Use of Condor on the Open Science Grid Chris Green, OSG User Group / FNAL Condor Week, April 30 2008
2
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 1 What is OSG? Links OSG home page.OSG home page VORS resource map and information.VORS VDT (Virtual Data Toolkit) home page.VDT Current use of OSG.Current use "Virtual Organizations" (VOs): trust point for authorization; role-based personalities. Works with multiple underlying batch systems (Condor, PBS family, LSF, SGE). Collection of mostly US-based scientific / academic sites sharing computing and storage resources via common software stack. Job submission and management based around Globus / CondorG.
3
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 2 OSG facts and figures 83 registered computing resources. 30 registered VOs. Usage breakdown for 2008/04/19 – 2008/04/25:
4
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 3 Survey of Condor use on OSG Out of the box: CondorG for inter-site job transfer via Globus/GRAM: GT2 submissions via CondorG still (by far) the most common method of grid job submission on OSG. Task scheduling for site health monitoring. One of several batch systems supported on OSG. "ManagedFork" job management.
5
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 4 Survey of Condor use on OSG External projects Glidein / WMS: "pilot" job submission and management. FermiGrid: job forwarding, "campus grid" management. OSGMM / ReSS: job forwarding and attribute-based matchmaking across multiple OSG sites. "condorview:" enhanced job monitoring and control – not the web-based statistics client of the same name. Complex workflows (eg LIGO: Pegasus/DAGMAN). Gratia: accounting system leverages features of condor where available: condor_history, PER_JOB_HISTORY_DIR, DN.
6
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 5 More detail: Glidein/WMS Workload Management System (Igor Sfiligoi, FNAL) uses Condor Glideins -- startd submitted as a grid job ("pilot") makes remote batch nodes look like local ones.Workload Management SystemIgor Sfiligoi Two main components: One or more glidein factories: manage available grid sites and submit pilot jobs. One or more VO frontends: receive payload submissions from users for distribution to sites. Pilots receive user payloads as distributed by VO frontends.
7
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 6 More detail: Glidein/WMS
8
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 7 More detail: Glidein/WMS Uses GCB for firewall / NAT management. Intra-VO priority management. Works with glExec: application running on worker nodes which handles authorization and UID mapping for payloads – per user accountability to the site.glExec Unaffected by grid site batch manager choice. V1.0 released Dec.'07; v1.1 Jan'08.v1.1 Jan'08 In use by: CDF; Minos (FNAL); being commissioned for CMS.
9
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 8 More detail: "condorview" Michael Thomas, Caltech.Michael Thomas Graphical tool for browsing and managing a condor queue. Hooks to vacate and kill jobs. Hooks to ssh into job directory on worker node and print out process tree. Uses condor_q, condor_config_val, and condor_fetchlog.
10
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 9 More detail: condorview
11
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 10 More detail: condorview
12
April 30, 2008 Condor Week Chris Green OSG User Group / FNAL 11 Concluding statements Condor essential to the OSG. Condor use underpins connectivity of sites within the OSG. Close ties: Miron is OSG PI; VDT team at Wisconsin; new Condor features often a result of OSG needs. Widely used on OSG; many novel uses of and applications building on Condor features. More details in later talks!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.