Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week.

Slides:



Advertisements
Similar presentations
Chapter 20 Oracle Secure Backup.
Advertisements

Greg Quinn Computer Sciences Department University of Wisconsin-Madison Condor on Windows.
Condor and GridShell How to Execute 1 Million Jobs on the Teragrid Jeffrey P. Gardner - PSC Edward Walker - TACC Miron Livney - U. Wisconsin Todd Tannenbaum.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Chapter 8: Network Operating Systems and Windows Server 2003-Based Networking Network+ Guide to Networks Third Edition.
Linux+ Guide to Linux Certification, Second Edition
Tutorial 11: Connecting to External Data
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machine Universe in.
Jim Basney Computer Sciences Department University of Wisconsin-Madison Managing Network Resources in.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
Condor Project Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Implementing a Central Quill Database in a Large Condor Installation Preston Smith Condor Week April 30, 2008.
Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.
Session 5: Working with MySQL iNET Academy Open Source Web Development.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
Module 14: Configuring Print Resources and Printing Pools.
Condor Tugba Taskaya-Temizel 6 March What is Condor Technology? Condor is a high-throughput distributed batch computing system that provides facilities.
Progress Report Barnett Chiu Glidein Code Updates and Tests (1) Major modifications to condor_glidein code are as follows: 1. Command Options:
Eurotrace Hands-On The Eurotrace File System. 2 The Eurotrace file system Under MS ACCESS EUROTRACE generates several different files when you create.
Grid Computing I CONDOR.
Greg Thain Computer Sciences Department University of Wisconsin-Madison cs.wisc.edu Interactive MPI on Demand.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
Zach Miller Computer Sciences Department University of Wisconsin-Madison Bioinformatics Applications.
Condor Project Computer Sciences Department University of Wisconsin-Madison Condor-G Operations.
Oracle 10g Database Administrator: Implementation and Administration Chapter 2 Tools and Architecture.
Grid job submission using HTCondor Andrew Lahiff.
Database weekly reports Zbigniew Baranowski Carlos Fernando Gamboa.
10/25/20151 Single Sign-On Web Service Supervisors: Viktor Kulikov Alexander Sherman Liana Lipstov Pavel Bilenko.
What is Sure Stats? Sure Stats is an add-on for SAP that provides Organizations with detailed Statistical Information about how their SAP system is being.
Windows PostgreSQL installation. Pre-requisites Define a local non-administrative Windows account (eg. postgres) Must install on Console, not through.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Quill / Quill++ Tutorial.
Chapter 10: Rights, User, and Group Administration.
Zach Miller Computer Sciences Department University of Wisconsin-Madison Securing Condor.
PostgreSQL Installation By: Keerthi Nelaturu Previous version by Ahmed Jeddah & Miguel Garzon.
1 AHM -2-4 Sept 2003 e-Science Centre Running SRB Ananta Manandhar.
Peter Couvares Associate Researcher, Condor Team Computer Sciences Department University of Wisconsin-Madison
Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week.
Linux Operations and Administration
Configuring and Deploying Web Applications Lesson 7.
Hyperion Artifact Life Cycle Management Agenda  Overview  Demo  Tips & Tricks  Takeaways  Queries.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison What’s New in Condor-G.
Condor Project Computer Sciences Department University of Wisconsin-Madison Condor Job Router.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor NT Condor ported.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Condor and Virtual Machines.
HTCondor’s Grid Universe Jaime Frey Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison.
HTCondor Security Basics HTCondor Week, Madison 2016 Zach Miller Center for High Throughput Computing Department of Computer Sciences.
Gabi Kliot Computer Sciences Department Technion – Israel Institute of Technology Adding High Availability to Condor Central Manager Adding High Availability.
Leveraging Database Technologies in Condor Jeff Naughton April 25, 2006.
Condor Week May 2012No user requirements1 Condor Week 2012 An argument for moving the requirements out of user hands - The CMS experience presented.
Postgres and LDAP By Lloyd Albin Postgres and LDAP 10/4/2016.
HTCondor Security Basics
LINUX ADMINISTRATION 1
Chapter 2: System Structures
Active Directory Administration
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
Adding High Availability to Condor Central Manager Tutorial
HTCondor Command Line Monitoring Tool
HTCondor Security Basics HTCondor Week, Madison 2016
Condor Glidein: Condor Daemons On-The-Fly
HTCondor Training Florentia Protopsalti IT-CM-IS 1/16/2019.
Condor: Firewall Mirroring
Condor-G Making Condor Grid Enabled
Multi VO Rucio Andrew Lister.
Presentation transcript:

Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week 2006

What is Quill? A non-invasive method of storing a read only version of the job queue and job historical data in a relational database.

Why Do We Need It? › Presents the job queue information as a set of tables in a relational database (Big Win!) › Fault tolerance › Provides performance enhancements in very large and busy pools

Job Queue Management Job Queue scheddquilldDatabase Job Queue schedd Without QuillWith Quill

Deployment › One Quill daemon per schedd › Quill daemons must be uniquely named › Each Quill daemon uses a unique DB name › Multiple Quill daemons may utilize one database server › Currently uses PostgreSQL  Recommend PostgreSQL 8.1 or later for automatic vacuuming of tables

Condor’s Interface to Quill › Modified two tools to utilize the DB  condor_q  condor_history › Very minor modifications to schedd › Multiple sources for Job Queue & History pose an interesting problem

Job Queue Discovery Sequence (Local Query) Job Queue scheddquilldDatabase condor_q 1 2 3

Job Queue Discovery Sequence (Remote Query) Job Queue schedd condor_q collectorquilldDatabase

A User Perspective: condor_q › condor_q changes  -name takes a ScheddName or QuillName  -avgqueuetime details average time in queue for all jobs

A User Perspective: condor_q Example: condor_q -name Linux merlin > condor_q -name -- DB: : : psilord_db ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 92.0 psilord 4/21 09: :00:00 I foo 1 jobs; 1 idle, 0 running, 0 held

A User Perspective Example: condor_q -avgqueuetime Linux merlin > condor_q -avgqueuetime -- DB: : : psilord_db Average time in queue for uncompleted jobs (in hh:mm:ss) 00:40:

Job History Discovery Sequence (Local Query) quilldDatabase condor_history History File The quilld is never queried directly! Job Queue 1 2

Job History Discovery (Remote Query) NEW! Job Queue condor_history quilldDatabase The quilld is never queried directly! History File collector 1 0

A User Perspective: condor_history › condor_history changes  -name takes a Quill Name to retrieve job histories from a remote quill’s database  -completedsince returns all jobs completed since a PostgreSQL formatted date

A User Perspective: condor_history Example: condor_history -name Linux merlin > condor_history -name -- DB: : : psilord_db ID OWNER SUBMITTED RUN_TIME ST COMPLETED CMD 91.0 psilord 4/20 14: :00:00 X ??? /scratch/psilor 92.0 psilord 4/21 09: :00:00 X ??? /scratch/psilor 93.0 psilord 4/21 10: :00:01 C 4/21 10:12 /scratch/psilor

A User Perspective: condor_history Example: condor_history -completedsince Linux merlin > condor_history -completedsince " :00:01" -- DB: : : psilord_db ID OWNER SUBMITTED RUN_TIME ST COMPLETED CMD 93.0 psilord 4/21 10: :00:01 C 4/21 10:12 /scratch/psilor

Short Circuiting the Discovery Sequence › Use the –direct option! › Examples  condor_q –direct rdbms  condor_q –direct quilld  condor_q –direct schedd › “rdbms”, “quilld”, and “schedd” are the actual parameters. › Invaluable for debugging!

PostgreSQL 8.1 Installation ›./configure › gmake && gmake install › mkdir /path/to/pgsql/data › initdb –D /path/to/pgsql/data › postmaster –D /path/to/pgsql/data › Note: Default port binding is 5432.

PostgreSQL Configuration › Add two special user accounts: quillreader and quillwriter  createuser quillreader --no-createdb --no-adduser --pwprompt  createuser quillwriter --createdb --no-adduser --pwprompt

PostgreSQL Configuration (cont) › Allow TCP/IP connections  Edit file postgresql.conf Add listen_address = '*' › Allow connections from specific hosts  Edit file pg_hba.conf host all quillreader password host all quillwriter password › Note: only use ‘password’ authentication at this time.

Quill Configuration › User quillwriter needs a write password. › Store it in a file called.quillwritepassword in the $(SPOOL) directory. › Ensure only the condor uid can read it if Condor is running as root

Quill Configuration (cont) › Condor system specific attributes in file condor_config.local  QUILL = $(SBIN)/condor_quill  QUILL_LOG = $(LOG)/QuillLog  QUILL_ADDRESS_FILE = $(LOG)/.quill_address  DAEMON_LIST = …, QUILL  VALID_SPOOL_FILES = …,.quillwritepassword  DC_DAEMON_LIST = …, QUILL

Quill Configuration (cont) › Quill specific attributes  QUILL_ENABLED = TRUE  # The quill name must be unique across all  # quill daemons AND schedds  QUILL_NAME =  QUILL_DB_NAME = psilord_db  QUILL_DB_IP_ADDR = merlin.cs.wisc.edu:42999  QUILL_POLLING_PERIOD = 10 (seconds)

Quill Configuration (cont) › QUILL_HISTORY_CLEANING_INTERVAL = 24 (hours) › QUILL_HISTORY_DURATION = 30 (days) › QUILL_MANAGE_VACUUM = FALSE › QUILL_IS_REMOTELY_QUERYABLE = TRUE › QUILL_DB_QUERY_PASSWD = xxx

DB Storage Method › Schema designed to store and query classads  4 tables to represent the job queue classads  2 for history data  1 for metadata › Some queries are easier than others › Ask more questions at the BOF!

Thank you! › Want more information? › BOF “Databases in Condor: Now and in the Future”