Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week.

Slides:



Advertisements
Similar presentations
Jaime Frey Computer Sciences Department University of Wisconsin-Madison OGF 19 Condor Software Forum Routing.
Advertisements

Chapter 20 Oracle Secure Backup.
Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
Greg Thain Computer Sciences Department University of Wisconsin-Madison Condor Parallel Universe.
Greg Quinn Computer Sciences Department University of Wisconsin-Madison Condor on Windows.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 1: Introduction to Windows Server 2003.
Condor Project Computer Sciences Department University of Wisconsin-Madison Security in Condor.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machine Universe in.
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Derek Wright Computer Sciences Department, UW-Madison Lawrence Berkeley National Labs (LBNL)
Condor Project Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor Derek Wright Computer Sciences Department.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Implementing a Central Quill Database in a Large Condor Installation Preston Smith Condor Week April 30, 2008.
Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.
April Open Science Grid Building a Campus Grid Mats Rynge – Renaissance Computing Institute University of North Carolina, Chapel.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
Progress Report Barnett Chiu Glidein Code Updates and Tests (1) Major modifications to condor_glidein code are as follows: 1. Command Options:
Installing and Managing a Large Condor Pool Derek Wright Computer Sciences Department University of Wisconsin-Madison
Ashish Patro MinJae Hwang Thanumalayan S. Thawan Kooburat.
Hao Wang Computer Sciences Department University of Wisconsin-Madison Security in Condor.
1 1 Vulnerability Assessment of Grid Software Jim Kupsch Associate Researcher, Dept. of Computer Sciences University of Wisconsin-Madison Condor Week 2006.
Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week.
Greg Thain Computer Sciences Department University of Wisconsin-Madison cs.wisc.edu Interactive MPI on Demand.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
Zach Miller Computer Sciences Department University of Wisconsin-Madison Bioinformatics Applications.
Hao Wang Computer Sciences Department University of Wisconsin-Madison Authentication and Authorization.
Alain Roy Computer Sciences Department University of Wisconsin-Madison ClassAds: Present and Future.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Quill / Quill++ Tutorial.
Condor Project Computer Sciences Department University of Wisconsin-Madison Grids and Condor Barcelona,
Derek Wright Computer Sciences Department University of Wisconsin-Madison Condor and MPI Paradyn/Condor.
Derek Wright Computer Sciences Department University of Wisconsin-Madison New Ways to Fetch Work The new hook infrastructure in Condor.
Pilot Factory using Schedd Glidein Barnett Chiu BNL
Ian D. Alderman Computer Sciences Department University of Wisconsin-Madison Condor Week 2008 End-to-end.
Linux Operations and Administration
Configuring and Deploying Web Applications Lesson 7.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison What’s New in Condor-G.
Dan Bradley Condor Project CS and Physics Departments University of Wisconsin-Madison CCB The Condor Connection Broker.
Matthew Farrellee Computer Sciences Department University of Wisconsin-Madison Condor and Web Services.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor NT Condor ported.
Condor Project Computer Sciences Department University of Wisconsin-Madison Running Interpreted Jobs.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Condor and Virtual Machines.
Leveraging Database Technologies in Condor Jeff Naughton March 14, 2005.
HTCondor Security Basics HTCondor Week, Madison 2016 Zach Miller Center for High Throughput Computing Department of Computer Sciences.
Gabi Kliot Computer Sciences Department Technion – Israel Institute of Technology Adding High Availability to Condor Central Manager Adding High Availability.
Leveraging Database Technologies in Condor Jeff Naughton April 25, 2006.
Condor Week May 2012No user requirements1 Condor Week 2012 An argument for moving the requirements out of user hands - The CMS experience presented.
HTCondor Networking Concepts
HTCondor Networking Concepts
HTCondor Security Basics
Dynamic Deployment of VO Specific Condor Scheduler using GT4
Operating a glideinWMS frontend by Igor Sfiligoi (UCSD)
High Availability in HTCondor
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
Adding High Availability to Condor Central Manager Tutorial
Monitoring HTCondor with Ganglia
Building Grids with Condor
Accounting in HTCondor
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
HTCondor Command Line Monitoring Tool
HTCondor Security Basics HTCondor Week, Madison 2016
Basic Grid Projects – Condor (Part I)
Condor: Firewall Mirroring
Condor-G Making Condor Grid Enabled
Presentation transcript:

Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week 2007

Typical Condor Pool Central Manager master collector negotiator = ClassAd Communication Pathway = Process Spawned Submit-Only master schedd Execute-Only master startd Execute-Only master startd

What is Quill? A technology to store a read only version of the job queue and job historical data in a relational database.

Why Quill? Offloads query overhead from sched  Performance boost! › Easier to make web portal  RDMS access easier than SOAP/CLI

Job Queue Management Job Queue scheddquilldDatabase Job Queue schedd Without QuillWith Quill

Quill downsides › Additional latency › More complicated setup › Handful of attributes not in DBMS

Quill and Quill++ › Quill in Condor since › Quill++ (quillpp) coming soon.  Support for all daemons  Multiple schedds in one database  Support for Oracle on some platforms  Replaces quill › We’ll talk about both

Typical Quill’d Condor Pool Central Manager master collector negotiator = ClassAd Communication Pathway = Process Spawned Submit-Only master schedd Execute-Only master startd Execute-Only master startd Database postgres query quill condor_q

Typical Quillpp’d Condor Pool Central Manager master collector negotiator = ClassAd Communication Pathway = Process Spawned Submit-Only master schedd Execute-Only master startd Execute-Only master startd Database postgres query quillquillpp condor_q quillpp

How to use Schema? › We’ll talk about this in another talk  Quill Front End and Schema BoF Thursday 11am

Quill (not Quill++) Deployment › One Quill daemon per schedd › Quill daemons must be uniquely named › Each Quill daemon uses a unique DB name › Currently uses PostgreSQL  Recommend PostgreSQL 8.2 or later Better disk management

Quill++ deployment › One condor_quillpp per machine › One condor_dbmsd per database › Manual installation of schema › One DB per pool › Uses Postgres or Oracle

Condor’s Interface to Quill › Modified two tools to utilize the DB  condor_q  condor_history

A User Perspective: condor_q › condor_q changes  When QUILL_ENABLED, goes to rdbms  -name takes a ScheddName or QuillName  -avgqueuetime details average time in queue for all jobs

Condor_q -direct › -direct rdbms  (default when QUIL_ENABLE=true) › -direct quilld  (useful for firewall traversal) › -direct schedd  (100% up-to-date view)

A User Perspective: condor_history › condor_history changes  -name takes a Quill Name to retrieve job histories from a remote quill’s database

Condor_history -direct › There isn’t any (yet) › Condor_history –f \  `condor_config_val HISTORY` › No –direct quilld equivalent

PostgreSQL Configuration › Add two special user accounts: quillreader and quillwriter  createuser quillreader --no-createdb --no-adduser --pwprompt  createuser quillwriter --createdb --no-adduser --pwprompt

PostgreSQL Configuration (cont) › Allow TCP/IP connections  Edit file postgresql.conf Add listen_address = '*' › Allow connections from specific hosts  Edit file pg_hba.conf host all quillreader password host all quillwriter password › Note: only use ‘password’ authentication at this time.

Quill Configuration › User quillwriter needs a password. › Store it in › $(SPOOL) /.quillwritepassword (quill) › $(SPOOL) /.pgpass (quill++) .pgpass has host:port:db:user:pass › Ensure only the condor uid can read it if Condor is running as root

Quill Configuration (cont) › Condor system specific attributes in file condor_config.local  QUILL = $(SBIN)/condor_quill  QUILL_LOG = $(LOG)/QuillLog  QUILL_ADDRESS_FILE = $(LOG)/.quill_address  DAEMON_LIST = …, QUILL  VALID_SPOOL_FILES = …,.quillwritepassword  DC_DAEMON_LIST = …, QUILL

Quill Configuration (cont) › Quill specific attributes  QUILL_ENABLED = TRUE  # The quill name must be unique across all  # quill daemons AND schedds  QUILL_NAME =  QUILL_DB_NAME = psilord_db  QUILL_DB_IP_ADDR = merlin.cs.wisc.edu:42999  QUILL_POLLING_PERIOD = 10 (seconds)

Quill Configuration (cont) › QUILL_HISTORY_CLEANING_INTERVAL = 24 (hours) › QUILL_HISTORY_DURATION = 30 (days) › QUILL_MANAGE_VACUUM = FALSE › QUILL_IS_REMOTELY_QUERYABLE = TRUE › QUILL_DB_QUERY_PASSWD = xxx

Schema management › Quill automatically loads schema  Upgrades itself automatically › Quill++ requires manual loading:  Psql –Uquillwriter<common_createddl.sql  Psql –Uquillwriter<pgsql_createddl.sql

Conversion to Quill++ › Conversion only matters for history › Conversion is one-way-only! › Two steps:  Dump quill history tables to file with Condor_dump_history  Load quill++ history tables from file with Condor_load_history

Data Management › Constrain database size  History truncation Quill++ other tables, too  Postgres Index management  Oracle cleans itself › Careful of long queries, esp with Quill

Data Management: Quill › HISTORY_CLEANING_INTERVAL  In hours (24 hours) › HISTORY_DURATION  How long in days (7 days) › QUILL_SHOULD_REINDEX  Boolean (false) › QUILL_MANAGE_VACUUM (false)

Data Management: Quill++ › Condor_dbmsd does all the work  QUILL_DBSIZE_LIMIT (20 Gb) – s warning when 75% is hit  DATABASE_PURGE_INTERVAL (s (24 hours))  DATABASE_REINDEX_INTERVAL (s (24 hours))  QUILL_DB_TYPE (oracle, pgsql)  QUILL_RESOURCE_HISTORY_DURATION (7 days)  QUILL_JOB_HISTORY_DURATION (10 years!)  QUILL_RUN_HISTORY_DURATION (7 days)

Thank you! › Want more information? › BOF “Databases in Condor”