Leveraging Database Technologies in Condor Jeff Naughton April 25, 2006.

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

Implementing Tableau Server in an Enterprise Environment
Jaime Frey Computer Sciences Department University of Wisconsin-Madison OGF 19 Condor Software Forum Routing.
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
ICS 434 Advanced Database Systems
Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
Condor and GridShell How to Execute 1 Million Jobs on the Teragrid Jeffrey P. Gardner - PSC Edward Walker - TACC Miron Livney - U. Wisconsin Todd Tannenbaum.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Chapter 9: The Client/Server Database Environment
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
WDK Driver Test Manager. Outline HCT and the history of driver testing Problems to solve Goals of the WDK Driver Test Manager (DTM) Automated Deployment.
Lecture The Client/Server Database Environment
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Understanding and Managing WebSphere V5
The Client/Server Database Environment
Dynamics AX Technical Overview Application Architecture Dynamics AX Technical Overview.
® IBM Software Group © IBM Corporation IBM Information Server Service Oriented Architecture WebSphere Information Services Director (WISD)
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
EJB Overview Celsina Bignoli Distributed Business Applications Server DB Client DB Server DB.
September 6, 2015 Connecting Client Applications to Informix Databases using IBM Informix Connect and ODBC James Edmiston Database Consultant Quest Information.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
TechEd /22/2017 5:40 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
The Glidein Service Gideon Juve What are glideins? A technique for creating temporary, user- controlled Condor pools using resources from.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
WITSML Service Platform - Enterprise Drilling Information
Fundamentals of Database Chapter 7 Database Technologies.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
CS480 Computer Science Seminar Introduction to Microsoft Solutions Framework (MSF)
Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Condor Birdbath Web Service interface to Condor
ETICS All Hands meeting Bologna, October 23-25, 2006 NMI and Condor: Status + Future Plans Andy PAVLO Peter COUVARES Becky GIETZEL.
ArcGIS Server for Administrators
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Oracle Data Integrator Architecture Components.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Quill / Quill++ Tutorial.
1 Condor BirdBath SOAP Interface to Condor Charaka Goonatilake Department of Computer Science University College London
Windows Role-Based Access Control Longhorn Update
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week.
Integrating the Mainframe Liberating Enterprise Data.
Linux Operations and Administration
Condor Services for the Global Grid: Interoperability between OGSA and Condor Clovis Chapman 1, Paul Wilson 2, Todd Tannenbaum 3, Matthew Farrellee 3,
ViaSQL Technical Overview. Viaserv, Inc. 2 ViaSQL Support for S/390 n Originally a VSE product n OS/390 version released in 1999 n Identical features.
SAN DIEGO SUPERCOMPUTER CENTER Welcome to the 2nd Inca Workshop Sponsored by the NSF September 4 & 5, 2008 Presenters: Shava Smallen
Leveraging Database Technologies in Condor Jeff Naughton March 14, 2005.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Building web applications with the Windows Azure Platform Ido Flatow | Senior Architect | Sela | This session.
Interstage BPM v11.2 1Copyright © 2010 FUJITSU LIMITED INTERSTAGE BPM ARCHITECTURE BPMS.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Maintaining and Updating Windows Server 2008 Lesson 8.
17 Copyright © 2006, Oracle. All rights reserved. Information Publisher.
The Holmes Platform and Applications
Business System Development
GPIR GridPort Information Repository
Netscape Application Server
Dynamic Deployment of VO Specific Condor Scheduler using GT4
The Client/Server Database Environment
Consulting Services JobScheduler Architecture Decision Template
CERN-Russia Collaboration in CASTOR Development
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
Basic Grid Projects – Condor (Part I)
Condor-G Making Condor Grid Enabled
Workforce Central v6.3 Technology
Condor-G: An Update.
Presentation transcript:

Leveraging Database Technologies in Condor Jeff Naughton April 25, 2006

Overview › Introducing ourselves › What we have done since last year Obtained funding (Yay! thank you NSF!) Quill: deployed DB-centric data tool Quill++: more comprehensive, deployed in test- cluster, running (guinea pig) user jobs Condor J2EE: radical departure experimental system, deployed last week in test cluster Published some research papers… › BOF 1:30 on Thursday

Who we are › Faculty: David DeWitt, Jeff Naughton › Students: Jiansheng Huang, Ameet Kini, Christine Reilly, Eric Robinson, Srinath Shankar, Lakshmikant Shrinivas

How do we fit in? › Advanced Development/Research group focused on data management. › Goal: Interact frequently with Condor Dev. Team and users Design and prototype new technology; transfer to Condor team for deployment. › What we don’t do: Determine roadmap and schedules for deployment within Condor.

Why Condor and DBMS? › Premise: A running Condor system is awash in data: Operational data, Historical data, User data › DBMS technology can help capture, organize, manage, archive, and query this data. › This can make Condor even more powerful, usable, and useful.

Quill > Non-invasive approach to capturing job related information > Works by sniffing updates to the job queue log > Serves condor_q and condor_history queries > Independent, reliable, and efficient querying of job related information, with underlying SQL interface So how does it work?

Quill Architecture QuillSchedd Job Queue log Get new events Write events Store events RDBMS Startd … Master Queue + History Tables

Quill++ › More comprehensive than Quill (data from all daemons, not just SchedD) › Built on Quill code base › Condor daemons write to SQL logs, Quill daemon reads and inserts in DBMS › Central database serves entire pool › Web-based query GUI

Data Capture in Quill++ › Condor daemons augmented to record important events in a database › Database is in addition to standard daemon logs › Pool will run unaffected even in the absence of a database Schedd Negotiator Starter Startd Shadow A Machine Schedd

Quill++ Architecture Quill++Schedd Job Queue log Get new events Write events Store events RDBMS Startd … Master Queue, History, Machine, Match etc. Event logs

Implementation Details › Quill++: First class condor daemon Managed by Condor Master Native PostgreSQL API Can be ported to any platform for which PostgreSQL drivers are available (AIX, BSD, IRIX, HP-UX, Linux, Solaris, Windows etc.) Porting Quill++ to other databases involves implementing a database virtual class

Web Interface › Useful for: User job monitoring Administrative monitoring over jobs and resources Debugging

Condordb Admin Screen Recency summary Machine Status Jobs in queue History jobs

Job history by owner

Machine Report

Status about a job Classad Info Run Info Event Info Match Info Rejects Info

Recency info for exceptional data sources

Present Status › Deployed in testbed dbc cluster (93 machines) Has successfully run almost 100,000 jobs. Working with Condor team planning future distribution with Condor.

Caveats › Web interface to DB Basic prototype implemented Needs to be made more robust, user friendly (!) › Gathers incomplete information in multiple pool scenarios ( flocking, glide- in, condor-c)

CondorJ2 › To boldly go where no one has gone before Quill/Quill++: Database reflects state of Condor pool Condor J2: Database is the state of Condor pool › Overview of CondorJ2 Use database to maintain operational data (workflow state, machine state, config policies, etc.) Implement workflow management, resource management and resource allocation in Application Server environment Modify master, startd and starter to be thin web service clients Provide web interface for all system services (workflow submission, machine reconfiguration etc.)

Motivation › Scalability › Flexibility › Administratibility

Java Application Servers › Industrial strength middleware for high performance & scalable web applications › Widely deployed systems Oracle AS 10g, IBM WebSphere, BEA WebLogic, JBoss (open source) › Key features Support for transactions Web service interfaces Support for clustering (for scalability) Configurable security Backend database independence

Condor Web Services User’s Web Browser Condor Pool Web Site Condor Database Web Service Clients SOAP over HTTP master startd starter Execute Machines Application Server User’s Custom Tools Machine Modules Matchmaking Modules Workflow Modules HTTP JDBC

What can do in CondorJ2 via browsers and web services? › Add and configure new machines › Reconfigure machines on the fly › Specify, submit, monitor and manage workflows › Monitor global system state

Virtuous Cycle › As we learn where Condor can use DBMS technology, we learn where DBMS technology can be (must be?) improved. Support for sparse data sets [ICDE 2006]. Pushing match-making style operations into DBMS [SIGMOD 2006]. Data provenance as byproduct of Quill++ data capture. [IPAW 2006] › Improving DBMS technology will lead to more places that it can be installed.

Other ongoing work… › File caching in Condor pools › Techniques for explaining data consistency rather than dictating consistency › Automatic monitoring of system “health” by mining captured data

Visit us and see demos! › Come see demos of Quill, Quill++, and CondorJ2 in Rm. 216/218 Fluno Center on Thurs. afternoon 1:30 – 2:30pm.