Leveraging Database Technologies in Condor Jeff Naughton March 14, 2005.

Slides:



Advertisements
Similar presentations
PHINMS: Application Integration
Advertisements

Jaime Frey Computer Sciences Department University of Wisconsin-Madison OGF 19 Condor Software Forum Routing.
Database System Concepts and Architecture
17th February, 2000 by Maciej Korzeniowski (CERN-IT-IA-MI) 1 Oracle Discoverer Product Presentation  This is an ad hoc query and analysis tool for.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
OnContact CRM Customer Relationship Management. CRM 7 Benefits Rich "client" experience, completely web-based Access data anytime, anywhere. Ease of navigation.
Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
Brian Alderman | MCT, CEO / Founder of MicroTechPoint Pete Harris | Microsoft Senior Content Publisher.
Access 2007 Product Review. With its improved interface and interactive design capabilities that do not require deep database knowledge, Microsoft Office.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Ch1: File Systems and Databases Hachim Haddouti
Database Administration
Workload Management Massimo Sgaravatto INFN Padova.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Lecture The Client/Server Database Environment
© 2006, Cognizant Technology Solutions. All Rights Reserved. The information contained herein is subject to change without notice. Automation – How to.
The Client/Server Database Environment
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
CSCI ClearQuest 1 Rational ClearQuest Michel Izygon - Jim Helm.
IT – DBMS Concepts Relational Database Theory.
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
Introduction to the Enterprise Library. Sounds familiar? Writing a component to encapsulate data access Building a component that allows you to log errors.
1 DAN FARRAR SQL ANYWHERE ENGINEERING JUNE 7, 2010 SCHEMA-DRIVEN EXPERIMENT MANAGEMENT DECLARATIVE TESTING WITH “DEXTERITY”
 Introduction Introduction  Purpose of Database SystemsPurpose of Database Systems  Levels of Abstraction Levels of Abstraction  Instances and Schemas.
Database Concepts & Introduction to MS Access 1. Outline Database Overview  Database Management System Concepts  Database Structures Database, tables,
PRESENTED BY: LASONYA SHELBY 04/18/2010 LSTE 7309 The Importance of Databases.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
1 HawkEye A Monitoring and Management Tool for Distributed Systems Todd Tannenbaum Department of Computer Sciences University of.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
CS480 Computer Science Seminar Introduction to Microsoft Solutions Framework (MSF)
Peter Keller Computer Sciences Department University of Wisconsin-Madison Quill Tutorial Condor Week.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Yokogawa Electric Corporation Copyright © Yokogawa Electric Corporation Release 2.10 Functionality Overview September 2004.
FORUM II Best Practices in Data Warehousing in Higher Education: A Framework for Higher Education Reporting April 18, 2005 Slide 1 Cornell University’s.
Copyright © Yokogawa Electric Corporation Release 2.10 Functionality Overview September 2004.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
- Ahmad Al-Ghoul Data design. 2 learning Objectives Explain data design concepts and data structures Explain data design concepts and data structures.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
IPortal Bringing your company and your business partners together through customized WEB-based portal software. SanSueB Software Presents iPortal.
CERN – European Organization for Nuclear Research Administrative Support - Internet Development Services CET and the quest for optimal implementation and.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Quill / Quill++ Tutorial.
Transparently Gathering Provenance with Provenance Aware Condor Christine Reilly and Jeffrey Naughton Department of Computer Sciences University of Wisconsin.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Distributed database system
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
Greg Thain Computer Sciences Department University of Wisconsin-Madison Configuring Quill Condor Week.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
Week 7 Lecture Part 2 Introduction to Database Administration Samuel S. ConnSamuel S. Conn, Asst Professor.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Leveraging Database Technologies in Condor Jeff Naughton April 25, 2006.
I/Watch™ Weekly Sales Conference Call Presentation (See next slide for dial-in details) Andrew May Technical Product Manager Dax French Product Specialist.
Workload Management Workpackage
The Client/Server Database Environment
The Client/Server Database Environment
Building Grids with Condor
Database Management Systems
Database Management System (DBMS)
Lecture 1: Multi-tier Architecture Overview
Basic Grid Projects – Condor (Part I)
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Condor: Firewall Mirroring
New Technologies for Storage and Display of Meteorological Data
Presentation transcript:

Leveraging Database Technologies in Condor Jeff Naughton March 14, 2005

Overview › Introducing ourselves › How we got involved › What we are doing and what we hope to do › Request for input

Who we are › Faculty: David DeWitt, Jeff Naughton › Students: Jiansheng Huang, Ameet Kini, Christine Reilly, Eric Robinson, Srinath Shankar, Lakshmikant Shrinivas

Wisconsin DB Group › A world-leading DB research group for over 20 years. › Strong presence in:  Research publications.  Grads on faculty at top schools (Berkeley X 2, Cornell X 2, CMU)  Grads at top industrial DB research centers (IBM Almaden, MS Research)  Grads in development organizations of main DB companies (IBM DB2, Oracle, MS SQL Server) › History of influential software artifacts (WiSS, Gamma, Exodus, SHORE, Paradise)

So how did we get to Condor/Paradyn week? › 4 th floor of CS building: 4361 Naughton, 4367 DeWitt, 4369 Livny (adjacent offices!) › Miron was very persuasive. His algorithm: 1. Enter our offices. 2. Describe some challenging and interesting data management problem Condor faces or will face. 3. Leave office, get on airplane. 4. Return to Madison, go to 1.

Why Condor and DBMS? › Premise: A running Condor system is awash in data:  Operational data  Historical data  User data › DBMS technology can help capture, organize, manage, archive, and query this data.

Three potential levels of involvement 1. Passively collect and organize data, expose it through DB query interfaces. 2. Move/extend some data-related portions of Condor to DBMS (Condor writes to and reads from DBMS) 3. Provide services to help users manage their data.

Why do this? › For Condor developers:  Easier to trouble shoot and debug the system;  Easier to implement new functionality; Less time hassling with data management issues; Power of declarative data management language.  Easier to make data management aspects of the system scalable; Leverage 25 years of DBMS research on scalable data management.

Why do this? › For Condor administrators  Easier to analyze and trouble shoot;  Easier to audit;  Easier to explore current and past system status and behavior.

Why do this? › For Condor users:  An ever-improving system due to more productive developers and administrators.  Easier to monitor and understand performance of their jobs.  Easier to analyze history of their use of the system. Complete record of every job they have submitted, and everything that happened to every job while it was running. Support for detailed data lineage queries.  Data management facilities to assist them in handling large, complex, inter-related data sets.

Our projects and plans › Quill: Transparently provide a DBMS query interface to job_queue and history data. [ready to deploy!] › CondorDB: Transparently captures and provides interface to critical data from all Condor daemons. [status: partial prototype working in our own “sandbox”]

Longer-term plans › Tight integration of DBMS technology and Condor [status: thinking hard!]. › DBMS-inspired data management services to help Condor users manage their own data. [status: thinking really hard!]

Why doesn’t Condor currently use DBMS technology? › Simple answer: Condor and DBMSs “grew up” together.  Condor project started  Postgres project started › Now both are ready for each other.

Project 1: Quill > Non-invasive approach to capturing job related information > Works by sniffing updates to the job queue log > Serves condor_q and condor_history queries > Independent, reliable, and efficient querying of job related information So how does it work?

Quill Architecture QuillSchedd Job Queue log Get new events Write events Store events RDBMS Startd … Master Queue + History Tables

Querying Job Related Information RDBMS … Master StartdScheddQuill condor_q queries condor_q++ queries Querying an already busy schedd!! Independent and a more powerful query functionality

Quill benefits › Robustness: Monitored by master just like other condor daemons – resilient to failure › Independence: Not in critical path of any other condor daemons › Performance: Derive benefits of SQL to serve job related queries an order of magnitude faster › Functionality: A broader range of queries › Extensibility: Easy to add more complex queries › Downside: only handles job queue and history data.

Project 2: CondorDB › CondorDB is a passive approach to capturing operational data in a condor pool › Modified daemons log events to the database at run time – no log sniffing › Central database serves entire pool › Web-based query GUI

Data Capture in CondorDB › Condor daemons augmented to record important events in a database › Database is in addition to standard daemon logs › Pool will run unaffected even in the absence of a database Schedd Negotiator Starter Startd Shadow A Machine Schedd

CondorDB User Interface › Users can access Condor through a web-interface  Job queue, job history, machine info, match and reject info, aggregates and summaries, etc… › The web server queries the database with PHP

Users see only their own job information

Users see only their own job queue on a shared machine

Drill-down to get detailed job information

Matchmaking data at your fingertips Matches Rejects

Machine information in a single central repository

The data-centric approach makes many tasks easier › Privacy enhanced by presenting user with queue/history information about her jobs only › Intuitive “drill-down” navigation to get increasingly detailed information › All information about a job from submit-time until present available from a single screen › Useful summary information presented in tabular and graphical format › Optionally query database directly for ad hoc information on job queue, job history, matchmaking and file usage

Acknowledgement › The Condor team has been wonderfully responsive and supportive throughout this effort.

Demos! › Come see demos of Quill and CondorDB in room 4360 CS on Wed. afternoon.

Virtuous Cycle › As we learn where Condor can use DBMS technology, we also learn where DBMS technology can be (must be?) improved.  Support for dynamic-schema sparse data sets.  Extreme requirements of self-installation and self-maintenance.  Pushing match-making style operations into DBMS. › Improving DBMS technology will lead to more places that it can be installed.

Request › We want your input! › We have a lot of ideas but want to filter, modify, and augment them through the benefit of your experience. › Send mail to anytime.