DUCKS – Distributed User-mode Chirp- Knowledgeable Server Joe Thompson Jay Doyle.

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

Parasol Architecture A mild case of scary asynchronous system stuff.
P-GRADE and WS-PGRADE portals supporting desktop grids and clouds Peter Kacsuk MTA SZTAKI
A CHAT CLIENT-SERVER MODULE IN JAVA BY MAHTAB M HUSSAIN MAYANK MOHAN ISE 582 FALL 2003 PROJECT.
Multiple Tiers in Action
1 Multiple class queueing networks Mean Value Analysis - Open queueing networks - Closed queueing networks.
Performance testing of Progress Appservers and a plug-in for Jmeter
Application for Internet Radio Directory 19/06/2012 Industrial Project (234313) Kickoff Meeting Supervisors : Oren Somekh, Nadav Golbandi Students : Moran.
Web Proxy Server. Proxy Server Introduction Returns status and error messages. Handles http CGI requests. –For more information about CGI please refer.
Conceptual Architecture of PostgreSQL PopSQL Andrew Heard, Daniel Basilio, Eril Berkok, Julia Canella, Mark Fischer, Misiu Godfrey.
31 January 2007Craig E. Ward1 Large-Scale Simulation Experimentation and Analysis Database Programming Using Java.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
WINDOWS SERVICES. Introduction You often need programs that run continuously in the background Examples: – servers –Print spooler You often need.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
AliEn uses bbFTP for the file transfers. Every FTD runs a server, and all the others FTD can connect and authenticate to it using certificates. bbFTP implements.
Institute of Computer and Communication Network Engineering OFC/NFOEC, 6-10 March 2011, Los Angeles, CA Lessons Learned From Implementing a Path Computation.
Software Engineering 2003 Jyrki Nummenmaa 1 CASE Tools CASE = Computer-Aided Software Engineering A set of tools to (optimally) assist in each.
The EDGeS project receives Community research funding 1 Specific security needs of Desktop Grids Desktop Grids Desktop Grids EDGeS project EDGeS project.
M1G Introduction to Database Development 6. Building Applications.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Introduction to Hadoop and HDFS
Cloud computing for internet emulator. Professor Muthucumaru Maheswaran Team Members Mia Hochar Simon Foucher David El Achkar David El Achkar Marc Atie.
BaBar MC production BaBar MC production software VU (Amsterdam University) A lot of computers EDG testbed (NIKHEF) Jobs Results The simple question:
GraphLab: how I understood it with sample code Aapo Kyrola, Carnegie Mellon Univ. Oct 1, 2009.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
Backdrop Particle Paintings created by artist Tom Kemp September Grid Information and Monitoring System using XML-RPC and Instant.
System Initialization 1)User starts application. 2)Client loads settings. 3)Client loads contact address book. 4)Client displays contact list. 5)Client.
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Claudio Grandi INFN Bologna CHEP'03 Conference, San Diego March 27th 2003 BOSS: a tool for batch job monitoring and book-keeping Claudio Grandi (INFN Bologna)
Framework of Job Managing for MDC Reconstruction and Data Production Li Teng Zhang Yao Huang Xingtao SDU
Development of the distributed monitoring system for the NICA cluster Ivan Slepov (LHEP, JINR) Mathematical Modeling and Computational Physics Dubna, Russia,
Server to Server Communication Redis as an enabler Orion Free
GLAST Science Support CenterJuly, 2003 LAT Ground Software Workshop Status of the D1 (Event) and D2 (Spacecraft Data) Database Prototypes for DC1 Robert.
Grid Scheduler: Plan & Schedule Adam Arbree Jang Uk In.
Design and Implementation of PARK (PARallel Kernel for data fitting) Paul KIENZLE, Wenwu CHEN and Ziwen FU Reflectometry Group.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
 Registry itself is easy and straightforward in implementation  The objects of registry are actually complicated to store and manage  Objects of Registry.
7 February 2008Dietrich Beck A LabVIEW Interface to ELOG ELOG Possible solution paths Status Integration into – CS ? – Domain Management System?
DATABASE CONNECTIVITY TO MYSQL. Introduction =>A real life application needs to manipulate data stored in a Database. =>A database is a collection of.
EGEE is a project funded by the European Union under contract IST “Interfacing to the gLite Prototype” Andrew Maier / CERN LCG-SC2, 13 August.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
WSV207. Cluster Public Cloud Servers On-Premises Servers Desktop Workstations Application Logic.
Maintenance Practices. Goal  Automate the necessary DBA chores to put organizations on the path of having healthier, consistent and more trustworthy.
Display Page (HTML/CSS)
Master Cluster Manager User Interface (API Level) User Interface (API Level) Query Translator Avro NTA Query Engine NTA Query Engine Job Scheduler Avro.
The Storage Resource Broker and.
Creating Simple and Parallel Data Loads With DTS.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Distributed Storage Middleware To build a distributed web storage service for small files; To provides RESTFUL interface to access files and directories.
Review of PARK Reflectometry Group 10/31/2007. Outline Goal Hardware target Software infrastructure PARK organization Use cases Park Components. GUI /
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Review of Last Year’s Midterm
What are they? The Package Repository Client is a set of Tcl scripts that are capable of locating, downloading, and installing packages for both Tcl and.
How to connect your DG to EDGeS? Zoltán Farkas, MTA SZTAKI
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
CERN-Russia Collaboration in CASTOR Development
Replication Middleware for Cloud Based Storage Service
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Bob Tabor | Microsoft Azure Fundamentals: Data Understanding Microsoft Azure Storage Queues Bob Tabor |
Building a Database on S3
Tiers vs. Layers.
A Scripting Server for Domain Automation Tasks
Web Application Development Using PHP
Presentation transcript:

DUCKS – Distributed User-mode Chirp- Knowledgeable Server Joe Thompson Jay Doyle

Performance Usability Chirp & Condor Chirp & Condor DUCKS Motivation

DUCKS Goals Bring together functionality of Condor and CHIRP in an easy to use package. Abstract Condor and CHIRP interfaces. Intelligently distribute files over CHIRP servers. Provide simple interface for the Chirp Active Storage Program-To-Data model. Provide simple interface for the Condor Data-To- Program model.

MySQL Garbage Collector Garbage Collector Chirp Tracker Chirp Tracker Timeout Handler Timeout Handler Incoming Message Queue Incoming Message Queue Incoming Message Queue Incoming Message Queue Incoming Message Queue Incoming Message Queue Chirp Server List Chirp Server List Transaction List Transaction List Transaction Handler Transaction Handler DUCKS Client DUCKS Client

ducks_put Local MachineDUCKS Server Chirp Node Store_Request 1) Verify the pair is not already in the DB 2) Find a Chirp node with enough free space to store the file 3) Query database for path name to use on the Chirp node Store_Response 4) Parse response to get the Chirp storage location for chirp_put Store_success 5) Update DB to reflect this file storage File

ducks_get Local MachineDUCKS Server Chirp Node Get_Request 1) Query DB for Chirp location of Get_Response 2) Parse response to get the Chirp storage location for chirp_get File

ducks_delete Local MachineDUCKS Server Garbage Collector delete_Request 1) Set the delete_flag of the entry in the DB 2) Periodically query the DB for files with the delete_flag set DUCKS DB … … Result Set 3) Delete files in the list Chirp Node Delete File 4) Remove files from DB

ducks_ls Local MachineDUCKS Server ls_Request 1) Query the DB for all files owned by the user that match “% %”; 2) Start a background thread and pass it the result set containing the found files Result Set 3) Iterate through the set and send the file information to the client File_info ls Thread

Distribution via Chirp Active Storage to nodes already storing input Wrapper/ Submit Scripts input01.txt input02.txt input03.txt input04.txt exe/libs request ducks_get Chirp Nodes Client DUCKS Server Job Request Response with locations of input files

Client Condor exe/libs/input_file request Chirp Nodes DUCKS Server Distribute tasks to any available node Get input and exe/lib files from other nodes in the Chirp cluster with ducks_get

DUCKS Future Work The basic framework is implemented. Add a more robust file interface. Implement DUCKS management of job status information. (Queued, Running, Complete)

?