The Kangaroo Approach to Data Movement on the Grid Douglas Thain, Jim Basney, Se-Chang Son, and Miron Livny

Slides:



Advertisements
Similar presentations
A Proposal of Capacity and Performance Assured Storage in The PRAGMA Grid Testbed Yusuke Tanimura 1) Hidetaka Koie 1,2) Tomohiro Kudoh 1) Isao Kojima 1)
Advertisements

Buffers & Spoolers J L Martin Think about it… All I/O is relatively slow. For most of us, input by typing is painfully slow. From the CPUs point.
WHAT MAKES UP A COMPUTER BY SABRINA & BETH. MOTHERBOARD On the Computer, the Motherboard is where all the different parts are connected too work the computer.
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
CS 345 Computer System Overview
Asynchronous Remote Execution PhD Preliminary Examination Douglas Thain University of Wisconsin 19 March 2002.
A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.
Deconstructing Clusters for High End Biometric Applications NSF CCF June Douglas Thain and Patrick Flynn University of Notre Dame 5 August.
The Kangaroo Approach to Data Movement on the Grid Jim Basney, Miron Livny, Se-Chang Son, and Douglas Thain Condor Project University of Wisconsin.
Reliable I/O on the Grid Douglas Thain and Miron Livny Condor Project University of Wisconsin.
Cooperative Computing for Data Intensive Science Douglas Thain University of Notre Dame NSF Bridges to Engineering 2020 Conference 12 March 2008.
The Kangaroo Approach to Data Movement on the Grid Douglas Thain, Jim Basney, Se-Chang Son, and Miron Livny Condor Project University of Wisconsin.
Introduction  What is an Operating System  What Operating Systems Do  How is it filling our life 1-1 Lecture 1.
The Condor Data Access Framework GridFTP / NeST Day 31 July 2001 Douglas Thain.
The Difficulties of Distributed Data Douglas Thain Condor Project University of Wisconsin
A Web Services Based Streaming Gateway for Heterogeneous A/V Collaboration Hasan Bulut Computer Science Department Indiana University.
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Harnessing the Capacity of Computational.
Douglas Thain, John Bent, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, and Miron Livny WiND and Condor Projects 6 May 2003 Pipeline and Batch Sharing in.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
Operating Systems.
Networked Storage Technologies Douglas Thain University of Wisconsin GriPhyN NSF Project Review January 2003 Chicago.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Building a Real Workflow Thursday morning, 9:00 am Greg Thain University of Wisconsin - Madison.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
STORK: Making Data Placement a First Class Citizen in the Grid Tevfik Kosar and Miron Livny University of Wisconsin-Madison March 25 th, 2004 Tokyo, Japan.
Stages of Processing.  When a computer is given instructions, a series of tasks must take place in order for a result to be accomplished  To accomplish.
1 The Kangaroo approach to Data movement on the Grid Rajesh Rajamani June 03, 2002.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
Large Scale Parallel File System and Cluster Management ICT, CAS.
Intro to Computers Computer Applications. What is a Computer? Initially the term computer referred to an individual whose job it was to perform mathematical.
Distributed System Concepts and Architectures 2.3 Services Fall 2011 Student: Fan Bai
Lesson 1 Operating Systems, Part 1. Objectives Describe and list different operating systems Understand file extensions Manage files and folders.
Cluster 2004 San Diego, CA A Client-centric Grid Knowledgebase George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison September 23 rd,
C o n f i d e n t i a l 1 Course: BCA Semester: III Subject Code : BC 0042 Subject Name: Operating Systems Unit number : 1 Unit Title: Overview of Operating.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Tevfik Kosar Computer Sciences Department University of Wisconsin-Madison Managing and Scheduling Data.
Distributed System Concepts and Architectures Services
STORK: Making Data Placement a First Class Citizen in the Grid Tevfik Kosar University of Wisconsin-Madison May 25 th, 2004 CERN.
Monte Carlo Data Production and Analysis at Bologna LHCb Bologna.
Dzmitry Kliazovich University of Luxembourg, Luxembourg
A Fully Automated Fault- tolerant System for Distributed Video Processing and Off­site Replication George Kola, Tevfik Kosar and Miron Livny University.
The Kangaroo Approach to Data Movement on the Grid Author: D. Thain, J. Basney, S.-C. Son, and M. Livny From: HPDC 2001 Presenter: NClab, KAIST, Hyonik.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Condor and (the) Grid (one of.
Parag Mhashilkar Computing Division, Fermi National Accelerator Laboratory.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau.
3.Data Communications 3.3Network Operating Systems.
Run-time Adaptation of Grid Data Placement Jobs George Kola, Tevfik Kosar and Miron Livny Condor Project, University of Wisconsin.
Silberschatz and Galvin  Operating System Concepts Module 1: Introduction What is an operating system? Simple Batch Systems Multiprogramming.
Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy
System Components Operating System Services System Calls.
INTRODUCTION TO COMPUTERS. A computer system is an electronic device used to input data, process data, store data for later use and produce output in.
References A. Silberschatz, P. B. Galvin, and G. Gagne, “Operating Systems Concepts (with Java)”, 8th Edition, John Wiley & Sons, 2009.
Fundamental Operations Scalability and Speedup
An Overview of the Computer System
2. OPERATING SYSTEM 2.1 Operating System Function
Joint JRA1/JRA3/NA4 session
Migratory File Services for Batch-Pipelined Workloads
Looking Inside the machine (Types of hardware, CPU, Memory)
An Overview of the Computer System
CGS 3763 Operating Systems Concepts Spring 2013
Computer Fundamentals
Chapter 2: Operating-System Structures
Wide Area Workload Management Work Package DATAGRID project
Chapter 2: Operating-System Structures
Function of Operating Systems
Overview of Computer system
Presentation transcript:

The Kangaroo Approach to Data Movement on the Grid Douglas Thain, Jim Basney, Se-Chang Son, and Miron Livny

“Half-Interactive” Jobs on the Grid  The “Grid” is a vision of an international, highly available, robust, shared computation service.  Grid applications generally…..are half-interactive...are brittle wrt errors...have weak consistency requirements.  Can we perform on-line data access without subjecting apps to unexpected errors? App Unreliable Network

 Make a third party responsible for executing each application’s I/O operations.  Use all available resources to hide latencies.  Never return an error to the application. (Maybe tell the user or scheduler.)  Benefits: Fault tolerance, higher throughput.  Cost: Weaker consistency. The Kangaroo Approach To Data Movement

The Vision: A Data Grid File System File System File System File System K K K K K K K Data Movement System App Disk

Kangaroo Prototype KK App K An application may contact any node in the system and perform partial-file reads and writes. Disk The node may then execute or buffer operations as conditions warrant. Buffer

User Interface  An interposition agent converts POSIX operations into the Kangaroo protocol. % vi /kangaroo/coral.cs.wisc.edu/etc/hosts % gcc -c /gsiftp/ftp.cs.wisc.edu/test.c -o /kangaroo/coral/tmp/test.o K App POSIX Kangaroo Agent get, putread, write

Macrobenchmark: Image Processing  Post-processing of satellite image data: Need to compute various enhancements and produce output for each. Read input image For I=1 to N – –Compute transformation of image – –Write output image  Example: Image size about 5 MB Compute time about 6 sec IO-cpu ratio about 0.9 MB/s

I/O Models Compared OUTPUT CPU OUTPUT Online I/O: Offline I/O: Kangaroo: INPUT OUTPUT CPU OUTPUT CPUOUTPUTINPUTOUTPUTCPU OUTPUT CPUOUTPUTINPUTOUTPUTCPU PUSH CPU ReleasedTask Done CPU Released Task Done CPU Released