MyGrid: A User-Centric Approach for Grid Computing Walfredo Cirne Universidade Federal da Paraíba.

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

1 Bogotá, EELA-2 1 st Conference, On the Co-existence of Service and Opportunistic Grids Francisco Brasileiro Universidade Federal.
University of Southampton Electronics and Computer Science M-grid: Using Ubiquitous Web Technologies to create a Computational Grid Robert John Walters.
PlanetLab What is PlanetLab? A group of computers available as a testbed for computer networking and distributed systems research.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
The OurGrid Project Walfredo Cirne Universidade Federal de Campina Grande.
The OurGrid Project Walfredo Cirne Universidade Federal de Campina Grande.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
City University London
Minimum intrusion GRID. Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then.
Running Thor over MyGrid Walfredo Cirne Universidade Federal de Campina Grande.
1 CIS450/IMSE450/ECE478 Operating Systems Winter 2003 Professor Jinhua Guo.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Labs of The World, Unite!!! Walfredo Cirne Universidade Federal de Campina Grande.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
E-science grid facility for Europe and Latin America OurGrid E2GRIS1 Rafael Silva Universidade Federal de Campina.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
TRASC Globus Application Launcher VPAC Development Team Sudarshan Ramachandran.
Computational grids and grids projects DSS,
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
Alexandre Duarte Gustavo Wagner Francisco Brasileiro Walfredo Cirne Multi-Environment Software Testing on the Grid Universidade Federal de Campina Grande.
Application Layer Khondaker Abdullah-Al-Mamun Lecturer, CSE Instructor, CNAP AUST.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
1 Bogotá, EELA-2 1 st Conference, The OurGrid Approach for Opportunistic Grid Computing Francisco Brasileiro Universidade Federal.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Silberschatz and Galvin  Operating System Concepts Module 1: Introduction What is an operating system? Simple Batch Systems Multiprogramming.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
What is virtualization? virtualization is a broad term that refers to the abstraction of computer resources in order to work with the computer’s complexity.
© GCSE Computing Computing Hardware Starter. Creating a spreadsheet to demonstrate the size of memory. 1 byte = 1 character or about 1 pixel of information.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
HPC HPC-5 Systems Integration High Performance Computing 1 Application Resilience: Making Progress in Spite of Failure Nathan A. DeBardeleben and John.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
1.1 Sandeep TayalCSE Department MAIT 1: Introduction What is an operating system? Simple Batch Systems Multiprogramming Batched Systems Time-Sharing Systems.
Introduction to Makeflow and Work Queue Nicholas Hazekamp and Ben Tovar University of Notre Dame XSEDE 15.
UNIX U.Y: 1435/1436 H Operating System Concept. What is an Operating System?  The operating system (OS) is the program which starts up when you turn.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Advanced Network Administration Computer Clusters.
GridOS: Operating System Services for Grid Architectures
Applied Operating System Concepts
Deploying Research in the Real World: The OurGrid Experience
Clouds , Grids and Clusters
Example: Rapid Atmospheric Modeling System, ColoState U
GWE Core Grid Wizard Enterprise (
Grid Computing.
Grid Computing Colton Lewis.
Chapter 16: Distributed System Structures
Unit 27: Network Operating Systems
Operating System Concepts
Introduction to Operating Systems
Introduction to Operating Systems
Chapter 1: Introduction
Operating System Concepts
Presentation transcript:

MyGrid: A User-Centric Approach for Grid Computing Walfredo Cirne Universidade Federal da Paraíba

High-Performance Computing High-Performance Computing means running faster than the typical machine du jour Unbeatable price/performance of microprocessors has killed specialized high-performance machines Therefore, paralelism currently is the way to do High-Performance Computing –Parallel supercomputers

Solving a Real Problem I had hundreds of thousands of independent simulations to run Parallel supercomputers are typically –hard to get acess to –slow (too much time in the queue) Since my simulations were independent, I had the perfect application for the Computational Grid

Grid Computing Grid Computing aims to enable the execution of parallel applications over processors that are: –Geographically distributed –Under multiple administrative domains –Not dedicated The potential for resource gathering is enormous –“Let´s run over the Internet”

Grid Applications Not all applications can benefit from the Grid Loosely coupled applications match the Grid characteristics much better than tightly coupled applications

State of Art in Grid Computing Most services are provided by the Grid Infrastructure –Naming, remote execution/task control, security, etc Scheduling is done at the application level Globus “Virtual Organizations”

Back to the Real Problem I had hundreds of thousands of independent simulations to run I was working in a top research lab in Grid Computing I could not manage to use the Grid It is hard to get the Grid Infrastructure Software installed everywhere

The Motivation for MyGrid Users of loosely coupled applications could benefit from the Grid now However, they don´t run on the Grid today because the Grid Infrastructure is not widely deployed What if we build a solution at the user level? That is, a solution that does not depend upon installed infrastructure?

MyGrid MyGrid is a framework to build infrastructure-independent grid applications The user provides: –A description of her Grid –A way to do remote execution and file transfer –“The application” MyGrid provides: –Grid abstractions –Scheduling

MyGrid Goals open = do not require a particular infrastructure self-installable = do not require manual installation on a given machine extensible = simple to add refinements complete = cover the whole production cycle

MyGrid Concepts Job = set of independent tasks –Tasks have three pieces: init, remote and final Home machine  Grid machine Grid abstractions –remote execution –file transfer –playpen –mirroring

Defining My Personal Grid bagre.dsc.ufpb.br dsc, linux ssh %machine %command scp %localdir/%file %machine:%remotedir scp %machine:%remotedir/%file %localdir traira.dsc.ufpb.br dsc, linux ssh %machine %command scp %localdir/%file %machine:%remotedir scp %machine:%remotedir/%file %localdir quidam.ucsd.edu cse, linux ssh %machine %command scp %localdir/%file %machine:%remotedir scp %machine:%remotedir/%file %localdir

Fatoring with MyGrid Fatora n gerates tasks, init, remotei, and collect User runs mygrid.ui.AddTask < tasks tasks task: init= init remote= remote1 final= collect processor= linux playpensize= 0 cost = 1 task: init= init remote= remote2 …

Fatoring with MyGrid init java mygrid.ui.MyGridUI p $PROC./Fat.class $PLAYPEN remote1 java Fat output-$TASK remote2 java Fat output-$TASK collect java mygrid.ui.MyGridUI g $PROC "" $PLAYPEN saida-$TASK.

Running an MyGrid Task (3c) (3b) task-done (4) remote exec (3) playpen, file xfer, and remote exec (3a) (2) add-task (1) Home Machine Grid Machine Task Manager User Agent Server home stasks User Agent Daemon grid stask

User Agent User Agent provides the grid abstractions User Agent Daemon runs on grid machines User Agent Server runs on home machines The Daemon and the Server rely upon public-key cryptography to authenticate each other

Self Instalation We are working on having MyGrid install and start-up User Agents everywere The user provides a way to do remote execution and file transfer to make that possible

Scheduling in MyGrid Grid scheduling is application dependent and effort intensive Most people don´t want to spend months to write good schedulers for their applications MyGrid provides a sensible default scheduler –The user can of course replace the default scheduler

Default Scheduler How to provide good performance with no knowledge about the application or the current state of the Grid –The key is to avoid having the job waiting for a task that runs in a slow/loaded machine Task replication is our answer for this problem –Task replication is only done when the jobs has no other tasks

Preliminary Results During a 40-day period, we ran 600,000 simulations using 178 processors located in 6 different administrative domains widely spread in the USA MyGrid took 16.7 days to run the simulations My desktop machine would have taken 5.3 years to do so Speed-up is for 178 processors

Conclusions Running Grid Applications at the user-level is a viable strategy Bag-of-tasks parallel applications can currently benefit from the Grid Is “upperware” the way to go for new middleware development?

Future Work Turn MyGrid into a production-quality software Investigate the impact of task replication in resource consumption Develop a default scheduler for data intensive applications –Such a scheduler should try to minimize data movement