Minimum intrusion GRID. Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

GXP in nutshell You can send jobs (Unix shell command line) to many machines, very fast Very small prerequisites –Each node has python (ver or later)
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
Chapter 7 Advantages and Disadvantages of Web Services
Distributed components
State Machine Replication Project Presentation Ido Zachevsky Marat Radan Supervisor: Ittay Eyal Winter Semester 2010.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Minimum intrusion GRID. Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
OxGrid, A Campus Grid for the University of Oxford Dr. David Wallom.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
MiG Projects DM75. Runtime Environment DB Problem: –Many Grid jobs assume that the binaries and libraries are already available –Even if this is the case.
Scaling Service Requests Linux: ipvsadm & iptoip.
Workload Management Massimo Sgaravatto INFN Padova.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Distributed Databases
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
FTP File Transfer Protocol. Introduction transfer file to/from remote host client/server model  client: side that initiates transfer (either to/from.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Portals and Credentials David Groep Physics Data Processing group NIKHEF.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
The material in this presentation is the property of Fair Isaac Corporation. This material has been provided for the recipient only, and shall not be used,
Selected Topics in Software Computing Distributed Software Development CVSQL Final Project Presentation.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
ILDG5QCDgrid1 QCDgrid status report UKQCD data grid Chris Maynard.
DISTRIBUTED COMPUTING
Presented by Xiaoyu Qin Virtualized Access Control & Firewall Virtualization.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 8 Omar Meqdadi Department of Computer Science and Software Engineering University of.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
SKYPIAX, how to add Skype capabilities to FreeSWITCH (and Asterisk) CHICAGO, USA, September 2009.
Portal Update Plan Ashok Adiga (512)
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Using the ARCS Grid and Compute Cloud Jim McGovern.
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
Lecture 4 Page 1 CS 111 Online Modularity and Virtualization CS 111 On-Line MS Program Operating Systems Peter Reiher.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Portals, Services, Interfaces Marlon Pierce Indiana University March 15, 2002.
USGS GRID Exploratory Status Review Stuart Doescher Mike Neiers USGS/EDC May
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Open Map Yamama Dagash & Haitham Khateeb under the supervision of: Benny Daon & Eyal Levin Open Map.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
BIG DATA/ Hadoop Interview Questions.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
By Manish Shrotriya CSE MS Software Programs Shrink Wrap Software : Software that one can buy off the shelf and can install on his computer. They.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Contents Software components All users in one location:
Web Development Web Servers.
Lecture 1: Multi-tier Architecture Overview
Production Manager Tools (New Architecture)
Presentation transcript:

Minimum intrusion GRID

Build one to throw away … So, in a given time frame, plan to achieve something worthwhile in half the time, throw it away, then rebuild what has to be a better version, given the advantages of hindsight.

MiG Minimum intrusion GRID This really is the philosophy –We want to develop a GRID middleware that makes as few requirements as possible

What’s wrong with the GLOBUS model? Single point of failure –There may be redundancy –But algorithmically these collapse onto one component –We need a model where liveliness scales with the system-size

What’s wrong with the GLOBUS model? Lack of scheduling –Parallel submissions end up on the same resource –No means of addressing changes to timing that originates from the local queuing system –We need real scheduling with online properties

What’s wrong with the GLOBUS model? Poor scalability –Paradox: The more resources Grid has the faster your job can be completed – but the time to submit your job grows linearly –If you assume that the number of users grow with the size of the Grid, the users actually looses from larger Grid’s

What’s wrong with the GLOBUS model? No means of implementing privacy –Users need to authenticate themselves to the resource –Users contact the resources directly –Some users are concerned with exposing what they are working on –We need to allow anonymity

What’s wrong with the GLOBUS model? No means of utilizing ‘cycle-scavenging’ –The Globus model expect the machine to be dedicated –And be available for a predefined period –Condor + NorduGrid do address this But not with deadlines –We would like a model that uses cycle- scavenging but with deadlines

What’s wrong with the GLOBUS model? Require a very large installation on each resource and on the user site –The resource software is huge –The client software is pretty large Though ARC-lib limits the client side enormously –We need a model that require no software to be installed on either resource or client side

What’s wrong with the GLOBUS model? Firewall dependency –Globus requires a number of ports to be opened in the firewall 2135, , –System administrators are very reluctant to do this –We need a setup that exists within the confines of a standard setup

What’s wrong with the GLOBUS model? Highly bloated middleware –The middleware has become huge –There are large amounts of dead code –Much replicated code –We need middleware that is more compact and maintainable

What’s wrong with the GLOBUS model? Complex implementation using multiple languages –Mutiple languages are used in Globus-2 –None (few) seems to be well chosen Much C code – but the functions are not system related or performance limited Some Perl code – actually well chosen for the functions but unreadable and hard to maintain

MiG Idea MiG should address all the previous concerns GRID should be a system –not just a protocol between sites By having a set of servers that are GRID we get full control of GRID –we upgrade all machines at the same time Users and resources do not have to maintain anything

The desired user view of MiG MiG should be a virtual computer to the user Files are stored within MiG – each user has his own home-directory –In job descriptions files are relative to the users home-directory Eventually we’d like interactive application support Economically MiG should be fair

The desired resource view of MiG MiG is just another user –or set of users Economically MiG should be fair

MiG Rules Nothing produced by MiG can be required to be installed on either the resource or client end Everything within MiG must be implemented in Python unless another language is absolutely required Any design and implementation decision must optimize towards transparency for the users Anything that is not right must be thrown away

GRID User Resource The abstract MiG model

MiG requirements - User Users should depend on no installed SW –A web-browser should be enough In addition to a browser the users need only a certificate –A scriptable user-interface is also available – but not required

MiG requirements - Resource Resources should only create a MiG user As few requirements to the resource as possible –support ssh inbound –support https outbound The system administrator also needs a certificate to register the resource

The simple MiG model The user sends a job to the GRID machine The GRID submits the job to a free resource –When one is available –Otherwise the job will be stored The users can then retrieve the result

GRID User Resource The simple MiG model

Considering the simple model Single point of failure –and bottleneck Solution –add more GRID machines

GRID Resource User The full MiG model

Considering the full model How do we survive the crash of a GRID machine? –make redundant copies of jobs and results How do we schedule on multiple GRID servers? –Envy based load-balancing How is the user informed of GRID machines? –Users automatically get new hostlist when contacting GRID –But he only needs to access one

Future work Shared data-structures for MiG Fault-tolerance Load balancing Accounting and Job Pricing User defined scheduling Grid File System Occam like job description language

Future work Interfacing with other Grid Implementations Programmers API Stand-alone http server in Python Monitoring Statistics Application User interfaces Graphics rendering on GRID Command line interface Screensaver model

MiG and DCGC MiG is available to DCGC – naturally We would like more systems to join as we mature –MiG is non intrusive so it will co-exist with any other Grid system as long as ordinary users are supported MiG is already being used for our “bio-user- case” Anybody within DCGC is very welcome to join the MiG effort

Next The simple model – Implementation and status –Henrik MiG Remote File Access –Rasmus Economics and Loadbalancing –Jonas