Legion: The Grid OS Architecture and User View Anand Natrajan ( ) Marty Humphrey ( ) The Legion Project, University.

Slides:



Advertisements
Similar presentations
N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE Protein Folding Landscapes in a Distributed Environment All Hands Meeting, 2001 University.
Advertisements

USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Web Server Hardware and Software
Network+ Guide to Networks, Fourth Edition Chapter 10 Netware-Based Networking.
Distributed Object Computing Weilie Yi Dec 4, 2001.
October, 2003 – Linkoping, Sweden Andrew Grimshaw Department of Computer Science, Virginia CTO & Founder Avaki Corporation From Clusters to Grids.
Legion Worldwide virtual computer. About Legion Made in University of Virginia Object-based metasystems software project middleware that connects computer.
Asynchronous Solution Appendix Eleven. Training Manual Asynchronous Solution August 26, 2005 Inventory # A11-2 Chapter Overview In this chapter,
Xuan Guo Chapter 1 What is UNIX? Graham Glass and King Ables, UNIX for Programmers and Users, Third Edition, Pearson Prentice Hall, 2003 Original Notes.
Network File System (NFS) in AIX System COSC513 Operation Systems Instructor: Prof. Anvari Yuan Ma SID:
AN INTRODUCTION TO LINUX OPERATING SYSTEM Zihui Han.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
What is Unix Prepared by Dr. Bahjat Qazzaz. What is Unix UNIX is a computer operating system. An operating system is the program that – controls all the.
Windows interoperability with Unix/Linux. Introduction to Active Directory Integration for Unix and Linux Systems Unix/Linux interoperability components.
Web Servers Web server software is a product that works with the operating system The server computer can run more than one software product such as .
What is UNIX? UNIX is an Operating System (OS). An operating system is a control program that helps the user communicate with the computer hardware. UNIX.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
Network Operating Systems : Tasks and Examples Instructor: Dr. Najla Al-Nabhan
Chapter 10 – UNIX. History In late 1960s, two employees of Bell Labs (Ken Thompson & Dennis Ritchie) designed a new operating system to overcome the constraints.
GridFS Targeting Data Sharing in Grid Environments Marcelo Nery dos Santos / Renato Cerqueira PUC-Rio, Brazil Presented by: Francisco Silva.
1 Guide to Novell NetWare 6.0 Network Administration Chapter 13.
CS240 Computer Science II Introduction to Unix Based on “UNIX for Programmers and Users” by G.Class and K. Ables.
University of Virginia Experiences with NMI at the University of Virginia NMI Integration Testbed: Experiences in Middleware Deployment Spring 2003 Internet2.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
Grid Andrew Grimshaw September, What is a Grid System? A Grid system is a collection of distributed resources connected by a network. Examples of.
Database-Driven Web Sites, Second Edition1 Chapter 5 WEB SERVERS.
Chapter 10 Netware-Based Networking Network+ Guide to Networks, Fourth Edition.
Network+ Guide to Networks, Fourth Edition Chapter 10 Netware-Based Networking.
CS 390 Unix Programming Summer Unix Programming - CS 3902 Course Details Online Information Please check.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
How to create DNS rule that allow internal network clients DNS access Right click on Firewall Policy ->New- >Access Rule Right click on Firewall.
Grid Chemistry System Architecture Overview Akylbek Zhumabayev.
GCRC Meeting 2004 Introduction to the Grid and Security Philip Papadopoulos.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 Linux Networking and Security Chapter 5. 2 Configuring File Sharing Services Configure an FTP server for anonymous or regular users Set up NFS file.
1 Alexandru V Staicu 1, Jacek R. Radzikowski 1 Kris Gaj 1, Nikitas Alexandridis 2, Tarek El-Ghazawi 2 1 George Mason University 2 George Washington University.
UNICOS. When it comes to solving real-world problems, leading-edge hardware is only part of the solution. A complete solution also requires a powerful.
Studying Protein Folding on the Grid: Experiences Using CHARMM on NPACI Resources under Legion University of Virginia Anand Natrajan Marty A. Humphrey.
Legion - A Grid OS. Object Model Everything is object Core objects - processing resource– host object - stable storage - vault object - definition of.
Shell Interface Shell Interface Functions Data. Graphical Interface Graphical Interface Command-line Interface Command-line Interface Experiments Private.
Course ILT Unit objectives Describe the network clients that are available to connect DOS-, Windows-, and Macintosh-based computers to a network Network.
Basic UNIX Concepts. Why We Need an Operating System (OS) OS interacts with hardware and manages programs. A safe environment for programs to run is required.
Capacity and Capability Computing using Legion Anand Natrajan ( ) The Legion Project, University of Virginia (
AMH001 (acmse03.ppt - 03/7/03) REMOTE++: A Script for Automatic Remote Distribution of Programs on Windows Computers Ashley Hopkins Department of Computer.
Introduction to UNIX CS465. What is UNIX? (1) UNIX is an Operating System (OS). An operating system is a control program that allocates the computer's.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
HNC COMPUTING - Network Concepts 1 Network Concepts Network Concepts Network Operating Systems Network Operating Systems.
CEG 2400 FALL 2012 Linux/UNIX Network Operating Systems.
User Interface UI TP: UI User Interface installation & configuration.
1 Chapter 2: Operating-System Structures Services Interface provided to users & programmers –System calls (programmer access) –User level access to system.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
LINUX An Introduction Wan Ahmad Tajuddin Wan Abdullah Universiti Malaya KL ACGRID-II 2009.
Computer System Structures
Windows interoperability with Unix/Linux
Operating System & Application Software
File System Implementation
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Chapter 2: System Structures
Chapter 1 What is UNIX? Graham Glass and King Ables,
STATEL an easy way to transfer data
Presentation transcript:

Legion: The Grid OS Architecture and User View Anand Natrajan ( ) Marty Humphrey ( ) The Legion Project, University of Virginia ( )

Grid Environment  Computers  Networks  People  Data  Devices  Disjoint file systems  Disjoint namespaces  Multiple administration domains  Unpredictable load, availability, failures  Security problems

Grid OS Requirements Wide-area High Performance Complexity Management Extensibility Security Site Autonomy Input / Output Heterogeneity Fault-tolerance Scalability Simplicity Single Namespace Resource Management Platform Independence Multi-language Legacy Support

Legion - A Grid OS

Tools MPI / PVM P-space studies - multi-run Parallel C++ Parallel object-based Fortran CORBA binding Object migration Accounting Remote builds and compilations Fault-tolerant MPI libraries Post-mortem debugger Console objects Parallel 2D file objects Collections Licence support

Commercial Support - Avaki Corp. Mentat Legion Avaki Web Venture funded Headquartered in Boston Growing number of employees Multi-tiered support offering

Protein Folding with CHARMM Molecular Dynamics Simulations structures to sample (r,R gyr ) space R gyr 

IBM Blue Horizon SDSC 375MHz Power3 512/1184 IBM Blue Horizon SDSC 375MHz Power3 512/1184 Resources Available HP V-class CalTech 440 MHz PA /128 HP V-class CalTech 440 MHz PA /128 IBM SP3 UMich 375MHz Power3 24/24 IBM SP3 UMich 375MHz Power3 24/24 IBM Azure UTexas 160MHz Power2 32/64 IBM Azure UTexas 160MHz Power2 32/64 Sun HPC SDSC 400MHz SMP 32/64 Sun HPC SDSC 400MHz SMP 32/64 DEC Alpha UVa 533MHz EV56 32/128 DEC Alpha UVa 533MHz EV56 32/128

Transparent Remote Execution User initiates “run” User/Legion selects site Legion copies binaries Legion copies input files Legion starts job(s) Legion monitors progress Legion copies output files

Mechanics of CHARMM Runs Legion Register binaries Create task directories & specification Dispatch runs Dispatch more runs

Types Of Applications Legacy applications Legion-aware applications –I/O library –2D file object Applications Using Stdgrid Parameter Space Studies Parallel Programs –MPI, PVM, MPL, Basic Fortran Support (BFS)

Grid Application Requirements Security Fault-tolerance Heterogeneity Collaboration … Legion supports these and other needs

Heterogeneous Runs BT-Med Ocean Model

Cross-Organisation Collaboration Different companies Proprietary simulations and data Each needs the other Form virtual partnership

Platforms Windows NT, 2K, 98, 95 Sun (Solaris) SGI (Irix, Origin) Intel (Linux, Free BSD) DEC (Unix, Linux) Cray (T90, T3E) IBM (AIX, SP-2) HP (HPUX) Codine LoadLeveler Maui PBS NQS LSF

Applications Biochemistry and Molecular Science Information Retrieval Materials Science Climate Modelling Neuroscience Aerospace Astronomy Graphics NPACI - SDSC, UCSD, Caltech, UTexas, Umich, UCB, UVa. DoD MSRCs - NAVO & ARL, NASA Ames

User View Command-Line Interface

Setup Setup shell environment variables. ~legion/setup.sh OR export LEGION=/home/legion/Legion export LEGION_OPR=/home/maya/OPR. $LEGION/bin/legion_env.sh Specifies where binaries and configuration files can be found Sets root context

Login Authentication to system legion_login /users/stephen Currently uses password - other mechanisms, e.g., Kerberos ticket possible Login object (a.k.a. Authentication object) - /users/stephen - is user’s proxy to world Login object generates certificate identifying user

Context Space / hostsusershome mach1mach2youmemydir progfile1ttysubdir Unix-like legion_ls legion_pwd legion_cd legion_cat...

Context Space Network-wide, transparent file system Location-independent read/write of files Convenient transfer of files between context space and local file system I/O libraries for access Unix-like utilities

Context Example legion_ls /

Another Context legion_ls /hosts

Yet Another Context legion_ls /users

More Context Fun

Other Context Commands Locate a LOID in context space legion_list_names Locate an object on a machine legion_whereis Find status of an object legion_object_info List metadata of an object legion_list_attributes

Status Of An Object legion_object_info -c work

Physical Location Of Object legion_whereis -c work

Context Space vs. Local Space Local space = your machine’s directory structure –OS-specific, Machine-specific –Use cp, copy, etc. –e.g., C:\Program Files\, /usr/bin, /mnt/disk1 Context space = Legion’s directory structure –OS-independent, Machine-independent –Use legion_cp, etc.

Context Space and Local Space Transfer one file from local space to context space legion_cp -localsrc Transfer one file from context space to local space legion_cp -localdest

Context Space and Local Space Copying local directory to context space legion_cp -r -localsrc OR legion_import_tree Copying context directory to local space legion_cp -r -localdest

Context Space and Local Space Map (not copy!) local directory to context space temporarily legion_export_dir Does NOT make copy of local directory Merely provides Legion-like access to local directory –Use legion_cat on local files

Making Context Space… Local sub-directory with Legion NFS daemon –Use cat on context files FTP directory with FTP interface Windows directory with Samba interface URL tree with HTTP interface

I/O Performance –X-Axis = number of clients simultaneously performing 1MB reads on 10MB files –Y-Axis = total read bandwidth –Each point = average of multiple runs –Clients = 400MHz Intels, NFS Server = 800MHz Intel

Making Context Space… Local sub-directory with Legion NFS daemon –Use cat on context files FTP directory with FTP interface Windows directory with Samba interface URL tree with HTTP interface

Flexible Context Space Context Directory Disk e ftp legion_export_dir legion_import_tree Samba NFSHTTP FTP

Access Control MayI for each object implements access control on a per-function basis Users named by login object Sets of users grouped by contexts legion_change_permissions [+-rwx] [-v] legion_change_permissions +r /users/fred /home/grimshaw/myfile

Access Control Example

Unified Console Prog. File TTY User creates tty object User starts running program Legion passes tty LOID to program Program produces stdout, stderr User shares tty LOID User shares tty LOID

TTY Object Redirect run-time output to central (or multiple) consoles Connect and disconnect dynamically Debug quickly and simply Monitor status, errors, easily Share console with others legion_tty

User View Web Interface

Logging In

Listing Contents Of A Context

Control Window

Status Window

StdOut Window

StdErr Window

Listing Classes (Contents of /class)

Listing Hosts (Contents of /hosts)

List Attributes Of An Object

Start A Run

Check The Status Of A Job

Start An Amber (BioGrid) Run

Check The Status Of An Amber Run

Graphically Check An Amber Run

Interact With Amber Run

Start A Hawley-Hydro Run

Check The Status Of A Hydro Run

Graphically Check A Hydro Run

Run RenderGrid Jobs (P-Space Jobs)

Check The Status of A RenderGrid Job

Check Accounting Logs

User View Windows Interface

Windows Browser

Context Space in Windows  Ability to export local directories into Legion’s context space  Easy-to-use interface  Ability of users to control when shared directories are visible to other users

Access Control  Ability of users to specify access control policies  Fine-grained nature of policies  Allow/Deny read access to users or groups  Allow/Deny write access to users or groups  Ease with which access rights can be changed  Speed at which access rights are propagated through Legion space

Windows Legion FTP Daemon

Windows Job Sandbox

Windows Process Control

National Legion Net

Summary Philosophy –Grid as a Single Virtual Machine –Provide mechanisms; let others build policies Architecture –Object-based, integrated –Default policies for scheduling, security, … User Interfaces –Command-line, Web, Windows, FTP, HTTP, …

Future Directions Improved user interfaces More robust system Research activities - University of Virginia Commercial activities - Avaki Corporation Legion-G? Continued Continued support for nationwide grid, grid applications