Grids and Globus at BNL Presented by John Scott Leita.

Slides:



Advertisements
Similar presentations
Data Management Expert Panel - WP2. WP2 Overview.
Advertisements

The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
High Performance Computing Course Notes Grid Computing.
Globus Overview CS Lecture Notes In courtesy of Andy Kowalski.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Status of Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Application architectures
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
INFN-GRID Globus evaluation (WP 1) Massimo Sgaravatto INFN Padova for the INFN Globus group
Introduction to NorduGrid ARC / Arto Teräs Slide 1(16) Introduction to NorduGrid ARC Arto Teräs Free and Open Source Software Developers' Meeting.
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Computational grids and grids projects DSS,
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
GridFE: Web-accessible Grid System Front End Jared Yanovich, PSC Robert Budden, PSC.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Andrew McNabETF Firewall Meeting, NeSC, 5 Nov 2002Slide 1 Firewall issues for Globus 2 and EDG Andrew McNab High Energy Physics University of Manchester.
Production Tools in ATLAS RWL Jones GridPP EB 24 th June 2003.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
© 2007 UC Regents1 Track 1: Cluster and Grid Computing NBCR Summer Institute Session 1.1: Introduction to Cluster and Grid Computing July 31, 2007 Wilfred.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
Creating and running an application.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
Introduction to Grid Computing and its components.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Duncan MacMichael & Galen Deal CSS 534 – Autumn 2016
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Presentation transcript:

Grids and Globus at BNL Presented by John Scott Leita

Grid Basics What is a grid? What use is the grid in terms of computing? What are some grid requirements? How are these requirements met?

A grid is a bunch of nodes interconnected via a network. It includes a set of standards to make its use possible. Think of the US electric power Grid as an example. There are nodes that supply various amounts of power and nodes that use that power. In order for this to work the grid must include standards such as the 120V 60Hz signal that we all use. What is a grid?

Grid Basics What is a grid? What use is the grid in terms of computing? What are some grid requirements? How are these requirements met?

What use is the grid in terms of computing? The Grid is used to enforce standards and to offer libraries (APIs) that simplify writing grid compliant software. This levels the playing field so all nodes can use and offer services over a heterogeneous network.

Grid Basics What is a grid? What use is the grid in terms of computing? What are some grid requirements? How are these requirements met?

Resources – Large storage (HPSS), local drives, CPUs (Farms, Clusters, Supercomputers, PCs), queues (LSF). Data –Files and databases with storage elements abstracted. Jobs – Users should be able to submit various jobs to be executed. These jobs can include things like executables, parallel programs (MPI), batch jobs, SQL queries, and shell scripts. Much like like data the resources should be abstracted as much as the user would like. Security – Anyone who uses the grid should be authenticated and checked for authorization for requested resources or data access. What use is the grid in terms of computing?

Grid Basics What is a grid? What use is the grid in terms of computing? What are some grid requirements? How are these requirements currently met?

One popular way of implementing computational and data grid is by use of middleware. Middleware is software that lies between the operating system and user applications. This layer provides standards and functionality like the requirements listed on the previous slide. Globus is a software toolkit that realizes a middleware based grid.

Globus Questions? What specifically is Globus? What does it include? How does it work?

What specifically is Globus? Globus is a software package that implements the grid requirement mentioned on previous slides. It’s package consists of three distinct parts: 1.Client middleware – used to access remote data, and resources. Also used to submit, run and manage jobs. 2.Server middleware – used to offer data and resources. 3.Libraries and API – used by developers to allow easy production of grid friendly software. Also to easily convert existing software to be grid friendly.

Globus Questions? What specifically is Globus? What does it include? How does it work?

What does it include? Globus includes the following parts to implement the grid requirements: GRAM (Globus Resource Manager) – Used to submit and control jobs over the grid. NEXUS – A library that allows different jobs to communicate with each other. GSI (Grid Security Infrastructure) – Provides PKI authentication. GASS (Globus Access to Secondary Storage) – Makes accessing data the same as accessing web pages. MDS (Meta Computing Directory Service) – information about the availability of resources.

Globus Questions? What specifically is Globus? What does it include? How does it work?

It is best if each component is explained separately: GRAM NEXUS GSI GASS MDS

GRAM OVERVIEW The Globus Resource Allocation Manager (GRAM) is the lowest level of Globus resource management architecture. GRAM allows you to run jobs remotely, providing an API for submitting, monitoring, and terminating your job. A job is submitted, the request is sent to the gatekeeper of the remote computer. The gatekeeper handles the request and creates a job manager for the job. The job manager starts and monitors the remote program, communicating state changes back to the user on the local machine. When the remote application terminates, normally or by failing, the job manager terminates as well. NOTE: This text is from the Globus Website:

From GRAM Illustration

GRAM Example GRAM can be included in your program by using the GRAM API or you can use an existing client program to submit a job(s). Example %globus-job-run somehost.anl.gov /bin/echo hello This will cause the remote machine to execute the echo command. This is a very simple example with a very limited scope but still there is a lot going on behind the seines. First this program automatically writes a RSL to inform the remote machine of the argument (hello) the executable (echo), the host (somehost), and the environment variables needed (to redirect stdio). Authentication takes place at this time using SSL. Then it gives the remote gatekeeper the RSL. The remote gatekeeper starts a job manager which forks a child process that is the actual job. When the job is finished a GASS server is started and the local host is notified that the status of the job is complete. The local host uses a GASS client to get the standard output. GRAM can be used for much more powerful submissions however, for example submitting jobs to many different queues such as LSF that are detected using a resource broker.

Nexus As stated before nexus is a method of passing messages between processes the previous version of MPI-G used nexus as its protocol to pass messages. However the new version of MPI does this more efficiently itself. Nexus is still however an easy way to write programs that need to talk to one another. Here is the formal definition of Nexus from the website: Nexus is a runtime library designed primarily as a compiler target for languages supporting task-parallel and mixed data- and task-parallel execution. The Nexus interface and Nexus design are described elsewhere; here, we provide the information needed to execute programs that use Nexus services.

GSI The Grid Security Infrastructure (GSI) is a set of libraries and tools, provided with the Globus Metacomputing Toolkit, for doing secure authentication over an open network. The GSI allows you to use an X.509 certificate, normally used for authentication on a system running Globus, for other tasks such as system log on. It is possible to use just the authentication portion of Globus via two applications, GSI-enabled Secure Shell (SSH) and GSI-enabled FTP (gsiftp). From Globus supports proxy certificates. This means that a user can authenticate once per session and use all authorized grid services without constantly being asked to authenticate themselves.

GASS GASS simplifies the porting and running of applications that use file I/O to the Globus environment. Libraries and utilities are provided to eliminate the need to manually login to sites and ftp files install a distributed file system The APIs are designed to allow reuse of programs that use Unix or standard C I/O with little or no modification. Currently the ftp and x-gass (GASS server) protocols are supported. From Now for my description: GASS allows data to be accessed over the grid with ease. Much like GRAM you can create your own programs using the GASS-API or you can use clients already created. Example: %globus-rcp somehost.com:data.dat /here This will retrieve the requested file via http protocol. It is interesting to note that GRAM is used to start the GASS server on the remote end.

MDS The MDS is a directory service that is based on the LDAP protocol. It is used to query both static and dynamic information on grid resources such as: Available CPUs Available Storage Scientific instruments Each node with Globus installed has a Grid Resource information System (GRIS) component in the middleware. That node reports all relevant statuses to the GRIS. An organization or collaboration then maintains a Grid Index Information System (GIIS) that can pull information from the GRISs of any node that its in charge of.

Globus Hourglass Redraw of picture fount in The Grid : Blueprint for a New Computing Infrastructure by Ian Foster (Editor), Carl Kesselman

DataBase Replication

The Future  GRID User prompt  Grid Virtual File System  Globus Replica Catalog

References The Grid: Blueprint for a New Computing Infrastructure Edited by Ian Foster and Carl Kesselman.