Globus DataGrid Overview Bill Allcock, ANL GridPP Meeting 30 June 2003.

Slides:



Advertisements
Similar presentations
Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
Advertisements

The Globus Striped GridFTP Framework and Server Bill Allcock 1 (presenting) John Bresnahan 1 Raj Kettimuthu 1 Mike Link 2 Catalin Dumitrescu 2 Ioan Raicu.
The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
Case Study 1: Data Replication for LIGO Scott Koranda Ann Chervenak.
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
Cross-site data transfer on TeraGrid using GridFTP TeraGrid06 Institute User Introduction to TeraGrid June 12 th by Krishna Muriki
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
GridFTP: File Transfer Protocol in Grid Computing Networks
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
CS CS 5150 Software Engineering Lecture 13 System Architecture and Design 1.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
GridFTP Guy Warner, NeSC Training.
The Data Replication Service Ann Chervenak Robert Schuler USC Information Sciences Institute.
An easy way to manage Relational Databases in the Globus Community Sandro Fiore ISUFI/ Center for Advanced Computational Technologies Director: prof. Giovanni.
Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago.
Thesis Proposal Ali Kaplan
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Globus Data Replication Services Ann Chervenak, Robert Schuler USC Information Sciences Institute.
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Why GridFTP? l Performance u Parallel TCP streams, optimal TCP buffer u Non TCP protocol such as UDT u Order of magnitude greater l Cluster-to-cluster.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Secure, Collaborative, Web Service enabled and Bittorrent Inspired High-speed Scientific Data Transfer Framework.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
File and Object Replication in Data Grids Chin-Yi Tsai.
Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008 John Bresnahan Michael Link Raj Kettimuthu Argonne National Laboratory and.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
UDT as an Alternative Transport Protocol for GridFTP Raj Kettimuthu Argonne National Laboratory The University of Chicago.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Part Four: The LSC DataGrid Part Four: LSC DataGrid A: Data Replication B: What is the LSC DataGrid? C: The LSCDataFind tool.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Data Management and Transfer in High-Performance Computational Grid Environments B. Allcock, J. Bester, J. Bresnahan, A. L. Chervenak, I. Foster, C. Kesselman,
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Globus – Part II Sathish Vadhiyar. Globus Information Service.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
Objective What is RFT ? How does it work Architecture of RFT RFT and OGSA Issues Demo Questions.
Rights Management in Globus Data Services Ann Chervenak, ISI/USC Bill Allcock, ANL/UC.
AERG 2007Grid Data Management1 Grid Data Management GridFTP Carolina León Carri Ben Clifford (OSG)
The Globus eXtensible Input/Output System (XIO): A protocol independent IO system for the Grid Bill Allcock, John Bresnahan, Raj Kettimuthu and Joe Link.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Globus Data Storage Interface (DSI) - Enabling Easy Access to Grid Datasets Raj Kettimuthu, ANL and U. Chicago DIALOGUE Workshop August 2, 2005.
GridFTP Guy Warner, NeSC Training Team.
1 GridFTP and SRB Guy Warner Training, Outreach and Education Team, Edinburgh e-Science.
Protocols and Services for Distributed Data- Intensive Science Bill Allcock, ANL ACAT Conference 19 Oct 2000 Fermi National Accelerator Laboratory Contributors:
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
A Sneak Peak of What’s New in Globus GridFTP John Bresnahan Michael Link Raj Kettimuthu (Presenting) Argonne National Laboratory and The University of.
Current Globus Developments Jennifer Schopf, ANL.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
OGSA-DAI.
System Architecture CS 560. Project Design The requirements describe the function of a system as seen by the client. The software team must design a system.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
The Data Grid: Towards an architecture for Distributed Management
Joseph JaJa, Mike Smorul, and Sangchul Song
A Replica Location Service
University of Technology
Presentation transcript:

Globus DataGrid Overview Bill Allcock, ANL GridPP Meeting 30 June 2003

Sources of Information / Support l Me –definitive source of information about GridFTP –Responsible for requirements gathering, feature prioritization, getting developer resources, directing the development work, etc.. l –Extensive archive that is worth searching –GridFTP developers monitor it and are good about answering, but not required. l Bugzilla –Used for submitting bugs

GridFTP Feature Set –GSI, Kerberos security –Third-party transfers –Parameter set/negotiate –Partial file access –Reliability/restart –Large file support –Data channel reuse –Defacto Standard on the Grid –Integrated instrumentation –Logging/audit trail –Parallel transfers –Striping –TCP Buffer size control –Policy-based access control –Server-side computation –Based on Standards

GridFTP at SC2000: Long-Running Dallas-Chicago Transfer SciNet Power Failure Other demos starting up (Congestion) Parallelism Increases (Demos) Backbone problems on the SC Floor DNS Problems Transition between files (not zero due to averaging)

Reliable File Transfer l Note that I said any *REMOTE* resource can fail l Local failure would mean loss of state since it is held in the clients memory. l Could modify the restart plug-in to write state to disk. l We opted for a service that accepts data transfer jobs and uses a database. l This provides increased robustness AND allows a client to initiate a long running job and not have to tie up the local computer to keep it running. l We call this server the Reliable File Transfer (RFT) service l One test ran 54 hours, moved 0.3 TB, and survived muliple failures both natural and intentional

GridFTP: Standards Based l Existing standards –RFC 949: File Transfer Protocol –RFC 2228: FTP Security Extensions –RFC 2389: Feature Negotiation for the File Transfer Protocol –Draft: FTP Extensions l New drafts –GridFTP: Protocol Extensions to FTP for the Grid >Grid Forum GridFTP Working Group >Submitted for public comment

GridFTP: Future Work l New Server Beta in August (wuftp replacement) w/ transport and security l Striping functionality and HPSS released in Q1/Q with HPSS 5.2b and logging. l Other features based on demand. l Improved testing and documentation l Inclusion of Protocol extensions from GGF l Interface in server for policy engine. I.e., allocate one stripe per 100MB of file size l New web services control channel protocol l Utilization of Non-TCP network protocols l Bandwidth Limiting

Basic Layout of GridFTP for HPSS

eXtensible IO Library (xio) l Abstract away the transport layer l Define standard function signatures for Read/Write/Open/Close l Two types of drivers: transport and transform l Transport has to be the first pushed on the stack l Can have an arbitrary number of transform drivers

Transform Driver Example (gsi) l Open does the authentication and if specified via an attribute, delegation. l Read/Write could be a simple pass through or if requested might do encryption or integrity. l Close in this case is a no-op. l Kerberos *should* be easier. Simply pop gsi and push kerberos.

Planned xio drivers l Basics: TCP, UDP, file, gsi l GridFTP: Make it simple for an application to access files under the control of a GridFTP server. –Note that xio drivers can call xio drivers: The GridFTP driver will call sockets which will call TCP l MultiStream Data Channel Protocol l HTTP l SABUL l Rate Limiting

Transport Stack in Globus Reliable File Transfer Service New GridFTP Server Extensible IO System (under all of Globus) Client / User App can poke down the stack as necessary

Replica Management

Replica Catalog Structure: A Climate Modeling Example Logical File Parent Logical File Jan 1998 Logical Collection C02 measurements 1998 Replica Catalog Location jupiter.isi.edu Location sprite.llnl.gov Logical File Feb 1998 Size: Filename: Jan 1998 Filename: Feb 1998 … Filename: Mar 1998 Filename: Jun 1998 Filename: Oct 1998 Protocol: gsiftp UrlConstructor: gsiftp://jupiter.isi.edu/ nfs/v6/climate Filename: Jan 1998 … Filename: Dec 1998 Protocol: ftp UrlConstructor: ftp://sprite.llnl.gov/ pub/pcmdi Logical Collection C02 measurements 1999

A Replica Location Service l A Replica Location Service (RLS) is a distributed registry service that records the locations of data copies and allows discovery of replicas l Maintains mappings between logical identifiers and target names –Physical targets: Map to exact locations of replicated data –Logical targets: Map to another layer of logical names, allowing storage systems to move data without informing the RLS l RLS was designed and implemented in a collaboration between the Globus project and the DataGrid project

LRC RLI LRC Replica Location Indexes Local Replica Catalogs LRCs contain consistent information about logical-to- target mappings on a site RLIs nodes aggregate information about LRCs Soft state updates from LRCs to RLIs: relaxed consistency of index information, used to rebuild index after failures Arbitrary levels of RLI hierarchy

A Flexible RLS Framework Five elements: 1. Consistent Local State: Records mappings between logical names and target names and answers queries 2. Global State with relaxed consistency: Global index supports discovery of replicas at multiple sites; relaxed consistency 3. Soft state mechanisms for maintaining global state: LRCs send information about their mappings (state) to RLIs using soft state protocols 4. Compression of state updates (optional): reduce communication and storage overheads 5. Membership service: for location of participating LRCs and RLIs and dealing with changes in membership

LRC RLI LRC Replica Location Indexes Local Replica Catalogs An RLS with No Redundancy, Partitioning of Index by Storage Sites

An RLS with Redundancy

Replica Location Service In Context l The Replica Location Service is one component in a layered data management architecture l Provides a simple, distributed registry of mappings l Consistency management provided by higher-level services

Components of RLS Implementation l Front-End Server – Multi-threaded – Supports GSI Authentication – Common implementation for LRC and RLI l Back-end Server – mySQL Relational Database – Holds logical name to target name mappings l Client APIs: C and Java

Implementation Features l Two types of soft state updates from LRCs to RLIs – Complete list of logical names registered in LRC – Bloom filter summaries of LRC l User-defined attributes – May be associated with logical or target names l Partitioning – Divide LRC soft state updates among RLI index nodes using pattern matching of logical names l Membership service –Static configuration only –Eventually use OGSA registration techniques