Managed Object Placement Service John Bresnahan, Mike Link and Raj Kettimuthu (Presenting) Argonne National Lab.

Slides:



Advertisements
Similar presentations
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Advertisements

The Globus Striped GridFTP Framework and Server Bill Allcock 1 (presenting) John Bresnahan 1 Raj Kettimuthu 1 Mike Link 2 Catalin Dumitrescu 2 Ioan Raicu.
GridFTP Challenges In Data Transport John Bresnahan Argonne National Laboratory The University of Chicago.
Cross-site data transfer on TeraGrid using GridFTP TeraGrid06 Institute User Introduction to TeraGrid June 12 th by Krishna Muriki
© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
CCNA – Network Fundamentals
Lecture 7 Transport Layer
Esma Yildirim Department of Computer Engineering Fatih University Istanbul, Turkey DATACLOUD 2013.
Answers of Exercise 7 1. Explain what are the connection-oriented communication and the connectionless communication. Give some examples for each of the.
GridFTP: File Transfer Protocol in Grid Computing Networks
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 1 v3.0 Module 11 TCP/IP Transport and Application Layers.
Telnet/SSH Tim Jansen, Mike Stanislawski. TELNET is short for Terminal Network Enables the establishment of a connection to a remote system, so that the.
Module – 7 network-attached storage (NAS)
Process-to-Process Delivery:
Christopher Bednarz Justin Jones Prof. Xiang ECE 4986 Fall Department of Electrical and Computer Engineering University.
GridFTP Guy Warner, NeSC Training.
Chapter 7: Using Windows Servers to Share Information.
1 Chapter Client-Server Interaction. 2 Functionality  Transport layer and layers below  Basic communication  Reliability  Application layer.
Part Three: Data Management 3: Data Management A: Data Management — The Problem B: Moving Data on the Grid FTP, SCP GridFTP, UberFTP globus-URL-copy.
Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago.
Globus GridFTP: What’s New in 2007 Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
Jaringan Komputer Dasar OSI Transport Layer Aurelio Rahmadian.
Slide 1 DESIGN, IMPLEMENTATION, AND PERFORMANCE ANALYSIS OF THE ISCSI PROTOCOL FOR SCSI OVER TCP/IP By Anshul Chadda (Trebia Networks)-Speaker Ashish Palekar.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 7-Oct-15 OSI transport layer CCNA Exploration Semester 1 Chapter 4.
1 Version 3.0 Module 11 TCP Application and Transport.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
Secure, Collaborative, Web Service enabled and Bittorrent Inspired High-speed Scientific Data Transfer Framework.
UDT: UDP based Data Transfer Protocol, Results, and Implementation Experiences Yunhong Gu & Robert Grossman Laboratory for Advanced Computing / Univ. of.
The Globus GridFTP Framework and Server John Bresnahan, Mike Link and Raj Kettimuthu (Presenting) Math & Computer Science Division, Argonne National Laboratory,
Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008 John Bresnahan Michael Link Raj Kettimuthu Argonne National Laboratory and.
Globus GridFTP and RFT: An Overview and New Features Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
UDT as an Alternative Transport Protocol for GridFTP Raj Kettimuthu Argonne National Laboratory The University of Chicago.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Transport Layer COM211 Communications and Networks CDA College Theodoros Christophides
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
Flexibility, Manageability and Performance in a Grid Storage Appliance John Bent, Venkateshwaran Venkataramani, Nick Leroy, Alain Roy, Joseph Stanley,
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
FTP File Transfer Protocol Graeme Strachan. Agenda  An Overview  A Demonstration  An Activity.
GridFTP Richard Hopkins
CCNA4 v3 Module 6 v3 CCNA 4 Module 6 JEOPARDY K. Martin.
CEDPS Data Services Ann Chervenak USC Information Sciences Institute.
Hepix LAL April 2001 An alternative to ftp : bbftp Gilles Farrache In2p3 Computing Center
Full and Para Virtualization
A Managed Object Placement Service (MOPS) using NEST and GridFTP Dr. Dan Fraser John Bresnahan, Nick LeRoy, Mike Link, Miron Livny, Raj Kettimuthu SCIDAC.
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
The Globus eXtensible Input/Output System (XIO): A protocol independent IO system for the Grid Bill Allcock, John Bresnahan, Raj Kettimuthu and Joe Link.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Renesas Electronics America Inc. © 2010 Renesas Electronics America Inc. All rights reserved. Overview of Ethernet Networking A Rev /31/2011.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
Globus Data Storage Interface (DSI) - Enabling Easy Access to Grid Datasets Raj Kettimuthu, ANL and U. Chicago DIALOGUE Workshop August 2, 2005.
GridFTP Guy Warner, NeSC Training Team.
1 GridFTP and SRB Guy Warner Training, Outreach and Education Team, Edinburgh e-Science.
New Development Efforts in GridFTP Raj Kettimuthu Math & Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, U.S.A.
A Sneak Peak of What’s New in Globus GridFTP John Bresnahan Michael Link Raj Kettimuthu (Presenting) Argonne National Laboratory and The University of.
Application Layer Functionality and Protocols Abdul Hadi Alaidi
Chapter 7: Using Windows Servers
Diskpool and cloud storage benchmarks used in IT-DSS
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Client-Server Interaction
Process-to-Process Delivery:
Process-to-Process Delivery: UDP, TCP
Computer Networks Protocols
Preventing Privilege Escalation
Presentation transcript:

Managed Object Placement Service John Bresnahan, Mike Link and Raj Kettimuthu (Presenting) Argonne National Lab

Topics for discussion l MOPS overview l MOPS 0.1 u Pipelining u Gfork u GridFTP plugin for Gfork u Lotman l Other GridFTP enhancements u Performance u Ease of use l Future directions

Managed Object Placement Service (MOPS) l Resource management in GridFTP u Memory usage limitation u Enforce appropriate storage usage u Enforce appropriate bandwidth usage u Eliminates the potential to *overheat* a system l Bandwidth and storage reservation l Transfer scheduling l MOPS 0.1 is available at

MOPS l MOPS 0.1 includes u Optimization for lots of small files transfer u Globus fork (Gfork) - inetd like service that allows state to be maintained across connections u GridFTP plugin for Gfork - allows for dynamic addition/removal of data movers, limit memory usage u Lotman - manage storage u Plugin for GridFTP to enforce storage usage policies using lotman

Lots of Small Files (LOSF) Problem l GridFTP and FTP - command response protocols l A client must wait for a “Finished response” before sending the next command l Overhead added on a per file basis l Performance is best on large files u Overhead has less impact l Performance suffers for a large data set partitioned into many small files

LOSF l Traditional data transfer pattern File Request 1 File Request 2 File Request 3 DATA 1 DATA 2 DATA 3 ACK 1 ACK 2 ACK 3

Pipelining l Data transfer pattern with pipelining File Request 1 File Request 2 File Request 3 DATA 1 DATA 2 DATA 3 ACK 1 ACK 2 ACK 3

Pipelining WAN - SDSC and ANL with GSI security

Gfork l Under extreme loads it is possible that GridFTP servers require more memory than the system has and cause the system to fall over l Developed a service called gfork to help avoid this situation l Gfork - a service like inetd that listens on a TCP port and runs a configurable executable in a child process whenever a connection is made l Allows server state to be maintained across connections

Gfork l Associated with Gfork is a user defined master process l Master process runs for the lifetime of the gfork daemon Client Server Host GFork Server GridFTP Plugin GridFTP Server Instance Fork GridFTP Server Instance GridFTP Server Instance State Sharing Link Client Inherited Links Control Channel Connections

Dynamic data movers l GridFTP can be run as a striped server - there is a control process and several data movers. l The data movers run in tandem to transfer files faster by tying together many NICs. l When the control process is run out of inetd the list of possible data movers had to be statically configured. l But data movers tend to come and go. u Sometimes data movers fail u Sometimes data movers are added to a pool

Dynamic data movers l GridFTP plugin for Gfork allows for dynamic addition/removal of data movers l GridFTP plugin on the control node can be configured to listen for data mover registrations l GridFTP plugin on the data movers can be configured to register with the plugin on the control node

Dynamic data movers Control node GFork Server GridFTP Plugin Frontend Instance Fork Lookup available backend Backend Instance Data Mover Fork Registration Control Connection l Multiple data movers register with plugin u Plugin maintains the list of available DMs l Control process instance selects N DMs for use l If any one DM fails another can be used l DM pool can grow and shrink GFork Server GridFTP Plugin

Memory Management l GridFTP plugin for Gfork has a memory limiting option l Limit memory usage to a given value or to the maximum amount of RAM in the system. l Most of the memory is given to the first few connections l When the plugin detects that it is overloaded, each session is limited to half the available memory.

Memory Management l To measure the effectiveness of the memory limiting functionality, we performed experiments using xen VM. l Clients are run on a 2GHz processor with 512MB of RAM and the server was run in a xen VM with either 64MB or 128MB of RAM with a 2GHz AMD 64bit processor. l The machines were connected to each other via a 1Gb/s network with iperf measured max speeds of 600Mb/s.

No memory limiting

With memory limiting

Lotman l Manage disk space - interface to create disk space called lot l Lot defined by four characteristics u Owner, capacity, duration, files u Owner - client that is allowed to use the lot u Capacity - amount of data that can be stored u Duration - time a lot is guaranteed to exist u Lot contains a set of files

GridFTP with lotman Client GridFTP Server Storage Access Control Plugin DSI Plugin Main Codebase Lotman

Other Enhancements l Performance enhancement u GridFTP over UDT l Ease of Use enhancements u Alternate security mechanism u GridFTP Where there’s FTP

GridFTP over UDT l UDT is an application-level data transport protocol that uses UDP to transfer data l Implement its own reliability and congestion control mechanisms l Achieves good performance on high-bandwidth, high- delay networks where TCP has significant limitations l GridFTP uses Globus XIO interface to invoke network I/O operations

GridFTP over UDT l XIO framework presents a standard open/close/read/write interface to many different protocol implementations u including TCP, UDP, HTTP -- and now UDT l The protocol implementations are called drivers. u A driver can be dynamically loaded and stacked by any Globus XIO application. l Created an XIO driver for UDT reference implementation l Enabled GridFTP to use it as an alternate transport protocol

GridFTP over UDT Argonne to NZ Throughput in Mbit/s Argonne to LA Throughput in Mbit/s Iperf – 1 stream Iperf – 8 streams GridFTP mem TCP – 1 stream GridFTP mem TCP – 8 streams GridFTP disk TCP – 1 stream GridFTP disk TCP – 8 streams GridFTP mem UDT GridFTP disk UDT UDT mem UDT disk

Alternate security mechanism l GridFTP traditionally uses GSI for establishing secure connections l In some situations, preferable to use SSH security mechanism l Leverages the fact that an SSH client can remotely execute programs by forming a secure connection with SSHD u The client (globus-url-copy) acts as an SSH client and remotely executes a Globus GridFTP server u All of the standard IO from the remote program is routed back to the client.

SSH security mechanism l Client support for using SSH is automatically enabled l On the server side (where you intend the client to remotely execute a server) u setup-globus-gridftp-sshftp -server l In order to use SSH as a security mechanism, the user must provide urls that begin with sshftp:// as arguments. u globus-url-copy sshftp:// : / file:/ u is the port in which sshd listens on the host referred to by (the default value is 22).

GridFTP Where there’s FTP (GWFTP) l GridFTP has been in existence for some time and has proven to be quite robust and useful l Only few GridFTP clients available l FTP has innumerable clients l GWFTP - created to leverage the FTP clients l A proxy between FTP clients and GridFTP servers

GWFTP l Two security options provided with GWFTP to authenticate its client u Password based and host based FTP Client GWFTP (GSI Credential) wiggum.mcs.anl.gov GridFTP Server (2811) USER ::gsiftp://wiggum.mcs.anl.gov:2811/ PASS GSI Authentication Get request Data

Future direction Controller Network reservation service Enhanced GridFTP Storage Manager Controller l Enhance GridFTP servers to expose status as WS resource properties l Take advantage of the network provisioning services u Collaborating with Terapath and LambdaStation projects

Infiniband over SONET Need specialized hardware: Obsidian longbow 1. IB over SONET/Ethernet – frame conversion 2. Buffer-based termination of IB flow control Linux host Linux host ORNL 700 miles Linux host Linux host ChicagoCDCIChicagoCDCI SeattleCDCISeattleCDCISunnyvaleCDCISunnyvaleCDCI ORNLCDCIORNLCDCI longbowIB/SlongbowIB/S longbowIB/SlongbowIB/S 3300 miles4300 miles ORNL loop -0.2 mile: 7.5Gbps IB 4x: 8Gbps (full speed) Host-to-host local switch:7.5Gbps IB 4x ORNL-Chicago loop – 1400 miles: 7.46Gbps ORNL- Chicago - Seattle loop – 6600 miles: 7.23Gbps ORNL – Chicago – Seattle - Sunnyvale loop – 8600 miles: 7.20Gbps

GridFTP over Infiniband l Can use infiniband through Sockets Direct Protocol (SDP) l SDP provides a socket interface for Open Fabrics software stack (a standard implemented by infiniband and iwarp) u No kernel bypass l User level verbs to interface directly with infiniband hardware u Develop a XIO driver for verbs interface