1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007.

Slides:



Advertisements
Similar presentations
HEPiX GFAL and LCG data management Jean-Philippe Baud CERN/IT/GD.
Advertisements

Jens G Jensen CCLRC/RAL hepsysman 2005Storage Middleware SRM 2.1 issues hepsysman Oxford 5 Dec 2005.
1 The SciDAC Scientific Data Management Center: Infrastructure and Results Arie Shoshani Lawrence Berkeley National Laboratory SC 2004 November, 2004.
Computing Sciences Directorate, L B N L 1 CHEP 2003 Storage Resource Management In the Grid Environment Alex Sim Junmin Gu Arie Shoshani Scientific Data.
1 CHEP 2003 Arie Shoshani Experience with Deploying Storage Resource Managers to Achieve Robust File replication Arie Shoshani Alex Sim Junmin Gu Scientific.
Grid Collector: Enabling File-Transparent Object Access For Analysis Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Toni Saarinen, Tite4 Tomi Ruuska, Tite4 Earth System Grid - ESG.
A. Sim, CRD, L B N L 1 Data Management Foundations Workshop, Mar. 3, 2009 Storage in OSG and BeStMan Alex Sim Scientific Data Management Research Group.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
16 th May 2006Alessandra Forti Storage Alessandra Forti Group seminar 16th May 2006.
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
A. Sim, CRD, L B N L GIN-Data : SRM Island Inter-Op Testing With SRM-TESTER Alex Sim, Vijaya Natarajan Computational Research Division Lawrence Berkeley.
A. Sim, CRD, L B N L 1 US CMS Workshop, Mar. 3, 2009 Berkeley Storage Manager (BeStMan) Alex Sim Scientific Data Management Research Group Computational.
A. Sim, CRD, L B N L 1 Oct. 23, 2008 BeStMan Extra Slides.
SDM Center February 2, 2005 Progress on MPI-IO Access to Mass Storage System Using a Storage Resource Manager Ekow J. Otoo, Arie Shoshani and Alex Sim.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
1 Alex Romosan,Derek Wright, Alex Romosan, Derek Wright, Ekow Otoo, Doron Rotem, Arie Shoshani (Guidance: Doug Olson) Lawrence Berkeley National Laboratory.
A. Sim, CRD, L B N L 1 OSG Applications Workshop 6/1/2005 OSG SRM/DRM Readiness and Plan Alex Sim / Jorge Rodriguez Scientific Data Management Group Computational.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
Computing Sciences Directorate, L B N L 1 SC 2003 Storage Resource Managers: Essential Components for the Grid Arie Shoshani Staff: Alex Sim, Junmin Gu,
File and Object Replication in Data Grids Chin-Yi Tsai.
Data Management The GSM-WG Perspective. Background SRM is the Storage Resource Manager A Control protocol for Mass Storage Systems Standard protocol:
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
NUG 2004 Grid File Yanker Demo Shreyas Cholia Mass Storage Group, NERSC 06/24/2004.
Bulk Data Movement: Components and Architectural Diagram Alex Sim Arie Shoshani LBNL April 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Andrew C. Smith – Storage Resource Managers – 10/05/05 Functionality and Integration Storage Resource Managers.
1 Meeting Location: LBNL Sept 18, 2003 The functionality of a Replica Registration Service Attendees Michael Haddox-Schatz, JLAB Ann Chervenak, USC/ISI.
Intergrid KoM Santander 22 june, 2006 E-Infraestructure shared between Europe and Latin America José Manuel Gutiérrez
The Earth System Grid: A Visualisation Solution Gary Strand.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Fox 2 AISRP April 4-6, 2005  Earth System Grid  Grid-enabled OPeNDAP  Architecture - Server and Application access  Framework experience.
1 Grid File Replication using Storage Resource Management Presented By Alex Sim Contributors: JLAB: Bryan Hess, Andy Kowalski Fermi: Don Petravick, Timur.
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
Computing Sciences Directorate, L B N L 1 CHEP 2003 Standards For Storage Resource Management BOF Co-Chair: Arie Shoshani * Co-Chair: Peter Kunszt ** *
SRM Monitoring 12 th April 2007 Mirco Ciriello INFN-Pisa.
1 SRM-Lite: overcoming the firewall barrier for data movement Arie Shoshani Alex Sim Viji Natarajan Lawrence Berkeley National Laboratory SDM Center All-Hands.
January 26, 2003Eric Hjort HRMs in STAR Eric Hjort, LBNL (STAR/PPDG Collaborations)
CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
1 Research and Development. 2 R&D Agenda  Security  Bulk Data Movement  Data Replication and Mirroring  Monitoring  Metrics  Versioning  Product.
SDM Center Coupling Parallel IO to SRMs for Remote Data Access Ekow Otoo, Arie Shoshani and Alex Sim Lawrence Berkeley National Laboratory.
1 Use of SRM File Streaming by Gateway Alex Sim Arie Shoshani May 2008.
Computing Sciences Directorate, L B N L 1 SC 2003 Storage Resource Managers: Essential Components for the Grid Arie Shoshani Staff: Alex Sim, Junmin Gu,
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
PPDG meeting, July 2000 Interfacing the Storage Resource Broker (SRB) to the Hierarchical Resource Manager (HRM) Arie Shoshani, Alex Sim (LBNL) Reagan.
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
Production Mode Data-Replication Framework in STAR using the HRM Grid CHEP ’04 Congress Centre Interlaken, Switzerland 27 th September – 1 st October Eric.
SRM-iRODS Interface Development WeiLong UENG Academia Sinica Grid Computing 1.
G RID D ATA M ANAGEMENT. D ATA M ANAGEMENT Distributed community of users need to access and analyze large amounts of data Requirement arises in both.
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
A. Sim, CRD, L B N L 1 OSG Site Administrators Meeting, Dec. 13, 2007 Berkeley Storage Manager (BeStMan) Alex Sim Scientific Data Management Research Group.
Introduction to Storage Element Hsin-Wei Wu Academia Sinica Grid Computing Center, Taiwan.
9/20/04Storage Resource Manager, Timur Perelmutov, Jon Bakken, Don Petravick, Fermilab 1 Storage Resource Manager Timur Perelmutov Jon Bakken Don Petravick.
A. Sim, CRD, L B N L 1 Production Data Management Workshop, Mar. 3, 2009 BeStMan and Xrootd Alex Sim Scientific Data Management Research Group Computational.
Grid, Storage and SRM Jan , 2008.
Status of the SRM 2.2 MoU extension
The Earth System Grid: A Visualisation Solution
Berkeley Storage Manager (BeStMan)
Data Management cluster summary
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Presentation transcript:

1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007

2 First, some background: What are SRMs Storage Resource Managers (SRMs) are middleware components whose function is to provide:Storage Resource Managers (SRMs) are middleware components whose function is to provide: dynamic space allocation AND file management in spaces for storage components on the local or wide-area network Based on a common standard SRM (BeStMan) client/user applications Unix-based Disk Pools Examples of some storage systems currently supported by SRMs dCache CASTOR CCLRC RAL GPFS SRM (DPM) SRM/ dCache SRM/ CASTOR SRM (StoRM) Unix-based Disk Pools

3 SRM Functional Concepts Manage Spaces dynamically Reservation, lifetime Negotiation Manage files in spaces Request to put files in spaces Request to get files from spaces Lifetime, pining of files, release of files No logical name space management (done by replica location services) Access remote sites for files Bring files from other sites and SRMs as requested Use existing transport services (GridFTP, https, …) Transfer protocol negotiation Manage multi-file requests Manage request queues Manage caches Manage garbage collection Directory Management Uxix semantics: srmLs, srmMkdir, srmMv, srmRm, srmRmdir

4 Tomcat servlet engine Tomcat servlet engine MCS Metadata Cataloguing Services MCS Metadata Cataloguing Services RLS Replica Location Services RLS Replica Location Services SOAP RMI MyProxy server MyProxy server MCS client RLS client MyProxy client GRAM gatekeeper GRAM gatekeeper CAS Community Authorization Services CAS Community Authorization Services CAS client disk MSS Mass Storage System HPSS High Performance Storage System disk HPSS High Performance Storage System disk DRM Storage Resource Management DRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server openDAPg server openDAPg server gridFTP Striped server gridFTP Striped server LBNL LLNL ISI NCAR ORNL ANL DRM Storage Resource Management DRM Storage Resource Management Example Use of SRMs in Earth Science Grid (in production for 3 years) 3100 users120 TBs managed LBNL’s SRMs inter-communicate between several sites and main portal site at NCAR to provide storage management and multi-file movement

5 DataMover: SRMs used to provide large scale robust data streaming between sites Problem: move thousands of files robustlyProblem: move thousands of files robustly Takes many hours Need error recovery Mass storage systems failures Network failures Solution: Use Storage Resource Managers (SRMs)Solution: Use Storage Resource Managers (SRMs) File streaming paradigm By reserving and releasing storage space automatically Problem: too slowProblem: too slow Solution:Solution: in GridFTP Use parallel streams Use large FTP windows Pre-stage files from MSS Use concurrent transfers NERSC Anywhere BNL Disk Cache Disk Cache SRM-COPY (thousands of files) SRM-GET (one file at a time) DataMover SRM (performs writes) SRM (performs reads) GridFTP GET (pull mode) stage files archive files Network transfer Get list of files Example setup for STAR high-energy-physics experiment

6 SRM-Lite Goal 1: automate file movement behind a firewall a client program to automate movement of multiple files to/from client’s directory to a remote site given a OTP firewall at one site Support entire directory transfers Recover from mid-transfer interruption and machine failure Goal 2: pull files into user’s workstation Use SRM-Lite by users to download files into their workstations Using various transfer protocols (GridFTP, bbcp, https, …) Have a GUI that shows transfer progress Or have a command line Support entire directory transfers Support suspend/resume operations (e.g. on laptops)

7 SRM-Lite: a client program to automate movement of multiple files to/from client’s directory to a remote SSH server given a OTP firewall at one site Disk Cache HPSS SSH Server NERSC SSH Request GridFTP/FTP/ SCP transfers Process StepsProcess Steps Login to ORNL using OTP At ORNL invoke SRM-Lite User composes XML input file, srmlite.xml for selected files/directories to copy from/to another site Or, user gives command line option for a selected file/directory SRM-Lite uses srmlite.xml or command line input to automatically Push/Pull files to/from NERSC Use multiple threads for concurrent transfers Disk Cache ORNL SRM- Lite OTP Login Use Case A: OTP firewall at local site (ORNL), SSH server at remote site (NERSC) srmlite.xml Local Commands

8 Scenario: one end has SRM, The other end has a firewall, use SRM Disk Cache HPSS SRM NERSC GridFTP/FTP/ SCP transfers Process StepsProcess Steps Login to ORNL using OTP At ORNL invoke SRM-Lite User composes XML input file, srmlite.txt for selected files/directories to copy over to/from another SRM controlled storage system Or, user gives command line option for a selected file/directory SRM-Lite uses srmlite.xml or command line input to automatically Push/Pull files to/from SRM at NERSC Use multiple threads for concurrent transfers Disk Cache ORNL SRM- Lite OTP Login srmlite.xml SRM Request Use Case B: OTP firewall at local site (ORNL), SRM server at remote site (NERSC)

9 Scenario: one end has SRM, The other end has a firewall, use either SSH/SRM, Disk Cache HPSS SRM SSH Server NERSC SSH Request GridFTP/FTP/ SCP transfers Process StepsProcess Steps Login to ORNL using OTP At ORNL invoke SRM-Lite User composes XML input file, srmlite.xml for selected files/directories to copy over to another site Or, user gives command line option for a selected file/directory SRM-Lite uses srmlite.xml or command line input to automatically Push/Pull files to/from NERSC using either SSH or SRM Use multiple threads for concurrent transfers Disk Cache ORNL SRM- Lite OTP Login srmlite.txt SRM Request Use Case C: OTP firewall at local site (ORNL), SRM/SSH server at remote site (NERSC)

10 Scenario: both ends have SRMs, both ends have a firewall, use SRM-lite on both ends, Use SSH to invoke SRM-Lite at other end Process StepsProcess Steps Login to ORNL using OTP Create a OTP SSH tunnel to NERSC User composes XML input file, srmlite.txt for selected files/directories to copy over to another site Or, user gives command line option for a selected file/directory SRM-Lite uses srmlite.txt or command line input, through SSH tunnel to automatically Communicate with SRM-lite at other end Push/Pull files to/from NERSC using SRM Use multiple threads for concurrent transfers Can use SCP onlyCan use SCP only NERSC SCP transfers ORNL OTP Login SSH-Tunneling OTP SSH Tunnel Disk Cache SRM- Lite srmlite.txt HPSS SRM Disk Cache HPSS SRM SRM- Lite SRM Request Use Case D: OTP firewall at both local site (ORNL), and remote site (NERSC), Use SRM-lite at both ends

11 SRM-Lite: Status SRM-Lite is developed Available from: Tested with GridFTP, SCP, HTTPS, HTTP Tested with large number of files Tested behind a firewall Access from local SRMs that acess HPSS – not tested yet Access between two filewalled system – not developed yet

12 SRM-Lite: GUI GUI was developed when used at user’s site (Linux, PC, MAC), called DataMover-lite (DML) Available from: Example GUI screen Shows info on: completed, active, and pending transfers Also, file sizes, transfer times, transfer speed

13 Extra Slides

14 Storage Resource Managers SRMs are middleware components whose function is to provide:SRMs are middleware components whose function is to provide: dynamic space allocation AND file management in spaces for storage components on the local or wide-area network Based on a common standard SRM (BeStMan) JASMine client/user applications Unix-based disks Examples of storage systems currently supported by SRMs dCache CASTOR CCLRC RAL GPFS SRM (DPM) SRM (Jlab-SRM) Unix-based disks CASTOR SRM (StoRM) SRM/ dCache SRM/ CASTOR SRM/ CASTOR SRM (StoRM) MSS SRM/ L-Store SRM (BeStMan) SRM (BeStMan)

15 DataMover-Lite use in ESG: a client program used to automate movement of multiple files to client’s directory Disk Cache MSS SRM ESG Portal Disk Cache User’s browser DataPortal NCAR User’s machine DataMover Lite release request GridFTP/FTP/ HTTP/HTTPS transfers Process StepsProcess Steps User downloads DataMoverLite User goes to portal, select files Portal gets ALL files into SRM disk Portal generates XML input file, datamover.txt, for user selected files DML uses datamover.txt to automatically get files, and release files after move completes successfully datamover.txt

16 Another example of DML GUI