Enabling Grids for E-sciencE EGEE-II INFSO-RI The Development of SRM interface for SRB Fu-Ming Tsai Academia Sinica Grid Computing.

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Advertisements

Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
A. Sim, CRD, L B N L 1 Oct. 23, 2008 BeStMan Extra Slides.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE The Medical Data Manager : the components Johan Montagnat, Romain Texier, Tristan.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Data Management The GSM-WG Perspective. Background SRM is the Storage Resource Manager A Control protocol for Mass Storage Systems Standard protocol:
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Andrew C. Smith – Storage Resource Managers – 10/05/05 Functionality and Integration Storage Resource Managers.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks AMGA PHP API Claudio Cherubino INFN - Catania.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Medical Data Manager 1 Dicom retrieval : overview of the DPM One command line to retrieve a file:
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Storage Accounting for Grid Environments Fabio Scibilia INFN - Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A GRID based platform to host multiple repositories.
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
The Storage Resource Broker and.
SRM-iRODS Interface Development WeiLong UENG Academia Sinica Grid Computing 1.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra Data Management System gLite – LCG – FiReMan Salvatore Scifo INFN Catania.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
Bologna, March 30, 2006 Riccardo Zappi / Luca Magnoni INFN-CNAF, Bologna.
Enabling Grids for E-sciencE INFN Workshop – May 7-11 Rimini 1 Grid Accounting Status at INFN Riccardo Brunetti INFN-TORINO.
User Domain Storage Elements SURL  TURL LFC Domain (LCG File Catalogue) SA1 – Data Grid Interoperation Enabling Grids for E-sciencE EGEE-III INFSO-RI
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.
Riccardo Zappi INFN-CNAF SRM Breakout session. February 28, 2012 Ingredients 1. Basic ingredients (Fabric & Conn. level) 2. (Grid) Middleware ingredients.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
EGEE Data Management Services
a brief summary for users
gLite Basic APIs Christos Filippidis
The Data Grid: Towards an architecture for Distributed Management
Vincenzo Spinoso EGI.eu/INFN
Cross-health enterprises Medical Data Management on the EGEE grid
Practical: The Information Systems
gLite Data management system overview
StoRM Architecture and Daemons
Middleware independent Information Service
Introduction to Data Management in EGI
GFAL 2.0 Devresse Adrien CERN lcgutil team
Data Management Ouafa Bentaleb CERIST, Algeria
AMGA Web Interface Vincenzo Milazzo
Data services in gLite “s” gLite and LCG.
Architecture of the gLite Data Management System
INFNGRID Workshop – Bari, Italy, October 2004
Presentation transcript:

Enabling Grids for E-sciencE EGEE-II INFSO-RI The Development of SRM interface for SRB Fu-Ming Tsai Academia Sinica Grid Computing

EGEE-II INFSO-RI Enabling Grids for E-sciencE 2 What is SRM? SRM is an unique interface for accessing diffident backend storages for diffident middleware. Easy to develop applications to adapt different backend storages. Provide space and file management on the storage system. SRM is the web service interface and the implementation usually depends on the backend storage technology.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 3 Why SRM(Storage resource management)? Storage Elements (SE) can use different type of technologies –CASTOR, dCache, DPM, BeStMan,...,etc. –DRM (Disk Resource Manager)/TRM (Tape Resource Manager) /HRM (Hierarchical Resource Manager) Grid middleware needs to access files with an uniform interface –Manage storage resources –Not a file transfer protocol

EGEE-II INFSO-RI Enabling Grids for E-sciencE 4 SRM features Provides space management Provides an uniform access interface Manages DRM/Tape/HRM Does not transfer files itself. Manage the life time of file

EGEE-II INFSO-RI Enabling Grids for E-sciencE 5 What is SRB (Storage Resource Broker) ? Developed by San Diego Supercomputer Center A distributed file management system (Data Grid), based on a client-server architecture. Allows users to access files seamlessly across a distributed environment, based upon their attributes rather than just their names or physical locations. It replicates, syncs, archives, and connects heterogeneous resources in a logical and abstracted manner.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 6 SRB Physical Structure Oracle ClientSRB Server Oracle RDBMS SRB location B SRB location B SRB location D SRB Server Storage Space Storage Driver SRB Server Storage Space Storage Driver SRB Server Storage Space Storage Driver location X

EGEE-II INFSO-RI Enabling Grids for E-sciencE 7 SRM for SRB – Our Implementation

EGEE-II INFSO-RI Enabling Grids for E-sciencE 8 Cache server (+gridftp server) Architecture Overview Core Cache repository SRB+DSI File catalog SRB/gridftp Gridftp/management API SRM API File transfer (gridftp) Web Service Data server management Users/applications

EGEE-II INFSO-RI Enabling Grids for E-sciencE 9 Architecture Overview (cont.) Web Service Core Data server management Cache server (+gridftp server) Non MES+DSI Cache repository MES Server (GSI enabled) SRB storage space Non MES File catalog (AMGA) Put a file (SURL) SURL->Path Where to put the file? Non MES (+DSI) Non MES (+DSI)

EGEE-II INFSO-RI Enabling Grids for E-sciencE 10 Architecture Overview (cont.) Web Service Core Data server management Cache server (+gridftp server) Non MES+DSI Cache repository MES Server (GSI enabled) SRB storage space Non MES (+DSI) Non MES (+DSI) File catalog (AMGA) Return TURL Transfer to TURL Server name and path name

EGEE-II INFSO-RI Enabling Grids for E-sciencE 11 Architecture Overview (cont.) Web Service Core Data server management Cache server (+gridftp server) Storage space Cache repository MES Server (+GSI enabled) Storage space Non MES (+DSI) Non MES (+DSI) File catalog (AMGA) Upload a file(gridftp)

EGEE-II INFSO-RI Enabling Grids for E-sciencE 12 Architecture Overview (cont.) Web Service Core Data server management Cache server (+gridftp server) Storage space Cache repository MES Server (GSI enabled) Storage space Non MES (+DSI) Non MES (+DSI) File catalog (AMGA) put file done SRBInfoServer InfoServer SURL -> Path Update status of this file Update all resource information Query InfoServe r Query SRBInfoServer Query InfoServer

EGEE-II INFSO-RI Enabling Grids for E-sciencE 13 Architecture Overview (cont.) Web Service Core Data server management Cache server (+gridftp server) Storage space Cache repository MES Server (GSI enabled) Storage space Non MES (+DSI) Non MES (+DSI) File catalog (AMGA) The Status of PutDone SRBInfoServer InfoServer Return the Status of PutDone Update status of each resource Resource info Logical Resource info Resource info

EGEE-II INFSO-RI Enabling Grids for E-sciencE 14 Support Flexible File/Space Types SRM system has a caching mechanism and has to take care of SRM issues like file lifetime, space management, …,etc. –Volatile space –Durable space –Permanent space In our implementation –Volatile space & Durable space  In the temporary storage space. –Permanent space  In the SRB env.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 15 Checking Disk Status How to get the disk usage of the space? –Need to know the free and used space on cache server and SRB server –SRB does not provide the mechanism to monitor resource usage –We need to know the usage  Space management In our implementation –InfoServer:  Deployed on cache server and non-Mcat enabled SRB server –SRBInfoServer:  Deployed on Mcat-enabled SRB server

EGEE-II INFSO-RI Enabling Grids for E-sciencE 16 Naming Mapping between SURL, TURL, and file name –Different naming space on cache server and SRB server  /XXX/YYY/ZZZ: cache server  /Zone/home/UserAccount.UserDomain/: SRB server  How to map them?? In our implementation –Use AMGA to provide uniform naming space  /XXX/YYY/ZZZ –How to map to TURL  On cache server Use the mounted point gotten from InfoServer and use this mounted point to store user’s data, then reply TURL  On SRB server Try to login and get user’s home collection. Use user’s home collection to store user’s data, then reply TURL.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 17 Typical Use Case

EGEE-II INFSO-RI Enabling Grids for E-sciencE 18 Testbed deployment Hostname: fct01.grid.sinica.edu.tw The end point: httpg://fct01.grid.sinica.edu.tw:8443/axis/services/srm Info: Cache server (gridftp server) and SRM interface Hostname: t-ap20.grid.sinica.edu.tw Info: SRB server (SRB-DSI installed) User InterfaceUser Interface SURL Gridftp/manage ment commands Return some information TURL Host information File information Hostname:t-ap51.grid.sinca.edu.tw Info: AMGA server

EGEE-II INFSO-RI Enabling Grids for E-sciencE 19 Demo Test the functions that we have implemented. Access data in –Cache server –SRB server Use LFC (just a simulated LFC host) to get SURL and use this SURL to connect to SRM server, then get TURL back.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 20 Demo(1) Test the following functions –SrmPing –SrmGetTransferProtocols –SrmCheckPermission –SrmGetPermission –SrmMkdir –SrmRmdir –SrmPrepareToPut –SrmPutDone –SrmPrepareToGet –SrmReleaseFiles –SrmRm

EGEE-II INFSO-RI Enabling Grids for E-sciencE 21 Demo(2) Put/get data in/from cache env through SRM Put/get data in/from cache SRB env through SRM

EGEE-II INFSO-RI Enabling Grids for E-sciencE 22 User needs to store data in SE (from a UI) 1.Create a new LFN entry in LFC, return a SURL. 2.srmPrepateToPut (SURL) 3.Transfer the file 4.srmPutDone (SURL) Demo(3)

EGEE-II INFSO-RI Enabling Grids for E-sciencE 23 User needs to retrieve (onto the UI) data stored into SE 1.Query the file catalog to retrieve the SURL from the LFN. 2.srmPrepateToGet (SURL) 3.Transfer the file (read) 4.srmReleaseFile (SURL) Demo(3)

EGEE-II INFSO-RI Enabling Grids for E-sciencE 24 Progress What we have done: –We have finished the parts about authorizing users, web service interface (just interface) and gridftp deployment, and SRB-DSI, and some functions like directory functions, permission functions,..,etc. What we are doing: –Now, we are focusing on the implementation of our core (data transfer functions and space management).

EGEE-II INFSO-RI Enabling Grids for E-sciencE 25 Future Plan Short term –Implement interface compatible SRM version 2.2. Mid-term –Interact and test with other data management systems such as Castor, dCache, …,etc. Long-term –Interoperate with gLite.  Hope glite users who join VO can access files and resources in SRB and vice versa.

EGEE-II INFSO-RI Enabling Grids for E-sciencE 26 Future Plan

EGEE-II INFSO-RI Enabling Grids for E-sciencE 27 Future Plan SRB SRM

EGEE-II INFSO-RI Enabling Grids for E-sciencE 28 References SRM-SRB interface project wiki page – SRM working group: – SRB: – Globus: – CoG: – Axis: –

EGEE-II INFSO-RI Enabling Grids for E-sciencE 29 Thanks for your attention