1 Meeting Location: LBNL Sept 18, 2003 The functionality of a Replica Registration Service Attendees Michael Haddox-Schatz, JLAB Ann Chervenak, USC/ISI.

Slides:



Advertisements
Similar presentations
RLS and DRS Roadmap Items Ann Chervenak Robert Schuler USC Information Sciences Institute.
Advertisements

© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
HEPiX GFAL and LCG data management Jean-Philippe Baud CERN/IT/GD.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
CMS Applications Towards Requirements for Data Processing and Analysis on the Open Science Grid Greg Graham FNAL CD/CMS for OSG Deployment 16-Dec-2004.
1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007.
CASTOR SRM v1.1 experience Presentation at SRM meeting 01/09/2004, Berkeley Olof Bärring, CERN-IT.
A Computation Management Agent for Multi-Institutional Grids
1 CHEP 2003 Arie Shoshani Experience with Deploying Storage Resource Managers to Achieve Robust File replication Arie Shoshani Alex Sim Junmin Gu Scientific.
Aug Arie Shoshani Particle Physics Data Grid Request Management working group.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Understanding Active Directory
Summary of issues and questions raised. FTS workshop for experiment integrators Summary of use  Generally positive response on current state!  Now the.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
A. Sim, CRD, L B N L GIN-Data : SRM Island Inter-Op Testing With SRM-TESTER Alex Sim, Vijaya Natarajan Computational Research Division Lawrence Berkeley.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
A. Sim, CRD, L B N L 1 Oct. 23, 2008 BeStMan Extra Slides.
Data management in grid. Comparative analysis of storage systems in WLCG.
Status report on SRM v2.2 implementations: results of first stress tests 2 th July 2007 Flavia Donno CERN, IT/GD.
Mass Storage System Forum HEPiX Vancouver, 24/10/2003 Don Petravick (FNAL) Olof Bärring (CERN)
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
File and Object Replication in Data Grids Chin-Yi Tsai.
EGEE Catalogs Peter Kunszt EGEE Data Management Middleware Service Grids NeSC, July 2004 EGEE is a project funded by the.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Bulk Data Movement: Components and Architectural Diagram Alex Sim Arie Shoshani LBNL April 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Andrew C. Smith – Storage Resource Managers – 10/05/05 Functionality and Integration Storage Resource Managers.
1 Meeting Location: LBNL Sept. 1-2, 2004 SRM Annual Meeting Attendees Olof Barring, CERN Jean-Philippe Baud, CERN James Casey, CERN Peter Kunszt, CERN.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
The Earth System Grid: A Visualisation Solution Gary Strand.
1 Grid File Replication using Storage Resource Management Presented By Alex Sim Contributors: JLAB: Bryan Hess, Andy Kowalski Fermi: Don Petravick, Timur.
Computing Sciences Directorate, L B N L 1 CHEP 2003 Standards For Storage Resource Management BOF Co-Chair: Arie Shoshani * Co-Chair: Peter Kunszt ** *
SRM & SE Jens G Jensen WP5 ATF, December Collaborators Rutherford Appleton (ATLAS datastore) CERN (CASTOR) Fermilab Jefferson Lab Lawrence Berkeley.
1 SRM-Lite: overcoming the firewall barrier for data movement Arie Shoshani Alex Sim Viji Natarajan Lawrence Berkeley National Laboratory SDM Center All-Hands.
1 LHCb File Transfer framework N. Brook, Ph. Charpentier, A.Tsaregorodtsev LCG Storage Management Workshop, 6 April 2005, CERN.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
SEE-GRID-SCI Storage Element Installation and Configuration Branimir Ackovic Institute of Physics Serbia The SEE-GRID-SCI.
1 Use of SRM File Streaming by Gateway Alex Sim Arie Shoshani May 2008.
Computing Sciences Directorate, L B N L 1 SC 2003 Storage Resource Managers: Essential Components for the Grid Arie Shoshani Staff: Alex Sim, Junmin Gu,
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
PPDG meeting, July 2000 Interfacing the Storage Resource Broker (SRB) to the Hierarchical Resource Manager (HRM) Arie Shoshani, Alex Sim (LBNL) Reagan.
Update of SAM Implementation ALICE TF Meeting 18/10/07.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Data Management The European DataGrid Project Team
Author - Title- Date - n° 1 Partner Logo WP5 Status John Gordon Budapest September 2002.
3 Copyright © 2007, Oracle. All rights reserved. Using the RMAN Recovery Catalog.
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
Production Mode Data-Replication Framework in STAR using the HRM Grid CHEP ’04 Congress Centre Interlaken, Switzerland 27 th September – 1 st October Eric.
SRM-iRODS Interface Development WeiLong UENG Academia Sinica Grid Computing 1.
1 SRM v2.2 Discussion of key concepts, methods and behaviour F. Donno CERN 11 February 2008.
1 DIRAC Data Management Components A.Tsaregorodtsev, CPPM, Marseille DIRAC review panel meeting, 15 November 2005, CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
9/20/04Storage Resource Manager, Timur Perelmutov, Jon Bakken, Don Petravick, Fermilab 1 Storage Resource Manager Timur Perelmutov Jon Bakken Don Petravick.
EGEE Data Management Services
SRM V2.1: Additional Design Issues
Data Management cluster summary
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Presentation transcript:

1 Meeting Location: LBNL Sept 18, 2003 The functionality of a Replica Registration Service Attendees Michael Haddox-Schatz, JLAB Ann Chervenak, USC/ISI Jean-Philippe Baud, CERN Timur Perelmutov, Fermi Don Petravick, Fermi Doug Olson, LBNL Alex Sim, LBNL Arie Shoshani, LBNL

2 Generalized Replica Management Service Replicate {LFN}  siteReplicate {LFN}  site Lookup LFNs – find out which RLS to use Planning: select PFNs / SURLs for accessing files Allocate space Move files robustly Register files

3 Simple Replica Management: focus on three services Dynamic Space AllocationDynamic Space Allocation Now space is pre-allocated Future: request space reservation Multi-file replicationMulti-file replication Managing the entire multi-file request Use file transfer services Guarantee robustness Replica RegistrationReplica Registration Into one or more catalogs Can register into standard Replica Catalogs Can register into specialized experiment file catalogs

4 Replica Management Architecture Replica Management Service Multi-file Replication Service Replica Registration Service Client File Transfer Services Replica/File catalog Services

5 Replica Management for STAR Experiment Replica Management Service Replica Registration Service Client GridFTP (bbftp) STAR File Catalog DataMover HRM

6 Replica Management for STAR Experiment Replica Registration Service Client GridFTP (bbftp) STAR File Catalog DataMover HRM

7 Generalizing the Concept: Need to Standardize APIs Replica Management Service Multi-file Replication Service Replica Registration Service (RRS) Client File Transfer Services Replica/File catalog Services Multi-file Replication Service Multi-file Replication Service (MRS) SRMs Magda SRB … File Transfer Services File Transfer Services GridFTP bbftp scp … Replica/File catalog Services Replica/File catalog Services RLS MCAT STAR-FC ATLAS-FC … Multi-file Replication specReplica Registration spec Replica Registration APIMulti-file Replication API

8 Three use cases Client generated a bunch of filesClient generated a bunch of files Call a RepReg service to register to one or more RepCats Client want to replicate a bunch of files (or directory)Client want to replicate a bunch of files (or directory) Call RepReg with different modes Want to delete one or more filesWant to delete one or more files Should a RepReg service to cleanup catalogs? Question: how the RepCats chosen?Question: how the RepCats chosen? We propose to use the MDS for that Qusetion: should RRS be a separate (external) serviceQusetion: should RRS be a separate (external) service Yes, because it is too much to impose on underlying RepCats Yes, because it is a common functionality to all RepCats Yes, because it has value as a separate service

9 Replica Management Architecture Replica Management Service Multi-file Replication Service Replica Registration Service Client File Transfer Services Replica/File catalog Services Monitoring & Discovery Service Default RepCats Find RepCats (optional)

10 Functionality of Replica Registration Registration modesRegistration modes File-at-a-time registration Mode 1: register all the files that were replicated successfully, and report which failed. Mode 2: stop registration process if there is a failure in registration, report which failed. Global registration: after the entire multi-file replication finished Mode 3: register only if all file replications are successful. Mode 4: register all files that were successfully replicated, and report which failed, if any. Failures are in terms of registering to a RepCat E.g. RepCat database is full or down Assume – immutable files, LFN-PFN duplication is error Register to any catalogsRegister to any catalogs Standard catalogs – e.g. RLS Specialized catalogs – e.g. experiments file catalogs Issue: need a standard for RepCat interface In the meantime we need have a plug-in type service

11 Functionality of Replica Registration (Cont’d) Queue registration requestsQueue registration requests Request token Multi-file reg capability Permit multiple registration modes File-a-time (as quickly as you can): mode 1 or mode 2 Bulk (do it all or do none => roll back): mode 3 or mode 4 Failure: no-response, no space, file-already-registered Permission – security: GSI proxy + user name for now Access control – specify r/w for catalog (limited today, ACLs later) Abort and Roll back registration Commit registration Global reg request (commit implied) RobustnessRobustness Recover from transient failures Retry per request – with time limit (in seconds)

12 Functionality of Replica Registration (Cont’d) Register to multiple catalogs simultaneouslyRegister to multiple catalogs simultaneously Ability to specify which catalogs to register to Refer to RepCats by URLs Manage multiple registration requests for multiple clientsManage multiple registration requests for multiple clients Implementation dependent

13 Functionality of Replica Registration (Cont’d) Asynchronous serviceAsynchronous service Provide dynamic status on registration Success, failed, pending – per file per RepCat Failed: already-in-catalog, ill-formed (keep RepCat error string), not-authorized RepCat not responding – per RepCat RepCat database full – per RepCat Waiting for commit – per request Request timed-out – per request Full Success / partial success (but completed) – per request Provide statistics and failure information Summary per request (History activity information)

14 Un-register Problem: want to delete a file and un-register it from all RepCats that know about it.Problem: want to delete a file and un-register it from all RepCats that know about it. 1) Need to find out where file is registered or 2) broadcast to all RepCat in the VO Prefer 2) because we avoid keeping a state of where files are registered Who issues the notification to un-register a file?Who issues the notification to un-register a file? SRMs – need to have permission Maybe RepCats should check with SRM before removal Should this be a service of RRS?Should this be a service of RRS? Probably, but not in first version