INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org Data Management + Practical Ruediger Berlich / Forschungszentrum Karlsruhe Mike Mineter /

Slides:



Advertisements
Similar presentations
Workflows over Grid-based Web services General framework and a practical case in structural biology gLite 3.0 Data Management Hands-on David García Aristegui.
Advertisements

Grid Data Management Assaf Gottlieb - Israeli Grid NA3 Team EGEE is a project funded by the European Union under contract IST EGEE tutorial,
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite Data Management System Yaodong Cheng CC-IHEP, Chinese Academy.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
Computational grids and grids projects DSS,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Data Management Hands-on Claudio Cherubino.
E-science grid facility for Europe and Latin America LFC Server Installation and Configuration Antonio Calanducci INFN Catania.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) GISELA Additional Services Diego Scardaci
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Nov. 18, EGEE and gLite are registered trademarks gLite Middleware Usage Dusan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
Jan 31, 2006 SEE-GRID Nis Training Session Hands-on V: Standard Grid Usage Dušan Vudragović SCL and ATLAS group Institute of Physics, Belgrade.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Hands-on on data management Tony Calanducci.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
INFSO-RI Enabling Grids for E-sciencE Experiences with LFC and comparison with RNS Erwin Laure Jean-Philippe.
E-science grid facility for Europe and Latin America Data Management Services E2GRIS1 Rafael Silva – UFCG (Brazil) Universidade Federal.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE GILDA Practicals Giuseppe Andronico Roberto Barbera Mike Mineter EGEE Tutorial, Taipei,
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
SEE-GRID-SCI Storage Element Installation and Configuration Branimir Ackovic Institute of Physics Serbia The SEE-GRID-SCI.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid2Win: Porting of gLite middleware to.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
INFSO-RI Enabling Grids for E-sciencE GILDA Praticals Data management system GILDA Tutors INFN Catania EGEE Tutorial Roma 03.November.2005.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Command Line Grid Programming Spiros Spirou Greek Application Support Team NCSR “Demokritos”
Data Management The European DataGrid Project Team
Enabling Grids for E-sciencE EGEE-II INFSO-RI Porting an application to the EGEE Grid & Data management for Application Rachel Chen.
User Interface UI TP: UI User Interface installation & configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Data Management Hands-on Juan Eduardo Murrieta.
12th EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
Enabling Grids for E-sciencE gLite security pratical tutorial Dario Russo INFN Catania Catania,
Istituto Nazionale di Astrofisica Information Technology Unit INAF-SI Job with data management Giuliano Taffoni.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra Data Management System gLite – LCG – FiReMan Salvatore Scifo INFN Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Algiers, EUMED/Epikh Application Porting Tutorial, 2010/07/04.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) LFC Installation and Configuration Dong Xu IHEP,
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
Grid Data Management Assaf Gottlieb Tel-Aviv University assafgot tau.ac.il EGEE is a project funded by the European Union under contract IST
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Data Management Maha Metawei
INFSO-RI Enabling Grids for E-sciencE Practicals on LFC and gLite DMS Tony Calanducci Emidio Giorgio INFN Retreat between GILDA.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
Grid2Win Porting of gLite middleware to Windows XP platform
GFAL Grid File Access Library
gLite Basic APIs Christos Filippidis
Java API del Logical File Catalog (LFC)
gLite Data management system overview
Grid2Win: Porting of gLite middleware to Windows XP platform
The gLite API – Part II Giuseppe LA ROCCA ACGRID-II School
Hands-On Session: Data Management
Data Management in Release 2
Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy
Data Management Ouafa Bentaleb CERIST, Algeria
Data services in gLite “s” gLite and LCG.
EGEE Middleware: gLite Information Systems (IS)
Architecture of the gLite Data Management System
gLite Data and Metadata Management
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE Data Management + Practical Ruediger Berlich / Forschungszentrum Karlsruhe Mike Mineter / National e-Science Centre slides contributed by INFN Catania, NeSC Edinburgh Brisbane,

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Outline Introduction to Data Management –Storage Element –Data services in gLite –Catalogs –File Transfer Practicals –Data Management (LFC and FireMan) - Commands - Examples and Exercises –Explore the GILDA Testbed

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, User Interface Access Host: glite-tutor.ct.infn.it Username: brisbaneXX (XX=01…30) Password: GridBRIXX (XX=01…30) PassPhrase: BRISBANE GENIUS: Please use the same logins as yesterday !

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Introduction to Data Managemnt

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Storage Element in GLite Storage back-end with all the associated hardware and drivers SRM service implementation on the given storage Transfer service for a (set of) transfer protocol(s) gLite POSIX-like File I/O service Auxiliary Security and Logging services –If SE supports ACL (extensions to POSIX-like access control – e.g. multiple groups), SE accesses the user, group data in VOMS proxy –Optional logging and accounting services Currently, Mass Storage Systems: –Castor, dCache

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, SRM Enstore JASMine Client USER/APPLICATIONS Grid Middleware SRM dCache SRM Castor SRM SE CCLRC RAL SRB Currently supported, via SRM, by gLite

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Why use SRM? Becoming standard for data management in grids – GGF working group - Will also allow gLite to provide data scheduling –Before jobs run –For file transfer wg/doc/ggf10.DataWorkshop.Arie.SRM.interface.pdf

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Data services in gLite File Access Patterns: –Write once, read-many –Rare append-only with one owner –Frequent updated at one source - replicas check/pull new version –(NOT frequent updates, many users, many sites) 3 service types for data –Storage –Catalogs –Movement

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, File names and identifiers in gLite Globally unique identifier Site URL Transport URL: includes protocol user need only see these Logical filename Unique: e.g..glite/myVO.org/run22/ …

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, I/O server interactions Provided by site Provided by VO

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Cataloguing Requirements Catalogs built based on requirements from HEP experiments and the Biomedical EGEE community Started design from AliEn File Catalog –Logical namespace management –Virtual Filesystem view (DataSets via directory hierarchy) –Support Metadata attached to files –Bulk Operations –Strong security: basic unix permissions and fine-grained ACLs (i.e. not just directory but file-granularity) –Support flexible deployment models  Single central catalog model  Site local catalogs connected to a single central catalog model  Site local catalogs without single central catalog model –Scalable to many clients and to a large number of entries; address performance issues seen with EDG RLS

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Catalogs Fireman –Fireman = File and Replica Manager  Also interfaces to metadata catalog –Implements all file management interfaces  Using replica catalog: manage replicas using GUID File Authorization Service –Request authorisation - based on the DN and the Groups from the user’s delegated credentials –the FAS and Catalog interfaces are implemented by the same service Metadata Catalog – not yet! –Metadata are application specific –All files in a directory have the same schema –(Many directories can share a schema)

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, gLite Catalog Releases FiReMan Catalog –Release 1: Single Central deployment model only –Release 2: Distributed catalog according to design using Java Messaging Services to propagate updates between catalog instances Storage Index –Already in Release 1 –Main interaction point with Workload Management Metadata Catalog –Release 1: Base Implemented by FiReMan –Also a standalone service, single central instance –Release 2: distribution using a messaging infrastructure

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Data Movement File movement is asynchronous – submit a job –Held in file transfer queue Data scheduler –Single service per VO – can be distributed –VO can apply policies (priorities, preferred sites, recovery modes..) Client interfaces: –Browser –APIs –Web service “File transfer” –Uses SURL “File placement” –Uses LFN or GUID, accesses Catalogues to resolve them

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, DMS Status gLite DMS being released in phases –File placement and transfer in August –Metadata en route –Data scheduling later High energy physicists urgently needed better functionality than LCG file management middleware Interim developed - LCF  LCG File Catalogue LHC Compute Grid oLHC = Large Hadron Collider Effect: 2 data management middlewares temporarily: –gLite (emerging), LCF (interim – production, GILDA)

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Summary Trigger and monitor transfer Look up, register, authorise

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, For More Information JRA1 Data Management homepage EGEE Middleware Architecture and Planning gLite FiReMan user guide –Overview –Command Line tools –C/C++ API –Java API

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Practicals on Data Managemnt (LFC and lcg-utils)

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Set up your environment Set the following environment variables to specify the catalog type and its location:  export LCG_CATALOG_TYPE=lfc  export LFC_HOST=lfc-gilda.ct.infn.it Ensure you have created a proxy certificate and it is still valid. If not create it by:  voms-proxy-init --voms gilda –Remember: The Passphrase is BRISBANE

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Listing the entries of a LFC directory lfc-ls [-cdiLlRTu] [--comment] path… where path specifies the LFC pathname (mandatory) –Remember that LFC has a directory tree structure –/grid/ / –All members of a given VO have read-write permissions under their directory –-l (it is a lowercase “L”) outputs long listing –-R lists the contents of directories recursively (don’t use it so often!) –You can set LFC_HOME to use relative paths LFC_HOME=/grid/gilda/myDir  /grid/gilda/myDir/myFile becomes myFile LFC Catalog commands Defined by the user LFC Namespace

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lfc-ls examples $ lfc-ls –l /grid/gilda/user.example Examples: > lfc-ls /grid/gilda > lfc-ls -l /grid/gilda > lfc-ls -l -R /grid/gilda... -rw-rw-r Jun 21 09:40 tutor02-rel-pippo-pluto -rw-rw-r Jun 21 09:39 tutor14 -rw-rw-r Jun 21 09:40 tutor16-mytxt -rw-rw-r Jun 21 09:32 unitprot-ibcp02 -rw-rw-r Jun 21 09:36 uploadfile -rw-rw-r Jun 21 09:36 uploadfilelfn -rw-rw-r Jun 21 09:38 user.example -rw-rw-r Jun 21 09:38 user.example2 -rw-rw-r Jun 21 09:40 valencia15.ejemplo -rw-rw-r Jun 21 09:40 valencia15.example... $ export LFC_HOME=/grid/gilda/ $ lfc-ls –l user.example -rw-rw-r Jun 21 09:38 /grid/gilda/user.example

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, LFC Catalog commands Creating a symbolic link lfc-ln -s file linkname lfc-ln -s directory linkname Create a link to the specified file or directory with linkname –Example: $ lfc-ln -s /grid/gilda/user.example /grid/gilda/brisb/linkToUser.ex Let’s check the link using lfc-ls with long listing (-l) $ lfc-ls -l /grid/gilda/brisb lrwxrwxrwx Jul 17 12:06 linkToUser.ex -> /grid/gilda/user.example Original File Symbolic link

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, LFC Catalog commands Creating directories in the LFC lfc-mkdir [-m mode] [-p] path... Where path specifies the LFC pathname Remember that while registering a new file (using lcg-cr, for example) the corresponding destination directory must be created in the catalog before Examples: $ lfc-mkdir /grid/gilda/Examples You can just check the directory with: $ lfc-ls -l /grid/gilda

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, LFC Catalog commands Adding/deleting metadata information lfc-setcomment path comment lfc-delcomment path lfc-setcomment adds/replaces a comment associated with a file/directory in the LFC Catalog lfc-delcomment deletes a comment previously added Example: lfc-setcomment /grid/gilda/user.example “Hello Brisbane” Check your job with.. lfc-ls --comment /grid/gilda/user.example /grid/gilda/user.example Hello Melbourne

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Example: lfc-delcomment /grid/gilda/user.example Check your job with.. lfc-ls –l --comment /grid/gilda/user.example -rw-rw-r Jun 21 09:38 /grid/gilda/user.example LFC Catalog commands

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Hands-on Session Exercise No.1: –Log onto a UI and initialize your proxy credentials if not already done –check / set up properly the environment variables to use the lfc-gilda.ct.infn.it catalog –have a look inside the catalog –create a directory with your surname (or a unique name) –put inside the just created directory a link to an existing file –add a comment to that file and verify it

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, LFC Catalog commands Add/replace a commentlfc-setcomment Set file/directory access control listslfc-setacl Remove a file/directorylfc-rm Rename a file/directorylfc-rename Create a directorylfc-mkdir List file/directory entries in a directorylfc-ls Make a symbolic link to a file/directorylfc-ln Get file/directory access control listslfc-getacl Delete the comment associated with the file/directorylfc-delcomment Change owner and group of the LFC file-directorylfc-chown Change access mode of the LFC file/directorylfc-chmod Summary of the LFC Catalog commands

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils The LCG Data Management tools (usually called lcg- utils) allow users to copy files between UI, CE, WN and a SE, to register entries in the File Catalogs and replicate files between SEs. Check if LCG_GFAL_INFOSYS environment variable is correctly set to the local GILDA Information Index (BDII) –export LCG_GFAL_INFOSYS=grid004.ct.infn.it:2170

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils: lcg-cr Upload a file to a SE and register it into the catalog lcg-cr -d dest_file | dest_host -l lfn [-g guid] [-l lfn] [-v | --verbose] --vo vo src_file where –dest_host is the fully qualified hostname of the destination SE –dest_file is a valid SURL (both sfn:// or srm:// format are valid) –guid specifies the Grid Unique IDentifier. If this option is not present, a GUID is generated internally –lfn specifies the Logical File Name associated with the file –vo specifies the Virtual Organization the user belongs to –src_file specifies the source file name: the protocol can be file:/// or gsiftp:///

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils: lcg-cr To discover which SEs the user is allowed to use, remember you can use lcg-infosites command. lcg-infosites --vo gilda se The output is a list of SEs and related information on available/used space lcg-cr usage example: $ lcg-cr -v -d grid-se.bio.dist.unige.it -l lfn:/grid/gilda/ruediger/note.txt --vo gilda file:///home/local/note.txt Using grid catalog type: lfc Source URL: file:///home/local/note.txt File size: 51 Destination specified: grid-se.bio.dist.unige.it Destination URL for copy: gsiftp://grid- se.bio.dist.unige.it/flatfiles/SE00/gilda/generated/ /file1f0e73d8-7e3f- 47d1-bc95-c03c92aae569 # streams: 1 Alias registered in Catalog: lfn:/grid/gilda/SEOUL/note.txt Transfer took ms Destination URL registered in Catalog: sfn://grid- se.bio.dist.unige.it/flatfiles/SE00/gilda/generated/ /file1f0e73d8-7e3f- 47d1-bc95-c03c92aae569 guid:4c10a8e c38-bc98-ed98ae7cb94e

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils: lcg-aa and lcg-la Adding an alias for a given GUID lcg-aa --vo vo guid lfn where –vo specifies the Virtual Organization the user belongs to –guid specifies the Grid Unique Identifier of the file you want to add the alias to –lfn specifies the new alias Example: $ lcg-aa --vo gilda guid:4c10a8e c38-bc98-ed98ae7cb94e lfn:/grid/gilda/SEOUL/aliasToNote.txt To check if the previous command was successful, you can use lcg- la command to list the aliases for a given LFN, GUID or SURL $ lcg-la --vo gilda lfn:/grid/gilda/seoul/aliasToNote.txt lfn:/grid/gilda/seoul/note.txt lfn:/grid/gilda/seoul/aliasToNote.txt

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Hands-on session Exercise No.2: –verify that your LCG_GFAL_INFOSYS is correctly set up –create a dummy file –check the available storage elements –copy and register the previously created file into your previously created dir –add an alias to the just uploaded file –check if the alias was assigned correctly

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils commands for replicas (I) Copying a file from one SE to another one and register it in the Catalog lcg-rep -d dest_file | dest_host [-v | --verbose] --vo vo src_file where –dest_host is the fully qualified hostname of the destination SE –dest_file is a valid SURL (both sfn:// or srm:// are valid) –vo specifies the Virtual Organization the user belongs to –src_file specifies the source file name: the protocol can be LFN, GUID or SURL. An SURL scheme can be sfn: for a classical SE or srm : $ lcg-rep -v -d grid009.ct.infn.it --vo gilda lfn:/grid/gilda/seoul/note.txt Using grid catalog type: lfc Source URL: lfn:/grid/gilda/seoul/note.txt File size: 51 Destination specified: grid009.ct.infn.it Source URL for copy: gsiftp://grid-se.bio.dist.unige.it/flatfiles/SE00/gilda/generated/ /file1f0e73d8-7e3f-47d1- bc95-c03c92aae569 Destination URL for copy: gsiftp://grid009.ct.infn.it/flatfiles/SE00/gilda/generated/ /file4f3b4cb2-b5fe-467e-9a3e- 1ef602465a17 # streams: 1 Transfer took 2410 ms Destination URL registered in LRC: sfn://grid009.ct.infn.it/flatfiles/SE00/gilda/generated/ /file4f3b4cb2-b5fe- 467e-9a3e-1ef602465a17

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils commands for replicas (II) Listing of replicas for a given LFN, GUID or SURL lcg-lr --vo vo file where –vo specifies the Virtual Organization the user belongs to –file specifies the Logical File Name, the Grid Unique IDentifier or the Site URL. An SURL scheme can be sfn: for a classical SE or srm: Example: $ lcg-lr --vo gilda lfn:/grid/gilda/SEOUL/note.txt sfn://grid-se.bio.dist.unige.it/flatfiles/SE00/gilda/generated/ /file1f0e73d8-7e3f-47d1-bc95-c03c92aae569 sfn://grid009.ct.infn.it/flatfiles/SE00/gilda/generated/ /file4f3b4cb2-b5fe-467e-9a3e-1ef602465a17 or we got the same output using its GUID $ lcg-lr --vo gilda guid:4c10a8e c38-bc98-ed98ae7cb94e

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils commands for replicas (III) Deleting replicas lcg-del [ -a ] [ -s se ] [ -v | --verbose ] --vo vo file where –a is used to delete all replicas of the given file –se specifies the SE from which you want to remove the replica –vo specifies the Virtual Organization the user belongs to –file specifies the Logical File Name, the Grid Unique IDentifier or the Site URL. An SURL scheme can be sfn: for a classical SE or srm:. Example: delete one replica $ lcg-del --vo gilda -s grid009.ct.infn.it lfn:/grid/gilda/seoul/note.txt delete all the replicas $ lcg-del -a --vo gilda lfn:/grid/gilda/seoul/note.txt let’s check if the previous command was successful $ lcg-lr --vo gilda lfn:/grid/gilda/seoul/note.txt lcg_lr: No such file or directory or by lfs-ls /grid/gilda/seoul (you will not see anymore note.txt and its alias)

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, lcg-utils: lcg-cp Downloading a Grid file in a SE to a local destination lcg-cp [ -v | --verbose ] --vo vo src_file dest_file where –vo specifies the Virtual Organization the user belongs to –src_file specifies the source file name: the protocol can be LFN, GUID, SURL or local file. An SURL scheme can be sfn: for a classical SE or srm: –dest_file specifies the destination. The protocol can be file:/// or gsiftp:/// Example: $ lcg-cp --vo gilda lfn:/grid/gilda/seoul/note2.txt file:/home/local/note2.txt Source URL: lfn:/grid/gilda/SEOUL/note2.txt File size: 51 Source URL for copy: gsiftp://gilda-se-01.pd.infn.it/shared/gilda/generated/ /file06c3b28c-465f-489c-be3c-b68728e1ca16 Destination URL: file:/home/local/note2.txt # streams: 1 Transfer took 1060 ms

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Hands-on session Exercise No.3: –Create two replicas of the file you previously uploaded (you could also use the alias to point it out) –Check if the operation was successful –Download the file back in your UI –Delete just one replica and verify that –Delete all the replicas and verify that –Verify if the entry is still in the catalog

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Final exercise GOAL: Submit a job that does data management: it will retrieve a file previously registered into the catalog. Steps to follow up: –Create a new file in your UI and put some data into it –Choose a SE to upload the file to (hint: use lcg-infosites ) and use the appropriate command to accomplish at this operation (lcg-cr –v –vo gilda –l lfn:/grid/gilda/brisb/ -d file:`pwd`/ ) –create a script.sh file with the following content: #!/bin/sh /bin/hostname #Change the LFN_NAME to download from the Catalog. echo "Start to download.." lcg-cp --vo gilda lfn:/grid/gilda/brisb/ file:`pwd`/output.dat echo "Done.."

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Final exercise (II) Create the JobWithData.jdl: Type = "job"; JobType = "Normal"; Executable = "/bin/sh"; Arguments = "script.sh"; VirtualOrganisation = "gilda"; StdOutput = "std.out"; StdError = "std.err"; InputSandbox = {"script.sh"}; OutputSandbox = {"std.out","std.err","output.dat"}; Submit it to the grid Retrieve the output and verify the content of output.dat

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Summary of lcg-utils commands Replica Management Sets file status to “Done” for a given SURL in a SRM requestlcg-sd Gets the TURL for a given SURL and transfer protocollcg-gt Replication between SEs and registration of the replicalcg-rep Delete one filelcg-del Copies a file to a SE and registers the file in the cataloglcg-cr Copies a grid file to a local destinationlcg-cp File Catalog Interaction Lists the replicas for a given GUID, SURL or LFNlcg-lr Get the GUID for a given LFN or SURLlcg-lg Lists the alias for a given SURL, GUID or LFNlcg-la Unregisters in LFC a file placed in a SElcg-uf Registers in LFC a file placed in a SElcg-rf Remove an alias in LFC for a given GUIDlcg-ra Add an alias in LFC for a given GUIDlcg-aa

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Bibliography Information on the file catalogs –LFC, gfal, lcg-utils: “Evolution of LCG-2 Data Management (J-P Baud, J. Casey)” –LFC installation, administration, migration from RLS:  Wiki entries indicated through the presentation: CG_File_Catalog_%28LFC%29http://goc.grid.sinica.edu.tw/gocwiki/How_to_migrate_the_RLS_entries_into_the_L CG_File_Catalog_%28LFC%29 –LFC contacts:  

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Practicals on Data Managemnt (FiReMan)

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Browsing the catalog > glite-catalog-ls Main options : -l long output (with permissions) -s URL, specify the service endpoint (i.e. the catalog to use) -c Display the creation time instead of the modification time -g Also display GUIDs Check all the options with –h ACL : - for regular files, d for directory, l for symbolic links and v for virtual directories. p indicates the permission to change attribute, while d rights to delete the entry. Successive 12 bits indicates, for user (u), group (g), other (o), permission to read, write, list contents or execute the content. Last two are reserved for metadata use, and so are currently unused. They will show the right to get or set the metadata. > glite-catalog-ls -l /test1103 -pdrwl-gs--r-l-g :01:55 /test1103

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Upload and register a file : glite-put glite-put [-m ] [-c ] glite-put copies a local file to a gLite I/O server, updating also the File Catalog. The file is copied in the IO server pointed by the UI’s IO client The catalog updated is the one associated with that IO server > glite-put file2register lfn:///test1156 [glite_put] Total 0.00 MB |====================| % [0.0 Mb/s] Transfer Completed: LFN : /test1156 GUID : 0002bb56-7ce9-12db-bf0e- c0a70228beef SURL : srm://glite- se.ct.infn.it:8443/srm/managerv1?SFN=/pnfs/ct.infn.i t/data/gilda/test1156 Data Written [bytes] : 30 Eff.Transfer Rate[Mb/s] :

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Check FC entries : glite-catalog-stat glite-catalog-stat [options] URI glite-catalog-stat gives all the informations on a catalog entries, including file/directory permissions, GUID, owner/group, SURL location… It’s very useful to verify correct setting of permissions and ownerships

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Download a file : glite-get glite-get [-c ] glite-get download locally a file from the IO server pointed by the UI’s client glite-get lfn:///emacs localcopy [glite_get] Total 0.00 MB |====================| % [0.0 Mb/s] Transfer Completed: LFN : /emacs GUID : 0032f db c0a70219beef SURL : srm://lxcde08.pd.infn.it:8443/srm/managerv1?SFN=/pnf s/pd.infn.it/data/gilda/emacs Data Written [bytes] : 237 Eff.Transfer Rate[Mb/s] :

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Remove a file glite-rm [-c ] glite-rm removes a file from the IO server pointed by the UI, updating also the file catalog glite-rm lfn:///emacs Unlink Completed: File : lfn:///emacs Time [s] :

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Exercise Create a text file containing some data Copy and register the file assigning as remote file name your username Verify correct file creation Download the file you’ve just created, changing its local file name Delete the file from the catalog Verify the correct file deletion

Enabling Grids for E-sciencE INFSO-RI EGEE Tutorial, Brisbane, Questions ?