INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.

Slides:



Advertisements
Similar presentations
Data Management Expert Panel - WP2. WP2 Overview.
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
INFSO-RI Enabling Grids for E-sciencE Comparison of LCG-2 and gLite Author E.Slabospitskaya Location IHEP.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
EGEE Catalogs Peter Kunszt EGEE Data Management Middleware Service Grids NeSC, July 2004 EGEE is a project funded by the.
INFSO-RI Enabling Grids for E-sciencE Status and Plans of gLite Middleware Erwin Laure 4 th ARDA Workshop 7-8 March 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
1 LHCb File Transfer framework N. Brook, Ph. Charpentier, A.Tsaregorodtsev LCG Storage Management Workshop, 6 April 2005, CERN.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
Introduction to The Storage Resource.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
EGEE-0 / LCG-2 middleware Practical.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Segundo Taller Latino Americano de Computación GRID – Primer Taller Latino Americano de EELA – Primer Tutorial Latino Americano de EELA
Distributed Data Access Control Mechanisms and the SRM Peter Kunszt Manager Swiss Grid Initiative Swiss National Supercomputing Centre CSCS GGF Grid Data.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE is a project funded by the European Union under contract IST Data Management Data Access From WN Paolo Badino Ricardo.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra Data Management System gLite – LCG – FiReMan Salvatore Scifo INFN Catania.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
INFSO-RI Enabling Grids for E-sciencE FiReMan Catalog installation Emidio Giorgio INFN EGEE tutorial, Rome
INFSO-RI Enabling Grids for E-sciencE FiReMan Catalog installation Emidio Giorgio INFN First Latin American Workshop for Grid Administrators.
INFSO-RI Enabling Grids for E-sciencE gLite Overview Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Riccardo Zappi INFN-CNAF SRM Breakout session. February 28, 2012 Ingredients 1. Basic ingredients (Fabric & Conn. level) 2. (Grid) Middleware ingredients.
EGEE Data Management Services
gLite Basic APIs Christos Filippidis
StoRM: a SRM solution for disk based storage systems
Practical: The Information Systems
gLite Data management system overview
gLite Grid Services Salma Saber
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Comparison of LCG-2 and gLite v1.0
Grid Services Ouafa Bentaleb CERIST, Algeria
Short update on the latest gLite status
Data Management Ouafa Bentaleb CERIST, Algeria
Data services in gLite “s” gLite and LCG.
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
Architecture of the gLite Data Management System
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Acknowledgments EGEE Middleware Architecture and Planning SRM slides derived from presentation by Andrew Smith (NeSC) Roberto Barbera, ISSGC05, Vico Equense, July dex.htm dex.htm

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Outline Storage Element Data services in gLite Catalogs File Transfer

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Storage Element in GLite Storage back-end with all the associated hardware and drivers SRM service implementation on the given storage Transfer service for a (set of) transfer protocol(s) gLite POSIX-like File I/O service Auxiliary Security and Logging services –If SE supports ACL (extensions to POSIX-like access control – e.g. multiple groups), SE accesses the user, group data in VOMS proxy –Optional logging and accounting services Currently, Mass Storage Systems: –Castor, dCache

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, SRM Enstore JASMine Client USER/APPLICATIONS Grid Middleware SRM dCache SRM Castor SRM SE CCLRC RAL SRB Currently supported, via SRM, by gLite

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Why use SRM? Becoming standard for data management in grids – GGF working group - Will also allow gLite to provide data scheduling –Before jobs run –For file transfer wg/doc/ggf10.DataWorkshop.Arie.SRM.interface.pdf

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Data services in gLite File Access Patterns: –Write once, read-many –Rare append-only with one owner –Frequent updated at one source - replicas check/pull new version –(NOT frequent updates, many users, many sites) 3 service types for data –Storage –Catalogs –Movement

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, File names and identifiers in gLite Globally unique identifier Site URL Transport URL: includes protocol user need only see these Logical filename Unique: e.g..glite/myVO.org/run22/ …

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, I/O server interactions Provided by site Provided by VO

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, AuthN and AuthZ in data management MyProxy Data service (transfer or I/O) File AuthZ service VOMS Client application SE 1. voms-proxy-init 2. myproxy-init 3. Service call (with delegation) 4. renew Local AA, map user 5 6 Bulk interfaces: reduce impact of AA

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Cataloguing Requirements Catalogs built based on requirements from HEP experiments and the Biomedical EGEE community Started design from AliEn File Catalog –Logical namespace management –Virtual Filesystem view (DataSets via directory hierarchy) –Support Metadata attached to files –Bulk Operations –Strong security: basic unix permissions and fine-grained ACLs (i.e. not just directory but file-granularity) –Support flexible deployment models  Single central catalog model  Site local catalogs connected to a single central catalog model  Site local catalogs without single central catalog model –Scalable to many clients and to a large number of entries; address performance issues seen with EDG RLS

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Catalogs Fireman –Fireman = File and Replica Manager  Also interfaces to metadata catalog –Implements all file management interfaces  Using replica catalog: manage replicas using GUID File Authorization Service –Request authorisation - based on the DN and the Groups from the user’s delegated credentials –the FAS and Catalog interfaces are implemented by the same service Metadata Catalog – not yet! –Metadata are application specific –All files in a directory have the same schema –(Many directories can share a schema)

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, gLite Catalog Releases FiReMan Catalog –Release 1: Single Central deployment model only –Release 2: Distributed catalog according to design using Java Messaging Services to propagate updates between catalog instances Storage Index –Already in Release 1 –Main interaction point with Workload Management Metadata Catalog –Release 1: Base Implemented by FiReMan –Also a standalone service, single central instance –Release 2: distribution using a messaging infrastructure

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Data Movement File movement is asynchronous – submit a job –Held in file transfer queue Data scheduler –Single service per VO – can be distributed –VO can apply policies (priorities, preferred sites, recovery modes..) Client interfaces: –Browser –APIs –Web service “File transfer” –Uses SURL “File placement” –Uses LFN or GUID, accesses Catalogues to resolve them

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, DMS Status gLite DMS being released in phases –File placement and transfer in August –Metadata en route –Data scheduling later High energy physicists urgently needed better functionality than LCG file management middleware Interim developed - LCF  LCG File Catalogue LHC Compute Grid oLHC = Large Hadron Collider Effect: 2 data management middlewares temporarily: –gLite (emerging), LCF (interim – production, GILDA)

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Summary Trigger and monitor transfer Look up, register, authorise

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, For More Information JRA1 Data Management homepage EGEE Middleware Architecture and Planning gLite FiReMan user guide –Overview –Command Line tools –C/C++ API –Java API

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, Responses to question Relationship between SRB and SRM? GLUE schema MASS STORAGE MANAGEMENT AND THE GRID A. Earl, P. Clark, University of Edinburgh, Edinburgh, Scotland

Enabling Grids for E-sciencE INFSO-RI DMS Overview EGEE Tutorial, Seoul, SRB and SRM SRB: Storage Resource Broker From San Diego Supercomputing Center SRM: Storage Resource Manager set of specifications for providing a Grid interface to storage management systems of various types –specification for Unix based systems – Web Service MASS STORAGE MANAGEMENT AND THE GRID A. Earl, P. Clark, University of Edinburgh, Edinburgh, Scotland