NorduGrid plans and questions for gLite Marko Niinimaki, NorduGrid 3 rd EGEE meeting Athens, April 2005.

Slides:



Advertisements
Similar presentations
1 WP2: Data Management Paul Millar eScience All Hands Meeting September
Advertisements

Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Grid Data Management Assaf Gottlieb - Israeli Grid NA3 Team EGEE is a project funded by the European Union under contract IST EGEE tutorial,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Nov. 18, EGEE and gLite are registered trademarks gLite Middleware Usage Dusan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
AERG 2007Grid Data Management1 Grid Data Management Replica Location Service Carolina León Carri Ben Clifford (OSG)
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
CGW 04, Stripped replication for the grid environment as a web service1 Stripped replication for the Grid environment as a web service Marek Ciglan, Ondrej.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Lattice QCD Data Grid Middleware: Meta Data Catalog (MDC) -- CCS ( tsukuba) proposal -- M. Sato, for ILDG Middleware WG ILDG Workshop, May 2004.
Karolina Sarnowska, University of Virginia Andrew Grimshaw, University of Virginia Mark Morgan, University of Virginia Akos Frohner, CERN Erwin Laure,
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Object storage and object interoperability
DIRAC Data Management: consistency, integrity and coherence of data Marianne Bargiotti CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
LCG Distributed Databases Deployment – Kickoff Workshop Dec Database Lookup Service Kuba Zajączkowski Chi-Wei Wang.
Data Management The European DataGrid Project Team
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Data Management The European DataGrid Project Team
Testing the HEPCAL use cases J.J. Blaising, F. Harris, Andrea Sciabà GAG Meeting April,
Distributed Data Access Control Mechanisms and the SRM Peter Kunszt Manager Swiss Grid Initiative Swiss National Supercomputing Centre CSCS GGF Grid Data.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
EGEE is a project funded by the European Union under contract IST Data Management Data Access From WN Paolo Badino Ricardo.
1 Data Management for Internet Backplane Protocol by Tang Ming Assoc/Prof. Francis Lee School of Computer Engineering, Nanyang Technological University,
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
Istituto Nazionale di Astrofisica Information Technology Unit INAF-SI Job with data management Giuliano Taffoni.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
Grid Data Management Assaf Gottlieb Tel-Aviv University assafgot tau.ac.il EGEE is a project funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE Practicals on LFC and gLite DMS Tony Calanducci Emidio Giorgio INFN Retreat between GILDA.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
EGEE Data Management Services
GFAL Grid File Access Library
gLite Basic APIs Christos Filippidis
Oxana Smirnova, Jakob Nielsen (Lund University/CERN)
StoRM: a SRM solution for disk based storage systems
gLite 1.4. Data Mangement Exercises
Metadata Services on the GRID
Comparison of LCG-2 and gLite v1.0
Data Management in Release 2
Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy
Data services in gLite “s” gLite and LCG.
Architecture of the gLite Data Management System
DIRAC Data Management: consistency, integrity and coherence of data
Metadata Services on the GRID
Presentation transcript:

NorduGrid plans and questions for gLite Marko Niinimaki, NorduGrid 3 rd EGEE meeting Athens, April 2005.

Finished ROME jobs

The interoperability scene: common protocols to make ARC – gLite FireMan connectivity. The “improvements” scene: enable searching by file attributes and metadata. Motivation: job submissions, data transfers etc. are handled already by both gLite, ARC, Grid3.. (job submission language may even eventually be the same). ARC uses RLS as a simple physical-logical filename mapper, users are unhappy: AA*, query language, performance*, no hierarchy*. using a “common” catalog-SE interface would make future integration easier for developers and prevent re-inventing the wheel. Plans

Does this make sense? Are there resources (manpower) to include this in the schedule? (From NorduGrid side yes) If yes, how to proceed (following slides) Questions

gLite FireMan gLite i/o storage soap over http/g rfio db clients FireMan architecture

FireMan/glite io client examples > voms-proxy-info VO : EGEE Valid from : Mar 7 12:36: GMT Valid to : Mar 8 00:36: GMT > glite-catalog-ls -l /tmp... dpdrwlxgspdrwlxgspdrwlxgs :37:15 /tmp/krzychu -pdrwlxgs :00:00 /tmp/la_vispa_teresa... > glite-catalog-mkdir /tmp/markoniinimaki > glite-catelog-rmdir /tmp/markoniinimaki

FireMan/glite io client examples (cont) > glite-put /etc/issue /tmp/markoniinimaki/mytest [glite_put] Total 0.00 MB |====================| % [0.0 Mb/s] Transfer Completed: LFN : /tmp/markoniinimaki/mytest GUID : 004d9388-4d47-122c-ad5e-808d2561beef SURL : srm://gridftp05.cern.ch:8443/srm/managerV1?SFN=/castor/cern.ch/user/g/gprodu ct/EGEETEST/SE/tmp/markoniinimaki/mytest Data Written [bytes] : 42 Eff.Transfer Rate[Mb/s] : > glite-catalog-find -name markoniinimaki /tmp > glite-catelog-setreplica /tmp/markoniinimaki/mytest > glite-catalog-getreplica /tmp/markoniinimaki/mytest srm://gridftp05.cern.ch:8443/srm/managerV1?SFN=/castor/cern.ch/user/g/gprodu ct/EGEETEST/SE/tmp/markoniinimaki/mytest

Part of ARC http/s/g data suite (other parts incl. data move) SSE is only part of full infrastructure and must be complemented with Data Indexing Services (IS). SSE has no internal structure for storing data units (files). Files are identified by their identity used in IS (Logical File Name, GUID, Logical Path, etc.) All operations on data units (creation, replication, deletion of replica, etc.) are done on request of the client through SSE. Access to data units on SSE is based on the identity of the client and uses GACL. SSE implements https/g protocols for data transfer. SSE can register stored files at Globus Replica Catalog (RC) and Replica Location System (RLS) indexing services. About SSE (

RLS (IS) ARC “Smart SE” storage db clients ARC SSE-RLS architecture

ARC SSE-RLS examples The SE in pcephc23 reports to RLS in grid.uio.no > ngcopy file:///etc/issue "se://pcephc23.cern.ch/se?foo" > ngls rls://grid.uio.no foo mytest > ngcopy rls://grid.uio.no/foo file:///tmp/foofile:///tmp/foo Normal RLS attribute searches can be applied, > globus-rls-cli attribute search size lfn '>' string 500 rls://grid.uio.no

Integration layer I simply do not know where it could be. ARC SSE emulates rfio (NG developers do not like this idea). SSE could either use part of glite-io which is responsible for talking to Fireman. Or Fireman interface could be used directly by SSE. [Unfortunately I wasn't able to make FireMan talk to anything else than gLite i/o].

The improvement part Boolean searches involving meta data, like: list all files owned by X, smaller than 2 GB, physically available in an SE whose name contains “uio.no”. Does the data that FireMan stores in db incluse such attributes? Can FireMan API (WSDL) be expanded to include such operations?

For completeness' sake.. Generated from manager/https/se/file_soap.h?rev=1.12;content-type=text%2Fplain <definitions name="ARCStorageElement" <schema targetNamespace=" xmlns:SOAP-ENV=" xmlns:SOAP-ENC=" xmlns:xsi=" xmlns:xsd=" xmlns:ns=" xmlns=" Manipulate access rules for named content

.. Add new content(file) Update metadata of content Search for named content and provide metadata Manipulate access rules for named content Remove contant and unregister it