DataGrid is a project funded by the European Commission under contract IST-2000-25182 WP2 – R2.1 Overview of WP2 middleware as present in EDG 2.1 release.

Slides:



Advertisements
Similar presentations
The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
Advertisements

1 WP2: Data Management Paul Millar eScience All Hands Meeting September
WP2: Data Management Gavin McCance University of Glasgow.
EU DataGrid TestBed 2 Component Review Paul Millar (University of Glasgow) (slides based on a presentation by Erwin Laure)
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
EGEE is a project funded by the European Union under contract IST Using SRM: DPM and dCache G.Donvito,V.Spinoso INFN Bari
Stephen Burke – Heidelberg - 26/9/2003 Partner Logo Overview of applications view of the data management middleware Stephen Burke.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
RLS Production Services Maria Girone PPARC-LCG, CERN LCG-POOL and IT-DB Physics Services 10 th GridPP Meeting, CERN, 3 rd June What is the RLS -
N° 1 LCG EDG Data Management Catalogs in LCG James Casey LCG Fellow, IT-DB Group, CERN
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Overview  Strong consistency  Traditional approach  Proposed approach  Implementation  Experiments 2.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 DataGrid Quality Assurance On behalf.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
UCL workshop – 4-5 March 2004 – HEP Assessment of EDG – n° 1 HEP Applications Evaluation of the EDG Testbed and Middleware Stephen Burke (EDG HEP Applications.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami.
RLS Tier-1 Deployment James Casey, PPARC-LCG Fellow, CERN 10 th GridPP Meeting, CERN, 3 rd June 2004.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Next Generation Data Mgmt... – n° 1 James Casey CERN
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
Part Four: The LSC DataGrid Part Four: LSC DataGrid A: Data Replication B: What is the LSC DataGrid? C: The LSCDataFind tool.
WP3 Information and Monitoring Steve Fisher / RAL 23/9/2003.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
30-Sep-03D.P.Kelsey, SCG Summary1 Security Co-ordination Group (WP7 SCG) EDG Heidelberg 30 September 2003 David Kelsey CCLRC/RAL, UK
EGEE is a project funded by the European Union under contract IST R-GMA: Production Services for Information and Monitoring in the Grid John.
UKQCD Grid Status Report GridPP 13 th Collaboration Meeting Durham, 4th—6th July 2005 Dr George Beckett Project Manager, EPCC +44.
DataGRID Testbed Enlargement EDG Retreat Chavannes, august 2002 Fabio HERNANDEZ
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Database authentication in CORAL and COOL Database authentication in CORAL and COOL Giacomo Govi Giacomo Govi CERN IT/PSS CERN IT/PSS On behalf of the.
Project Wanzenhaus By Myat Min Mong-Hang Vo Pratik Dhupia.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Ákos FROHNER – DataGrid Security n° 1 Security Group TODO
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
The Storage Resource Broker and.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
Stephen Burke – Sysman meeting - 22/4/2002 Partner Logo The Testbed – A User View Stephen Burke, PPARC/RAL.
EGEE is a project funded by the European Union under contract IST Data Management Data Access From WN Paolo Badino Ricardo.
EGEE is a project funded by the European Union under contract IST New VO Integration Fabio Hernandez ROC Managers Workshop,
EGEE is a project funded by the European Union under contract IST Issues from current Experience SA1 Feedback to JRA1 A. Pacheco PIC Barcelona.
Overview of the New Security Model Akos Frohner (CERN) WP8 Meeting VI DataGRID Conference Barcelone, May 2003.
EGEE is a project funded by the European Union under contract IST Datamat Status Report F. Pacini Datamat S.p.a. Milan, IT-CZ JRA1 meeting,
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
ATLAS DDM Developing a Data Management System for the ATLAS Experiment September 20, 2005 Miguel Branco
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Jean-Philippe Baud, IT-GD, CERN November 2007
Regional Operations Centres Core infrastructure Centres
The EDG Testbed Deployment Details
StoRM: a SRM solution for disk based storage systems
JRA3 Introduction Åke Edlund EGEE Security Head
Technical Board Meeting, CNAF, 14 Feb. 2004
EGEE VO Management.
R-GMA Security Stephen Hicks UK Cluster Security
Introduction to Grid Technology
Installation, Configuration, Examples of use
Data Management in Release 2
Update on EDG Security (VOMS)
Public Key Infrastructure from the Most Trusted Name in e-Security
The GENIUS Security Services
INFNGRID Workshop – Bari, Italy, October 2004
The INFN-GRID RLS A. Cavalli - INFN-CNAF
Presentation transcript:

DataGrid is a project funded by the European Commission under contract IST WP2 – R2.1 Overview of WP2 middleware as present in EDG 2.1 release and outlook to demo functionality

WP2 in Release n° 2 Overview u Changes from R2.0 to R2.1 u Interfaces n Usage of info services n Usage of Storage services u Known Issues and possible resolutions u Outlook u Discussion

WP2 in Release n° 3 Changes from R2.0 to R2.1 u BUGFIXES n No major new functionalities n Almost all open bugs fixed with priority < 4 n Complied with some enhancement requests u Security n VOMS certificate parsing s --vo flag is not necessary if user has issued voms-proxy-init s All services accept VOMS certs as well as old-style certs n Unified tomcat configuration and installation through java-security s Easy maintenance, modular (even running on the same node, VO- specific services can be managed with no/little disruption to others s Same infrastructure for WP2,WP3,WP5

WP2 in Release n° 4 Changes from R2.0 to R2.1 u Replication n Full RLS, i.e. Introduction of Replica Location Indices n No visible change to the user, all should be transparent n Much better scalability: a local catalog is always available (LRC) n Single central LRC model is still possible Client LRC Client LRC RLI

WP2 in Release n° 5 Changes from R2.0 to R2.1 u Metadata n RLS & RMC: enhanced user-defined metadata s Support for int, float in addition to string type s Support for complex query using SQL where clause i.e get all files with "I > 2 AND KIND LIKE '%TOP' OR Z <= 0.54" n Spitfire: security support and much easier adaptability u Conformance checking n Logical file names, GUIDs and SURLs are checked for their 'correctness' i.e. adherence to their definitions n SURL normalization: //,../ resolution OKWRONG lfn:test guid:3254-3a c-8899db srm://lxshare0408.cern.ch/wpsix/data/myfile.dat mytest guid:test srm://path/data.txt

WP2 in Release n° 6 Interfaces – Storage u Full SE support. Issues: n Delete support – WP5 claims its there but we could not test yet n Fast file existence check – " " n Asynchronous operations – not for R2.1 n SRM interface support – not for R2.1 u GridFTP server support ("Classic SE mode") n Fake GridFTP server to make it look like an SE n Works well with CASTOR-GridFTP which provides MSS access u SRM support n SRM v1.1 support – integration tests with Fermilab, CASTOR n Not well tested

WP2 in Release n° 7 Interfaces – Infosystem u Full RGMA integration n Publication to proper RGMA interfaces (producers) n Lookup through native RGMA interfaces n Cron entries provided to keep information fresh u Full MDS integration for LCG n Info providers also exist for GLUE schema n MDS support through LDAP interface u Replica Manager also configurable using a configuration file (Stub). NOTE: The config file attributes have changed. Please see the release notes on the WP2 website

WP2 in Release n° 8 Known Issues : Performance u Performance of edg-rm Command Line Interface n Due mainly to startup of full JVM per call n Java API is fast enough n C++ API is most efficient u Example: listReplicas (insecure) n CLI: ~3.5 sec n Java API: ~0.2 sec n C++: ~30ms u Resolution: provide C++ - based CLI for most critical operations (register, lookup)

WP2 in Release n° 9 Known Issues : enabling security u All services have been tested in secure mode using certificates. u All Java connectivity works, all tests pass in secure mode n Performance penalty: extra 200ms per call (or more, depending on latency. Nothing we can do about that, this is the nature of the SSL handshake mechanism, i.e. the price we pay for security.) u Services do not provide authorization configuration options yet. This is something we want to provide for the last demo. u C++ secure client has had a lot of issues, which were resolved only very late n Not enough testing has been done n Not advisable to deploy for production yet n Primary interface of RB to WP2  Cannot enable security for the services on prod.TB NOW, but probably in a few weeks' time (which is probably too late)  Solution is there and exists and we have deployed far worse stuff in the past, but now we're more cautious (and smarter).

WP2 in Release n° 10 Known Issues: Complexity u Too much of the internals of the replication mechanism is exposed to the user. u Replica Manager Service deferred to next project n Provide ease-of-use n Failsafety n Consistency checking n Reliability n Needs proper security delegation mechanism (through OGSA?) u Solution to some extent: RLS Front-End (aka proxy) service, probable prototype to be demo'd at last review u NOTE: POOL does not solve the problem but obscures it a bit, at least currently. It is a step in the right direction, but architecturally in the wrong layer.

WP2 in Release n° 11 Known Issues: Bulk operations u Missing GDMP functionality: bulk transfer and subscription- based replication n Was planned for 2.1 but was cut off to keep features low n Experiments drove this feature cut n Effort was not available to keep GDMP supported or to adapt it to the new system u RSS – Replication Subscription Service is an unfinished prototype. Its future is unclear.

WP2 in Release n° 12 Next steps u BUGFIXES u Make full RLS work n It is essential to gather experience using a 'real' distributed Grid Middleware n RGMA integration and automatic subscription can only be fully tested on the real system. There may be a few hickups. Manual subscription is always possible and will probably have to be done. u Work with applications to have a real Spitfire instance in production n Gathering experience on this layer is also essential to be able to understand the requirements properly For the demo: u Provide RLS-FE service. This will lower the complexity on the client side and deal with the outbound connectivity issue at least for this service. u Provide authorization options for RMC, RLS (i.e. Who may do what?) based on VOMS proxy content.

WP2 in Release n° 13 Conclusion and Outlook u We have provided working services to EDG and LCG on a very aggressive timescale. n Gained a lot of expertise in the domain of data management and security n We established a lot of bonds to other organizations and have built good collaborations u We have learned a LOT on how to NOT do things in a project u Towards the end we have started to do things right but n People got overloaded while adapting the proper procedures and keeping the tight timelines at the same time u I think EDG has provided the most solid Grid M/W there is today! n We should advertise it at least as much as our friends from other projects do.