The Work Package 2 experience

Slides:



Advertisements
Similar presentations
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
Advertisements

WP2: Data Management Gavin McCance University of Glasgow.
OptorSim: A Replica Optimisation Simulator for the EU DataGrid W. H. Bell, D. G. Cameron, R. Carvajal, A. P. Millar, C.Nicholson, K. Stockinger, F. Zini.
Data Grid Management (WP2) W. H. Bell Grid Data Management (WP2) William Bell University of Glasgow.
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
High Performance Computing Course Notes Grid Computing.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Title – n° 1 Grid Data Management in Action Experience in Running and.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Technology program Helsinki Institute of Physics Technology Program Grid Activities Ari-Pekka Hameri Miika Tuisku Michael Kustaa Gindonis.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Next Generation Data Mgmt... – n° 1 James Casey CERN
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
A Grid Computing Use case Datagrid Jean-Marc Pierson.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.
Ajou University, South Korea GCC 2003 Presentation Dynamic Data Grid Replication Strategy based on Internet Hierarchy Sang Min Park , Jai-Hoon Kim, and.
1 WP2: Data Management Gavin McCance RAL Middleware Workshop 24 February 2003.
Δ Storage Middleware GridPP10 What’s new since GridPP9? CERN, June 2004.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
DataGrid is a project funded by the European UnionVirtual Observatory as a Data Grid – WP2 Data Management Peter Kunszt CERN
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Caitriana Nicholson, CHEP 2006, Mumbai Caitriana Nicholson University of Glasgow Grid Data Management: Simulations of LCG 2008.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
CGW 04, Stripped replication for the grid environment as a web service1 Stripped replication for the Grid environment as a web service Marek Ciglan, Ondrej.
Data Management GridPP and EDG Gavin McCance University of Glasgow May 9, 2002
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Andrew McNabSecurity Middleware, GridPP8, 23 Sept 2003Slide 1 Security Middleware Andrew McNab High Energy Physics University of Manchester.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
Oracle to MySQL synchronization Gianni Pucciani CERN, University of Pisa.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
1 P.Kunszt C EU DataGrid Data Management Workpackage : WP2 Status and Plans Peter Z Kunszt IT/DB
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
INDIGO – DataCloud WP5 introduction INFN-Bari CYFRONET RIA
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
WP2: Data Management Gavin McCance University of Glasgow.
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen
ScotGRID is the Scottish prototype Tier 2 Centre for LHCb and ATLAS computing resources. It uses a novel distributed architecture and cutting-edge technology,
Using iRODS with the EnginFrame Grid Portal into the GRIDA3 project Francesco Locunto Marco Piras Matteo Vocale.
BaBar-Grid Status and Prospects
Gavin McCance University of Glasgow GridPP2 Workshop, UCL
JRA3 Introduction Åke Edlund EGEE Security Head
EO Applications Parallel Session
StoRM Architecture and Daemons
Grid Computing.
Grid Portal Services IeSE (the Integrated e-Science Environment)
WP1 activity, achievements and plans
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
WP7 objectives, achievements and plans
Network Requirements Javier Orellana
Leigh Grundhoefer Indiana University
Future EU Grid Projects
Wide Area Workload Management Work Package DATAGRID project
Status of Grids for HEP and HENP
Presentation transcript:

The Work Package 2 experience Data Management on the Grid Peter.Kunszt@cern.ch

Outline Objectives and how they were met Achievements Lessons learned Future & Exploitation Questions

Objectives DataGrid Technical Annex: Enable secure access to massive amounts of data in a universal global name space. Move and replicate data at high speed from one geographical site to another. Interface to heterogeneous mass storage management systems. Manage synchronisation of remote copies. Automate data caching and distribution according to dynamic usage patterns. Write-only versions. Network monitoring considerations.

Achievements Delivering Middleware Many existing Grid components were included in the first release (like GDMP, Globus replica catalog) Based on first experience and on user feedback, the EDG2 services have been designed and developed Pioneering role in the usage of J2EE-based web services (long time before OGSI) EDG2 is a complete set of data management solution – but should still be considered first generation. Collaboration Close interaction with the Globus Project on replica location service Collaboration with CrossGrid on storage resource metrics Strong participation in GGF and the various groups therein

Achievements per Task Data Replication Task Replica Manager (EDG1 and EDG2) Replica Catalog (EDG1) and Replica Location Service (EDG2) Grid Data Mirroring Package (EDG1) Replica Metadata Catalog (EDG2) Replica Location Service WP3 Information Services Replica Manager Client Replica Metadata Catalog SOAP SOAP Replica Optimization Service WP5 Storage Element Globus GridFTP GSIFTP EDG2 Data Management

Achievements per Task Optimization Task Replica Optimization Service (EDG2) Active research: OptorSim Simulation package Simulation of optimal replica placement strategies Includes simulation of network conditions Metadata Access Task Spitfire – Grid Access to Relational DB (Demo) Served as a prototype web service application for EDG2 services Replica Location Service metadata Replica Metadata Catalog metadata

Achievements per Task Security Task Secure java-based web services: TrustManager for authentication (EDG2). In use by WP3 and WP5 as well AuthorizationManager for authorization (In operation). It can apply VOMS authorization information to the service. Secure clients in java and c++ to web services (EDG2) Strong participation in EDG Security Group

Lessons Learned Development Cycle In EDG it was not possible to do a proper requirements gathering, prototyping, testing, development fast enough for the lifetime of the project. A faster release cycle to the end- users will be possible from now on since future projects won’t start from zero. We focussed on core services in EDG. The much needed end- to-end capabilities can now be added easier since the users also know better what they want and how they want it. User interface and documentation are important and difficult to get right first time Less is more Focusing on the basics: stability and usability paid off Extra features good, but should be pluggable because not all users want them

Lessons Learned Security is key Can’t ‘add security later’ – horizontal through all services Security mechanisms are deeply reflected in the design Lots of open issues: Performance, delegation, site buy-in… Web Services work well Modular web service structure Pluggable QoS (deployable in open source or commercial environments) Based on standards: well supported by industry and open source community

Future & Exploitation Products The LHC Computing Grid is running WP2 services (except Replica Optimization and Spitfire) and will maintain and support them for at least this year for their community. Spitfire has served as an example for other projects already. The security infrastructure will serve as one of the bases for java- based web service infrastructures over SSL for the next projects. The optimization work has enriched the computer science community with many valuable insights through its many publications. The code base will be made available through GridForge. The publications, documentation and tutorials serve as a reference for future projects.

Future & Exploitation People All members of WP2 have gained valuable experience while working on EDG. Their expertise will be very useful to their future projects. Processes The lessons learned in EDG will help improve the processes of the future projects that EDG members participate in. A lot of work remains to be done Data sets and virtual data Application metadata bindings into the low-level services End-to-end integration with user applications …

BIG THANKS To all people who have contributed to WP2. CERN: Diana Bosio, Akos Frohner, Leanne Guy, Wolfgang Hoschek, Javier Jaen- Martinez, Marcin Kania, Arnaud Lacroix, Erwin Laure, Levi Lucio, Ben Segal, Heinz Stockinger, Kurt Stockinger INFN: Giuseppe Andronico, Federico Di Carlo, Andrea Domenici, Flavia Donno, Livio Salconi, Marco Serra PPARC: William Bell, David Cameron, Gavin McCance, Paul Millar, Caitriana Nicholson HIP/CSC: Joni Hahkala, Niklas Karlsson, Juho Karppinen, Ville Nenonen, Marko Niinimäki, Tuomas Nissi, Henri Mikkonen, Olli Serimaa, Mika Silander, John White KDC: Olle Mulmo, Bjorn Torkelsson, Gian Luca Volpato ITC-IRST: Paolo Busetta, Luigi Capozza, Mark Carman, Ruben Carvajal- Schiaffino, Luciano Serafini, Floriano Zini LCG: Itzhak Ben-Akiva, James Casey, Radovan Chytracek, Kálmán Kövári, Sophie Lemaitre PPDG: Andrew Hanushevsky, Shahzad Muzaffar, Asad Samar And: Brian Tierney(BNL), Aleksandr Konstantinov(NorduGrid/CrossGrid)