WP2: Data Management Gavin McCance University of Glasgow.

Slides:



Advertisements
Similar presentations
Texas Digital Library Services Preservation Network.
Advertisements

© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
3 Copyright © 2005, Oracle. All rights reserved. Designing J2EE Applications.
17 Copyright © 2005, Oracle. All rights reserved. Deploying Applications by Using Java Web Start.
0 - 0.
Experiences of the Grid… Gavin McCance University of Glasgow NeSC Meeting, 24 October 2001.
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
1 WP2: Data Management Paul Millar eScience All Hands Meeting September
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC GridPP2: Data and Storage Management.
EU DataGrid TestBed 2 Component Review Paul Millar (University of Glasgow) (slides based on a presentation by Erwin Laure)
OptorSim: A Replica Optimisation Simulator for the EU DataGrid W. H. Bell, D. G. Cameron, R. Carvajal, A. P. Millar, C.Nicholson, K. Stockinger, F. Zini.
Data Grid Management (WP2) W. H. Bell Grid Data Management (WP2) William Bell University of Glasgow.
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Partner Logo UK GridPP Testbed Rollout John Gordon GridPP 3rd Collaboration Meeting Cambridge 15th February 2002.
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
Enterprise Java and Data Services Designing for Broadly Available Grid Data Access Services.
The National Grid Service and OGSA-DAI Mike Mineter
Eldas 1.0 Enterprise Level Data Access Services Design Issues, Implementation and Future Development Davy Virdee.
Overview Environment for Internet database connectivity
1 Migrating from Access to SQL Server Simon Kingston, CSU / NPS NRGIS.
Data Management Expert Panel - WP2. WP2 Overview.
Adding services to PA and Plesk infrastructure with APS Ilya Baimetov Director of Program Management, Automation.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
ICS 434 Advanced Database Systems
COMOS Mobile Solutions 1.0 Simplified global collaboration
Andrew McNab - Manchester HEP - 2 May 2002 Testbed and Authorisation EU DataGrid Testbed 1 Job Lifecycle Software releases Authorisation at your site Grid/Web.
1 CHEP 2000, Roberto Barbera Recent grid activities at INFN Catania (*) HEPiX/HEPNT 2002, Catania, (*) work in collaboration with.
Continued Investment in ATML
DataGrid is a project funded by the European Commission under contract IST WP2 – R2.1 Overview of WP2 middleware as present in EDG 2.1 release.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
Andrew McNab - Manchester HEP - 26 June 2001 WG-H / Support status Packaging / RPM’s UK + EU DG CA’s central grid-users file grid “ping”
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Next Generation Data Mgmt... – n° 1 James Casey CERN
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
1 WP2: Data Management Gavin McCance RAL Middleware Workshop 24 February 2003.
Tony Doyle & Gavin McCance - University of Glasgow ATLAS MetaData AMI and Spitfire: Starting Point.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
Metadata Mòrag Burgon-Lyon University of Glasgow.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Data Management GridPP and EDG Gavin McCance University of Glasgow May 9, 2002
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
 CMS data challenges. The nature of the problem.  What is GMA ?  And what is R-GMA ?  Performance test description  Performance test results  Conclusions.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
WP2: Data Management Gavin McCance University of Glasgow.
Gavin McCance University of Glasgow GridPP2 Workshop, UCL
Moving the LHCb Monte Carlo production system to the GRID
Spitfire Overview Gavin McCance.
T-StoRM: a StoRM testing framework
GSAF Grid Storage Access Framework
Presentation transcript:

WP2: Data Management Gavin McCance University of Glasgow

2/18 GridPP, September 2002Gavin McCance, University of Glasgow Overview Current status of components Testbed 1.2 strategy Testbed 2 strategy Meta-data: Spitfire OptorSim

3/18 GridPP, September 2002Gavin McCance, University of Glasgow Current software status Replication: edg-replica-catalog – deployed GSI version available. Graphical User Interface available. Replication: edg-replica-manager – deployed Atomic replication operations. This component should be used for data management tests. Replication: GDMP – deployed More complex subscription-based functionality. Has been tested and is proven to work. Can be abused to provide the same functionality as edg-replica- manager but it is overkill and much harder to do. Metadata Storage: SQL DB Service Spitfire Browser – deployed Web browser based access to metadata GSI enabled + authorisation Optimisation: Replica Selection OptorSim – working

4/18 GridPP, September 2002Gavin McCance, University of Glasgow …Current software status Replication: Replica Location Service – deployable Joint Globus – EDG development Unit and performance tests successful Distributed Replica Catalog service Graphical User Interface available. Security: edg-security module – in testing Security for other Web Services Component Metadata Storage: SQL Database Service Spitfire – in testing Web Service interface over SOAP Remote Procedure Call Security through edg-security and proper local authorisation Complimentary service to web based Spitfire Browser Replication: Replica Manager Reptor – in testing and development Optimization: Replica Selection Optor – in testing and development Replication: Replica Metadata Catalog Repmec – in testing and development Security: edg-voms – in development, joint effort with WP6, WP7

5/18 GridPP, September 2002Gavin McCance, University of Glasgow Strategy for 1.2 Integrate a single-instance RLS into TB1.2 (replacement for edg-replica-catalog) Integrate RLS into edg-replica-manager and GDMP Focus users away from GDMP and towards edg-replica-manager for simple data management use cases. Increment stability and usability of edg-replica-manager Configuration Simplify configuration and build system of deployed components Unify build for all components, adopt edg-standards Testing Large effort on unit testing and functionality tests inside WP2 for all components. Close interaction with ITeam and the new testing group

6/18 GridPP, September 2002Gavin McCance, University of Glasgow Towards testbed 2 Focus away from GDMP Introduction of Reptor (refactorization of GDMP) Initially with identical functionality Incremental and controlled addition of requested functionality Use R-GMA for information gathering and publishing edg-replica-manager will stay as a command-line interface to Reptor, easy migration for users Security: work towards new security model with VOMS and local authorization. Documentation: Guides Examples, How-to Tutorials

7/18 GridPP, September 2002Gavin McCance, University of Glasgow Overview Current status of components Testbed 1.2 strategy Testbed 2 strategy Metadata: Spitfire OptorSim

8/18 GridPP, September 2002Gavin McCance, University of Glasgow Metadata interfaces Spitfire Metadata interface will handle general metadata e.g. calibration data, bookkeeping data Replica Metadata interface (RepMec) is an convenient extra service interface that will handle metadata which is keyed on Logical File Names. Both technical and application metadata

9/18 GridPP, September 2002Gavin McCance, University of Glasgow Spitfire Split into 2 independent products now. Spitfire-browser (web pages) Spitfire service itself (web services)

10/18 GridPP, September 2002Gavin McCance, University of Glasgow Spitfire browser The client is a web browser (optionally with a GSI certificate loaded) The service sits in front of a relational database. The service exposes specific views and operations of a database via web page forms.

11/18 GridPP, September 2002Gavin McCance, University of Glasgow Web form which sends user request to predefined template Any operation is possible although typical use would be simple select or simple update

12/18 GridPP, September 2002Gavin McCance, University of Glasgow After querying the result-set is returned and can be formatted as necessary GSI authentication & authorisation; full integration with WP2 security module

13/18 GridPP, September 2002Gavin McCance, University of Glasgow Spitfire web service WSDL defined Remote Procedure Call (RPC) interface to permit secure access to database. Sits in front of RDBMS. Currently available for Java, C, C++. Beta just released. GSI authentication and authorisation via WP2 security module

14/18 GridPP, September 2002Gavin McCance, University of Glasgow Spitfire for meta-data access RDBMS Application in GAUDI framework GANGA Services Spitfire Web Service Spitfire Client library Component::getCalibration( …, blah ) Spitfire::select( …, WHERE cal1=blah) Registry Secure web service connection

15/18 GridPP, September 2002Gavin McCance, University of Glasgow Build system WP2 working to extend build and configuration system for web services + servlet software Currently ~absent from testbed docu Merge with other WP approaches Common standard for deploying Web Services

16/18 GridPP, September 2002Gavin McCance, University of Glasgow OptorSim Optimise data access and replication based on an economic model View files as digital assets which can be bought and sold for profit Let nodes on the grid interact according to an economic model and thus optimise the system implicitly rather than explicitly

17/18 GridPP, September 2002Gavin McCance, University of Glasgow Results of OptorSim 0.4 Sequential Flat Random Unitary Walk Gaussian Access Pattern Basic Economic Model

18/18 GridPP, September 2002Gavin McCance, University of Glasgow …Optimisation to-do Looking at access patterns from SAM for tuning optimisation algorithms Improving OptorSim with a proper simulation of job times More realistic simulation of the eco- model process Application to the real-life Optor module (>=TB2.0)