WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.

Slides:



Advertisements
Similar presentations
Experiences of the Grid… Gavin McCance University of Glasgow NeSC Meeting, 24 October 2001.
Advertisements

1 WP2: Data Management Paul Millar eScience All Hands Meeting September
WP2: Data Management Gavin McCance University of Glasgow.
EU DataGrid TestBed 2 Component Review Paul Millar (University of Glasgow) (slides based on a presentation by Erwin Laure)
Data Grid Management (WP2) W. H. Bell Grid Data Management (WP2) William Bell University of Glasgow.
5-Dec-02D.P.Kelsey, GridPP Security1 GridPP Security UK Security Workshop 5-6 Dec 2002, NeSC David Kelsey CLRC/RAL, UK
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
Andrew McNab - Manchester HEP - 24 May 2001 WorkGroup H: Software Support Both middleware and application support Installation tools and expertise Communication.
Andrew McNab - Manchester HEP - 22 April 2002 EU DataGrid Testbed EU DataGrid Software releases Testbed 1 Job Lifecycle Authorisation at your site More.
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Andrew McNab - Manchester HEP - 2 May 2002 Testbed and Authorisation EU DataGrid Testbed 1 Job Lifecycle Software releases Authorisation at your site Grid/Web.
Author - Title- Date - n° 1 GDMP The European DataGrid Project Team
NIKHEF Testbed 1 Plans for the coming three months.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Title – n° 1 Grid Data Management in Action Experience in Running and.
GRID DATA MANAGEMENT PILOT (GDMP) Asad Samar (Caltech) ACAT 2000, Fermilab October , 2000.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Security Mechanisms The European DataGrid Project Team
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Ákos FROHNER – DataGrid Security Requirements n° 1 Security Group D7.5 Document and Open Issues
MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May Proposal.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
1 WP2: Data Management Gavin McCance RAL Middleware Workshop 24 February 2003.
Tony Doyle & Gavin McCance - University of Glasgow ATLAS MetaData AMI and Spitfire: Starting Point.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Data Management GridPP and EDG Gavin McCance University of Glasgow May 9, 2002
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Andrew McNabSecurity Middleware, GridPP8, 23 Sept 2003Slide 1 Security Middleware Andrew McNab High Energy Physics University of Manchester.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
10 May 2001WP6 Testbed Meeting1 WP5 - Mass Storage Management Jean-Philippe Baud PDP/IT/CERN.
Data Management The European DataGrid Project Team
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL DOE/NSF Review of US LHC Software and Computing Fermilab Nov 29, 2001.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Site Authorization Service Local Resource Authorization Service (VOX Project) Vijay Sekhri Tanya Levshina Fermilab.
11-May-01D.P.Kelsey, Security Update1 GRID Security Update David Kelsey CLRC/RAL, UK
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Overview of the New Security Model Akos Frohner (CERN) WP8 Meeting VI DataGRID Conference Barcelone, May 2003.
WP2: Data Management Gavin McCance University of Glasgow.
The EDG Testbed Deployment Details
Gavin McCance University of Glasgow GridPP2 Workshop, UCL
Moving the LHCb Monte Carlo production system to the GRID
Spitfire Overview Gavin McCance.
Data services on the NGS
Viet Tran Institute of Informatics Slovakia
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
The EU DataGrid Data Management
Presentation transcript:

WP2: Data Management Gavin McCance University of Glasgow November 5, 2001

Overview Deliverables Replication: GDMP Meta-data: Spitfire GridPP effort Future work Query Optimisation

Deliverables EU DataGrid WP2: Major M9 deliverables met GDMP delivered Spitfire delivered Architecture Document

GDMP Generic mirroring tool for any file type (read only replica) Particular plug-ins for Objectivity database files Subscription model for automatic synchronisation of files Automatic update of replica catalogue Currently uses Globus Replica Catalogue

…GDMP BrokerInfo API from WP1 Allows users of GDMP to obtain information from the job scheduler Mass Storage Interface from WP5 e.g. Support for file staging Security is provided via standard GSI (single sign-on) Authorisation via grid mapfile File transfer made using GridFTP Installation: RPM and tarball

…GDMP usage 1. A,B) Start GDMP services (inetd) 2. B) Registers itself with site A gdmp_host_subscribe 3. A) New files Register them gdmp_register_local_file This updates the local (on A) catalogue 4. A) Tell the world (well..all subscribed sites) gdmp_publish_catalogue Will update the import catalogue on all subscribed sites Site ASite B

…GDMP usage 1. B) Get the new files from site A gdmp_replicate_get The new files will be transferred from site A site B Globus replica catalogue updated Filters so you only get files you want CRC checking of file transfer Site ASite B

Spitfire Provides grid enabled access to any relational database SQL Database Service Storage of general meta-data Service Index soon… Secure access via GSI (single sign-on) Installation: RPM and tarball

Allows any HTTP compliant system e.g. Web- browsers / standard C++ HTTP libraries to access any relational database across the grid… …Spitfire = SQL Database Service (Spitfire) Oracle PostgreSQL + Grid Security + Standard communication protocols (XML over HTTPS) JAVA Servlet based

…Spitfire security Authentication is currently provided Standard user & server grid certificates For both application programs and web browsers Authorisation matrix coming soon… Will map grid identity to role(s) Reader, info-update, manager Roles will then map to a given database connection with given permissions on a database Eg. query-only, insert, update, create new tables

…Spitfire Easy to install Good documentation Ready to run examples For grid-based meta-data catalogue needs.. … we need feedback!

WP2 GridPP Effort Based at Glasgow Effort will focus on primarily the query optimisation task of WP2 1 PhD student, 1.5 RA Continuing effort in development of Spitfire and related applications 0.7 RA

Future Spitfire work Look at common ground between WP2 and WP3 Spitfire and R-GMA? Security Authorisation mechanisms Other spitfire applications Service Index, Replica Catalogue Work on scaleable architectures Common with e.g. replica catalogue work

Query Optimisation work Categorise possible areas for optimisation: User oriented: high performance Minimising cost for specific job Grid oriented: high throughput Maximise efficient usage of resources Site oriented: local policy Respond to specific site policies / requirements Much preliminary work done! Workshop in December 2001…

…Query Optimisation Short term: Data Access optimisation Replica Optimiser component How long will it take to get the data here? Developing and evaluating appropriate algorithms for working this out and choosing best replica…

…Query Optimisation Modelling and Simulation Best not to test out the more crazy algorithms on the experiment testbed Work underway with MONARC tool Evaluating suitability as simulation tool for this particular work Integrate into the QO work

Summary Major deliverables for M9 met GDMP and Spitfire GridPP will concentrate effort on Query Optimisation task of WP2 + continued Spitfire development Work already underway