1 WP2: Data Management Paul Millar eScience All Hands Meeting September 2-4 2003.

Slides:



Advertisements
Similar presentations
DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
Advertisements

Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
WP2: Data Management Gavin McCance University of Glasgow.
EU DataGrid TestBed 2 Component Review Paul Millar (University of Glasgow) (slides based on a presentation by Erwin Laure)
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
WP2 and GridPP UK Simulation W. H. Bell University of Glasgow EDG – WP2.
A PPARC funded project AstroGrid Framework Consortium meeting, Dec 14-15, 2004 Edinburgh Tony Linde Programme Manager.
The National Grid Service and OGSA-DAI Mike Mineter
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Andrew McNab - Manchester HEP - 2 May 2002 Testbed and Authorisation EU DataGrid Testbed 1 Job Lifecycle Software releases Authorisation at your site Grid/Web.
EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
DataGrid is a project funded by the European Commission under contract IST WP2 – R2.1 Overview of WP2 middleware as present in EDG 2.1 release.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
Holding slide prior to starting show. Supporting Collaborative Working of Construction Industry Consortia via the Grid - P. Burnap, L. Joita, J.S. Pahwa,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
RLS Production Services Maria Girone PPARC-LCG, CERN LCG-POOL and IT-DB Physics Services 10 th GridPP Meeting, CERN, 3 rd June What is the RLS -
N° 1 LCG EDG Data Management Catalogs in LCG James Casey LCG Fellow, IT-DB Group, CERN
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
Distributed Heterogeneous Data Warehouse For Grid Analysis
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Next Generation Data Mgmt... – n° 1 James Casey CERN
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
1 WP2: Data Management Gavin McCance RAL Middleware Workshop 24 February 2003.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
Part Four: The LSC DataGrid Part Four: LSC DataGrid A: Data Replication B: What is the LSC DataGrid? C: The LSCDataFind tool.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
Ákos FROHNER – DataGrid Security n° 1 Security Group D7.6 Design Ideas
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
Overview of Privilege Project at Fermilab (compilation of multiple talks and documents written by various authors) Tanya Levshina.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Data Management GridPP and EDG Gavin McCance University of Glasgow May 9, 2002
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Database authentication in CORAL and COOL Database authentication in CORAL and COOL Giacomo Govi Giacomo Govi CERN IT/PSS CERN IT/PSS On behalf of the.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
GRID Security & DIRAC A. Casajus R. Graciani A. Tsaregorodtsev.
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
VOX Project Status T. Levshina. 5/7/2003LCG SEC meetings2 Goals, team and collaborators Purpose: To facilitate the remote participation of US based physicists.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
Current Globus Developments Jennifer Schopf, ANL.
Storage Element Security Jens G Jensen, WP5 Barcelona, May 2003.
Jean-Philippe Baud, IT-GD, CERN November 2007
Manuel Brugnoli, Elisa Heymann UAB
Gavin McCance University of Glasgow GridPP2 Workshop, UCL
Patrick Dreher Research Scientist & Associate Director
Grid Data Replication Kurt Stockinger Scientific Data Management Group Lawrence Berkeley National Laboratory.
Presentation transcript:

1 WP2: Data Management Paul Millar eScience All Hands Meeting September

2 eScience -All Hands Meeting 2-4 September 2003 Introduction EDG in third and final year of project. 1 st Generation of tools provided a very good base for input. 2 nd Generation designed for modularity and to allow evolution. Java based services (using either Tomcat or Oracle 9iAS) Interface design defined in WSDL Client stubs for Java, C/C++ using AXIS and gSOAP Persistent service data is stored mySQL or Oracle. Replication service framework (RM, RLS, RMC, ROS) Java Security Package

3 eScience -All Hands Meeting 2-4 September 2003 Replication Service Framework User Replica Management Service Core Optimisation Security Collections Sessions Subscriptions Consistency Processing Replica Location Service MetaData Catalogue Transport Replica Selection Access History Replication Initiation

4 eScience -All Hands Meeting 2-4 September 2003 Interaction with services Internal Replica Location Service (RLS) Replica Metadata Catalogue (RMC) Replica Optimisation Service (ROS) External Relational Grid Monitoring Architecture (R-GMA) Globus C-based libraries, as well as CoG EDG network monitoring services. EDG-SE services.

5 eScience -All Hands Meeting 2-4 September 2003 Replica Location Service Maintains a (possibly distributed) catalogue of files: 1 file maps to potentially many replicas. Need to keep track of file location and consistently updated. RLS stores one-to-many relations between GUID and Physical File Names (PFNs). Two-level design: LRC (Local Replica Catalogue) and RLI (Replica Location Index). LRC contains a list of GUID to PFNs. RLI contains GUID to LRC mappings. RLS will operate with just an LRC. EDG2.0 operation LRCs publish Bloom filter objects: compact form of representing a set. May contain false +ve, but not false -ve.

6 eScience -All Hands Meeting 2-4 September 2003 RLS Demo at SC2002

7 eScience -All Hands Meeting 2-4 September 2003 Replica MetaData Catalogue RLS provides GUID to PFN mapping, but GUID isn't user friendly. RMC provides metadata on a per GUID basis. One such metadata is a Logical File Name, LFN. A GUID may have many LFNs associated. RMC is also capable of storing other metadata, such as file size, date of creation, owner... User-defined metadata can also be stored, and searched against.

8 eScience -All Hands Meeting 2-4 September 2003 LFNs, PFNs, GUIDs GUID ASSDF PFN1, Glasgow PFN2, CERN PFN3, Lyon Replica Location Service LFN1 LFN2 LFN3 Replica Meta-data Catalogue

9 eScience -All Hands Meeting 2-4 September 2003 Replica Optimisation Service Early TB1, getBestFile absent. Now available: select the best replica of several available. Light-weight web service gathers information from network monitoring service and Storage Element services. Resource Broker (meta-Scheduler) decides on which CE a job will run. ROS treat files mentioned in JDL as hints, returning an access cost for a given array of potential CEs, allowing RB to rank based on availability of data. Most research-oriented task. OptorSim developed to test replica optimization ideas.

10 eScience -All Hands Meeting 2-4 September 2003 Security Provided by separate Java package. Covers Authentication coarse-grain authorisation. Aim to be as flexible as possible. Investigating collaboration with Liberty Alliance – a consortium developing standards and solutions for federated identity.

11 eScience -All Hands Meeting 2-4 September 2003 Authentication Extends normal Java SSL. Mutual authentication in SSL happens by exchanging public certificates signed by mutually trusted CAs, and crypto challenges Uses proxy certificates. Accepts GSI proxies as the authentication method Supports GSI proxy loading and reloading Supports OpenSSL certificate-private key loading Supports CRLs with periodic reloading Integrates with Tomcat and Jakarta AXIS SOAP framework Proxy doesn't have to be signed by CA, but has to start with DN of the user's certificate.

12 eScience -All Hands Meeting 2-4 September 2003 Coarse grain authorisation Coarse-grain means the server decides what access to grant before the request is processed: role based. Modular design for client-server interaction. SOAP and HTTP web traffic already written. Modular configuration. Currently configuration modules exist for XML and text file (the gridmap file). Integration work with Virtual Organisation Membership Service (VOMS). This allows authorisation on per-VO basis, without gridmap files.

13 eScience -All Hands Meeting 2-4 September 2003 Conclusions The 2 nd generation of data management services has been written based on the Web-services paradigm. We have chosen an extensible service framework. This will allow the adoption of upcoming OGSA standards. Our choice of software is based on our aim of supporting both high- availability commercial products and standard Open-Source solutions. The 2 nd Generation of WP2 software is currently being rolled out in production systems as part of the 2.0 release of EDG Software. Integration of additional services (such as full RLS and VOMS) are being scheduled.