Author - Title- Date - n° 1 GDMP The European DataGrid Project Team

Slides:



Advertisements
Similar presentations
© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.
Advertisements

WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
OptorSim: A Replica Optimisation Simulator for the EU DataGrid W. H. Bell, D. G. Cameron, R. Carvajal, A. P. Millar, C.Nicholson, K. Stockinger, F. Zini.
MyProxy Guy Warner NeSC Training.
LHCb(UK) Meeting Glenn Patrick1 LHCb Grid Activities in UK LHCb(UK) Meeting Cambridge, 10th January 2001 Glenn Patrick (RAL)
1 CHEP 2000, Roberto Barbera Tests of data management services in EDG 1.2 ALICE Off-line Week,
NIKHEF Testbed 1 Plans for the coming three months.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
CMS HLT production using Grid tools Flavia Donno (INFN Pisa) Claudio Grandi (INFN Bologna) Ivano Lippi (INFN Padova) Francesco Prelz (INFN Milano) Andrea.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Title – n° 1 Grid Data Management in Action Experience in Running and.
GRID DATA MANAGEMENT PILOT (GDMP) Asad Samar (Caltech) ACAT 2000, Fermilab October , 2000.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Globus activities within INFN Massimo Sgaravatto INFN Padova for the INFN Globus group
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Status of Globus activities within INFN (update) Massimo Sgaravatto INFN Padova for the INFN Globus group
Vladimir Litvin, Harvey Newman Caltech CMS Scott Koranda, Bruce Loftis, John Towns NCSA Miron Livny, Peter Couvares, Todd Tannenbaum, Jamie Frey Wisconsin.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
A Grid Computing Use case Datagrid Jean-Marc Pierson.
File and Object Replication in Data Grids Chin-Yi Tsai.
DataGrid is a project funded by the European Union VisualJob Demonstation EDG 1.4.x 2003 The EU DataGrid How the use of distributed resources can help.
1 Grid Related Activities at Caltech Koen Holtman Caltech/CMS PPDG meeting, Argonne July 13-14, 2000.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
4/20/02APS April Meeting1 Database Replication at Remote sites in PHENIX Indrani D. Ojha Vanderbilt University (for PHENIX Collaboration)
Data Grid projects in HENP R. Pordes, Fermilab Many HENP projects are working on the infrastructure for global distributed simulated data production, data.
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
Magda Distributed Data Manager Status Torre Wenaus BNL ATLAS Data Challenge Workshop Feb 1, 2002 CERN.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
1 Process migration n why migrate processes n main concepts n PM design objectives n design issues n freezing and restarting a process n address space.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
Stephen Burke – Data Management - 3/9/02 Partner Logo Data Management Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
UK Grid Meeting Glenn Patrick1 LHCb Grid Activities in UK Grid Prototype and Globus Technical Meeting QMW, 22nd November 2000 Glenn Patrick (RAL)
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Oracle to MySQL synchronization Gianni Pucciani CERN, University of Pisa.
Research data management using Globus ESIP Summer Meeting 2015 Rachana Ananthakrishnan University of Chicago
Data Management The European DataGrid Project Team
Data Management The European DataGrid Project Team
GIIS Implementation and Requirements F. Semeria INFN European Datagrid Conference Amsterdam, 7 March 2001.
An Active Security Infrastructure for Grids Stuart Kenny*, Brian Coghlan Trinity College Dublin.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Dynamic staging to a CAF cluster Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
StoRM: status report A disk based SRM server.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
WP2: Data Management Gavin McCance University of Glasgow.
ATLAS DDM Developing a Data Management System for the ATLAS Experiment September 20, 2005 Miguel Branco
The EDG Testbed Deployment Details
U.S. ATLAS Grid Production Experience
Workload Management System ( WMS )
The European DataGrid Project Team
LCG middleware and LHC experiments ARDA project
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Initial job submission and monitoring efforts with JClarens
The EU DataGrid Data Management
The EU DataGrid Genius Data Management Services
The GENIUS Security Services
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

Author - Title- Date - n° 1 GDMP The European DataGrid Project Team

EDG GDMP Tutorial - n° 2  based on CMS requirements for replicating Objectivity files for High Level Trigger studies  production prototype project for evaluating Grid technologies (especially Globus)  experience for will directly be used in DataGrid n input also for PPDG and GriPhyN   implementation: Asad Samar, Heinz Stockinger

EDG GDMP Tutorial - n° 3 CMS DM Requirements n Data production (currently tens of Terabytes). n Managing the data locally at regional centres. n Replicating this data to other centres. s High speed transfers. s Secure access. s Disk management facilities s Minimizing human interference. s Fault tolerance and error recovery mechanisms. n Data integration on the destination. n Logging and book-keeping. Submit job Replicate data Replicate data Site A Site B Site C " Jobs are executed locally or remotely " Data is always written locally " Data is replicated to remote sites Job writes data locally

EDG GDMP Tutorial - n° 4 Subscription Model n All the sites that subscribe to a particular site get notified whenever there is an update in its catalog. n The sites that don’t subscribe have to poll themselves for any changes in the catalog. n Support both a pull and a push mechanism. Site 1 Site 3 Site 2 Subscriber list Subscriber list subscribe poll for changes

EDG GDMP Tutorial - n° 5 Export / Import Catalogue n Export Catalog s information about the new files produced. s is published n Import Catalog s information about the files which have been published by other sites but not yet transferred locally s As soon as the file is transferred locally, it is removed from the import catalogue. n Possible to pull the information about new files in your import catalogue. Site 1 Site 3 export catalog import catalog Site 2 export catalog 1) publish new files 2) transfer files 1) get info about new files 3) delete files

EDG GDMP Tutorial - n° 6 Usage  gdmp_server n server running at each site  gdmp_host_subscribe n first thing to be done by a site  gdmp_publish_catalogue n send information of newly created files to subscribed hosts (no real data transfer)  gdmp_replicate_file_get n get all the files from the import catalogue  gdmp_get_catalogue n creates a list of missing files; for error recovery

EDG GDMP Tutorial - n° 7 GDMP File Transfer Steps GDMP Site B GDMP Site A Host subscribe(0) Exchange of Globus Certificate Subject(-1) Trigger Publish files(1.1a) Publish Request(1.1b) Publish list of all files(1.2b) Publish list of new files(1.2a) Transfer required files(2) MSS Stage Migrate Staging request Replicate Catalog * Steps <=0 are done once per GDMP server Staging done (3)Post- processing OBJY Catalog

EDG GDMP Tutorial - n° 8 Using GDMP Site 2 Site 1 Site 3 Site 4 Site 5 Data produced at site 1 to be replicated to other sites

EDG GDMP Tutorial - n° 9 Using GDMP 2  Start with subscription n gdmp_host_subscribe –remotehost -remoteport Site 2 Site 1 Site 3 Site 4 Site 5 gdmp_host_subscribe Subscriber list

EDG GDMP Tutorial - n° 10 Using GDMP 3  Publish new files…can combine with filtering n gdmp_publish_catalogue n gdmp_filter_catalog-export Site 2 Site 1 Site 3 Site 4 Site 5 Subscriber list gdmp_publish_catalogue Export catalog Import catalog Import catalog Import catalog

EDG GDMP Tutorial - n° 11 Site 2 Site 1 Site 3 Site 4 Site 5 Subscriber list Export catalog Import catalog Import catalog Import catalog gdmp_get_catalogue Import catalog Using GDMP 4  Poll for change in catalog (pull model)…can combine with filtering…also used for error recovery. n gdmp_get_catalogue –host n gdmp_filter_catalog -import

EDG GDMP Tutorial - n° 12 Site 2 Site 1 Site 3 Site 4 Site 5 Subscriber list Export catalog Import catalog Import catalog Import catalog Import catalog gdmp_replicate_file_get Using GDMP 5  Transfer files…can use the progress meter n gdmp_replicate_file_get n get_progress_meter…produces a progress.log. n replica.log has all files already transferred.