Replica/File Catalog based on RAM SUNY

Slides:



Advertisements
Similar presentations
Dynamic Grid Optimisation TERENA Conference, Lijmerick 5/6/02 A. P. Millar University of Glasgow.
Advertisements

Author - Title- Date - n° 1 GDMP The European DataGrid Project Team
Towards the Design and Implementation of the DAME prototype: OGSA Compliant Grid Services on the White Rose Grid Sarfraz A Nadeem University of Leeds.
Data Management for Physics Analysis in PHENIX (BNL, RHIC) Evaluation of Grid architecture components in PHENIX context Barbara Jacak, Roy Lacey, Saskia.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
GRID COMPUTING: REPLICATION CONCEPTS Presented By: Payal Patel.
Introduction to Interactive Media 05: How the Web Works: Domain names, Hosting and Fundamentals of HTML.
Tech talk 20th June Andrey Grid architecture at PHENIX Job monitoring and related stuff in multi cluster environment.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May Proposal.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
File and Object Replication in Data Grids Chin-Yi Tsai.
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
PNPI HEPD seminar 4 th November Andrey Shevel Distributed computing in High Energy Physics with Grid Technologies (Grid tools at PHENIX)
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
CHEP Sep Andrey PHENIX Job Submission/Monitoring in transition to the Grid Infrastructure Andrey Y. Shevel, Barbara Jacak,
BaBar Data Distribution using the Storage Resource Broker Adil Hasan, Wilko Kroeger (SLAC Computing Services), Dominique Boutigny (LAPP), Cristina Bulfon.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
HEPD sem 14-Dec Andrey History photos: A. Shevel reports on CSD seminar about new Internet facilities at PNPI (Jan 1995)
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
Magda Distributed Data Manager Status Torre Wenaus BNL ATLAS Data Challenge Workshop Feb 1, 2002 CERN.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Andrey Meeting 7 October 2003 General scheme: jobs are planned to go where data are and to less loaded clusters SUNY.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
PHENIX and the data grid >400 collaborators Active on 3 continents + Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
Common Use Cases for a HEP Common Architecture Layer J. Templon, NIKHEF/WP8.
BNL ATLAS Database service update Yuri Smirnov, Iris Wu BNL, USA LCG Database Deployment and Persistency Workshop, CERN, Geneva October 17-19, 2005.
SkimData and Replica Catalogue Alessandra Forti BaBar Collaboration Meeting November 13 th 2002 skimData based replica catalogue RLS (Replica Location.
1 J. Keller, R. Naues: A Collaborative Virtual Computer Security Lab Amsterdam,Dec 4, 2006 Amsterdam, DEC 4, 2006 Jörg Keller FernUniversität in Hagen,
RUBRIC IP1 Ruben Botero Web Design III. The different approaches to accessing data in a database through client-side scripting languages. – On the client.
PPDG update l We want to join PPDG l They want PHENIX to join NSF also wants this l Issue is to identify our goals/projects Ingredients: What we need/want.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
January 26, 2003Eric Hjort HRMs in STAR Eric Hjort, LBNL (STAR/PPDG Collaborations)
PHENIX and the data grid >400 collaborators 3 continents + Israel +Brazil 100’s of TB of data per year Complex data with multiple disparate physics goals.
ATLAS Magda Distributed Data Manager Torre Wenaus BNL PPDG Robust File Replication Meeting Jefferson Lab January 10, 2002.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
Data Management The European DataGrid Project Team
Testing the HEPCAL use cases J.J. Blaising, F. Harris, Andrea Sciabà GAG Meeting April,
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL DOE/NSF Review of US LHC Software and Computing Fermilab Nov 29, 2001.
Mag Lev Vehicles Case Study #2 “Magnetic Levitation Transportation” Northern Highlands Regional High School Applied Technology Department Real World Engineering.
10 March Andrey Grid Tools Working Prototype of Distributed Computing Infrastructure for Physics Analysis SUNY.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
VO Experiences with Open Science Grid Storage OSG Storage Forum | Wednesday September 22, 2010 (10:30am)
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
BESIII data processing
Pegasus WMS Extends DAGMan to the grid world
U.S. ATLAS Grid Production Experience
Simone Campana CERN IT-ES
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Walter Binder Giovanna Di Marzo Serugendo Jarle Hulaas
Distributed Data Access and Resource Management in the D0 SAM System
Sergio Fantinel, INFN LNL/PD
Building A Web-based University Archive
LQCD Computing Operations
Who Is The Best Player In Major League Baseball?
Status of MC production on the grid
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
PBAN’s Use of SharePoint
Final Exam Review CSE487/587.
Software - Operating Systems
ATLAS DC2 & Continuous production
Frieda meets Pegasus-WMS
Introduction To Building a Web Site
Brief Multi-Band Explanation Conversion of Raw Data to FSL/AFNI Format
Presentation transcript:

Replica/File Catalog based on MAGDA @ RAM Data @ SUNY Grid Tools Replica/File Catalog based on MAGDA @ RAM Data @ SUNY Shevel@ram0.i2net.sunysb.edu 10 Dec 2002

Our needs in Replica/File Catalog Our Conditions Relatively small physics team (about 10 persons). Needs to keep some book-keeping to keep information about our files (mainly flat files) in different locations (in our computers, at BNL, etc.). Expected total number of files is about 10**5 – 10**6 (now is about 2*10**4) Needs to keep the catalog more or less up to date. 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

Our needs in Replica/File Catalog (continuation) Our Conditions From time to time we need to transfer a group of files (from about 10**2 to 10**4 files) in between different locations (in between SUNY and BNL). Apparently we need to keep newly copied files in Replica/File Catalog. Some trace of our data transfers is required as well. Finally we need to have program system or several stable systems for all of mentioned tasks. 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

MAGDA Several replication systems are available now for High Energy Physics. We are testing MAGDA. MAnager for Grid-based DAta = MAGDA (http://www.atlasgrid.bnl.gov/magdadoc/userguide.htm). MAGDA consists of many scripts in perl, kernel of the catalog is based on MySQL. 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

General terms REPLICA - a COPY of a file in DIFFERENT PLACE to MAIN LOCATION. MAIN REPLICA (Master COPY, MAIN COPY) – file in MAIN LOCATION. A DIFFERENT PLACE - might mean different directory, different media or different host. VIRTUAL ORGANIZATION (VO) - name of any group: one person, small team, detector group, University team, collaboration. 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

General terms (continuation) FULL LOGICAL FILE NAME - means line in format: lfn:/VO/short_logical_file_name SHORT_LOGICAL_FILE_NAME – name. This name is the same in spite of location. PHYSICAL FILE NAME - means line in format pfn:/HOST/PATH/file-name 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

General terms (continuation) CONSISTENCY - means that File/Replica Catalog does contain up to date information. 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

MAGDA terms COLLECTION – group of files. TASK – operation of replication of the COLLECTION. SITE – the place where data are. LOCATION – precise directory where data are. HOST – host where MAGDA clients might be run 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

RAM Prototype for File/Replica Catalog Now we have to go to see real prototype The prototype is located at http://ram3.chem.sunysb.edu/magda/dyShowMain.pl 10 Dec 2002 shevel@ram0.i2net.sunysb.edu

CONCLUSION We are playing now with the prototype of the File/Replica Catalog. First opinion it is very useful tool understand better what do we need in reality. We discovered that such catalog is very helpful for any kind of files (data, programs, papers, pictures, etc.). 10 Dec 2002 shevel@ram0.i2net.sunysb.edu