Performance measurement of transferring files on the federated SRB

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
The Storage Resource Broker and.
The Storage Resource Broker and.
1 GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing.
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
High Performance Computing Course Notes Grid Computing.
1 Configuring Internet- related services (April 22, 2015) © Abdou Illia, Spring 2015.
How’s My Network (HMN)? A Java approach to Home Network Measurement Alan Ritacco, Craig Wills, and Mark Claypool Computer Science Department Worcester.
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Understanding Networks I. Objectives Compare client and network operating systems Learn about local area network technologies, including Ethernet, Token.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
Data Grid Interactions with Firewalls Michael Wan Reagan Moore SDSC/UCSD/NPACI.
Getting Connected to NGS while on the Road… Donna V. Shaw, NGS Convocation.
11 REVIEWING MICROSOFT ACTIVE DIRECTORY CONCEPTS Chapter 1.
1 SAMBA. 2 Module - SAMBA ♦ Overview The presence of diverse machines in the network environment is natural. So their interoperability is critical. This.
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
IRODS performance test and SRB system at KEK Yoshimi KEK Building data grids with iRODS 27 May 2008.
MCAT: A Metadata Catalog San Diego Supercomputing Center Part of the Storage Resource Broker (SRB)
SRB system at Belle/KEK Yoshimi Iida CHEP 04, Interlaken 29 September 2004.
Linux+ Guide to Linux Certification Chapter Fifteen Linux Networking.
Network Tests at CHEP K. Kwon, D. Han, K. Cho, J.S. Suh, D. Son Center for High Energy Physics, KNU, Korea H. Park Supercomputing Center, KISTI, Korea.
File and Object Replication in Data Grids Chin-Yi Tsai.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
BZUPAGES.COM. What is a VPN VPN is an acronym for Virtual Private Network. A VPN provides an encrypted and secure connection "tunnel" path from a user's.
BaBar Data Distribution using the Storage Resource Broker Adil Hasan, Wilko Kroeger (SLAC Computing Services), Dominique Boutigny (LAPP), Cristina Bulfon.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
PC clusters in KEK A.Manabe KEK(Japan). 22 May '01LSCC WS '012 PC clusters in KEK s Belle (in KEKB) PC clusters s Neutron Shielding Simulation cluster.
Retina Network Security Scanner
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
The Storage Resource Broker and.
ITE PC v4.0 Chapter 8 1 © 2007 Cisco Systems, Inc. All rights reserved.Cisco Public  Networks are systems that are formed by links.  People use different.
Hiroyuki Matsunaga (Some materials were provided by Go Iwai) Computing Research Center, KEK Lyon, March
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
Chapter 7: Using Network Clients The Complete Guide To Linux System Administration.
SRB at KEK Yoshimi Iida, Kohki Ishikawa KEK – CC-IN2P3 Meeting on Grids at Lyon September 11-13, 2006.
Getting Connected to NGS while on the Road…
Status of WLCG FCPPL project
Computing Clusters, Grids and Clouds Globus data service
Guide to Linux Installation and Administration, 2e
Clouds , Grids and Clusters
The Data Grid: Towards an architecture for Distributed Management
File System Implementation
Global Catalog and Flexible Single Master Operations (FSMO) Roles
Status and Plans on GRID related activities at KEK
The transfer performance of iRODS between CC-IN2P3 and KEK
Introduction to CVMFS A way to distribute HEP software on cloud
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
GGF15 – Grids and Network Virtualization
Interoperability of Digital Repositories
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
Getting Connected to NGS while on the Road…
Information Technology Ms. Abeer Helwa
Global Catalog and Flexible Single Master Operations (FSMO) Roles
GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing.
Creating and Managing Folders
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets A.Chervenak, I.Foster, C.Kesselman, C.Salisbury,
STATEL an easy way to transfer data
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Performance measurement of transferring files on the federated SRB KEK Computing Research Center Yoshimi Iida

Outline The Belle experiment at KEK What is SRB? HEP Data Grid workshop Performance measurement Transfer measurement between sites in the Belle SRB federation Using WAN emulator Conclusions 29 April 2005 ISGC 2005

The Belle experiment at KEK The Belle experiment at KEK is one of several ongoing large experimental collaborations The accumulated data up to now is about 2PB including simulation data for analysis The large amount of data should be analyzed promptly to get statistically improved data for exploring new physics results Large numbers of files need to be managed consistently and shared easily among the 400 collaborators 29 April 2005 ISGC 2005

What is SRB? SRB provides a uniform interface for connecting to heterogeneous data resources and accessing data sets SRB, in conjunction with the Metadata Catalog (MCAT), provides a way to access data sets and resources based on logical attributes http://www.sdsc.edu/srb/ 29 April 2005 ISGC 2005

SRB federation SRB zone Federated MCAT consist of one or more SRB servers along with one MCAT Federated MCAT allow users to access resources and data sets across zones Server 1.1 MCAT 1 Server 1.2 MCAT 3 Server 3.1 MCAT 2 Server 2.1 Server 2.2 29 April 2005 ISGC 2005

Logical file system Single SRB system / - Zone1/ - container/ - home/ - srbUserA.domain/ - srbUserB.domain/ data.txt - styles/ - trash/ Federated SRB system / - Zone1/ - container/ - home/ - styles/ - trash/ - Zone2/ - Zone3/ : 29 April 2005 ISGC 2005

SRB data management Logical name space Data replication Mapping of each logical name to physical attributes UNIX like API and utilities for collections (directories) and data objects (files) Data replication replicate onto different resources 29 April 2005 ISGC 2005

User management Single Global User Name Space Uniquely identify by their usernames combined with their domain ‘iidayo@kek' Maintained in MCAT Single sign-on, access all resources No need for UNIX account at remote sites 29 April 2005 ISGC 2005

HEP Data Grid workshop Pre-workshop Workshop 1 - 3 December, 2004 Participants from 8 institutes at 7 countries SDSC (US) Australia National U., U. Melbourne (Australia) ASCC (Taiwan) KNU (Korea) IHEP (China) Krakow (Poland) KEK (Japan) Workshop 6 - 7 December, 2004 http://www-conf.kek.jp/hepdg/ 29 April 2005 ISGC 2005

SRB federated MCAT Zone IHEP (China) Zone ASCC (Taiwan) Zone KNU (Korea) 10Mbps Internet 100Mbps 100Mbps Zone ANU (Australia) Zone Krakow (Poland) 622Mbps 100Mbps Zone KEK (Japan) FW FW: Firewall M: MCAT enabled SRB server S: SRB server C: SRB client C S M C 29 April 2005 ISGC 2005

Belle SRB federation Install SRB server, a storage resource and MCAT at each site SRB version 3.2.1p Open the KEK firewall from the fixed IP address of the other sites After the installation, the functionality of the federation was tested and confirmed to be working 29 April 2005 ISGC 2005

Transfer measurement from KEK Copy files from KEK to the other sites 10 files from 0.5 to 100MB in size ‘Scp’ (SRB copy) use parallel I/O by default Configure to allocate one thread per 2MB of the file size up to a maximum of 16 threads (i.e. 32MB) 29 April 2005 ISGC 2005

Transfer rates from KEK Measured by ‘Scp’ 29 April 2005 ISGC 2005

Transfer performance between sites in the Belle SRB federation Source Destination KNU ASCC ANU Krakow IHEP KEK 9.65 5.89 3.06 2.69 0.59 - 5.68 4.45 4.47 0.61 7.54 2.84 5.09 0.64 4.79 2.53 3.44 0.58 1.21 1.64 4.19 0.60 0.70 0.39 0.44 0.25 Results of 100MB data file transfer in MB/s 29 April 2005 ISGC 2005

WAN emulation NIST Net Network emulation package Machine Specification NIST Net allows a single Linux PC set up as a router to emulate a wide variety of network conditions http://www-x.antd.nist.gov/nistnet/ Machine Specification CPU: Pentium4 3.2GHz Memory: 1GB OS: Red Hat Linux 8 NIC: GbE×2 Internet KEK network GbE GbE SRB server SRB server 100Mbps 100Mbps router NIST Net 29 April 2005 ISGC 2005

Transfer measurement with various file size Copy files on WAN emulation 8 files from 10 to 500MB in size Measure as a function of the RTT from 0 to 300ms ‘Scp’ use parallel I/O by default Configure to allocate one thread per 2MB of the file size up to a maximum of 16 threads Measure the bandwidth with iperf set the parallel client threads to 16 29 April 2005 ISGC 2005

Transfer performance with various file size iperf -P 16 Emulated by NIST Net 29 April 2005 ISGC 2005

Transfer performance on WAN (preliminary) BW 100Mbps WAN Emulation (NIST Net) KNU ASCC ANU Krakow IHEP Measured by Scp 29 April 2005 ISGC 2005

Bandwidth The nominal bandwidth between KEK and some sites are same 100Mbps as WAN emulation The actual bandwidth measured by iperf between KEK and other site is lower than WAN emulation e.g. 60Mbps for ASCC with iperf Further investigation is necessary 29 April 2005 ISGC 2005

Conclusions A federation of SRB servers was demonstrated successfully among ANU, KNU, IHEP, ASCC, Krakow and KEK The current results are preliminary and were obtained in the limited time available in the pre-workshop session The functionality of data sharing for analysis was proved to be stable and sufficient to be used by Belle collaboration 29 April 2005 ISGC 2005

Acknowledgements SDSC (San Diego Supercomputer Center) M. Wan ANU (Australia National University) S. J. McMahon University of Melbourne G. Moloney ASCC (Academia Sinica Computing Centre) H. Lin KNU (Kyungpook National University) K. Kwon IHEP (Institute of High Energy Physics) G. Chen Krakow (Institute of Nuclear Physics, Krakow) P. Lason and H. Palka Fujitsu S. Honma and T. Nakajima KEK (High Energy Accelerator Research Organization) Y. Karita, T. Sasaki, S. Y. Suzuki, S. Yashiro and Y. Watase 29 April 2005 ISGC 2005

Backups

‘Sls’ display data objects (file) or collections (directory) ‘SgetD’ display information about SRB data objects ‘SgetR’ display information about SRB resource This is an example. In SRB space, we can get the list of the SRB data object using Sls command. And SgetD command display information about SRB data object. So this is the information of testdata03.txt. We can see the two information with different SRB resources for one object. This is data replication. And user can get the information about the SRB resource using SgetR command. 29 April 2005 ISGC 2005

By the federated MCAT, the SRB zones maintained by each sites can see as a single tree structure directory. 29 April 2005 ISGC 2005