Roberto Barbera Prague, 12.12.2002 ALICE Multi-site Data Transfer Tests on a Wide Area Network Giuseppe Lo Re Roberto Barbera Work in collaboration with:

Slides:



Advertisements
Similar presentations
1 CHEP 2000, Roberto Barbera Grid activities at the ALICE Experiment TNC2002, On behalf of the ALICE GRID Working Group: G. Andronico.
Advertisements

1 CHEP 2000, Roberto Barbera The ALICE Inner Tracking System Off-line Software CHEP 2000, (
INFN & Globus activities Massimo Sgaravatto INFN Padova.
CHEP 2000, Roberto Barbera NA3, NA4, and NA5 activities Milano, Università di Catania and INFN Catania - Italy ALICE Collaboration.
CHEP 2000, Roberto Barbera Roberto Barbera (*) GENIUS: a Web Portal for the GRID Meeting Grid.it, Bologna, (*) work in collaboration.
Globus FTP Evaluation test Catania – 10/04/2001Antonio Forte – INFN Torino.
Istituto Nazionale di Fisica Nucleare Italy LAL - Orsay April Site Report – R.Gomezel Site Report Roberto Gomezel INFN - Trieste.
LCG Tiziana Ferrari - SC3: INFN installation status report 1 Service Challenge Phase 3: Status report Tiziana Ferrari on behalf of the INFN SC team INFN.
INFN Testbed1 status L. Gaido, A. Ghiselli WP6 meeting CERN, 11 December 2001.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
Evaluation of the Globus Toolkit: Status Roberto Cucchi – INFN Cnaf Antonia Ghiselli – INFN Cnaf Giuseppe Lo Biondo – INFN Milano Francesco Prelz – INFN.
1 CHEP 2000, Roberto Barbera Tests of data management services in EDG 1.2 ALICE Off-line Week,
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
LCG TCP performance optimization for 10 Gb/s LHCOPN connections 1 on behalf of M. Bencivenni, T.Ferrari, D. De Girolamo, Stefano.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
High-Performance Throughput Tuning/Measurements Davide Salomoni & Steffen Luitz Presented at the PPDG Collaboration Meeting, Argonne National Lab, July.
ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.
Ns Simulation Final presentation Stella Pantofel Igor Berman Michael Halperin
Modelli di differenziazione delle prestazioni per il supporto del traffico LHC1 Modelli di differenziazione delle prestazioni per il supporto del traffico.
INFN Testbed status report L. Gaido WP6 meeting CERN - October 30th, 2002.
11 September 2007Milos Lokajicek Institute of Physics AS CR Prague Status of the GRID in the Czech Republic NEC’2007.
EGEE is a project funded by the European Union under contract IST ALICE network stress tests Roberto Barbera NA4 Generic Applications Coordinator.
File System Benchmarking
FZU participation in the Tier0 test CERN August 3, 2006.
ALICE data access WLCG data WG revival 4 October 2013.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
DATAGRID ConferenceTestbed0 - resources in Italy Luciano Gaido 1 DATAGRID WP6 Testbed0 resources in Italy Amsterdam March,
Maximizing End-to-End Network Performance Thomas Hacker University of Michigan October 26, 2001.
T0/T1 network meeting July 19, 2005 CERN
LCG Service Challenge Phase 4: Piano di attività e impatto sulla infrastruttura di rete 1 Service Challenge Phase 4: Piano di attività e impatto sulla.
Network Tests at CHEP K. Kwon, D. Han, K. Cho, J.S. Suh, D. Son Center for High Energy Physics, KNU, Korea H. Park Supercomputing Center, KISTI, Korea.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
Data transfer over the wide area network with a large round trip time H. Matsunaga, T. Isobe, T. Mashimo, H. Sakamoto, I. Ueda International Center for.
DBI313. MetricOLTPDWLog Read/Write mixMostly reads, smaller # of rows at a time Scan intensive, large portions of data at a time, bulk loading Mostly.
Sejong STATUS Chang Yeong CHOI CERN, ALICE LHC Computing Grid Tier-2 Workshop in Asia, 1 th December 2006.
DataGrid Workshop Oxford, July 2-5 INFN Testbed status report Luciano Gaido 1 DataGrid Workshop INFN Testbed status report L. Gaido Oxford July,
BNL’s Network diagnostic tool IPERF was used and combined with different strategies to analyze network bandwidth performance such as: -Test with iperf.
CHEP04 Performance Analysis of Cluster File System on Linux Yaodong CHENG IHEP, CAS
INFN TIER1 (IT-INFN-CNAF) “Concerns from sites” Session LHC OPN/ONE “Networking for WLCG” Workshop CERN, Stefano Zani
Hepix LAL April 2001 An alternative to ftp : bbftp Gilles Farrache In2p3 Computing Center
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
Status of the Bologna Computing Farm and GRID related activities Vincenzo M. Vagnoni Thursday, 7 March 2002.
A Silvio Pardi on behalf of the SuperB Collaboration a INFN-Napoli -Campus di M.S.Angelo Via Cinthia– 80126, Napoli, Italy CHEP12 – New York – USA – May.
Condor on WAN D. Bortolotti - INFN Bologna T. Ferrari - INFN Cnaf A.Ghiselli - INFN Cnaf P.Mazzanti - INFN Bologna F. Prelz - INFN Milano F.Semeria - INFN.
ALICE DATA ACCESS MODEL Outline 05/13/2014 ALICE Data Access Model 2  ALICE data access model  Infrastructure and SE monitoring.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
Italy Report HTASC Meeting DESY, October 8-9, 2001 Francesco Forti, INFN-Pisa.
1.3 ON ENHANCING GridFTP AND GPFS PERFORMANCES A. Cavalli, C. Ciocca, L. dell’Agnello, T. Ferrari, D. Gregori, B. Martelli, A. Prosperini, P. Ricci, E.
Ideas and test setup for data transfer from CERN to Italian Ground Segment. M. Boschini, A. Favalli, M. Levtchenko CERN – March, 31, 2003.
Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Update on the network.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
Status of NICE/NT at INFN Gian Piero Siroli, Physics Dept. Univ. of Bologna and INFN HEPiX-HEPNT, SLAC, Oct.99.
Big Data transfer over computer networks Initial Sergey Khoruzhnikov Vladimir Grudinin Oleg Sadov Andrey Shevel Anatoly Oreshkin Elena Korytko Alexander.
1 June 11/Ian Fisk CMS Model and the Network Ian Fisk.
PRIN STOA-LHC: STATUS BARI BOLOGNA-18 GIUGNO 2014 Giorgia MINIELLO G. MAGGI, G. DONVITO, D. Elia INFN Sezione di Bari e Dipartimento Interateneo.
INFN Site Report R.Gomezel November 5-9,2007 The Genome Sequencing University St. Louis.
INFN Site Report R.Gomezel October 9-13, 2006 Jefferson Lab, Newport News.
Dynamic Extension of the INFN Tier-1 on external resources
DPM at ATLAS sites and testbeds in Italy
LCG Service Challenge: Planning and Milestones
INFN CNAF TIER1 Network Service
Bernd Panzer-Steindel, CERN/IT
The INFN TIER1 Regional Centre
Wide Area Networking at SLAC, Feb ‘03
ALICE-Grid Activities in Bologna
FTS Issue in Beijing Erming PEI 2010/06/18.
Achieving reliable high performance in LFNs (long-fat networks)
STATEL an easy way to transfer data
When to use and when not to use BBR:
Presentation transcript:

Roberto Barbera Prague, ALICE Multi-site Data Transfer Tests on a Wide Area Network Giuseppe Lo Re Roberto Barbera Work in collaboration with: P. Cerello, D. Di Bari, G. Donvito (CMS), E. Fragiacomo, A. Fritz, M. Luvisetto, M. Masera, F. Minafra, D. Mura, S. Piano, M. Sitta, J. Švec, R. Turrisi Contributions from GARR and INFN NetGroup: C. Allocchio, M. Campanella, L. Gaido, S. Lusso, M. Michelotto, S. Spanu, S. Zani, D. De Girolamo CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Outline Objectives Preparation and benchmark Testbed layout and test results Conclusions CHEP 2004, 30 Sep 2004

Prague, Objectives See if the actual bandwidths can cope with the ALICE needs Spot possible bottle-necks out in the point-to-point transfers (I/O  LAN  WAN  LAN  I/O) Check, with “real” numbers of “real” use cases, if bandwidth attributions foreseen in the next future are adequate I/O server Front-end router Front-end router WANLAN disk I/O (W/R block size) TCP windows # streams BDP = BW*RTT CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Preparation and Benchmark Standard configuration of both the TCP stack and disk I/O parameters in Linux SSH keys exchanged among all machines to “secure” file transfers without typing passwords Automatic procedure installed on all machines with both Flat and Multi-Tier configurations. CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Testbed layout and “numbers” BA: 3 servers (2 ALICE, 1 CMS) BO: 6 servers CA: 2 servers CNAF: 2 servers CT: 2 servers PD: 6 servers TO: 2 servers TS: 1 server Prague: 1 server Houston: 1 server CNAF Padova Houston Prague CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Disk access measurements (non reserved access, local disk) MachineWrite (MBytes/s)Read (MBytes/s) boalice8.bo.infn.it55 server3.ca.infn.it4561 aliserv10.ct.infn.it2734 alifarm02.to.infn.it4059 alifarm.ts.infn.it2836 MachineWrite (MBytes/s)Read (MBytes/s) boalice8.bo.infn.it53 server3.ca.infn.it4332 aliserv10.ct.infn.it5725 pcalice19.pd.infn.it55 alifarm02.to.infn.it3153 alifarm.ts.infn.it2734 Bonnie IOzone CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Bandwidth measurements MachineBW1(Mb/s)BW2(Mb/s)BW4 (Mb/s)BW8(Mb/s)BW16(Mb/s)BW32(Mb/s) boalice8.bo.infn.it server3.ca.infn.it aliserv10.ct.infn.it pcalice19.pd.infn.it alifarm02.to.infn.it alifarm.ts.infn.it Iperf Netperf-2.1 MachineBW1(Mb/s)BW2(Mb/s)BW4 (Mb/s)BW8(Mb/s)BW16(Mb/s)BW32(Mb/s) boalice8.bo.infn.it server3.ca.infn.it aliserv10.ct.infn.it pcalice19.pd.infn.it alifarm02.to.infn.it alifarm.ts.infn.it CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, GARR network status at the beginning Bari: 28 Mb/s (BGA: 16 Mb/s) Bologna: 32 Mb/s Cagliari: 8 Mb/s Catania: 34 Mb/s CNAF: 1024 Mb/s Padova: 155 Mb/s Torino: 155 Mb/s (BGA: 70 Mb/s) Trieste: 16 Mb/s CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Network bandwidths (after the tests) Bari: 28 Mb/s (BGA: 16 Mb/s) Bologna: 100 Mb/s (BGA: 32 Mb/s) Cagliari: 32 Mb/s Catania: 34 Mb/s (direct connection to GARR-G in 6 months, up to 2.5 Gb/s) CNAF: 1024 Mb/s Padova: 155 Mb/s Torino: 155 Mb/s (BGA: 70 Mb/s) Trieste: 24 Mb/s CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Flat test CHEP 2004, 30 Sep 2004 Each server transfer files from/to any other servers waits a random time uniformly choosen between 0 and customizable maximum (1 min and 5 mins tried so far) chooses at random on of the other N-1 servers (with a weight proportional to the maximum bandwith of the site that server belongs to) chooses at random one of three files with different sizes (1.6 GB, 0.8 GB, and 0.3 GB) sends back and forth the file using bbFTP with a customizable number of parallel streams (16 and 8 tried) checks if any bits got lost and fills a detailed log file

Roberto Barbera Selected results (Bologna) saturated ! Official GARR NOC statistics CHEP 2004, 30 Sep 2004

Roberto Barbera Selected results (Cagliari) saturated ! Official GARR NOC statistics

Roberto Barbera Selected results (Catania) heavy traffic ! Official GARR NOC statistics CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Multi-tier use-case (HBT prod., 5000 evts., 9 TB) CNAF 60% Tier-1 CT 20% TO 20% Tier TB 1 MB in 50 MB out Tier-3/4 BABOCAPDTS CHEP 2004, 30 Sep 2004

Roberto Barbera Results (Official GARR NOC stats.)

Roberto Barbera Prague, Latest developments TCP tuning to improve throughput The participation of non Italian sites (especially with large RTT’s) like Prague and Houston has been useful to verify the effect of TCP tuning. SiteRTT (msec) from CNAFBW (Mb/s) from CNAFBDP (MB) Houston Prague Catania CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, bbFTP vs # streams and TCP windows Catania-CNAF Max bw measured (iperf) = 25 Mb/s 1 streams 2 streams 4 streams Saturated also for small files saturated CHEP 2004, 30 Sep 2004

Houston-CNAF, Max bw measured (iperf) = 50 Mb/s Roberto Barbera Prague, bbFTP vs # streams and TCP windows 1 streams 2 streams 4 streams 6 streams saturated CHEP 2004, 30 Sep 2004

Prague-CNAF, Max bw measured (iperf) = 250 Mb/s Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration bbFTP vs no streams and TCP windows 1 streams 2 streams With an very high maximum buffer size (130KB->8 MB) 1 streams 2 streams CHEP 2004, 30 Sep 2004

Prague-CNAF with 1, 2, 4 and 6 streams with a very large maximum buffer size (8MB » BDP). [Ref: Roberto Barbera Prague, streams 1 streams 4 streams6 streams Bottleneck at I/O level CHEP 2004, 30 Sep 2004

Roberto Barbera Prague, Conclusions First “real” multi-site/multi-server stress-test of the Italian GARR network Actual bandwidths resulted strongly inadequate if we especially consider all ALICE sites “as a whole” and the present number of servers already available by now Useful information on the actual farm architecture (limits of NFS in case of many parallel threads and big files) Big “perturbation” and interest inside both INFN NetGroup and GARR with prompt and excellent feed-back and support Strong and “incredibly” fast bandwith upgrades in many sites made by the GARR NOC Mapping of the testbed on a multi-tier topology does not seem to pose major problems for Tier-3’s CHEP 2004, 30 Sep 2004