Roberto Barbera Prague, 12.12.2002 Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Update on the network.

Slides:



Advertisements
Similar presentations
1 CHEP 2000, Roberto Barbera Grid activities at the ALICE Experiment TNC2002, On behalf of the ALICE GRID Working Group: G. Andronico.
Advertisements

INFN & Globus activities Massimo Sgaravatto INFN Padova.
CHEP 2000, Roberto Barbera Roberto Barbera (*) GENIUS: a Web Portal for the GRID Meeting Grid.it, Bologna, (*) work in collaboration.
Globus FTP Evaluation test Catania – 10/04/2001Antonio Forte – INFN Torino.
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Istituto Nazionale di Fisica Nucleare Italy LAL - Orsay April Site Report – R.Gomezel Site Report Roberto Gomezel INFN - Trieste.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
Evaluation of the Globus Toolkit: Status Roberto Cucchi – INFN Cnaf Antonia Ghiselli – INFN Cnaf Giuseppe Lo Biondo – INFN Milano Francesco Prelz – INFN.
1 CHEP 2000, Roberto Barbera Tests of data management services in EDG 1.2 ALICE Off-line Week,
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.
Modelli di differenziazione delle prestazioni per il supporto del traffico LHC1 Modelli di differenziazione delle prestazioni per il supporto del traffico.
INFN Testbed status report L. Gaido WP6 meeting CERN - October 30th, 2002.
11 September 2007Milos Lokajicek Institute of Physics AS CR Prague Status of the GRID in the Czech Republic NEC’2007.
Outline Network related issues and thinking for FAX Cost among sites, who has problems Analytics of FAX meta data, what are the problems  The main object.
KEK Network Qi Fazhi KEK SW L2/L3 Switch for outside connections Central L2/L3 Switch A Netscreen Firewall Super Sinet Router 10GbE 2 x GbE IDS.
EGEE is a project funded by the European Union under contract IST ALICE network stress tests Roberto Barbera NA4 Generic Applications Coordinator.
Global Science experiment Data hub Center Oct. 13, 2014 Seo-Young Noh Status Report on Tier 1 in Korea.
Tiziana Ferrari Overview of INFN-GRID WP5: Network 1
1 Robust Transport Protocol for Dynamic High-Speed Networks: enhancing XCP approach Dino M. Lopez Pacheco INRIA RESO/LIP, ENS of Lyon, France Congduc Pham.
ALICE data access WLCG data WG revival 4 October 2013.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
DATAGRID ConferenceTestbed0 - resources in Italy Luciano Gaido 1 DATAGRID WP6 Testbed0 resources in Italy Amsterdam March,
+ discussion in Software WG: Monte Carlo production on the Grid + discussion in TDAQ WG: Dedicated server for online services + experts meeting (Thusday.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Jean-Yves Nief CC-IN2P3, Lyon HEPiX-HEPNT, Fermilab October 22nd – 25th, 2002.
LCG Service Challenge Phase 4: Piano di attività e impatto sulla infrastruttura di rete 1 Service Challenge Phase 4: Piano di attività e impatto sulla.
Network Tests at CHEP K. Kwon, D. Han, K. Cho, J.S. Suh, D. Son Center for High Energy Physics, KNU, Korea H. Park Supercomputing Center, KISTI, Korea.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
Data transfer over the wide area network with a large round trip time H. Matsunaga, T. Isobe, T. Mashimo, H. Sakamoto, I. Ueda International Center for.
Sejong STATUS Chang Yeong CHOI CERN, ALICE LHC Computing Grid Tier-2 Workshop in Asia, 1 th December 2006.
Status Report of WLCG Tier-1 candidate for KISTI-GSDC Sang-Un Ahn, for the GSDC Tier-1 Team GSDC Tier-1 Team 12 th CERN-Korea.
DataGrid Workshop Oxford, July 2-5 INFN Testbed status report Luciano Gaido 1 DataGrid Workshop INFN Testbed status report L. Gaido Oxford July,
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
Latest news on JXTA and JuxMem-C/DIET Mathieu Jan GDS meeting, Rennes, 11 march 2005.
Testing the UK Tier 2 Data Storage and Transfer Infrastructure C. Brew (RAL) Y. Coppens (Birmingham), G. Cowen (Edinburgh) & J. Ferguson (Glasgow) 9-13.
Roberto Barbera Prague, ALICE Multi-site Data Transfer Tests on a Wide Area Network Giuseppe Lo Re Roberto Barbera Work in collaboration with:
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
Status of the Bologna Computing Farm and GRID related activities Vincenzo M. Vagnoni Thursday, 7 March 2002.
A Silvio Pardi on behalf of the SuperB Collaboration a INFN-Napoli -Campus di M.S.Angelo Via Cinthia– 80126, Napoli, Italy CHEP12 – New York – USA – May.
Condor on WAN D. Bortolotti - INFN Bologna T. Ferrari - INFN Cnaf A.Ghiselli - INFN Cnaf P.Mazzanti - INFN Bologna F. Prelz - INFN Milano F.Semeria - INFN.
ALICE DATA ACCESS MODEL Outline 05/13/2014 ALICE Data Access Model 2  ALICE data access model  Infrastructure and SE monitoring.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
Italy Report HTASC Meeting DESY, October 8-9, 2001 Francesco Forti, INFN-Pisa.
TCP transfers over high latency/bandwidth networks & Grid DT Measurements session PFLDnet February 3- 4, 2003 CERN, Geneva, Switzerland Sylvain Ravot
1.3 ON ENHANCING GridFTP AND GPFS PERFORMANCES A. Cavalli, C. Ciocca, L. dell’Agnello, T. Ferrari, D. Gregori, B. Martelli, A. Prosperini, P. Ricci, E.
Ideas and test setup for data transfer from CERN to Italian Ground Segment. M. Boschini, A. Favalli, M. Levtchenko CERN – March, 31, 2003.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Data transfers and storage Kilian Schwarz GSI. GSI – current storage capacities vobox LCG RB/CE GSI batchfarm: ALICE cluster (67 nodes/480 cores for batch.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
IT-INFN-CNAF Status Update LHC-OPN Meeting INFN CNAF, December 2009 Stefano Zani 10/11/2009Stefano Zani INFN CNAF (TIER1 Staff)1.
Big Data transfer over computer networks Initial Sergey Khoruzhnikov Vladimir Grudinin Oleg Sadov Andrey Shevel Anatoly Oreshkin Elena Korytko Alexander.
1 June 11/Ian Fisk CMS Model and the Network Ian Fisk.
PRIN STOA-LHC: STATUS BARI BOLOGNA-18 GIUGNO 2014 Giorgia MINIELLO G. MAGGI, G. DONVITO, D. Elia INFN Sezione di Bari e Dipartimento Interateneo.
INFN Site Report R.Gomezel October 9-13, 2006 Jefferson Lab, Newport News.
Dynamic Extension of the INFN Tier-1 on external resources
Workload Management Workpackage
LCG Service Challenge: Planning and Milestones
INFN CNAF TIER1 Network Service
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Bernd Panzer-Steindel, CERN/IT
Update on Plan for KISTI-GSDC
Clouds of JINR, University of Sofia and INRNE Join Together
The INFN TIER1 Regional Centre
Storage elements discovery
Wide Area Networking at SLAC, Feb ‘03
When to use and when not to use BBR:
Presentation transcript:

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Update on the network stress test performed in ALICE Roberto Barbera Giuseppe Lo Re Work in collaboration with: P. Cerello, D. Di Bari, G. Donvito (CMS), E. Fragiacomo, A. Fritz, M. Luvisetto, M. Masera, F. Minafra, D. Mura, S. Piano, M. Sitta, J. Švec, R. Turrisi Contributions from GARR and INFN NetGroup: C. Allocchio, M. Campanella, L. Gaido, S. Lusso, M. Michelotto, S. Spanu, S. Zani, D. De Girolamo ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Outline Objectives Summary of the last tests Results of new tests Conclusions and future developments ALICE Off-line Week,

Prague, Roberto Barbera Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Objectives See if the actual bandwidths can cope with the ALICE needs Spot possible bottle-necks out in the point-to-point transfers (I/O  LAN  WAN  LAN  I/O) Check, with “real” numbers of “real” use cases, if bandwidth attributions foreseen in the next future are adequate ALICE Off-line Week, I/O server Front-end router Front-end router WANLAN disk I/O (W/R block size) TCP windows # streams BDP = BW*RTT

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Preparation and Benchmark Standard configuration of both the TCP stack and disk I/O parameters in Linux SSH keys exchanged among all machines to “secure” file transfers without typing passwords Automatic procedure installed on all machines: waits a random time uniformly choosen between 0 and customizable maximum (1 min and 5 mins tried so far) chooses at random on of the other N-1 servers (with a weight proportional to the maximum bandwith of the site that server belongs to) chooses at random one of three files with different sizes (1.6 GB, 0.8 GB, and 0.3 GB) sends back and forth the file using bbFTP with a customizable number of parallel streams (16 and 8 tried so far) checks if any bits got lost and fills a detailed log file ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Testbed layout and “numbers” BA: 3 servers (2 ALICE, 1 CMS) BO: 6 servers CA: 2 servers CNAF: 2 servers CT: 2 servers PD: 6 servers TO: 2 servers TS: 1 server Prague: 1 server Houston: 1 server CNAF Padova Houston Prague ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Disk access measurements (non reserved access, local disk) MachineWrite (MBytes/s)Read (MBytes/s) boalice8.bo.infn.it55 server3.ca.infn.it4561 aliserv10.ct.infn.it2734 alifarm02.to.infn.it4059 alifarm.ts.infn.it2836 MachineWrite (MBytes/s)Read (MBytes/s) boalice8.bo.infn.it53 server3.ca.infn.it4332 aliserv10.ct.infn.it5725 pcalice19.pd.infn.it55 alifarm02.to.infn.it3153 alifarm.ts.infn.it2734 Bonnie IOzone ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration GARR network status at the beginning Bari: 28 Mb/s (BGA: 16 Mb/s) Bologna: 32 Mb/s Cagliari: 8 Mb/s Catania: 34 Mb/s CNAF: 1024 Mb/s Padova: 155 Mb/s Torino: 155 Mb/s (BGA: 70 Mb/s) Trieste: 16 Mb/s ALICE Off-line Week,

Roberto Barbera Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Selected results (Bologna) saturated ! Official GARR NOC statistics ALICE Off-line Week,

Roberto Barbera Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Selected results (Cagliari) saturated ! Official GARR NOC statistics ALICE Off-line Week,

Roberto Barbera Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Selected results (Catania) heavy traffic ! Official GARR NOC statistics ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Network bandwidths now Bari: 28 Mb/s (BGA: 16 Mb/s) Bologna: 100 Mb/s (BGA: 32 Mb/s) Cagliari: 32 Mb/s Catania: 34 Mb/s (direct connection to GARR-G in 6 months, up to 2.5 Gb/s) CNAF: 1024 Mb/s Padova: 155 Mb/s Torino: 155 Mb/s (BGA: 70 Mb/s) Trieste: 24 Mb/s ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Bandwidth measurements MachineBW1(Mb/s)BW2(Mb/s)BW4 (Mb/s)BW8(Mb/s)BW16(Mb/s)BW32(Mb/s) boalice8.bo.infn.it server3.ca.infn.it aliserv10.ct.infn.it pcalice19.pd.infn.it alifarm02.to.infn.it alifarm.ts.infn.it Iperf Netperf-2.1 MachineBW1(Mb/s)BW2(Mb/s)BW4 (Mb/s)BW8(Mb/s)BW16(Mb/s)BW32(Mb/s) boalice8.bo.infn.it server3.ca.infn.it aliserv10.ct.infn.it pcalice19.pd.infn.it alifarm02.to.infn.it alifarm.ts.infn.it ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Multi-tier use-case (HBT prod., 5000 evts., 9 TB) CNAF 60% Tier-1 CT 20% TO 20% Tier TB 1 MB in 50 MB out Tier-3/4 BABOCAPDTS ALICE Off-line Week,

Roberto Barbera Results (Official GARR NOC stats.) Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration New developments (since last offline week) TCP tuning to improve throughput Since ALICE is a really “geographically” distributed Collaboration, the participation of foreign sites (especially with large RTT’s) would be very welcome. Up to now Houston and Prague have been involved. ALICE Off-line Week, SiteRTT (msec) from CNAFBW (Mb/s) from CNAFBDP (MB) Houston Prague Catania

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration bbFTP vs # streams and TCP windows Catania-CNAF Max bw measured (iperf) = 25 Mb/s 1 streams 2 streams 4 streams ALICE Off-line Week, Saturated also for small files saturated

Houston-CNAF, Max bw measured (iperf) = 50 Mb/s Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration bbFTP vs # streams and TCP windows 1 streams 2 streams 4 streams 6 streams ALICE Off-line Week, saturated

Prague-CNAF, Max bw measured (iperf) = 250 Mb/s Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration bbFTP vs no streams and TCP windows 1 streams 2 streams With an very high maximum buffer size (130KB->8 MB) 1 streams 2 streams ALICE Off-line Week,

Prague-CNAF with 1, 2, 4 and 6 streams with a very large maximum buffer size (8MB » BDP). [Ref: Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration 2 streams 1 streams 4 streams6 streams ALICE Off-line Week, Bottleneck at I/O level

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Conclusions (1/2) First “real” multi-site/multi-server stress-test of the Italian GARR network Actual bandwidths resulted strongly inadequate if we especially consider all ALICE sites “as a whole” and the present number of servers already available by now Useful information on the actual farm architecture (limits of NFS in case of many parallel threads and big files) Big “perturbation” and interest inside both INFN NetGroup and GARR with prompt and excellent feed-back and support Strong and “incredibly” fast bandwith upgrades in many sites made by the GARR NOC Mapping of the testbed on a multi-tier topology does not seem to pose major problems for Tier-3’s ALICE Off-line Week,

Roberto Barbera Prague, Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Conclusions (2/2) Network stress test in now an international activity and involve also sites with high RTT with respect to CNAF Throughput measurements have been obtained for several file sizes as a function of TCP windows and with variable number of streams. Up to now, increasing TCP maximum buffers size does not seem to give evident advantages with respect to multistreaming, but tests are not finished The research of the best set of number of streams and TCP windows for a given file size and network path could help us to optimize our data transfers. In this way we are able to determine our effective capability to use network and then what could be our reasonable requests. It would be really useful the partecipation of USA sites with machines Gbit-connected in order to study TCP tunning in details. A good news is that OSC recently joined the testbed. ALICE Off-line Week,

Roberto Barbera Virtual Organizations’ (read HEP Experiments here) planned and chaotic activities have big impacts on networks and strongly rely on their robustness and reliability. Network not only means the high bandwidth of international links but also, and more importantly, reliable end-to-(many)ends connections (“last mile” problems should be addressed and hopefully solved). The concept of Grid Network Element (emerging in the new grid information schemas) should be pursued and implemented as soon as possible. Scientific “collaboratories” are very dynamical as a function of both space and time so best effort and over-provisioning are not always good solutions. Quality of Service and bandwidth-on-demand will be key issues of future networks. Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration More general conclusions

Roberto Barbera Repeating throughput measurements with different block sizes (using both bbFTP and hdparm) in order to maximize disk I/O Continuing TCP tunning with the collaboration of some US servers with Gbit connection Preparation of an ALICE note and a paper for TNC2004 (due by October, 17). Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy ALICE Collaboration Next