Ideas and test setup for data transfer from CERN to Italian Ground Segment. M. Boschini, A. Favalli, M. Levtchenko CERN – March, 31, 2003.

Slides:



Advertisements
Similar presentations
Improving TCP over Wireless by Selectively Protecting Packet Transmissions Carla F. Chiasserini Michele Garetto Michela Meo Dipartimento di Elettronica.
Advertisements

CCNA – Network Fundamentals
Amazon CloudFront An introductory discussion. What is Amazon CloudFront? 5/31/20122© e-Zest Solutions Ltd. Amazon CloudFront is a web service for content.
Case Studies for Projects. Network Audit A brief description of the systems (via fingerprinting, if black box is used) Network perimeter should be described.
Technical Architectures
Basic concept Technologies we have used The Design Problems, challenges & solutions Educational Gain.
WNT Client/Server SDK Tony Vaccaro CS699 Project Presentation.
An Introduction to Internetworking. Algorithm for client-server communication with UDP (connectionless) A SERVER A CLIENT Create a server-socket (listener)and.
Error Checking continued. Network Layers in Action Each layer in the OSI Model will add header information that pertains to that specific protocol. On.
Network Topologies.
Process-to-Process Delivery:
RTSP Real Time Streaming Protocol
CIS679: RTP and RTCP r Review of Last Lecture r Streaming from Web Server r RTP and RTCP.
 2000 Deitel & Associates, Inc. All rights reserved. Chapter 24 – Web Servers (PWS, IIS, Apache, Jigsaw) Outline 24.1Introduction 24.2Microsoft Personal.
NETWORK CENTRIC COMPUTING (With included EMBEDDED SYSTEMS)
CERN 14/01/20021 Data Handling Scheme for the Italian Ground Segment (IGS), as part of AMS-02 Ground Segment (P.G. Rancoita) Functions of a “Regional Center”
Chapter 17 Networking Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William Stallings.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
13/09/2015 Michael Chai; Behrouz Forouzan Staffordshire University School of Computing Transport layer and Application Layer Slide 1.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 7: Transport Layer Introduction to Networking.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May Proposal.
June 2004 SIW-4 - IP in Space Implementation Guide 1 Handbook for Using IP Protocols for Space Missions James Rash - NASA/GSFC Keith Hogie, Ed Criscuolo,
Outline Overview Video Format Conversion Connection with An authentication Streaming media Transferring media.
University of the Western Cape Chapter 12: The Transport Layer.
TCP1 Transmission Control Protocol (TCP). TCP2 Outline Transmission Control Protocol.
Heterogeneous Database Replication Gianni Pucciani LCG Database Deployment and Persistency Workshop CERN October 2005 A.Domenici
INFNGrid Constanza Project: Status Report A.Domenici, F.Donno, L.Iannone, G.Pucciani, H.Stockinger CNAF, 6 December 2004 WP3-WP5 FIRB meeting.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
1 TCP III - Error Control TCP Error Control. 2 ARQ Error Control Two types of errors: –Lost packets –Damaged packets Most Error Control techniques are.
MC test production in MILANO ● VPN installed OK. ● Works OK. ●...but local SysAdmins would prefer not to use it...
Vertical Optimization Of Data Transmission For Mobile Wireless Terminals MICHAEL METHFESSEL, KAI F. DOMBROWSKI, PETER LANGENDORFER, HORST FRANKENFELDT,
Michele Caselle INFN - BariITS Meeting 4 Jun 2002 LabVIEW&Windows – MySQL interface  SPD database general architecture  LabVIEW – MySQL interface. 
Multimedia and Networks. Protocols (rules) Rules governing the exchange of data over networks Conceptually organized into stacked layers – Application-oriented.
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
Hepix LAL April 2001 An alternative to ftp : bbftp Gilles Farrache In2p3 Computing Center
1 CS 4396 Computer Networks Lab TCP – Part II. 2 Flow Control Congestion Control Retransmission Timeout TCP:
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Testing in Android. Methods Unit Testing Integration Testing System Testing Regression Testing Compatibility Testing Black Box (Functional) White Box.
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
Linux Operations and Administration
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
TCP continued. Discussion – TCP Throughput TCP will most likely generate the saw tooth type of traffic. – A rough estimate is that the congestion window.
G046 Lecture 04 Task C Briefing Notes Mr C Johnston ICT Teacher
© Janice Regan, CMPT 128, CMPT 371 Data Communications and Networking Principles of reliable data transfer 0.
Multimedia Retrieval Architecture Electrical Communication Engineering, Indian Institute of Science, Bangalore – , India Multimedia Retrieval Architecture.
MC Data transfer A possible approach 28th, July, 2003 CERN (M. Boschini)
McGraw-Hill Chapter 23 Process-to-Process Delivery: UDP, TCP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
11/01/20081 Data simulator status CCRC’08 Preparatory Meeting Radu Stoica, CERN* 11 th January 2007 * On leave from IFIN-HH.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
SURENDRA INSTITUTE OF ENGINEERING & MANAGEMENT PRESENTED BY : Md. Mubarak Hussain DEPT-CSE ROLL
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
DART SI-8: Pilot long-distance high speed and secure data transfer between the Repositories DART Workshop on Infrastructure Chief Investigator: Dr. Asad.
Federating Data in the ALICE Experiment
The Transport Layer Implementation Services Functions Protocols
TCP - Part II.
1.2 The Network Edge Beginning at the edge of a network we are looking at the components with which we are most familiar - namely, the computers that we.
CMPT 371 Data Communications and Networking
Transport Layer Unit 5.
Transport Layer Our goals:
Process-to-Process Delivery:
Moodle Scalability What is Scalability?
TCP Congestion Control
The Transport Layer Reliability
TCP III - Error Control TCP Error Control.
Computer Networks Protocols
Error Checking continued
Sending data to EUROSTAT using STATEL and STADIUM web client
Presentation transcript:

Ideas and test setup for data transfer from CERN to Italian Ground Segment. M. Boschini, A. Favalli, M. Levtchenko CERN – March, 31, 2003

Outline Goals Ideas Setup Tests Conclusions M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Goals of the system In the frame of the overall AMS-02 Data handling, an Italian Ground Segment will be set-up to maintain a MASTER COPY of all AMS-02 data (raw and reconstructed). System will have to Efficiently transfer data from CERN to IGS Book-keep Data transfer M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas... Decouple Data transfer protocol from book-keeping system Use standard techniques for both Test solutions separately Test overall solution....results... M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Test system... RH box at CERN on a 10 Mb/s line RH box at MI-INFN (further tests on Local MI-INFN 100 Mb/s line) AMS-01 raw 1min files and AMS-01 n- tuple.gz M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...data transfer protocol Test OpenSCP and bbftp OpenSCP too heavy (test and results presented in 2002) Bbftp: already tested in Dec with AMS- SW group (Eline, Klimentov)....thus, test bbftp in terms of network usage efficiency and reliability M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...bbftp parameters Bbftp has 2 main parameters: Number of parallel streams TCP window size Goal is finding best ratio WIN_SIZE/NUM_STREAMS having in mind that keeping NUM_STREAMS low is preferrable (RFC 1323) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...TCP parameters Theory: for an RFC1323 compliant network we can define CAPACITY = (BANWIDTH x RTT) OPT_WIN_SIZE = CAPACITY (the one for which you maximize throughput…) Since NUM_STREAM ~ (CAPACITY)/(WIN_SIZE) Setting WIN_SIZE = OPT_WIN_SIZE, NUM_STREAM = 1 and have maximum thr. BUT…in reality: LFNs have a high (BANWIDTH x RTT) but “standard” TCP implementations have a limit on win- size…(64K) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Tests...bbftp parameters 10 Mb/s M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Tests...iperf parameters 34 Mb/s (FNAL-MI)

Tests...bbftp conclusions  AMS-02 scenario will have LFN, thus 1) re-study TCP optimization in more realistic scenario 2) in general, we can assume that we will need large WIN_SIZE and high NUM_STREAM M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...book-keeping system We need to keep track of Which file has to be transferred How has it been transferred Where has it been stored M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...book-keeping system We decided to adopt a Data Base. One at CERN One in Milano CERN DB will contain file info and DT info for all files (even those unsuccessfully transferred) Milano DB will contain file info and DT info for all files arrived at MI. Automatic consistency has to be set-up DB's have to be web-browsable M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Tests...book-keeping system We decided to use MySQL. Stress-tested MySQL concurrent writes/sec have been measured. rollback/commit available. Uptime: 18 months. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...The complete system The complete system has to: Find new files at CERN in a “spool” directory. Insert in CERN-DB Move files to Milano Update CERN-DB with transfer status Insert in MI-DB M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Ideas...The complete system Hypothesis: “Production” rate (4 +8)Mb/s “permitted downtime” 3 days Spool dir big enough (~400 GB for 3 days) Peak bandwidth = 36 Mb/s M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Test...The complete system Test “reality”: “permitted downtime” 0.3 days Spool dir big enough (~40 GB 0.3 days) Peak bandwidth = 10 Mb/s but we already showed that bbftp+adaptive win-size can maximize bandwidth usage... …our test is ~ hypothesis x scale-factor. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Test...The complete system System is made out of: at CERN “fake_production” (8 Mb/s) Main --> looks for new files in spool dir bbftp_forker --> forked by Main when files found. Sends files to MI, inserts/updates in CERN-DB “Tells” MI to update DB (SSL client/server). Perl5 (SSL+DBI) + C patch to bbftp. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Test...The complete system System is made out of: at MILANO TCP/IP (SSL) server waiting for “requests for update” from CERN Updates DB. Perl5 scripts (SSL + DBI) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Test...The complete system SSL has been adopted to connect to MI in order to crypt connection to DB (MySQL uses standard sockets). We implemented bbftp private_auth feature. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

DB Consistency DB Consistency is a mechanism which ensures that DB contents at CERN and MI are consistent with the data that has actually been transferred. DB Consistency is performed in 2 ways: 1) check between DB entries at CERN corresponding to files which seem to have been transferred ok and DB entries at the IGS, is performed every hour.This check is based on SQL selections and comparisons. If difference is found, alert is sent and files are re-transferred. 2) based on a redundancy approach, in which a third host, also located in Milano, will keep a ''copy'' of both CERN and IGS DB's. The copy is performed as for now as DB dump/restore. On-line update is under development. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

“data integrity” DATA INTEGRITY is evaluated by means of an MD5 digest. md5sum calculated at CERN Sent to MI at “request for update” md5sum calculated at MI if (md5sum(MI) != md5sum(CERN) ) INTEGRITY = ‘FAIL’ in DB and DB consistency mechanism re-sends file. M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

“stress simulation” After 6 months of “normal” running, we started a stress simulation, artificially and randomly stoppping services. We stopped: Network (total down time in 3 months: 3 weeks scattered in 90 different downtimes, ranging from 5 min to 8 hr) MySQL (total down time in 3 months: 1 day,scattered in 48 different downtimes, ranging from 5 min to 1 hr) Bbftpd (total down time in 3 months: 1 day,scattered in 48 different downtimes, ranging from 5 min to 1 hr) Components of DT (total down time in 3 months: 3 weeks scattered in 90 different downtimes, ranging from 5 min to 8 hr) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

Total running … 9 months of uptime 380,000 files transferred 7.6 TB transferred 5% of files needed to be retransmitted (network) 0.03% of files needed to be retransmitted (DB consistency checks) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

GUIs … System has also “graphical” tools: DB Browser at CERN and MI (Perl5, PHP) DT System handling GUI at CERN and MI (GTK, Perl5) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003

To do Implement tests on a 100 Mb/s CERN LAN in order to study and optimize TCP window size and NUM_STREAMS is a scenario more similar to the AMS-02 one Implement on-line redundancy with third host. Integrate with High Speed Network hosts at CERN (wacdr001d.cern.ch) as suggested and discussed with IT Communication Serviceswacdr001d.cern.ch

Conclusions… More than files have been transferred correctly, ~ 5% needed to be retransmitted because of network outages. TCP parameters have been studied and tweaked in order to optimize the bandwidth usage The whole data set is organized in a DB, which acts as a data transfer book-keeping system. The DB proved to be robust and fast enough to suit our needs. DB Consistency mechanisms recovered a 0.03 % data loss. The system can be handled through a GUI and monitored via web. Other work to do…(higher bandwidth link, on-line consistency) M. Boschini – Ideas and Test Setup for Data Transfer from CERN to Italian Ground Segment CERN – March, 31, 2003