Rosa Filgueira – University of Edinburgh Iraklis Klamapnos- University of Edinburgh Yusuke Tanimura- AIST, Tsukuba Malcolm Atkinson- University of Edinburgh.

Slides:



Advertisements
Similar presentations
Transfer Content to a Website What is FTP? File Transfer Protocol FTP is a protocol – a set of rules Designed to allow files to be transferred across.
Advertisements

SSH Operation and Techniques - © William Stearns 1 SSH Operation and Techniques The Swiss Army Knife of encryption tools…
Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
Introduction to Databases
Content Overview Update Process Additional Tools.
GridFTP: File Transfer Protocol in Grid Computing Networks
File Transfer Methods : A Security Perspective. What is FTP FTP refers to the File Transfer Protocol, one of the protocols within the TCP/IP protocol.
COS 420 DAY 25. Agenda Assignment 5 posted Chap Due May 4 Final exam will be take home and handed out May 4 and Due May 10 Latest version of Protocol.
Introduction to the Internet How did the Internet start? Why was the Internet developed? How does Internet handle the traffic? Why WWW changed the Internet.
Electrical Engineering Department Software Systems Lab TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY Meeting recorder Application based on Software Agents.
Webinar: Questions and Answers? July 17, Introduction John Tkaczewski, President and Co-Founder Check out our blog:
Hands-On Microsoft Windows Server 2003 Administration Chapter 6 Managing Printers, Publishing, Auditing, and Desk Resources.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
Aaron Steele. Project Motivation Alice Bob 50+GB Free.
TNT Microsoft Exchange Server 2003 Disaster Recovery Michael J. Murphy TechNet Presenter
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Application Layer Functionality and Protocols Network Fundamentals – Chapter 3.
File Systems and N/W attached storage (NAS) | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Oracle Recovery Manager (RMAN) 10g : Reloaded
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Data Communications and Networks
IGEL UMS Product Marketing Manager October 2011 Florian Spatz Universal Management Suite.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
GridFS Targeting Data Sharing in Grid Environments Marcelo Nery dos Santos / Renato Cerqueira PUC-Rio, Brazil Presented by: Francisco Silva.
Module 7: Implementing Sites to Manage Active Directory Replication.
1. There are different assistant software tools and methods that help in managing the network in different things such as: 1. Special management programs.
Lec4: TCP/IP, Network management model, Agent architectures
Big Data Bijan Barikbin Denisa Teme Matthew Joseph.

Globus GridFTP and RFT: An Overview and New Features Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
UDT as an Alternative Transport Protocol for GridFTP Raj Kettimuthu Argonne National Laboratory The University of Chicago.
Internet Business Foundations © 2004 ProsoftTraining All rights reserved.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Chapter 15 – Part 2 Networks The Internal Operating System The Architecture of Computer Hardware and Systems Software: An Information Technology Approach.
Information Builders : SmartMart Seon-Min Rhee Visualization & Simulation Lab Dept. of Computer Science & Engineering Ewha Womans University.
Improved Access to RDA from the MSS OSD Executive Meeting April 28, 2009.
Computer Networking From LANs to WANs: Hardware, Software, and Security Chapter 13 FTP and Telnet.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Large Scale Parallel File System and Cluster Management ICT, CAS.
Open Source Evaluation - FileZilla Michael Nye ITEC 400 Assignment 14-1 Professor D’Andrea Franklin University April 10, 2008.
1 NETE4631 Working with Cloud-based Storage Lecture Notes #11.
PRAS-DT: Portable, Reliable, and Automatic Streaming Data Transfer Christine Harvey Dr. Rosa Filgueira.
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Web Technologies Lecture 8 Server side web. Client Side vs. Server Side Web Client-side code executes on the end-user's computer, usually within a web.
| nectar.org.au NECTAR TRAINING Module 9 Backing up & Packing up.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
ANYFILEBACKUP A powerful file backup and synchronization solution for significant number of files located in various folders, FTPs, clouds, and servers.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
New Development Efforts in GridFTP Raj Kettimuthu Math & Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, U.S.A.
A Sneak Peak of What’s New in Globus GridFTP John Bresnahan Michael Link Raj Kettimuthu (Presenting) Argonne National Laboratory and The University of.
Big Data transfer over computer networks Initial Sergey Khoruzhnikov Vladimir Grudinin Oleg Sadov Andrey Shevel Anatoly Oreshkin Elena Korytko Alexander.
SSH. 2 SSH – Secure Shell SSH is a cryptographic protocol – Implemented in software originally for remote login applications – One most popular software.
Billy Ripple Dr. Oblitey COSC  File-Sharing ◦ What is it? ◦ Uses ◦ History ◦ Types ◦ Dangers  Case Study  Summary  Works Cited.
2016 Global Seminar 按一下以編輯母片標題樣式 Virtualization apps simplify your IoT development Alfred Li.
Everything you've ever wanted to know about using Control-M to integrate any application workload September 9, 2016 David Fernandez Senior Presales Consultant.
Amazon Storage- S3 and Glacier
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Cloud based Open Source Backup/Restore Tool
File Transfer Olivia Irving and Cameron Foss
File Transfer Issues with TCP Acceleration with FileCatalyst
An Introduction to Cloud Computing
Internet Protocols IP: Internet Protocol
Downloading workshop files to your computer
Beyond FTP & hard drives: Accelerating LAN file transfers
Chapter 7 Network Applications
Sending data to EUROSTAT using STATEL and STADIUM web client
STATEL an easy way to transfer data
Presentation transcript:

Rosa Filgueira – University of Edinburgh Iraklis Klamapnos- University of Edinburgh Yusuke Tanimura- AIST, Tsukuba Malcolm Atkinson- University of Edinburgh

 Introduction ◦ Problem description ◦ Hypothesis ◦ Rock Physics laboratory experiments ◦ Objective ◦ Proposal  Related developments ◦ Data transfer protocols ◦ Data transport systems  FAST ◦ Selecting the best data transfer protocol ◦ Data transfer experiments ◦ Implementation and evaluation  Future work and Questions

 Large number of rock physics (RP) laboratories ◦ Runs many experiments (Experimentalists)  Large number of rock physicists ◦ Develops computational codes (Code builders)  Sharing experimental data among this community is still in its early days ◦ No facilities to transfer experimental data automatically in real time with their associated description (metadata)

 Several tools for providing reliable and high performance data transfer capabilities ◦ Dropbox or Globus Online  Not optimized for the RP requirements

 The RP community will benefit from tool ◦ Transfers data and metadata in near-real time ◦ Repository and DB accessible from a website  For experimentalists ◦ Collection and comparison of experiments from many labs  For code builders ◦ Find test data for running their models

 Laboratory rock property measurements ◦ Properties of the rock sample are studied under different conditions  High-pressure vessels to apply pore pressures and stresses to cylindrical rock sample  Until the sample has failed, different features (e.g stress, porosity, temperature, etc,....) are recorded at several time intervals  In each interval, data transferred to a local computer machine (channel. 1 channel per rock)

Pressure VesselUCL- RP LaboratoryRock Samples

Initial target: 30 months Deploy under the sea- Mediterranean 8 rock samples- different features Different interval of times and data sizes

 Each experiment can record data differently ◦ Events can be written in a new file or appended ◦ Files can be stored in the same directory or not ◦ Intervals for writing data can be shorts or long ◦ Number of rocks samples could be one or several ◦ Duration of an experiments can be short or long  Data intensive problem for transferring the data

 To transfer RP experimental data from one location to another ◦ Automated data transfer until the end-experiment  Transfer experimental data  Near real time and non-real time  Synchronization  Incremental (File) and Directory ◦ Possible interruptions and fails ◦ Record and transfer the metadata

 FAST: Flexible automated synchronization transfer ◦ Data and metadata in real time and non- real time ◦ Incremental (file) and directory sync ◦ Selection of the data-transfer protocol ◦ Compatible with all O.S ◦ Simple to set up and manage ◦ Monitors the transmission, detects errors and recovers from them. ◦ Data collected in a repository, metadata in DB, and web site for accessing them  Proposal is triggered by our work ◦ EFFORT project ◦ Using data provided by the Creep-2 project

 File transfer Protocol (FTP) ◦ Control and data are un-encrypted ◦ Easy to use, lack of security  FTP security extension (FTPS) ◦ Control encrypted (TLS or STLS), but data might not be  Secure Copy (SCP) ◦ SSH for transferring data and authentication (more secure than previous ones) ◦ File transfer only ◦ Ideal for quick transfer of single files  SSH File Transfer Protocol (SFTP) ◦ Based in SSH-2: best for secure access (packet confirmation) ◦ File transfer, creating and delete remote directories and files ◦ Directory synchronization,  Rsync ◦ Incremental file transfer (delta algorithm) ◦ File and directory synchronization ◦ Can provide encrypted transfer by using SSH ◦ On-the-fly compression option ◦ Idea for back-ups

 UDP-(UDT) ◦ UDP protocol for data-intensive applications ◦ UDT can transfer data a higher speed than TCP- based protocols  UDT Enabled Rsync (UDR) ◦ Uses Rsync for the transport mechanism (delta) ◦ Sends data over the UDT protocolIdeal for large data over long distance ◦ Ideal for large data over long distance

 GridFTP: ◦ HP secure, reliable data rate via high bandwidth ◦ many-to-many ◦ difficult to use  Globus Online ◦ Uses GridFTP protocol ◦ Automates the management of files:  monitoring performance, retrying files, recovering from failes ◦ Do not support file synchronization.  Dropbox: ◦ Centralize cloud storage, file and directory synchronization ◦ Rsync-delta protocol ◦ Data stored on the Amazon S3 (Third party) ◦ One-to-one file transfer  BTSync ◦ Decentralized cloud storage, P2P file synchronization (No Third party). ◦ Connecting the devices to communicate with UDP ◦ Many-to-many file transfers  WinSCP ◦ SFTP and FTP client for Windows

from Globus Online Support We recently noticed that you are creating many CLI sessions to cli.globusonline.org, each with a single blocking transfer. This is a suboptimal way to use Globus Online and in fact is causing us some resource usage issues. cli.globusonline.org

 Previous tools ◦ Different data-transfer protocols ◦ Some automated data synchronization  No one ◦ Select the best protocol depending on requirements ◦ Methods for tracking metadata and transferring it  Our work automatically ◦ Selects a protocol among FTPS, SFTP, Rsync, and UDR ◦ Injects a minimum of metadata ◦ GridFTP and P2P discarded: communications 1-to-1 ◦ FTPS instead of using FTP: minimum security level ◦ SFTP derives from SCP

FTPS, SFTP, Rsync and UDR

 Two machines located in Edinburgh ◦ VLAN Network 100MB/s  Synthetic program to generate events  Data size written to files: 50KB, 500KB, 1MB, 10MB, 100MB, 500MB, 1GB and 10GB.  Measures: transfer rate and elapsed time  Repetition: 10 times

SFTP fastest < 500MB Rsync fastest >= 500MB ** without compression Elapsed Time File SizeRsyncUDRSFTPFTPSRsync-cUDR-c 50KB KB MB MB MB GB GB

 UDR has been specially designed ◦ Large data transfer over long distance  UDR vs Rsync by using two machines ◦ Located in different local networks  University of Edinburgh  1GbE  AIST-Tsukuba  10GbE  Generated Files: 1MB, 500MB, 1GB, 10GB and 30GB.

UDR fastest ** without compression Elapsed Time File sizeRsyncUDRRsync-cUDR-c 1MB MB GB GB GB

 Front-end: GUI using Java SWING  Back-end: Decision tree  Data and Metadata ◦ Data stored in a remote repository (NAS) ◦ Metadata collected in remote database (MySQL)  Science gateway (Web tool) connected with the repository and database ◦ Searching ◦ Visualizing ◦ Analyzing ◦ Download

 FAST has been evaluated: ◦ By using synthetic programs for generating data  real time and non-real time  For each type of synchronization  Different data sizes, and different types of network locations  Short and Long term experiments  Stop and restart ◦ For transferring data from a real rock physic experiment  Laboratory- UCL (London) and Edinburgh  Days: 45 days  Interval: Every minute  Rock Samples: 1

 Use FAST in the Creep-2 experiment  Implement FAST policies ◦ Data available in the repository for specific users during a reasonable period  Sharing data from many-to-many locations  Decision-tree ◦ Automating generation and maintenance ◦ Keep up-to-date the by measuring transfers  Use FAST in more rock physics laboratories  Use FAST in other disciplines