1 GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing.

Slides:



Advertisements
Similar presentations
Basic Concepts of a Computer Network
Advertisements

Chapter 17: WEB COMPONENTS
Chapter 17 Networking Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
High Performance Computing Course Notes Grid Computing.
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
A Web Services Based Streaming Gateway for Heterogeneous A/V Collaboration Hasan Bulut Computer Science Department Indiana University.
Introduction to Networks Networking Concepts IST-200 VWCC 1.
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
In The Name Of Allah Whose Blessings Are Uncountable.
Data Mining on the Web via Cloud Computing COMS E6125 Web Enhanced Information Management Presented By Hemanth Murthy.
Information Systems Today: Managing in the Digital World TB4-1 4 Technology Briefing Networking.
Stuart Cunningham - Computer Platforms COMPUTER PLATFORMS Network Operating Systems Week 9.
Ali Kaplan Advisor: Prof. Geoffrey C. Fox 2/02/20091.
Ali Kaplan Advisor: Prof. Geoffrey C. Fox 14/27/2009.
Ali Kaplan Advisor: Prof. Geoffrey C. Fox 2/02/20091.
JMS Compliance in NaradaBrokering Shrideep Pallickara, Geoffrey Fox Community Grid Computing Laboratory Indiana University.
IRODS performance test and SRB system at KEK Yoshimi KEK Building data grids with iRODS 27 May 2008.
Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago.
Thesis Proposal Ali Kaplan
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Computers Are Your Future Tenth Edition Chapter 8: Networks: Communicating & Sharing Resources Copyright © 2009 Pearson Education, Inc. Publishing as Prentice.
High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
Boosting Event Building Performance Using Infiniband FDR for CMS Upgrade Andrew Forrest – CERN (PH/CMD) Technology and Instrumentation in Particle Physics.
1 Configurable Security for Scavenged Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh with: Samer Al-Kiswany, Matei Ripeanu.
1 Next Few Classes Networking basics Protection & Security.
Secure, Collaborative, Web Service enabled and Bittorrent Inspired High-speed Scientific Data Transfer Framework.
Moving Large Amounts of Data Rob Schuler University of Southern California.
Amy Apon, Pawel Wolinski, Dennis Reed Greg Amerson, Prathima Gorjala University of Arkansas Commercial Applications of High Performance Computing Massive.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Srihari Makineni & Ravi Iyer Communications Technology Lab
A Comparative Study of Signaling Protocols for Data Management and Synchronization # Department of Computer Engineering, Kocaeli University.
Investigating the Performance of Audio/Video Service Architecture II: Broker Network Ahmet Uyar & Geoffrey Fox Tuesday, May 17th, 2005 The 2005 International.
Framework for Evaluating Distributed Smalltalk Interface Jan Lukeš Czech Technical University.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
Summary - Part 2 - Objectives The purpose of this basic IP technology training is to explain video over IP network. This training describes how video can.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Large-scale Virtualization in the Emulab Network Testbed Mike Hibler, Robert Ricci, Leigh Stoller Jonathon Duerig Shashi Guruprasad, Tim Stack, Kirk Webb,
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 April 11, 2006 Session 23.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Investigating the Performance of Audio/Video Service Architecture I: Single Broker Ahmet Uyar & Geoffrey Fox Tuesday, May 17th, 2005 The 2005 International.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
GridNEWS: A distributed Grid platform for efficient storage, annotating, indexing and searching of large audiovisual news content Ioannis Konstantinou.
A Grid-enabled Multi-server Network Game Architecture Tianqi Wang, Cho-Li Wang, Francis C.M.Lau Department of Computer Science and Information Systems.
Distributed applications monitoring at system and network level A.Brunengo (INFN- Ge), A.Ghiselli (INFN-Cnaf), L.Luminari (INFN-Roma1), L.Perini (INFN-Mi),
7. Grid Computing Systems and Resource Management
Communications & Networks National 4 & 5 Computing Science.
Communications & Networks National 4 & 5 Computing Science.
Distributed Handler Architecture (DHArch) Beytullah Yildiz Advisor: Prof. Geoffrey C. Fox.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
High-level Interfaces for Scalable Data Mining Ruoming Jin Gagan Agrawal Department of Computer and Information Sciences Ohio State University.
Wide-Area Parallel Computing in Java Henri Bal Vrije Universiteit Amsterdam Faculty of Sciences vrije Universiteit.
Introduction to Networking
W.A.Wojcik/CCIN2P3, Nov 1, CCIN2P3 Site report Wojciech A. Wojcik IN2P3 Computing Center URL:
1.3 ON ENHANCING GridFTP AND GPFS PERFORMANCES A. Cavalli, C. Ciocca, L. dell’Agnello, T. Ferrari, D. Gregori, B. Martelli, A. Prosperini, P. Ricci, E.
January 20, 2000K. Sliwa/ Tufts University DOE/NSF ATLAS Review 1 SIMULATION OF DAILY ACTIVITITIES AT REGIONAL CENTERS MONARC Collaboration Alexander Nazarenko.
IT vocabulary 3.3 Networks By: Flora Bertin.
Pathway to Petaflops A vendor contribution Philippe Trautmann Business Development Manager HPC & Grid Global Education, Government & Healthcare.
Nguyen Thi Thanh Nha HMCL by Roelof Kemp, Nicholas Palmer, Thilo Kielmann, and Henri Bal MOBICASE 2010, LNICST 2012 Cuckoo: A Computation Offloading Framework.
System Optimization Networking
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 April 6, 2006 Session 22.
Overview on the work performed during EPIKH Training Faiza MEDJEK /INFN, CATANIA 1.
Performance measurement of transferring files on the federated SRB
Clouds , Grids and Clusters
Hadoop Clusters Tess Fulkerson.
مقدمة في الحاسب الآلي T. Arwa Alsarami.
GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing.
Presentation transcript:

1 GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing

2 Outline Introduction Motivation PTCP GridTorrent Framework Test Results  LAN Test Results  WAN Test Results  Overhead Questions

3 Introduction Today’s computational science is data- intensive  Large Hadron Collider (LHC) experiment at CERN generates petabytes of data Accessibility, replication and creation of the data are made very easy by  Internet  Computational Grid

4 Motivation The data is geographically distributed Users are dispersed Collaboration environments are required Resources should be used in efficient and effective way  CPU  Storage  Network

5 PTCP TCP has a performance problem over WANs It was developed to solve the above problem by using striping technique

6 GridTorrent Framework It is aimed  to provide collaboration environment for dispersed users  to make data transfer, management, and sharing easy via content manager  to use systems resources efficiently and effectively by harnessing P2P (Bittorrent) network structure

7 Collaboration & Content Manager The Content Manager allows users to publish or share their files with selected access control rights The Collaboration Manager permits users to build a virtual sharing environment by managing working groups or friend list ACL enforce access control rights for a given content Task Manager handles the users’ task list

8 GridTorrent Client It is responsible for  initiating actual data publishing  data sharing with other GTF clients  ensuring secure environment for the above activities

9 WS-Tracker WS-Tracker is a WS enabled server It assists in the communication between peers (GridTorrent clients) It delivers task lists which is generated by users to GridTorrent clients It supplies ACL of each shared file to GridTorrent clients

10 Experimental Results PTCP and GridTorrent Framework tests cases were conducted both in LAN and WAN type of computer networks Server and client machines’ specification and location table File size is 300 MB NameSpecificationNetwork Interface InstitutionLocation A Intel(R) Quad-Core Xeon(TM) 4x2.33GHz CPU with 8GB of RAM on Red Hat Enterprise Linux 4.0 Broadcom NetXtreme II BCM Base-T Indiana University Bloomington, IN B Sun Fire V880 8x1.2GHz UltraSPARC III processors with 16GB of RAM on Solaris 9. It has 6x72GB 10K rpm internal HD Gigabit Ethernet and 10/100-BaseT Ethernet Indiana University Bloomington, IN C Dual Pentium III 731MHz CPU with 512MB of RAM on GNU/Linux fc5 Gigabit Ethernet and 10/100-BaseT Ethernet Florida State University Tallahassee, FL

11 LAN Test Setup Server is located at Bloomington, IN Client is at Indianapolis, IN The number of parallel TCP streams between server and client has increased from 1 to 16 (PTCP) The number of seeders increased from 1 to 16 (GTF) Client and server configuration for PTCP GridTorrent test case configuration for LAN test. Regular Java sockets are used for data transfer.

12 LAN Test Result There is no significant improvement in bandwidth for both PTCP and GTF Experimental data transfer ( Mbps) rate is much lower the theoretical (1000Mbps) and measured data transfer rate (857Mbps)

13 WAN Test-I Setup Server is located at Bloomington, IN Client is at Tallahassee, FL The number of parallel TCP streams between server and client has increased from 1 to 16 (PTCP) The number of seeders increased from 1 to 16 (GTF) Client and server layout for PTCP test case GridTorrent test case configuration for wide area network test. Regular Java sockets are used for data transfer.

14 WAN Test-I Result Bandwidth usage is vastly improved in both GTF and PTCP PTCP’s bandwidth utilization rate has risen steadily until fifteen streams  its peak value is 118 Mbps GTF’s bandwidth utilization rate has risen steadily until thirteen streams GridTorrent is performing better than PTCP when parallel streams number is less than five

15 WAN Test-II Setup Server is located at Bloomington, IN Client is at Tallahassee, FL The number of parallel TCP streams between server and client has increased from 1 to 16 (PTCP) Besides Java socket, other data transfer protocols can be exploited in GridTorrent client The number of seeders increased from 1 to 16 (GTF)  Four parallel TCP sockets were used between peer and seeders Client and server layout for PTCP test case GridTorrent test case configuration for wide area network test. GridTorrent client uses four parallel TCP sockets in each connection for every source

16 WAN Test-II Result Using parallel TCP with Bittorrent algorithm demonstrates much better bandwidth usage than standalone GridTorrent and PTCP The maximum attained bandwidth is around 145 Mbps which is %23 higher than PTCP’s result

17 Overhead Both parallel TCP and GridTorrent have overhead due to nature of multiple parallel connections PTCP’s communication channel overhead time can be compared to GridTorrent WS-Tracker client’s overhead time varying between 300 and 600 milliseconds Another overhead of GridTorrent is that control messages exchanged between peers to ensure strictly enforced to all participating peers  The total size of overhead messages is between 148KB to 169 KB

18 Questions

19 Thanks to All.