High Performance Cooperative Data Distribution [J. Rick Ramstetter, Stephen Jenks] [A scalable, parallel file distribution model conceptually based on.

Slides:



Advertisements
Similar presentations
Exercises and Solutions Lecture 1
Advertisements

Packet / Message Switching Concepts
The BitTorrent Protocol. What is BitTorrent?  Efficient content distribution system using file swarming. Does not perform all the functions of a typical.
Incentives Build Robustness in BitTorrent Bram Cohen.
Presented by: Su Yingbin. Outline Introduction SocialSwam Design Notations Algorithms Evaluation Conclusion.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Lava: A Reality Check of Network Coding in Peer-to-Peer Live Streaming Mea Wang, Baochun Li Department of Electrical and Computer Engineering University.
Esma Yildirim Department of Computer Engineering Fatih University Istanbul, Turkey DATACLOUD 2013.
The TickerTAIP Parallel RAID Architecture P. Cao, S. B. Lim S. Venkatraman, J. Wilkes HP Labs.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 W. Schulte Chapter 5: Inter-VLAN Routing Routing And Switching.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 5: Inter-VLAN Routing Routing & Switching.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
Presented by Stephen Kozy. Presentation Outline Definition and explanation Comparison and Examples Advantages and Disadvantages Illegal and Legal uses.
Real-Time Geometric and Color Calibration for Multi-Projector Displays Christopher Larson, Aditi Majumder Large-Area High Resolution Displays Motivation.
Information Integration in Medical Databases Chris Trezzo, Professor Chen Li The Clinical Knowledge Gap With the recent explosion of new knowledge in the.
Wide Area High Definition Video Streaming for Tiled Displays Duy-Quoc Lai, Falko Kuester, Stephen Jenks, Zhiyu He · ·
Communication concepts (Continued) Week 2 Lecture 2.
Introduction to client/server architecture
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
P2P File Sharing Systems
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 5: Inter-VLAN Routing Routing And Switching.
GridFTP Guy Warner, NeSC Training.
Introduction Widespread unstructured P2P network
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
Open up Evernote, mini whiteboards.  2 minutes – write down as many as you can think of.
By Shobana Padmanabhan Sep 12, 2007 CSE 473 Class #4: P2P Section 2.6 of textbook (some pictures here are from the book)
Peer to Peer Network Anas Hardan. What is a Network? What is a Network? A network is a group of computers and other devices (such as printers) that are.
Chapter 4. After completion of this chapter, you should be able to: Explain “what is the Internet? And how we connect to the Internet using an ISP. Explain.
1 BitTorrent System Efrat Oune Bar-Ilan What is BitTorrent? BitTorrent is a peer-to-peer file distribution system (built for intensive daily use.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 38 How Music Sharing and File Sharing Work.
Performance Concepts Mark A. Magumba. Introduction Research done on 1058 correspondents in 2006 found that 75% OF them would not return to a website that.
Professor OKAMURA Laboratory. Othman Othman M.M. 1.
Bit Torrent A good or a bad?. Common methods of transferring files in the internet: Client-Server Model Peer-to-Peer Network.
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
David Abarca, Instructor Del Mar College Computer Corner Internet Access Dialup, DSL, or Cable.
Adaptive File Transfers for Diverse Environments Himabindu Pucha, Purdue University David G. Andersen, Carnegie Mellon University Michael Kaminsky, Intel.
Multiclass P2P Networks: Static Resource Allocation for Service Differentiation and Bandwidth Diversity Florence Clévenot-Perronnin, Philippe Nain and.
The Transmission Control Protocol (TCP) Application Services (Telnet, FTP, , WWW) Reliable Stream Transport (TCP) Connectionless Packet Delivery.
Planned AlltoAllv a clustered approach Stephen Booth (EPCC) Adrian Jackson (EPCC)
Othman Othman M.M., Koji Okamura Kyushu University 1.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Computer Security Workshops Networking 101. Reasons To Know Networking In Regard to Computer Security To understand the flow of information on the Internet.
1 Process migration n why migrate processes n main concepts n PM design objectives n design issues n freezing and restarting a process n address space.
B IT T ORRENT T ECHNOLOGY Anthony Pervetich. H ISTORY Bram Cohen Designed the BitTorrent protocol in April 2001 Released July 2, 2001 Concept Late 90’s.
Networks and Distributed Systems Mark Stanovich Operating Systems COP 4610.
GridFTP Richard Hopkins
1 Reforming Software Delivery Using P2P Technology Purvi Shah Advisor: Jehan-François Pâris Department of Computer Science University of Houston Jeffrey.
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
Networks and Distributed Systems Sarah Diesburg Operating Systems COP 4610.
Bit Torrent Nirav A. Vasa. Topics What is BitTorrent? Related Terms How BitTorrent works Steps involved in the working Advantages and Disadvantages.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
GridFTP Guy Warner, NeSC Training Team.
CONNECTING TO THE INTERNET THROUGH ISP. WHAT IS INTERNET? The Internet is a worldwide collection of computer networks, cooperating with each other to.
Chapter 3.  The advantages and disadvantages of: ◦ Client-Server Networks & ◦ Peer-to-Peer Networks.
Day 13 Intro to MANs and WANs. MANs Cover a larger distance than LANs –Typically multiple buildings, office park Usually in the shape of a ring –Typically.
SURENDRA INSTITUTE OF ENGINEERING & MANAGEMENT PRESENTED BY : Md. Mubarak Hussain DEPT-CSE ROLL
November 19, 2016 Guide:- Mrs. Kale J. S. Presented By:- Hamand Amol Sambhaji. Hamand Amol Sambhaji. Pardeshi Dhananjay Rajendra. Pardeshi Dhananjay Rajendra.
An example of peer-to-peer application
NGS data transmission, A point view from a user
Introduction to BitTorrent
CMS analysis job and data transfer test results
Introduction to client/server architecture
PEER TO PEER BitTorrent.
The BitTorrent Protocol
GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing.
Presentation transcript:

High Performance Cooperative Data Distribution [J. Rick Ramstetter, Stephen Jenks] [A scalable, parallel file distribution model conceptually based on the BitTorrent protocol] · · S ummer U ndergraduate 2 R esearch 0 F ellowship in 0 I nformation 6 T echnology Abstract: Current large file transfer approaches used in high performance cluster computing are point-to-point, normally based on GridFTP or Secure Copy (scp). If the file is to be disseminated to local disks on cluster nodes, either each transfer happens separately or the connection to the front-end becomes a bottleneck. In either case, the overall distribution time is proportional to the number of destinations times a single transfer time. With files in the tens becoming common, this time becomes quite long. Peer-to-peer filesharing offers a solution in the form of BitTorrent, which allows receivers to send data as well, thus multiplying the available bandwidth and decreasing download time of all participants. However, BitTorrent was designed for unreliable, shared, slow home Internet connections and does not perform as well in high-performance environments with gigabit or faster networks. Using concepts from the BitTorrent protocol, a high performance cooperative file distribution method has been developed in C++, allowing data transfer times to multiple destinations to come significantly closer to the data transfer time to a single destination. With continued work on the project, the transfer time to multiple destinations will continue to drop. BitTorrent protocol overview: BitTorrent is a filesharing protocol commonly used for the transfer of large files (such as images of CD-roms or DVDs) across the internet- a relatively slow, unreliable link. BitTorrent is unique in that it takes advantage of the upload speed of all end users downloading a file (called “peers”). This is accomplished by breaking a file into “chunk” and distributing these chunks to peers. Once a peer receives a full chunk of the file, it can begin sharing that chunk with other peers. To initiate a BitTorrent transfer, a (soon to be) peer contacts a tracker, which is a computer responsible for managing the peers. The tracker responds to the contacting peer with a list of other peers interested in the same file. The contacting peer contacts its newfound neighboring peers to see which peers have any needed file chunks. If a neighboring peer has such a file chunk, our peer is said to be “interested” in that peer, and a file transfer begins between the two. Normal file transfer: File is distributed to nodes one at a time Results: The following table illustrates transfer speed results as obtained through the cooperative file transfer method when syncing a 575Mb file between 4 peers, only one of which holds the file at the start. These results are displayed next to the results for a normal transfer from one node to the other three, and also the theoretical maximum speed obtainable by this project. They are bounded by factors including switch throughput and disk I/O. Method used:Time taken for 4 node synchronization to complete Cooperative distribution model – actual results 38 seconds This distribution model – theoretical maximum (same as a 2 node sync) 22 seconds Normal transfer (copy to one node at a time) 65 seconds Conclusion: Speeds faster than normal transfer from the starting node to the other nodes are possible with the cooperative data distribution method. As of now, the speeds achieved do not live up to the project’s original goals. This is most likely due to high overhead and poor programming, things which should be fixed with continued work on the project. Parallel Filesystem: Data is striped across multiple hosts Can result in network bottlenecks as no one host holds the entire file Cooperative Distribution All hosts utilize their upload bandwidth to send file “chunks to other hosts”