Peer-to-Peer Supported Cache System for File Transfer 2003.8.28 Joonbok Lee

Slides:



Advertisements
Similar presentations
Squirrel: A peer-to- peer web cache Sitaram Iyer (Rice University) Joint work with Ant Rowstron (MSR Cambridge) Peter Druschel (Rice University) PODC 2002.
Advertisements

Inktomi Confidential and Proprietary The Inktomi Climate Lab: An Integrated Environment for Analyzing and Simulating Customer Network Traffic Stephane.
Welcome to Middleware Joseph Amrithraj
Squirrel: A peer-to-peer web cache Sitaram Iyer Joint work with Ant Rowstron (MSRC) and Peter Druschel.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Storage management and caching in PAST, a large-scale, persistent peer- to-peer storage utility Antony Rowstron, Peter Druschel.
CHORD – peer to peer lookup protocol Shankar Karthik Vaithianathan & Aravind Sivaraman University of Central Florida.
Xiaoli Zhang P-Grid: A self-organizing access structure for P2P information systems Karl Aberer Department of Communication Systems Swiss Federal Institute.
Cloud Download : Using Cloud Utilities to Achieve High-quality Content Distribution for Unpopular Videos Yan Huang, Tencent Research, Shanghai, China Zhenhua.
“Scalable and Topologically-aware Application-layer Multicast” Yusung Kim Korea Advanced Institute of Science and Technology.
Distributed components
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Peer-to-peer Multimedia Streaming and Caching Service Jie WEI, Zhen MA May. 29.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
FRIENDS: File Retrieval In a dEcentralized Network Distribution System Steven Huang, Kevin Li Computer Science and Engineering University of California,
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Chord and CFS Philip Skov Knudsen Niels Teglsbo Jensen Mads Lundemann
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
Protecting Free Expression Online with Freenet Presented by Ho Tsz Kin I. Clarke, T. W. Hong, S. G. Miller, O. Sandberg, and B. Wiley 14/08/2003.
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Secure Overlay Services Adam Hathcock Information Assurance Lab Auburn University.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Peer-to-peer Multimedia Streaming and Caching Service by Won J. Jeon and Klara Nahrstedt University of Illinois at Urbana-Champaign, Urbana, USA.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Data Consistency in the Structured Peer-to-Peer Network Cheng-Ying Ou, Polly Huang Network and Systems Lab 台灣大學電機資訊學院電機所.
Content Distribution March 8, : Application Layer1.
Freenet. Anonymity  Napster, Gnutella, Kazaa do not provide anonymity  Users know who they are downloading from  Others know who sent a query  Freenet.
Department of Computer Science Southern Illinois University Edwardsville Dr. Hiroshi Fujinoki and Kiran Gollamudi {hfujino,
Data Management in Distributed Systems Minqi Zhou Software Engineering Institute Office: Room 111 Mathematics Building Phone:
Proxy-assisted Content Sharing Using Content Centric Networking (CCN) for Resource-limited Mobile Consumer Devices Jihoon Lee, Dae Youb Kim IEEE Transactions.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
Security Considerations for Structured p2p Peng Wang 6/04/2003.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Vincent Matossian September 21st 2001 ECE 579 An Overview of Decentralized Discovery mechanisms.
PSI Peer Search Infrastructure. Introduction What are P2P Networks? The term "peer-to-peer" refers to a class of systems and applications that employ.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan Presented.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
A P2P-Based Architecture for Secure Software Delivery Using Volunteer Assistance Purvi Shah, Jehan-François Pâris, Jeffrey Morgan and John Schettino IEEE.
Peer-to-Peer and Collective Intelligence A platform for collaboration Andrew Roczniak Collective Intelligence Lab Multimedia Communications Research Lab.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
Squirrel: A decentralized peer-to- peer web cache Paper by Sitaram Iyer, Antony Rowstron and Peter Druschel (© 2002) Presentation* by Alexander Prohaska.
Peer-to-Peer (P2P) Networks By Bongju Yu. Contents  What is P2P?  Features of P2P systems  P2P Architecture  P2P Protocols  P2P Projects  Reference.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
Hiearchial Caching in Traffic Server. Hiearchial Caching  A set of techniques and mechanisms to increase the size and performance of network caches.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
Brocade: Landmark Routing on Overlay Networks
Magdalena Balazinska, Hari Balakrishnan, and David Karger
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Early Measurements of a Cluster-based Architecture for P2P Systems
Building Peer-to-Peer Systems with Chord, a Distributed Lookup Service
Chord and CFS Philip Skov Knudsen
MIT LCS Proceedings of the 2001 ACM SIGCOMM Conference
Presentation transcript:

Peer-to-Peer Supported Cache System for File Transfer Joonbok Lee

Contents 1. Motivation 2. Problem Statement 3. Related Work 4. Approach 5. Simulation 6. Conclusion 7. Reference

1. Motivation ► KAIST Netflow Measurement ( )  Analyze the flow data of KAIST Border Router. Fig 2. Cumulative Distribution Function of the files transferred by FTP and HTTP. 1/17 10MB Some Findings: 1) The amount of bandwidth consumed by FTP is similar with the one consumed by HTTP 2) 78% of the FTP traffic is due to the large files which is larger than 10MB. Fig 1. The byte ratio in terms of Protocols

2. Problem Statement ► Unnegligible access to the large multimedia data. [Jung00] ► FTP Traffic:  17% of total traffic.  78% of them are larger than 10MB.  11% of them were failed during transfer. ► The large files transferred by FTP generate much traffic, and many of them takes long time. ► To solve this problem, we propose HTTP/FTP proxy cache which is scalable in terms of bandwidth and storage. 2/17

3. Related Work ► The researches which solve large files’ transfer.  RepliCache: A New Approach to Scalable Networking Storage System for Large Objects [Jung97]  Proactive Web caching with cumulative prefetching [Jung00] ► The researches which has scalable architecture.  Squirrel: A decentralized peer-to-peer web cache [Iyer02]  Peer-to-Peer Caching Scheme to Address Flash Crowds[Stading02] 3/17

4. Approach 4.1 Motivation 4.2 Cache with Peer-to-Peer Storage 4.3 Model 4.4 Detail Design 4/17

4.1 Motivation ► Peer-to-Peer Architecture as a Cache  Scalability (bandwidth, computing power and storage)  Cost  Overhead (to find object and to persist system) ► The Latency  One of the important metric of cache performance.  the lookup time + delivery time  Delivery time is depend on the file size.  Small files: the lookup time dominate Large files: the deliver time dominate 5/17

4.2 Cache with Peer-to-Peer Storage ► Hybrid Approach  Scalability: peer-to-peer storage  Lookup and control: central cache. ► Peer-to-Peer two-layer storage  The storage in central cache ► Expected to be always available, low latency. ► Store small files.  The second tier storages ► can be unavailable. ► Store large files. 6/17

O s1 Connectivity Cloud Peer 1 O S1,O S2 : Small object O L1, O L2 : Large object 4.3 Model HTTP/ FTP Server A Local Area Network Peer 2 Peer n,O s2 O L1 O L2 O L1 Peer-to-Peer Storage O s1 O L1 Web Proxy Cache with FTP supporting module HTTP/ FTP Server B O s1 Fig 3. Cache with two-layer storage 7/17

4.4 Detail Design ► 2 new components to support FTP and large files.  Preserve transparency of File Location ► FTP Cache Daemon  Store the state of FTP connection  Make the URL of files transferred by FTP  Check consistency. ► P2P Storage Manager  Control its own storage.  Managed by object table in central cache. HTTP Cache Daemon FTP Cache Daemon Object Table Storage Manager FTP/HTTP Server FTP/ HTTP Client P2P Storage Manager FTP/ HTTP Client P2P Storage Manager Control Data Fig 4. Control and Data connection between components 8/17

5. Simulation 5.1 Simulation Environment 5.2 Simulation Result 9/17

5.1 Simulation Environment ► Trace  Requested FTP file list  Gather the FTP control (port 21) packet and produce the trace ► ~ ( two weeks )  76,880 (783GB) file requests.  417 clients ► Assumption  Local Network: 100Mbps ► Simulated Caches  Cache A: 100GB Storage, 100Mbps  Cache B: Infinite Storage, 100Mbps  Cache C: Infinite Storage, Infinite Bandwidth  Cache D: Cache with Peer-to-Peer Storage 10/17

5.2 Simulation Result: Hit Ratio Fig 5. Cache Hit Ratio 11/17 Fig 6. Outbound traffic No strict storage control Some peers may have same files in their storage Even though some peers have available storage, the other peers can remove the file from their cache as a victim. degrade the performance of storage, but not much.

5.2 Simulation Result: Latency Fig 7. Average latency of 95~105MB files 12/17 Fig 8. Average latency of 95~105KB files Without the increase of small files’ latency, we can reduce the latency of large files.

5.2 Simulation Result :Cache Hit Ratio degradation by the peer failure Fig 8. Cache hit ratio degradation by the peer failure 13/17 30%

6. Conclusion 1) Shows that much amount of traffic is produced by FTP by the measurement. Among them,78% were occurred by the files larger than 10MB. 2) Propose the cache system which has two-layer storage using peer-to-peer architecture. It is transparent to the location of files. 3) Shows that two layer storage has good performance for the large files as well as small files using trace-driven simulation. 4) Can reduce the outbound traffic and latency by caching using our sistem. ► Other issues  Collaboration between proposed systems.  Load balancing between peers.  Security problem. 15/17

7. Reference ► Jaeyeon Jung, “RepliCache: Enhancing Web Caching Architecture with Replication of Large Objects” ► Jaeyeon Jung, Dongman Lee and Kilnam Chon, "Proactive Web Caching with Cumulative Prefetching for Large Multimedia Data", Computer Networks 33 (2000) pp ► Sitaram Iyer, Ant Rowstron and Peter Druschel, “Squirrel: A decentralized peer-to-peer web cache” In Proceedings of the PODC ’02, Monterey, CA ► Tyron Stading, Petros Maniatis, Mary Baker, “Peer-to-Peer Caching Schemes to Address Flash Crowds”, In Proceedings of the IPTPS ’02, MA, USA ► Hyun-chul Kim, Joonbock Lee, Jungwon Suh, and Kilnam Chon, “Measurements of File-Systems Deployed on High-Performance Research and Education Networks”, Technical Report ► I.Stoica, R. Morris, D. Karger, F.Kaas hoek, and H.Balakrishnan. Chord: A scalable content-addressable network. In Proceedings of the ACM SIGCOMM 2001 Technical Conference, San Diego, CA, USA, August 2001 ► S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. “A scalable content- addressable network.” In Proceedings of the ACM SIGCOMM 2001 Technical Conference, San Diego, CA, USA, August /17

7. Reference ► A. Rowstron and P. Druschel, "Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems". IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), Heidelberg, Germany, pages , November, ► Ian Clarke, Theodore W. Hong, Scott G. Miller, Oskar Sandberg, and Brandon Wiley, "Protecting Free Expression Online with Freenet," IEEE Internet Computing 6(1), January/February ► William J. Bolosky, John R. Douceur, David Ely, and Marvin Theimer, Feasibility of a Serverless Distributed File System Deployed on an Existing Set of Desktop PCs In proceeding of SIGMETRICS 2000 ► Internet RFC 959 File Transfer Protocol 17/17

Request File Check Protocol Lookup Object Table Check Consistency Check Cached Location Open FTP control connections to both peer which has file and peer which requests is. Make FTP data connections between two the peers. HTTP FTP not cached cached inconsistent consistent peer Handle a request like web proxy cache Transfer file Check File Size Central cache opens data connection to client. central server Update Object Table Transfer file Opens data connection between server and client Transfer file Server opens data connection to central cache. Update Object Table small Large Central cache opens data connection to client. Transfer file Update Object Table Appendix A