Early Measurements of a Cluster-based Architecture for P2P Systems

Slides:



Advertisements
Similar presentations
P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Advertisements

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Peer to Peer and Distributed Hash Tables
Scalable Content-Addressable Network Lintao Liu
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
Massively Distributed Database Systems Distributed Hash Spring 2014 Ki-Joune Li Pusan National University.
University of Cincinnati1 Towards A Content-Based Aggregation Network By Shagun Kakkar May 29, 2002.
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
Web Applications: Peer-to-Peer Networks Presentation by Michael Smathers Chapter 7.4 Internet Measurement: Infrastructure, Traffic and Applications by.
Peer to Peer File Sharing Huseyin Ozgur TAN. What is Peer-to-Peer?  Every node is designed to(but may not by user choice) provide some service that helps.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Distributed Lookup Systems
Overlay Networks EECS 122: Lecture 18 Department of Electrical Engineering and Computer Sciences University of California Berkeley.
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek and Hari alakrishnan.
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
Object Naming & Content based Object Search 2/3/2003.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Freenet A Distributed Anonymous Information Storage and Retrieval System I Clarke O Sandberg I Clarke O Sandberg B WileyT W Hong.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 Seminar: Information Management in the Web Gnutella, Freenet and more: an overview of file sharing architectures Thomas Zahn.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
Introduction to Peer-to-Peer Networks. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network that.
P2P File Sharing Systems
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Introduction Widespread unstructured P2P network
Popularity-Awareness in Temporal DHT for P2P-based Media Streaming Applications Abhishek Bhattacharya, Zhenyu Yang & Deng Pan IEEE International Symposium.
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
Introduction to Peer-to-Peer Networks. What is a P2P network A P2P network is a large distributed system. It uses the vast resource of PCs distributed.
Content Overlays (Nick Feamster). 2 Content Overlays Distributed content storage and retrieval Two primary approaches: –Structured overlay –Unstructured.
HERO: Online Real-time Vehicle Tracking in Shanghai Xuejia Lu 11/17/2008.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Structuring P2P networks for efficient searching Rishi Kant and Abderrahim Laabid Abderrahim Laabid.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
PSI Peer Search Infrastructure. Introduction What are P2P Networks? The term "peer-to-peer" refers to a class of systems and applications that employ.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
SIGCOMM 2001 Lecture slides by Dr. Yingwu Zhu Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
15-744: Computer Networking L-22: P2P. Lecture 22: Peer-to-Peer Networks Typically each member stores/provides access to content Has quickly.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
1. Outline  Introduction  Different Mechanisms Broadcasting Multicasting Forward Pointers Home-based approach Distributed Hash Tables Hierarchical approaches.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Peer to Peer Network Design Discovery and Routing algorithms
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 37 – Introduction to P2P (Part 1) Klara Nahrstedt.
Two Peer-to-Peer Networking Approaches Ken Calvert Net Seminar, 23 October 2001 Note: Many slides “borrowed” from S. Ratnasamy’s Qualifying Exam talk.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 37 – Introduction to P2P (Part 1) Klara Nahrstedt.
NCLAB 1 Supporting complex queries in a distributed manner without using DHT NodeWiz: Peer-to-Peer Resource Discovery for Grids Sujoy Basu, Sujata Banerjee,
Plethora: A Locality Enhancing Peer-to-Peer Network Ronaldo Alves Ferreira Advisor: Ananth Grama Co-advisor: Suresh Jagannathan Department of Computer.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 24 – Introduction to Peer-to-Peer (P2P) Systems Klara Nahrstedt (presented by Long Vu)
Distributed Web Systems Peer-to-Peer Systems Lecturer Department University.
Brocade: Landmark Routing on Overlay Networks
Peer-to-Peer Information Systems Week 12: Naming
A Survey of Peer-to-Peer Content Distribution Technologies Stephanos Androutsellis-Theotokis and Diomidis Spinellis ACM Computing Surveys, December 2004.
Controlling the Cost of Reliability in Peer-to-Peer Overlays
CHAPTER 3 Architectures for Distributed Systems
Plethora: Infrastructure and System Design
A Scalable content-addressable network
Building Peer-to-Peer Systems with Chord, a Distributed Lookup Service
Mobile P2P Data Retrieval and Caching
Distributed Hash Tables
MIT LCS Proceedings of the 2001 ACM SIGCOMM Conference
Peer-to-Peer Information Systems Week 12: Naming
Brocade: Landmark Routing on Peer to Peer Networks
Presentation transcript:

Early Measurements of a Cluster-based Architecture for P2P Systems Yinglian Xie Carnegie Mellon University Balachander Krishnamurthy, Jia Wang ATT Labs---Research

Motivation Peer-to-peer(P2P) applications provide us with a new content service model End-hosts self organized into an overlay network and share content with each other For a wide deployment of P2P applications We need a scalable content location and routing scheme in the application layer We need to study and understand P2P traffic patterns 11/8/2018

Recent Work Existing approaches for content location Recent designs Napster: uses a centralized server Gnutella: relies on flooding of queries Recent designs Distributed indexing schemes based on hash functions CAN, Chord, Pastry, Tapestry 11/8/2018

Our Work A Cluster-based architecture (CAP) for P2P systems Example application: distributed search (support keyword searching) Design: using network-aware clustering Early measurements of CAP trace analysis + simulations 11/8/2018

CAP System Design Network-aware clustering B. Krishnamurthy and J.Wang. On Network-Aware Clustering of Web Clients. In proceedings of ACM Sigcomm, August 2000 An effective technique to group clients that are topologically close and under common administrative domain Apply network-aware clustering to P2P applications An additional level in the hierarchy Less dynamism More scalability 11/8/2018

CAP Architecture Three entities Two operations Clustering server client delegate Clustering server Three entities Clustering server Delegate Client Two operations Node join and node leave Query lookup 11/8/2018

Inter-cluster Routing Each query has a maximum search depth Each delegate keeps a neighbor list Assigned randomly when the delegate joins the network Updated gradually based on application requirements Depth-first search among neighbors 11/8/2018

CAP Evaluation Collect Gnutella traces, apply network-aware clustering in trace data analysis To examine the potential advantage of using network-aware clustering Trace-driven simulations Measure CAP system performance based on real deployment (ongoing work) 11/8/2018

Collecting Gnutella Trace A modified open source Gnutella client (gnut) to passively monitor and log all Gnutella messages Location Trace length Number of IP addresses CMU 10 hours 799,386 ATT 14 hours 302,262 ACIRI 6 hours 185,905 Location Trace length Number of IP addresses CMU 89 hours 301,025 ATT 139 hours 261,094 UKY 96 hours 409,084 75 hours 292,759 WPI 10 hours 69,285 Table 1 Traces with unlimited connections Table 2 Traces with limited connections 11/8/2018

Cluster Distribution CMU trace 5/24/2001 – 5/25/2001, 799,386 IP addresses, 45,129 clusters Clustering helps reduce query latency by caching repeated queries 11/8/2018

Client and Cluster Distribution along Time Network-aware clustering helps reduce dynamism in the P2P network 11/8/2018

Simulation Trace-driven simulation Performance metrics Use Gnutella trace to generate “join, leave, search” Assume the query distribution follows the file distribution Performance metrics Hit rate Overhead Search Latency 11/8/2018

Hit Rate Use CMU trace 1,000 node stationary network 311 clusters 4,615search messages 3,793 unique files 11/8/2018

Overhead and Search Latency Messages per search, forward operations per delegate In Gnutella, overhead grows exponentially In CAP, overhead grows linearly Search Latency Application level hop length In CAP, search path length is short 11/8/2018

Summary CAP is promising to increase stability and scalability of distributed applications Ongoing work: We are implementing CAP, deploying it in machines around the world, and measuring the performance 11/8/2018