Peer-to-Peer Information Systems Week 6: Performance

Slides:



Advertisements
Similar presentations
Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
Advertisements

Complex Networks Advanced Computer Networks: Part1.
Topics in Database Systems: Data Management in Peer-to-Peer Systems
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Efficient Search - Overview Improving Search In Peer-to-Peer Systems Presented By Jon Hess cs294-4 Fall 2003.
Improving Search in Peer-to-Peer Networks Beverly Yang Hector Garcia-Molina Presented by Shreeram Sahasrabudhe
Structuring Unstructured Peer-to-Peer Networks Stefan Schmid Roger Wattenhofer Distributed Computing Group HiPC 2007 Goa, India.
Information Networks Small World Networks Lecture 5.
Search in Power-Law Networks Presented by Hakim Weatherspoon CS294-4: Peer-to-Peer Systems Slides also borrowed from the following paper Path Finding Strategies.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
An Overview of Peer-to-Peer Networking CPSC 441 (with thanks to Sami Rollins, UCSB)
Peer-to-Peer Networks João Guerreiro Truong Cong Thanh Department of Information Technology Uppsala University.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Efficient Search in Peer to Peer Networks By: Beverly Yang Hector Garcia-Molina Presented By: Anshumaan Rajshiva Date: May 20,2002.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
P2P File Sharing Systems
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Freenet. Anonymity  Napster, Gnutella, Kazaa do not provide anonymity  Users know who they are downloading from  Others know who sent a query  Freenet.
Peer-to-Peer Computing CS587x Lecture Department of Computer Science Iowa State University.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Introduction Widespread unstructured P2P network
1 Reading Report 4 Yin Chen 26 Feb 2004 Reference: Peer-to-Peer Architecture Case Study: Gnutella Network, Matei Ruoeanu, In Int. Conf. on Peer-to-Peer.
Searching In Peer-To-Peer Networks Chunlin Yang. What’s P2P - Unofficial Definition All of the computers in the network are equal Each computer functions.
Chapter 2: Application layer
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
PSI Peer Search Infrastructure. Introduction What are P2P Networks? The term "peer-to-peer" refers to a class of systems and applications that employ.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
Optimal Content Delivery with Network Coding Derek Leong, Tracey Ho California Institute of Technology Rebecca Cathey BAE Systems CISS 2009 March 19, 2009.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
1 A connection management protocol for promoting cooperation in Peer-to-Peer networks Authors: Murat Karakaya, Ibrahim Korpeoglu, and Ozgur Ulusoy Source:
Analyzing the Vulnerability of Superpeer Networks Against Attack Niloy Ganguly Department of Computer Science & Engineering Indian Institute of Technology,
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Computer Networking P2P. Why P2P? Scaling: system scales with number of clients, by definition Eliminate centralization: Eliminate single point.
Peer-to-Peer (P2P) Networks By Bongju Yu. Contents  What is P2P?  Features of P2P systems  P2P Architecture  P2P Protocols  P2P Projects  Reference.
Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University.
CS 347Notes081 CS 347: Parallel and Distributed Data Management Notes 08: P2P Systems.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
Advanced Computer Networks: Part 2 Complex Networks, P2P Networks and Swarm Intelligence on Graphs.
Composing Web Services and P2P Infrastructure. PRESENTATION FLOW Related Works Paper Idea Our Project Infrastructure.
Distributed Web Systems Peer-to-Peer Systems Lecturer Department University.
Peer-to-Peer Information Systems Week 12: Naming
An example of peer-to-peer application
Data Center Network Architectures
CSE 486/586 Distributed Systems Distributed Hash Tables
CHAPTER 3 Architectures for Distributed Systems
Peer-to-Peer and Social Networks
Early Measurements of a Cluster-based Architecture for P2P Systems
The Watts-Strogatz model
Peer-to-Peer Video Services
Improving Performance in the Gnutella Protocol
Joydeep Chandra, Santosh Shaw and Niloy Ganguly
Mobile P2P Data Retrieval and Caching
An Overview of Peer-to-Peer
Peer-To-Peer Data Management
Peer-to-Peer Information Systems Week 2: File Sharing
Peer-to-Peer Information Systems Week 6: Assignment #4
Lecture 9: Network models CS 765: Complex Networks
Peer-to-Peer Information Systems Week 2: File Sharing
Peer-to-Peer Information Systems Week 6: Assignment #4
Peer-to-Peer Information Systems Week 12: Naming
Presentation transcript:

Peer-to-Peer Information Systems Week 6: Performance Old Dominion University Department of Computer Science CS 495/595 Fall 2004 Michael L. Nelson <mln@cs.odu.edu> 10/05/04

Regular & Random Graphs vertices: n=4096 neighbors: k=8 Regular Graph average diameter n/2k clustering coefficient (connections present) / (k(k-1)/2) approaches .75 for large n Random Graph average diameter log(n/log k) clustering coefficient k/n approaches 0 for large n =256 =4 =0.002 =0.5 figures 14-4 & 14-5 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Small Worlds, Revisited Duncan J. Watts, Steven H. Strogatz. Collective Dynamics of 'Small-World' Networks. Nature 393, 440-442 (1998) http://dx.doi.org/10.1038/30918 figures 1 & 2 from Watts & Strogatz

Freenet Performance Simulation 1000 identical nodes capacity: n=1000, k=4 diameter = 125, CC=0.5 capacity: 50 data items (all initially empty) 200 references (4 nearest neighbors initially linked) every timestep, randomly pick a node for an insert or request TTL=20 figure 14-8 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Freenet Path Evolution approaching 2 figure 14-9 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Freenet Path Evolution approaching 6; cf. ~50 for random routing (figure 14-13) figures 14-10 & 14-12 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Growing the Freenet Network start with 20 nodes add a new node every 5 timesteps until 1000 nodes results path length 2.2 CC 0.22 50% requests complete in <= 5 hops figure 14-15 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Attacks vs. Failures figure 14-19 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Free Riders in Freenet No real impact: if you don’t share files, you don’t get linked into the network the free riding node can contribute traffic to the network if a node refuses incoming connections, it is considered “down” and the other nodes route around it

Freenet Scalability slope increase probably due to a simulation side-effect figures 14-21 & 14-22 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Gnutella Performance build 1000 empty nodes, randomly add 1500 edges n=1000, k=3 random graph! (cf. Figure 14-5) after initialization, the network does not evolve no adds or joins “publish” 2500 data objects; each exists in 20 locations 50k (2500 * 20) total files in the network perform 300 queries on random files, TTL= halt query after file is found (unlike the real Gnutella)

Gnutella Communication breadth-first search finds the shortest path… figure 3 from Ripeanu, Iamnitchi & Foster, IEEE IC, 6(1), 2002 figures 14-23 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Gnutella Communications …but costs communications figure 14-24 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Gnutella Attacks vs. Failures no real difference between attacks and failures… …but are assumptions still valid? figures 14-29 & 14-26 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Gnutella - Power Law? figures 5&6 from Ripeanu, Iamnitchi & Foster, IEEE IC, 6(1), 2002

Free Riding in Gnutella 73% of Gnutella users share < 10 files! Gnutella has no mechanism to describe or remember “good” hosts free riders increase the path length w/o adding content free riders increase the communications requirements figure 1 from Adar & Huberman, http://www.firstmonday.dk/issues/issue5_10/adar/

Gnutella Scalability cf. figures 14-21 & 14-22 figure 14-31 from Ch. 14 & http://www.doc.ic.ac.uk/~twh1/academic/papers/oreilly.pdf

Super-peers All peers are equal, but some peers are more equal than others Beverly Yang, Hector Garcia-Molina, "Designing a Super-peer Network." In Proceedings of the 19th International Conference on Data Engineering (ICDE), Bangalore, India, March 2003. http://www-db.stanford.edu/~byang/pubs/superpeer.pdf definitions pure P2P: Gnutella, Freenet, Free Haven hybrid P2P: Napster super-peer: a node that acts as a centralized server (hybrid system) to a subset of clients super-peers are connected, and communication between them occurs as a pure P2P for downloads, peer and super-peer distinctions don’t matter

Super-Peer Network

Super-Peer Motivation very heterogeneous capability in P2P systems http://citeseer.nj.nec.com/saroiu02measurement.html combine pure & hybrid capabilities build hierarchical structures? super-super-peer? super-super-super-peer? Super-peer examples: http://www.kazaa.com/ http://www.morpheus.com/ http://www.grokster.com/

Gnutella Reflectors instead of forwarding queries, a Gnutella reflector answers them from its cache resources: http://www.openp2p.com/pub/a/p2p/2001/03/22/truelove.html?page=4 http://web.archive.org/web/20010616055844/http://www.clip2.com/reflector.html also: gnuTellavision http://www.sims.berkeley.edu/~rachna/gtv/ figure from http://web.archive.org/web/20010616055844/http://www.clip2.com/reflector.html

Gnutella Maps from: http://www.cybergeography.org/atlas/more_topology.html; cf. figure 8-3 (p. 109)

Super-Peer Definitions from Yang & Garcia-Molina, 2003 load = f(incoming bandwidth, outgoing bandwidth, processing power) individual load = load of 1 node aggregate load = load of all nodes in system k-redundancy = a super-peer has k backups

Super-Peer Rules of Thumb from Yang & Garcia-Molina, 2003 Increasing cluster size decreases aggregate load, but increases individual load as the cluster size grows, super-peers work harder Super-peer redundancy is good additional complexity, but yields “good aggregate load of a large cluster size, and the good individual load of a smaller cluster size -- in addition to increased reliability” Maximize outdegree of super-peers decreases expected # of hops, provided that all super-peers take on more clients Minimize TTL TTL can be tuned: start high, look at how far away your results are, back down to that…

Meta Services in Dienst Haven’t we been here before… Meta Services in Dienst Master Meta Server Region B Region A Regional Meta Server Regional Meta Server Merged Index Server Merged Index Server Standard Dienst Merged Index Server Backup Server Standard Dienst Standard Dienst Central Lite Site Standard Dienst Lite Site Lite Site