1.Optimizing P2P Networks: Lessons learned from social networking a)Social Networks b)Lessons Learned c)Are P2P Networks Social?? d)Organizing P2P Networks.

Slides:



Advertisements
Similar presentations
Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
Advertisements

Peer-to-peer and agent-based computing Scalability in P2P Networks.
1 Grid Computing, 7700 Guest ! Dr Ian Taylor
INF 123 SW ARCH, DIST SYS & INTEROP LECTURE 12 Prof. Crista Lopes.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
P2P Topologies Centralized Ring Hierarchical Decentralized Hybrid.
Peer-to-Peer is Not Always Decentralized …when Centralization is Good Nelson Minar
P2P Topologies CentralizedCentralized RingRing HierarchicalHierarchical DecentralizedDecentralized HybridHybrid.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
An Overview of Peer-to-Peer Networking CPSC 441 (with thanks to Sami Rollins, UCSB)
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Peer-to-Peer Networking By: Peter Diggs Ken Arrant.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
1 Client-Server versus P2P  Client-server Computing  Purpose, definition, characteristics  Relationship to the GRID  Research issues  P2P Computing.
Object Naming & Content based Object Search 2/3/2003.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 Seminar: Information Management in the Web Gnutella, Freenet and more: an overview of file sharing architectures Thomas Zahn.
Searching in Unstructured Networks Joining Theory with P-P2P.
Improving Data Access in P2P Systems Karl Aberer and Magdalena Punceva Swiss Federal Institute of Technology Manfred Hauswirth and Roman Schmidt Technical.
Peer-to-peer: an overview Selo TE P2P is not a new concept P2P is not a new technology P2P is not a new technology Oct : first transmission.
Review of Free Riding on Gnutella Eytan Adar and Bernardo Huberman Shreeram Sahasrabudhe.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Introduction to Peer-to-Peer Networks. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network that.
P2P File Sharing Systems
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Freenet. Anonymity  Napster, Gnutella, Kazaa do not provide anonymity  Users know who they are downloading from  Others know who sent a query  Freenet.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Introduction Widespread unstructured P2P network
P2P Architecture Case Study: Gnutella Network
1 Reading Report 4 Yin Chen 26 Feb 2004 Reference: Peer-to-Peer Architecture Case Study: Gnutella Network, Matei Ruoeanu, In Int. Conf. on Peer-to-Peer.
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
Peer Communications. References Many of the slides are from a tutorial “P2P Systems” by Keith Ross and Dan Rubenstein 1.Dejan S. Milojicic, Vana Kalogeraki,
P2P Web Standard IS3734/19/10 Michael Radzin. What is P2P? Peer to Peer Networking (P2P) is a “direct communications initiations session.” Modern uses.
Introduction of P2P systems
Peer-to-Peer Networks University of Jordan. Server/Client Model What?
Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.
Peer-to-Pee Computing HP Technical Report Chin-Yi Tsai.
PSI Peer Search Infrastructure. Introduction What are P2P Networks? The term "peer-to-peer" refers to a class of systems and applications that employ.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
The Start Shawn Fanning (19-yr-old student nicknamed Napster) developed the original Napster application and service in January 1999 while a freshman.
Peer-to-Peer Network Tzu-Wei Kuo. Outline What is Peer-to-Peer(P2P)? P2P Architecture Applications Advantages and Weaknesses Security Controversy.
FastTrack Network & Applications (KaZaA & Morpheus)
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
Computer Networking P2P. Why P2P? Scaling: system scales with number of clients, by definition Eliminate centralization: Eliminate single point.
ADVANCED COMPUTER NETWORKS Peer-Peer (P2P) Networks 1.
Peer to Peer Network Design Discovery and Routing algorithms
Peer to Peer Computing. What is Peer-to-Peer? A model of communication where every node in the network acts alike. As opposed to the Client-Server model,
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Peer-to-Peer (P2P) Networks By Bongju Yu. Contents  What is P2P?  Features of P2P systems  P2P Architecture  P2P Protocols  P2P Projects  Reference.
Two Peer-to-Peer Networking Approaches Ken Calvert Net Seminar, 23 October 2001 Note: Many slides “borrowed” from S. Ratnasamy’s Qualifying Exam talk.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Topologies and behavioral properties of the network Yvon Kermarrec Based on tml.
Composing Web Services and P2P Infrastructure. PRESENTATION FLOW Related Works Paper Idea Our Project Infrastructure.
Distributed Web Systems Peer-to-Peer Systems Lecturer Department University.
BitTorrent Vs Gnutella.
Peer-to-Peer Data Management
CHAPTER 3 Architectures for Distributed Systems
Unstructured Routing : Gnutella and Freenet
Peer-to-Peer Information Systems Week 6: Performance
Presentation transcript:

1.Optimizing P2P Networks: Lessons learned from social networking a)Social Networks b)Lessons Learned c)Are P2P Networks Social?? d)Organizing P2P Networks 2.Peer Topologies a)Centralized, Ring, Hierarchical & Decentralized b)Hybrid: oCentralized-Ring oCentralized-Centralized oCentralized-Decentralized c)Reflector Nodes 3.Gnutella Case Studies a)3 case studies Scalability 1

“You can’t scale better than by utilising someone else’s computer.” Paul James1 2

Limewire Gnutella Coding 3

Social Networks Stanley Milgram (Harvard professor ) – 1967 social networking experiment How many ‘social hops’ would it take for messages to traverse through the US population (200 million) Posted 160 letters randomly chosen people in Omaha, Nebraska Boston Omaha Asked them to try to pass these letters to a stockbroker working in Boston, Massachusetts Rules: use intermediacies whom they know on a first name basis chosen intelligently make a note at each hop 42 letters made it !! Average of 5.5 hops Demonstrated the ‘small world effect’ Proved that the social network of the United States is indeed connected with a path- length (number of hops) of around 6 – The 6 degrees of separation ! Does this mean that it takes 6 hops to traverse 200 million people?? 4

Lessons Learned from Milgrim’s Experiment Social circles are highly clustered A few members have wide-ranging connections these form a bridge between far-flung social clusters this bridging plays a critical role in bringing the network closer together For example A quarter of all letters passed through a local storekeeper A half were mediated by just 3 people Lessons Learned These people acted as gateways or hubs between the source and the wider world A small number of bridges dramatically reduces the number of hops 5

From Social Networks to Computer Networks… There are a number of similarities to social networks People = peers Intermediaries = Hubs, Gateways or Rendezvous Nodes (JXTA speak...) Number of intermediaries passed through = number of hops Are P2P Networks Special then? P2P networks are more like social networks than other types of computer network because they are often: Self Organizing Ad-Hoc Employ clustering techniques based on prior interactions (like we form relationships) Decentralized discovery and communication (like we form neighbourhoods, villages, cities etc) 6

Problem: how do we organize peers within ad-hoc, multi- hop pervasive P2P networks? network of self-organizing peers organized in a decentralized fashion such networks can rapidly expand from a few hundred peers to several thousand or even millions Peer to Peer: What’s the problem? P2P Environment Recap: Unreliable Environments Peers connecting/disconnecting – network failures to participation Random Failures e.g. power outages, Cable, DSL failure, hackers Personal machines are much more vulnerable than servers algorithms have to cope with this continuous restructuring of the network core. P2P systems need to treat failures as normal occurrences not freak exceptions must be designed in a way that promotes redundancy with the tradeoff of a degradation of performance 7

For P2P This does not mean abstract numerical benchmarks e.g. how many milliseconds will it take to compute this many millions of FFTs? Rather, it means asking question like: How long will it take to retrieve this particular file? How much bandwidth will this query consume? How many hops will it take for my package to get to a peer on the far side of the network? If I add/remove a peer to the network will the network still be fault tolerant? Does the network scale as we add more peers. Such networks can rapidly expand from a few hundred peers to several thousand or even millions So, how do we Organize Networks in Order to Get Optimum Performance? 8

3 main factors that make P2P networks more sensitive to performance issues: Performance Issues in P2P Networks 1.Communication. Fundamental necessity Users connected via different connections speeds Multi-hop 2.Searching No central Control so more effort is needed Each hop adds to total bandwidth – problems: time outs 3.Equal Peers Free Riders – unbalance in the harmonicity of network Degrades performance for others Need to get this right to adjust accordingly 9

Core Centralized Ring Hierarchical Decentralized Hybrid Centralized-Ring Centralized-Centralized Centralized-Decentralized Peer Topologies 10

Centralized Client/server Web servers Databases Napster search Instant Messaging Popular Power 11

Ring Fail-over clusters Simple load balancing Assumption –Single owner 12

Hierarchical Tree structure DNS Usenet (sort of) 13

Decentralized Gnutella Freenet Internet routing 14

Centralized + Ring Robust web applications High availability of servers 15

Centralized + Centralized N-tier apps Database heavy systems Web services gateways Google.com uses this topology to deliver their service 16

Centralized + Decentralized New Wave of P2P Clip2 Gnutella Reflector (next) FastTrack –KaZaA –Morpheus Like Social Networks perhaps ? 17

F1.mp3 – ID0:F1.mp3 … CF1.mp3 F2.mp3 F3.mp Reflector Nodes Known as ‘super peers’ – in JXTA these are Rendezvous peers cache file list of connected users – maintain an index When a query is issued, the Reflector does not retransmit it - it answers the query from its own memory Do they remind you of anything ? 18

Napster Gnutella User Napster.com Gnutella Super Peers: 1. Natural?? 2. Reflector (clip2.com) =? User Napster N2 N3 Napster Duplicated Servers Napster = Gnutella? 19

The figure below is a view of the topology of a Gnutella network as shown on the LimeWire web site, the popular Gnutella file-sharing client. Notice how the power-law or centralized-decentralized structure is demonstrated. The Gnutella Network Today 20

Another View of the Gnutella Network 21

Gnutella Studies 1: Free Riding E. Adar and B.A. Huberman (2000), “Free Riding on Gnutella,” First Monday 5(10), Two types of free riding 1.download files but never provide any files for other to download 2.users that have undesirable content They found 22,084 of the 33,335 peers in the network (66%) of the peers share no files 24,347 or 73% share ten or less files top 1 percent (333 hosts) represent 37 percent of the total files shared 20 percent (6,667 hosts) sharing 98% of the files shows - even without Gnutella Reflector nodes, the Gnutella network naturally converges into a centralized + decentralized topology with the top 20% of nodes acting as super peers or reflectors 22

Gnutella Studies 2: Equal Peers Study on Reflector Nodes [clip] Studied Gnutella for one month Noted an apparent scalability barrier when query rates went above 10 per second. Why?? Gnutella query = 560 bits long and queries make up approximately one quarter of traffic. Each peer is connect to three peers, so: 560 *10 * 3 = 16,800 bytes per second This is a quarter of the traffic so total traffic 67,200 bytes per second. a 56-K link cannot keep up with this amount of traffic one node connected in the incorrect place can grind the whole network to a halt. This is why P2P networks place slower nodes at the edges 23

Gnutella Studies 3: Communication Peer-to-Peer Architecture Case Study: Gnutella Network Matei Ripeanu, on-line at: Studied topology of Gnutella over several months & reported two findings: 1. Gnutella network shares the benefits and drawbacks of a power-law structure - networks that organize themselves so that most nodes have a few links and a small number of nodes have many - found to show an unexpected degree of robustness when facing random node failures. - vulnerable to attacks e.g. by removing a few of the super nodes can have a massive effect on the function of the network as a whole. 2. Gnutella network topology does not match well with the underlying Internet topology leading to inefficient use of network bandwidth. He gave 2 suggestions: 1.use an agent to monitor network and intervene by asking servents to drop/add links to keep the topology optimal. 2.replace the Gnutella flooding mechanism with a smarter routing and group communication mechanism. 24

What about other topologies: The Future? Centralized + Hierarchical? –Back end tree of information –Caching architectures Decentralized + Ring? –P2P network of fail-over clusters More ?? 25

1.Summary a)Centralized + Decentralized – understand from the original Gnutella to the new models b)The role of Reflector nodes 2.Further Information: Distributed Hashtable Models a)Pastry: b)Chord: 26 Closing Remarks