Churn and Selfishness: Two Peer-to-Peer Computing Challenges Invited Talk University of California, Berkeley 380 Soda Hall March, 2006 Stefan Schmid Distributed.

Slides:

Advertisements

Similar presentations

Price Of Anarchy: Routing

Advertisements

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.

Scalable Content-Addressable Network Lintao Liu

Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.

A Blueprint for Constructing Peer-to-Peer Systems Robust to Dynamic Worst-Case Joins and Leaves Fabian Kuhn, Microsoft Research, Silicon Valley Stefan.

Fabian Kuhn, Microsoft Research, Silicon Valley

Structuring Unstructured Peer-to-Peer Networks Stefan Schmid Roger Wattenhofer Distributed Computing Group HiPC 2007 Goa, India.

1 Algorithmic Game Theoretic Perspectives in Networking Dr. Liane Lewin-Eytan.

Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.

Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.

Information Dissemination in Highly Dynamic Graphs Regina O’Dell Roger Wattenhofer.

Ad-Hoc Networks Beyond Unit Disk Graphs

CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.

Routing, Anycast, and Multicast for Mesh and Sensor Networks Roland Flury Roger Wattenhofer RAM Distributed Computing Group.

Dynamic Internet Congestion with Bursts Stefan Schmid Roger Wattenhofer Distributed Computing Group, ETH Zurich 13th International Conference On High Performance.

On the Topologies Formed by Selfish Peers Thomas Moscibroda Stefan Schmid Roger Wattenhofer IPTPS 2006 Santa Barbara, California, USA.

CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.

Distributed Computing Group On the Impact of Malicious Players in Distributed Systems Stefan Schmid DYNAMO’07: 1st Workshop on Dynamic Networks (Salerno,

Economics of P2P Computing Stefan Schmid Distributed Computing Group.

Taming Dynamic and Selfish Peers “Peer-to-Peer Systems and Applications” Dagstuhl Seminar March 26th-29th, 2006 Stefan Schmid Distributed Computing Group.

Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.

Parallel Routing Bruce, Chiu-Wing Sham. Overview Background Routing in parallel computers Routing in hypercube network –Bit-fixing routing algorithm –Randomized.

Distributed Computing Group A Self-Repairing Peer-to-Peer System Resilient to Dynamic Adversarial Churn Fabian Kuhn Stefan Schmid Roger Wattenhofer IPTPS.

Dynamic Hypercube Topology Stefan Schmid URAW 2005 Upper Rhine Algorithms Workshop University of Tübingen, Germany.

Dynamics, Non-Cooperation, and Other Algorithmic Challenges in Peer-to-Peer Computing Stefan Schmid Distributed Computing Group Oberseminar TU München,

Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.

Dynamics, Non-Cooperation, and Other Algorithmic Challenges in Peer-to-Peer Computing Stefan Schmid Distributed Computing Group Visit at Los Alamos National.

A TCP With Guaranteed Performance in Networks with Dynamic Congestion and Random Wireless Losses Stefan Schmid, ETH Zurich Roger Wattenhofer, ETH Zurich.

Aggregating Information in Peer-to-Peer Systems for Improved Join and Leave Distributed Computing Group Keno Albrecht Ruedi Arnold Michael Gähwiler Roger.

Dept. of Computer Science Distributed Computing Group Asymptotically Optimal Mobile Ad-Hoc Routing Fabian Kuhn Roger Wattenhofer Aaron Zollinger.

Algorithmic Models for Sensor Networks Stefan Schmid and Roger Wattenhofer WPDRTS, Island of Rhodes, Greece, 2006.

Localized Self- healing using Expanders Gopal Pandurangan Nanyang Technological University, Singapore Amitabh Trehan Technion - Israel Institute of Technology,

Distributed Combinatorial Optimization

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.

1 Koorde: A Simple Degree Optimal DHT Frans Kaashoek, David Karger MIT Brought to you by the IRIS project.

Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.

COCONET: Co-Operative Cache driven Overlay NETwork for p2p VoD streaming Abhishek Bhattacharya, Zhenyu Yang & Deng Pan.

Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.

Network Aware Resource Allocation in Distributed Clouds.

Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks IPTPS Feb. 25, 2005 Byung-Gon Chun, Ben Y. Zhao, and John Kubiatowicz.

Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.

1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.

Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.

CS4231 Parallel and Distributed Algorithms AY 2006/2007 Semester 2 Lecture 10 Instructor: Haifeng YU.

Inoculation Strategies for Victims of Viruses and the Sum-of-Squares Partition Problem Kevin Chang Joint work with James Aspnes and Aleksandr Yampolskiy.

Hot Topics in Peer-to-Peer Computing (HOT-P2P 2004) Volendam 08 October 2004 Non-uniform deterministic routing on F-Chord(  ) Gennaro Cordasco, Luisa.

CCAN: Cache-based CAN Using the Small World Model Shanghai Jiaotong University Internet Computing R&D Center.

A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.

On a Network Creation Game PoA Seminar Presenting: Oren Gilon Based on an article by Fabrikant et al 1.

PODC Distributed Computation of the Mode Fabian Kuhn Thomas Locher ETH Zurich, Switzerland Stefan Schmid TU Munich, Germany TexPoint fonts used in.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.

1 30 November 2006 An Efficient Nearest Neighbor (NN) Algorithm for Peer-to-Peer (P2P) Settings Ahmed Sabbir Arif Graduate Student, York University.

Towards a Scalable and Robust DHT Baruch Awerbuch Johns Hopkins University Christian Scheideler Technical University of Munich.

Distributed Error- Confinement Shay Kutten (Technion) with Boaz Patt-Shamir (Tel Aviv U.) Yossi Azar (Tel Aviv U.)

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Introduction Wireless Ad-Hoc Network  Set of transceivers communicating by radio.

Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.

Chapter 15 Running Time Analysis. Topics Orders of Magnitude and Big-Oh Notation Running Time Analysis of Algorithms –Counting Statements –Evaluating.

1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Algorithms for Radio Networks Winter Term 2005/2006.

Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Data Center Network Architectures

Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.

SKIP GRAPHS James Aspnes Gauri Shah SODA 2003.

DHT Routing Geometries and Chord

Introduction Wireless Ad-Hoc Network

Network Formation Games

Presentation transcript:

Churn and Selfishness: Two Peer-to-Peer Computing Challenges Invited Talk University of California, Berkeley 380 Soda Hall March, 2006 Stefan Schmid Distributed Computing Group ETH Zurich, Switzerland

Stefan UC Berkeley, Outline of this Talk Current research of our group at ETH –Based on our papers at IPTPS 2005 and IPTPS 2006 Two challenges related to P2P topologies CHALLENGE 1: Churn dynamics of P2P systems, i.e., joins and leaves of peers (“churn”) our approach to maintain desirable properties in spite of churn CHALLENGE 2: Selfishness impact of selfish behavior on P2P topologies How bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers?

Stefan UC Berkeley, CHALLENGE 1: Fast and Concurrent Joins and Leaves (“Churn”)

Stefan UC Berkeley, Dynamic Peer-to-Peer Systems Properties compared to centralized client/server approach –Availability –Efficiency –Etc. => Peers may join and leave the network at any time! However, P2P systems are –composed of unreliable desktop machines –under control of individual users

Stefan UC Berkeley, Churn How to maintain desirable properties such as –Connectivity, –Network diameter, –Peer degree? Churn: Permanent joins and leaves

Stefan UC Berkeley, Challenge 1: Churn Motivation for adversarial (worst-case) churn Components of our system Assembling the components Results and Conclusion

Stefan UC Berkeley, Motivation Why permanent churn? Saroiu et al.: „A Measurement Study of P2P File Sharing Systems“ Peers join system for one hour on average Hundreds of changes per second with millions of peers in the system! Why adversarial (worst-case) churn? E.g., a crawler takes down neighboring machines (attacks weakest part) rather than randomly chosen peers!

Stefan UC Berkeley, The Adversary Model worst-case faults with an adversary ADV(J,L, ) ADV(J,L, ) has complete visibility of the entire state of the system May add at most J and remove at most L peers in any time period of length Note: Adversary is not Byzantine!

Stefan UC Berkeley, Synchronous Model Our system is synchronous, i.e., our algorithms run in rounds –One round: receive messages, local computation, send messages However: Real distributed systems are asynchronous! - Algorithms can still be used: local synchronizers Notion of time necessary to bound the adversary - E.g. 1 round = max. RTT

Stefan UC Berkeley, A First Approach What if number of peers is not 2 i ? How to prevent degeneration? Where to store data? Idea: Simulate the hypercube! Fault-tolerant hypercube?

Stefan UC Berkeley, Simulated Hypercube System Basic components: Simulation: Node consists of several peers! Such a hypercube can be maintained against ADV(J,L, )! Route peers to sparse areas Adapt dimension Token distribution algorithm! Information aggregation algorithm!

Stefan UC Berkeley, Components: Peer Distribution and Information Aggregation Peer Distribution Goal: Distribute peers evenly among all hypercube nodes in order to balance biased adversarial churn Basically a token distribution problem Counting the total number of peers (information aggregation) Goal: Estimate the total number of peers in the system and adapt the dimension accordingly Tackled next!

Stefan UC Berkeley, Peer Distribution (1) Algorithm: Cycle over dimensions and balance! Perfectly balanced after d steps!

Stefan UC Berkeley, Peer Distribution (2) But peers are not fractional! And an adversary inserts at most J and removes at most L peers per step! Theorem 1: Given adversary ADV(J,L,1), discrepancy never exceeds 2J+2L+d!

Stefan UC Berkeley, Components: Peer Distribution and Information Aggregation Peer Distribution Goal: Distribute peers evenly among all hypercube nodes in order to balance biased adversarial churn Basically a token distribution problem Counting the total number of peers (information aggregation) Goal: Estimate the total number of peers in the system and adapt the dimension accordingly Tackled next!

Stefan UC Berkeley, Information Aggregation (1) Goal: Provide the same (and good!) estimation of the total number of peers presently in the system to all nodes –Thresholds for expansion and reduction Means: Exploit again the recursive structure of the hypercube!

Stefan UC Berkeley, Information Aggregation (2) Algorithm: Count peers in every sub-cube by exchange with corresponding neighbor! Correct number after d steps!

Stefan UC Berkeley, Information Aggregation (3) But again, we have a concurrent adversary! Solution: Pipelined execution! Theorem 2: The information aggregation algorithm yields the same estimation to all nodes. Moreover, this number represents the correct state of the system d steps ago!

Stefan UC Berkeley, Composing the Components Our system permanently runs –Peer distribution algorithm to balance biased churn –Information aggregation algorithm to estimate total number of peers and change dimension accordingly But: How are peers connected inside a node, and how are the edges of the hypercube represented? And: Where is the data of the DHT stored?

Stefan UC Berkeley, Distributed Hash Table Hash function determines node where data item is replicated Problem: Peer which has to move to another node must replace all data items. Idea: Divide peers of a node into core and periphery –Core peers store data, –Peripheral peers are used for peer distribution

Stefan UC Berkeley, Intra- and Interconnections Peers inside a node are completely connected. Peers are connected to all core peers of all neighboring nodes. –May be improved: Lower peer degree by using a matching.

Stefan UC Berkeley, Maintenance Algorithm Maintenance algorithm runs in phases –Phase = 6 rounds In phase i: –Snapshot of the state of the system in round 1 –One exchange to estimate number of peers in sub-cubes (information aggregation) –Balances tokens in dimension i mod d –Dimension change if necessary All based on the snapshot made in round 1, ignoring the changes that have happened in-between!

Stefan UC Berkeley, Results Given an adversary ADV(d+1,d+1,  )... => Peer discrepancy at most 5d+4 (Theorem 1) => Total number of peers with delay d (Theorem 2)... we have, in spite of ADV(O(log n), O(log n), 1): –always at least one core peer per node (no data lost!), –peer degree O(log n) (asymptotically optimal!), –network diameter O(log n).

Stefan UC Berkeley, Discussion Simulated topology: A simple blueprint for dynamic P2P systems! –Requires algorithms for token distribution and information aggregation on the topology. –Straight-forward for skip graphs –Also possible for pancake graphs! ( Diameter = Degree = O(log n / loglog n) ) A lot of future work! –A first step only: dynamics of P2P systems offer many research chellenges! –E.g.: Other dynamics models, self- stabilization after larger changes, etc.! –E.g.: Selfishness => see CHALLENGE 2 –E.g.: also measurment studies are subject to current research: Churn in file sharing systems? Churn in Skype? (=> IPTPS 2006)

Stefan UC Berkeley, eQuus: An Alternative Approach with Low Stretch (1) eQuus –Optimized for random joins/leavs rather than worst-cae –Hypercube too restrictive –Token distribution is expensive –Adding locality awareness! “Simulated Chord” –Local split and merge only –According to constant thresholds –Split operation according to latencies!

Stefan UC Berkeley, eQuus: An Alternative Approach with Low Stretch (2) Split and merge happen seldom –If joins and leave uniformly distributed: balls-into-bins –Small stretches if nodes are uniformly distributed (= roughly direct paths used)

Stefan UC Berkeley, CHALLENGE 2: Selfish Peers

Stefan UC Berkeley, Challenge 1 -> Challenge 2 Simulated hypercube topology is fine… … if peers act according to protocol! However, in practice, peers can perform selfishly!

Stefan UC Berkeley, Motivation –CPU Cycles –Memory –Bandwidth –…–… Power of Peer-to-Peer Computing = Accumulation of Resources of Individual Peers Collaboration is of peers is vital! However, many free riders in practice!

Stefan UC Berkeley, Motivation Free riding –Downloading without uploading –Using storage of other peers without contributing own disk space –Etc. In this talk: selfish neighbor selection in unstructured P2P systems Goals of selfish peer: (1)Maintain links only to a few neighbors (small out-degree) (2)Small latencies to all other peers in the system (fast lookups) What is the impact on the P2P topologies?

Stefan UC Berkeley, Challenge 2: Road-Map Problem statement Game-theoretic tools How good / bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers Conclusion

Stefan UC Berkeley, Problem Statement (1) n peers {  0, …,  n-1 } distributed in a metric space –Metric space defines distances between peers –triangle inequality, etc. –E.g., Euclidean plane Metric Space

Stefan UC Berkeley, Problem Statement (2) Each peer can choose… –to which –and how many –… other peers its connects ii Yields a directed graph G

Stefan UC Berkeley, Problem Statement (3) Goal of a selfish peer: (1)Maintain a small number of neighbors only (out-degree) (2)Small stretches to all other peers in the system - Only little memory used - Small maintenance overhead –Fast lookups! –Shortest distance using edges of peers in G… –… divided by shortest direct distance

Stefan UC Berkeley, Cost of a peer: –Number of neighbors (out-degree) times a parameter  –plus stretches to all other peers –  captures the trade-off between link and stretch cost cost i =  outdeg i +  i  j stretch G (  i,  j ) Problem Statement (4) Goal of a peer: Minimize its cost!

Stefan UC Berkeley, Challenge 2: Road-Map Problem statement Game-theoretic tools How good / bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers Conclusion

Stefan UC Berkeley, Game-theoretic Tools (1) Social Cost –Sum of costs of all individual peers: Cost =  i cost i =  i (  outdeg i +  i  j stretch G (  i,  j )) Social Optimum OPT –Topology with minimal social cost of a given problem instance –=> “topology formed by collaborating peers”! What topologies do selfish peers form? => Concepts of Nash equilibrium and Price of Anarchy

Stefan UC Berkeley, Game-theoretic Tools (2) Nash equilibrium –“Result” of selfish behavior => “topology formed by selfish peers” –Topology in which no peer can reduce its costs by changing its neighbor set –In the following, let NASH be social cost of worst equilibrium Price of Anarchy –Captures the impact of selfish behavior by comparison with optimal solution –Formally: social costs of worst Nash equilibrium divided by optimal social cost PoA = max I {NASH(I) / OPT(I)}

Stefan UC Berkeley, Challenge 2: Road-Map Problem statement Game-theoretic tools How good / bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers Conclusion

Stefan UC Berkeley, Analysis: Social Optimum For connectivity, at least n links are necessary –=> OPT ¸  n Each peer has at least stretch 1 to all other peers –=> OPT ¸ n ¢ (n-1) ¢ 1 =  (n 2 ) Theorem: Optimal social costs are at least OPT 2  n + n 2 )

Stefan UC Berkeley, Analysis: Social Cost of Nash Equilibria In any Nash equilibrium, no stretch exceeds  +1 –Otherwise, it’s worth connecting to the corresponding peer –Holds for any metric space! A peer can connect to at most n-1 other peers Thus: cost i ·  O(n) + (  O(n) => social cost Cost 2 O(  n 2 ) Theorem: In any metric space, NASH 2  n 2 )

Stefan UC Berkeley, Analysis: Price of Anarchy (Upper Bound) Since OPT =  (  n + n 2 )... … and since NASH = O(  n 2 ), we have the following upper bound for the price of anarchy: Theorem: In any metric space, PoA 2  (min{ , n}).

Stefan UC Berkeley, Analysis: Price of Anarchy (Lower Bound) (1) Price of anarchy is tight, i.e., it also holds that Theorem: The price of anarchy is PoA 2  min{ ,n}) This is already true in a 1-dimensional Euclidean space: 11 22 33 44 55  i-1 ii  i+1 nn ½  ½  2 33 ½  4 ½  i-2  i-1 ½i½i ½  n-1 … … … … Peer: Position:

Stefan UC Berkeley, Price of Anarchy: Lower Bound (2) 11 22 33 44 55  i-1 ii  i+1 nn ½  ½  2 33 ½  4 ½  i-2  i-1 ½i½i ½  n-1 … … … … Peer: Position: Note: Social optimum is at most O(  n + n 2 ): To prove: (1) “is a selfish topology” = instance forms a Nash equilibrium (2) “has large costs compared to OPT” = the social cost of this instance is  (  n 2 )

Stefan UC Berkeley, Price of Anarchy: Lower Bound (3) Proof Sketch: Nash? –Even peers: For connectivity, at least one link to a peer on the left is needed With this link, all peers on the left can be reached with an optimal stretch 1 No link to the right can reduce the stretch costs to other peers by more than  –Odd peers: For connectivity, at least one link to a peer on the left is needed With this link, all peers on the left can be reached with an optimal stretch 1 Moreover, it can be shown that all alternative or additional links to the right entail larger costs ½  ½  2 33 ½  4 … … … 6 55

Stefan UC Berkeley, Price of Anarchy: Lower Bound (4) Idea why social cost are  (  n 2 ):  (n 2 ) stretches of size  (  ) The stretches from all odd peers i to a even peers j>i have stretch >  / ½  ½  2 33 ½  4 … … … And also the stretches between even peer i and even peer j>i are >  /2

Stefan UC Berkeley, Price of Anarchy PoA can grow linearly in the total number of peers Theorem: The price of anarchy is PoA 2  min{ ,n}) PoA can grow linearly in the relative importance of degree costs 

Stefan UC Berkeley, Challenge 2: Road-Map Problem statement Game-theoretic tools How good / bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers Conclusion

Stefan UC Berkeley, Stability (1) Peers change their neighbors to improve their individual costs. Theorem: Even in the absence of churn, peer mobility or other sources of dynamism, the system may never stabilize (i.e., P2P system never reaches a pure Nash equilibrium)! How long thus it take until no peer has an incentive to change its neighbors anymore?

Stefan UC Berkeley, Stability (2) Example for  =0.6 Euclidean plane: 11 22 aa bb cc 2-2  1 2+   …arbitrary small number

Stefan UC Berkeley, Stability (3) 11 22 aa bb cc Example sequence: Again initial situation => Changes repeat forever! Generally, it can be shown that there is no set of links for this instance where no peer has an incentive to change.

Stefan UC Berkeley, Stability (4) So far: no Nash equilibrium for  =0.6 11 But example can be extended for  of all magnitudes: - Replace single peers by group of k=n/5 very close peers on a line - No pure Nash equilibrium for  =0.6k 22 cc bb aa k

Stefan UC Berkeley, Challenge 2: Road-Map Problem statement Game-theoretic tools How good / bad are topologies formed by selfish peers? Stability of topologies formed by selfish peers Conclusion

Stefan UC Berkeley, Conclusion Unstructured topologies created by selfish peers Efficiency of topology deteriorates linearly in the relative importance of links compared to stretch costs, and in the number of peers Instable even in static environments Future Work: - Complexity of stability? NP-hard! - Routing or congestion aspects? - Other forms of selfish behavior? - More local view of peers? - Mechanism design?

Stefan UC Berkeley, Churn and Selfishness: Two P2P Challenges Thank you for your attention! Further reading: 1.“A Self-repairing Peer-to-Peer System Resilient to Dynamic Adversarial Churn”, Kuhn, Schmid, Wattenhofer; Ithaca, New York,USA, IPTPS “On the Topologies Formed by Selfish Peers”, Moscibroda, Schmid, Wattenhofer; Santa Barbara, California, USA, IPTPS “eQuus – A Provably Robust and Efficient Peer-to-Peer System”, Locher, Schmid, Wattenhofer; submitted. Questions? Comments?