© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Self-Organized Construction of Distributed Access Structures: A Comparative Evaluation of P-Grid and FreeNet Karl Aberer, Manfred Hauswirth and Magdalena Punceva
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis P2P approach Goal of a P2P system: to support a global search functionality without using central directories. Two fundamental approaches exist: Unstructured P2P systems -> Gnutella Structured P2P systems -> FreeNet, Chord, CAN, Pastry, Tapestry, P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Unstructured P2P Systems Search Broadcasting mechanism Advantages Peers are free to choose data they store Search predicates are not limitedProblems High search cost in terms of excessive bandwidth consumption-> do not scale well! High search cost in terms of excessive bandwidth consumption-> do not scale well! No guarantees on search success No guarantees on search success
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Structured P2P Systems Search Distributed scalable access structure-> efficient Coordination is required to establish a distributed access structure. Two fundamentally different approaches on how coordination can be achieved: global identification scheme to decide which part of the search space a peer is associated with the part of the search space is determined by bilateral interactions among peers (P-Grid and FreeNet)
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid : 2 1: 7 K=1 K=0
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid : 2 1: 7 K=1 K=0
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid : 2 1: 7 K=1 K=0 0: 4 1: 8 K=1 K=0
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid : 2 1: 7 K=1 K=0 0: 4 1: 8 K=1 K=0
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Random Walk-> P-Grid : 2 K=1 K=01 0: 4 K=1 K=00 1: 8 01: 2 1: 7 00: 4
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis FreeNet 000: 2 101: 3 Query=
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis FreeNet 000: 2 101: 3 001: 4 111: 5 011:6 110:1 Failed
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis FreeNet 000: 2 101: 3 001: 4 111: 1 011:6 110: : 2 001: 5 011:7 001: 2 001: 6
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis FreeNet 000: 2 101: 3 001: 4 111: 1 011:6 110:5 101: 2 001: 5 011:7 001: 2 001:
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis FreeNet 000: 2 101: 3 001: 4 111: 1 011:6 110:5 101: 2 001: 5 011:7 000: 7 001: 2 001: 6 QueryReply
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Simulations Goal Common framework for FreeNet and P-Grid in order to test them under comparable and varying conditions. Common parameters: Population of n peers. Each peer is identified with a unique ID number Initial topology for communication Each peer has a routing table where peers keep information about the adresses of other peers together with information relevant for routing d data objects are stored in the system and peers have a data store of size s. A necessary condition is r*d<s*n where r is the replication factor.
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Simulations: Experiment 1 Settings: total number of peers is 1000 each peer has at least 3 and at most 6 initial neighbors total number of inserted data objects 5000 replication factor is 20 maximal size of routing tables: 250 for FreeNet and only 35 for P-Grid total number of queries
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Experiment 1 FreeNet: Averaged number of messages per query over 100 trialsP-Grid: Averaged number of messages per query over 100 trials Results Two phases: bootstrapping phase stable phase: average number of messages per query is 4.58 for FreeNet and 4.54 for P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Simulations: Experiment 2 Changing the population‘s size total number of peers is 500 each peer has at least 3 and at most 6 initial neighbors total number of inserted data objects 2500 replication factor is 20 maximal size of routing tables: 125 for FreeNet and only 30 for P-Grid total number of queries
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Experiment 2 P-Grid: Averaged number of messages per query over 100 trialsFreeNet: Averaged number of messages per query over 100 trials Results Two phases: stable phase: average number of messages per query is 4.63 for FreeNet and 4.04 for P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Simulations: Experiment 3 Impact of changing the routing table sizes in FreeNet a)total number of peers is 1000 maximal size of routing tables is 35 total number of queries b)total number of peers is 500 maximal size of routing tables is 30 total number of queries All other parameters are as in Experiment 1 and 2
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Experiment 3 FreeNet: 500 peersFreeNet: 1000 peers Results 1000 peers: average number of messages per query is 154 and the success rate is 70% 500 peers: average number of messages per query is 109 and the success rate is 80%
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Simulations: Experiment 4 Impact of changing the replication factor for both total number of peers is 500 replication factor is 10 maximal sizes of routing tables are 125 for FreeNet and 35 for P-Grid total number of queries is 75000
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Experiment 4 P-GridFreeNet Results Average number of messages is 5.91 for FreeNet and 4.62 for P-Grid
© 2002, Magdalena Punceva, EPFL-IC, Laboratoire de systèmes d'informations répartis Conclusions Similar performance in terms of messages per query, however P-Grid requires much less resources in terms of size of routing tables FreeNet‘s performances deteriorates significantly if sizes of routing tables are relatively small compared to the total population Changing the routing tables sizes and replication factor affect FreeNet performance while P-Grid retains more stable behavior