Distributed Slicing in Dynamic Systems A. Fernández, V. Gramoli, E. Jiménez, A-M. Kermarrec, M. Raynal.

Slides:



Advertisements
Similar presentations
Sliver: A fast distributed slicing algorithm Ymir Vigfusson Cornell University Ymir Vigfusson Cornell University Vincent Gramoli EPFL & UniNE Switzerland.
Advertisements

CS3771 Today: deadlock detection and election algorithms  Previous class Event ordering in distributed systems Various approaches for Mutual Exclusion.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Scalable Content-Addressable Network Lintao Liu
20/10/2006ALPAGE1 Ordered slicing of very large scale overlay networks Mark Jelasity University of Bologna, Italy Anne-Marie Kermarrec INRIA Rennes/IRISA,
Distributed Shared Memory, Related Issues, and New Challenges in Large-Scale Dynamic Systems Vincent Gramoli 1.
Cognitive Publish/Subscribe for Heterogeneous Clouds Šarūnas Girdzijauskas, Swedish Institute of Computer Science (SICS) Joint work with:
Distributed Slicing in Dynamic Systems A. Fernández, V. Gramoli, E. Jiménez, A-M. Kermarrec, M. Raynal.
Gossip Algorithms and Implementing a Cluster/Grid Information service MsSys Course Amar Lior and Barak Amnon.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Lecture 4: Elections, Reset Anish Arora CSE 763 Notes include material from Dr. Jeff Brumfield.
PZ13B Programming Language design and Implementation -4th Edition Copyright©Prentice Hall, PZ13B - Client server computing Programming Language.
Thomas ZahnCST1 Seminar: Information Management in the Web Query Processing Over Peer- to-Peer Data Sharing Systems (UC Santa Barbara)
Small-world Overlay P2P Network
Scribe: A Large-Scale and Decentralized Application-Level Multicast Infrastructure Miguel Castro, Peter Druschel, Anne-Marie Kermarrec, and Antony L. T.
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
1 Amortized Analysis Consider an algorithm in which an operation is executed repeatedly with the property that its running time fluctuates throughout the.
Advanced Topics in Algorithms and Data Structures An overview of the lecture 2 Models of parallel computation Characteristics of SIMD models Design issue.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
On the Construction of Energy- Efficient Broadcast Tree with Hitch-hiking in Wireless Networks Source: 2004 International Performance Computing and Communications.
A Trust Based Assess Control Framework for P2P File-Sharing System Speaker : Jia-Hui Huang Adviser : Kai-Wei Ke Date : 2004 / 3 / 15.
Dynamic Hypercube Topology Stefan Schmid URAW 2005 Upper Rhine Algorithms Workshop University of Tübingen, Germany.
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Evaluating Hypotheses
A Hierarchical Energy-Efficient Framework for Data Aggregation in Wireless Sensor Networks IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 55, NO. 3, MAY.
Timed Quorum Systems … for large-scale and dynamic environments Vincent Gramoli, Michel Raynal.
Models of Parallel Computation Advanced Algorithms & Data Structures Lecture Theme 12 Prof. Dr. Th. Ottmann Summer Semester 2006.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
A Local Facility Location Algorithm Supervisor: Assaf Schuster Denis Krivitski Technion – Israel Institute of Technology.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Correctness of Gossip-Based Membership under Message Loss Maxim Gurevich, Idit Keidar Technion.
Peer-to-peer file-sharing over mobile ad hoc networks Gang Ding and Bharat Bhargava Department of Computer Sciences Purdue University Pervasive Computing.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Algorithms for Self-Organization and Adaptive Service Placement in Dynamic Distributed Systems Artur Andrzejak, Sven Graupner,Vadim Kotov, Holger Trinks.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
COCONET: Co-Operative Cache driven Overlay NETwork for p2p VoD streaming Abhishek Bhattacharya, Zhenyu Yang & Deng Pan.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
1 IEEE Trans. on Smart Grid, 3(1), pp , Optimal Power Allocation Under Communication Network Externalities --M.G. Kallitsis, G. Michailidis.
Parallel and Distributed IR. 2 Papers on Parallel and Distributed IR Introduction Paper A: Inverted file partitioning schemes in Multiple Disk Systems.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Routing Indices For P-to-P Systems ICDCS Introduction Search in a P2P system –Mechanisms without an index –Mechanisms with specialized index nodes.
Replication Strategies in Unstructured Peer-to-Peer Networks Edith CohenScott Shenker Some slides are taken from the authors’ original presentation.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Data Mining Algorithms for Large-Scale Distributed Systems Presenter: Ran Wolff Joint work with Assaf Schuster 2003.
CHAN Siu Lung, Daniel CHAN Wai Kin, Ken CHOW Chin Hung, Victor KOON Ping Yin, Bob SPRINT: A Scalable Parallel Classifier for Data Mining.
Association Rule Mining in Peer-to-Peer Systems Ran Wolff Assaf Shcuster Department of Computer Science Technion I.I.T. Haifa 32000,Isreal.
Dual-Region Location Management for Mobile Ad Hoc Networks Yinan Li, Ing-ray Chen, Ding-chau Wang Presented by Youyou Cao.
By Jonathan Drake.  The Gnutella protocol is simply not scalable  This is due to the flooding approach it currently utilizes  As the nodes increase.
DHT-based unicast for mobile ad hoc networks Thomas Zahn, Jochen Schiller Institute of Computer Science Freie Universitat Berlin 報告 : 羅世豪.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Network Coordinates : Internet Distance Estimation Jieming ZHU
Minimizing Churn in Distributed Systems P. Brighten Godfrey, Scott Shenker, and Ion Stoica UC Berkeley SIGCOMM’06.
Peer to Peer Network Design Discovery and Routing algorithms
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
Submitted by: Sounak Paul Computer Science & Engineering 4 th Year, 7 th semester Roll No:
CS Machine Learning Instance Based Learning (Adapted from various sources)
Repairing Sensor Network Using Mobile Robots Y. Mei, C. Xian, S. Das, Y. C. Hu and Y. H. Lu Purdue University, West Lafayette ICDCS 2006 Speaker : Shih-Yun.
1 Roie Melamed, Technion AT&T Labs Araneola: A Scalable Reliable Multicast System for Dynamic Wide Area Environments Roie Melamed, Idit Keidar Technion.
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
Adlib : A Self-Tuning Index for Dynamic Peer-to-Peer Systems Proceedings of the 21st International Conference on Data Engineering(ICDE'05) Prasanna Ganesan.
The Biologically Inspired Distributed File System: An Emergent Thinker Instantiation Presented by Dr. Ying Lu.
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS
An example of peer-to-peer application
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.
Controlling the Cost of Reliability in Peer-to-Peer Overlays
Distributed Energy Efficient Clustering (DEEC) Routing Protocol
A Case for Mutual Notification
DATA RETRIEVAL IN ADHOC NETWORKS
Presentation transcript:

Distributed Slicing in Dynamic Systems A. Fernández, V. Gramoli, E. Jiménez, A-M. Kermarrec, M. Raynal

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Context Distributed System  Interconnected set of nodes Heterogeneous environment  Resources are heterogeneously spread Bandwidth Processing Power Storage Space Uptime … Large-scale dynamic environment  Nodes leave and join at any time  Large amount of nodes  No global information 1,5 0,

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Problem What means rich/poor in this context?

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Problem What means rich/poor in this context? 20 Am I rich or poor?

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Problem What means rich/poor in this context? rich?

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Problem What means rich/poor in this context? poor?

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Applications Remedy the Gnutella problem in Peer-to-peer  Gnutella performance was limited by poor nodes  Kazaa/Skype sollicitate rich nodes rather than poor nodes Allocating resources/nodes to services  Streaming service needs nodes with highest bandwidth  Non-critical service can run on unstable nodes  File-sharing service requires nodes with many files  …

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Objectives Classifying nodes into categories, slices  Based on individual characteristics: attributes  A slice corresponds to a portion of the system Typically, answering the question:

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Objectives HOW RICH AM I COMPARED TO OTHERS? Classifying nodes into categories, slices  Based on individual characteristics: attributes  A slice corresponds to a portion of the system Typically, answering the question:

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Classifying the system nodes

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Classifying the system nodes …using their attribute values (assume a single attribute for simplicity reason)

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Classifying the system nodes Attribute values ai

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Classifying the system nodes Attribute values ai Normalized Indices pi

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal #4#3#2#1 Classifying the system nodes Normalized Indices pi 0 1 Slices si Attribute values ai

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Model Dynamic system  System of n nodes  Nodes join and leave the system at any time  Nodes may crash too Each node i knows  its attribute value ai,  its position estimate pi’,  the slices (ex: 10 equally sized slices, each containing 10% of the nodes),  a communication view Vi : constant number of neighbors j, their position estimate pj’, their attribute aj, (and their age)

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Underlying Gossip-based Overlay Each node i periodically:  Exchanges information with its neighbors (nodes of its view) Views, values…  Computes its new view and its new state View, value… Dynamic overlay:  Failed nodes are naturally removed from views  Joining nodes are naturally added into views Scalable overlay:  Limited amount of information stored  Limited amount of information exchanged

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Jelasity, Kermarrec 2006 (JK) Each node i sets pi’ as a uniform random value Each node i looks for a misplaced neighbor j A neighbor j is misplaced with i iff (ai – aj)(pi’ – pj’) < 0 Then, nodes i and j exchange pi’ and pj’ The sequence of position estimates matches the sequence of attribute values For any node couple i and j, ai pi’<pj’ Convergence to this ordering is exponentially fast (in the number of execution)

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering Similar to JK Uses Local Disorder Measure LDM  Let lpj’ (resp. lpj ) be the normalized index of j random value (resp. attribute value) among all j of Vi  i  LDM(i) = ∑ j in Vi  i (lpj’ – lpj)² Protocol Do periodically { Update view Vi using an underlying protocol. Choose the neighbor j that minimizes the LDM(i). Exchange random values pi’ and pj’ Update the random value pi’ w/ pj’ if necessary. Update slice assignment si’ := s : pi’ in s. }

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering Similar to JK Uses Local Disorder Measure LDM  Let lpj’ (resp. lpj ) be the normalized index of j random value (resp. attribute value) among all j of Vi  i  LDM(i) = ∑ j in Vi  i (lpj’ – lpj)² Protocol Do periodically { Update view Vi using an underlying protocol. Choose the neighbor j that minimizes the LDM(i). Exchange random values pi’ and pj’. Update the random value pi’ w/ pj’ if necessary. Update slice assignment si’ := s : pi’ in s. } Difference with JK

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering /11 9/11

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering /11 7/11

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering /11 2/11

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering /11 4/11

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 1: Ordering

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Result: Slight convergence speed up n = 10 4 #slices = 10 |V| = 20

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal If random values are not perfectly uniformly distributed …some nodes might never find their slice  e.g. the 3 nodes of S2 in the example above Problem: Wrong Slice Assignment #4#3#2#1 0 1 Slices Normalized indices

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking No random values! Protocol Do periodically { Update/Shuffle view Vi using an underlying protocol. l += #neighbors with lower attribute value. g += #neighbors Sends ai to a randomly chosen neighbor Sends ai to the neighbor that is the closest to a slice boundary Update slice assignment si’ = s such that l/g in s. } Upon reception { Receive aj from j if ( aj < ai ) l += 1; g +=1 ; Update slice assignment si’ = s such that l/g in s. }

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking No random values! Protocol Do periodically { Update/Shuffle view Vi using an underlying protocol. l += #neighbors with lower attribute value. g += #neighbors Sends ai to a randomly chosen neighbor Sends ai to the neighbor that is the closest to a slice boundary Update slice assignment si’ = s such that l/g in s. } Upon reception { Receive aj from j if ( aj < ai ) l += 1; g +=1 ; Update slice assignment si’ = s such that l/g in s. } Same number of messages

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking /11

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking /2

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking /4

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Algorithm 2: Ranking /3

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Result 1: Unlimited convergence n = 10 4 #slices = 100 |V| = 20

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Result 1: Unlimited convergence Ranking precision keeps improving n = 10 4 #slices = 100 |V| = 20

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Result 2: Tolerating Dynamism Churn is correlated with attribute values! e.g. the attribute is the remaining batery lifetime or available storage space. Churn

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Performance Analysis d, is the distance from p i ’ to the closest slice boundary. For confidence coefficient of 99,99%, the required number of attribute value drawn is m i ≥ z p i ’ (1 – p i ’) / d 2, with z <16, a constant.

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Conclusion Churn-tolerant algorithm Gossip-based mechanisms. Slice belongingness re-approximation. Scalable algorithm Limited number of neighbors. Size of the system is unknown. Applications of the distributed slicing Resource allocation Supernodes / Ultrapeers election Future work Can we obtain convergence speed of the first algorithm with the accuracy of the second algorithm?

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal References Ordered Slicing of Very Large-Scale Overlay Networks M. Jelasity and A.-M. Kermarrec In Proc. of the 6 th IEEE Conference on P2P Computing, Randomized Algorithms R. Motwani, P. Raghavan Cambridge University Press, 1995 Time Bounds for Selection M. Blum, R. Floyd, V. Pratt, R. Rivest, and R. Tarjan Journal Computer and System Sciences 7: , 1972

ICDCS 2007 June Fernandez, Gramoli, Jimenez, Kermarrec, Raynal Result 3: Feasibility