April 14, 2009, Arizona State University Committee: Andrea W. Richa (Chair) Goran Konjevod Rida Bazzi Christian Scheideler Overlay Network Construction.

Slides:



Advertisements
Similar presentations
Energy-Efficient Distributed Algorithms for Ad hoc Wireless Networks Gopal Pandurangan Department of Computer Science Purdue University.
Advertisements

PODC 2007 © 2007 IBM Corporation Constructing Scalable Overlays for Pub/Sub With Many Topics Problems, Algorithms, and Evaluation G. Chockler, R. Melamed,
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Multicast in Wireless Mesh Network Xuan (William) Zhang Xun Shi.
Fast Algorithms For Hierarchical Range Histogram Constructions
Constant Density Spanners for Wireless Ad hoc Networks Kishore Kothapalli (JHU) Melih Onus (ASU) Christian Scheideler (JHU) Andrea Richa (ASU) 1.
Information Networks Graph Clustering Lecture 14.
Gossip Scheduling for Periodic Streams in Ad-hoc WSNs Ercan Ucan, Nathanael Thompson, Indranil Gupta Department of Computer Science University of Illinois.
Broadcasting Protocol for an Amorphous Computer Lukáš Petrů MFF UK, Prague Jiří Wiedermann ICS AS CR.
Errol Lloyd Design and Analysis of Algorithms Approximation Algorithms for NP-complete Problems Bin Packing Computer Networks.
Randomized k-Coverage Algorithms for Dense Sensor Networks
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
Introduction to Approximation Algorithms Lecture 12: Mar 1.
1 Efficient Broadcasting and Gathering in Wireless Ad-Hoc Networks Melih Onus (ASU) Kishore Kothapalli (JHU) Andrea Richa (ASU) Christian Scheideler (JHU)
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Wireless Sensor Networks 21st Lecture Christian Schindelhauer.
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
1 Multicast Routing with Minimum Energy Cost in Ad hoc Wireless Networks Xiaohua Jia, Deying Li and Frankie Hung Dept of Computer Science, City Univ of.
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Distributed Coloring in Õ(  log n) Bit Rounds COST 293 GRAAL and.
1 Enhancing Cellular Multicast Performance Using Ad Hoc Networks Jun Cheol Park Sneha Kumar Kasera School of.
On the Construction of Energy- Efficient Broadcast Tree with Hitch-hiking in Wireless Networks Source: 2004 International Performance Computing and Communications.
ICNP'061 Benefit-based Data Caching in Ad Hoc Networks Bin Tang, Himanshu Gupta and Samir Das Computer Science Department Stony Brook University.
The Theory of NP-Completeness
1 TTS: A Two-Tiered Scheduling Algorithm for Effective Energy Conservation in Wireless Sensor Networks Nurcan Tezcan & Wenye Wang Department of Electrical.
A scalable multilevel algorithm for community structure detection
Minimum Maximum Degree Publish-Subscribe Overlay Network Design Melih Onus TOBB Ekonomi ve Teknoloji Üniversitesi, 28 Mayıs 2009.
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Wireless Sensor Networks 22nd Lecture Christian Schindelhauer.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
Distributed Combinatorial Optimization
1 University of Denver Department of Mathematics Department of Computer Science.
CSE 550 Computer Network Design Dr. Mohammed H. Sqalli COE, KFUPM Spring 2007 (Term 062)
The Hardness of Cache Conscious Data Placement Erez Petrank, Technion Dror Rawitz, Caesarea Rothschild Institute Appeared in 29 th ACM Conference on Principles.
Exposure In Wireless Ad-Hoc Sensor Networks S. Megerian, F. Koushanfar, G. Qu, G. Veltri, M. Potkonjak ACM SIG MOBILE 2001 (Mobicom) Journal version: S.
Connected Dominating Sets in Wireless Networks My T. Thai Dept of Comp & Info Sci & Engineering University of Florida June 20, 2006.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
Scott Perryman Jordan Williams.  NP-completeness is a class of unsolved decision problems in Computer Science.  A decision problem is a YES or NO answer.
Network Aware Resource Allocation in Distributed Clouds.
Sven Bittner, 12 April 2007 Talk at the 5th New Zealand Computer Science Research Student Conference NEWS ALERT: (Kiwi or Cow) and Chainsaw = (Kiwi and.
Distributed Coloring Discrete Mathematics and Algorithms Seminar Melih Onus November
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
An Algorithm for the Coalitional Manipulation Problem under Maximin Michael Zuckerman, Omer Lev and Jeffrey S. Rosenschein AAMAS’11.
June 21, 2007 Minimum Interference Channel Assignment in Multi-Radio Wireless Mesh Networks Anand Prabhu Subramanian, Himanshu Gupta.
1 Constant Density Spanners for Wireless Ad-Hoc Networks Discrete Mathematics and Algorithms Seminar Melih Onus April
MIDDLEWARE SYSTEMS RESEARCH GROUP Scaling Construction of Low Fan-out Overlays for Topic-based Publish/Subscribe Systems Chen Chen 1 joint work with Roman.
Inoculation Strategies for Victims of Viruses and the Sum-of-Squares Partition Problem James Apnes, Kevin Change, and Aleksandr Yampolskiy.
The Fast Optimal Voltage Partitioning Algorithm For Peak Power Density Minimization Jia Wang, Shiyan Hu Department of Electrical and Computer Engineering.
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
On Reducing Broadcast Redundancy in Wireless Ad Hoc Network Author: Wei Lou, Student Member, IEEE, and Jie Wu, Senior Member, IEEE From IEEE transactions.
Although this may seem a paradox, all exact science is dominated by the idea of approximation. Bertrand Russell Approximation Algorithm.
MIDDLEWARE SYSTEMS RESEARCH GROUP Modelling Performance Optimizations for Content-based Publish/Subscribe Alex Wun and Hans-Arno Jacobsen Department of.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Efficient k-Coverage Algorithms for Wireless Sensor Networks Mohamed Hefeeda.
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
1 G-REMiT: An Algorithm for Building Energy Efficient Multicast Trees in Wireless Ad Hoc Networks Bin Wang and Sandeep K. S. Gupta Computer Science and.
The 30th International Conference on Distributed Computing Systems June 2010, Genoa, Italy Parameterized Maximum and Average Degree Approximation in Topic-based.
LOCALIZED MINIMUM - ENERGY BROADCASTING IN AD - HOC NETWORKS Paper By : Julien Cartigny, David Simplot, And Ivan Stojmenovic Instructor : Dr Yingshu Li.
Scalable and Topology-Aware Load Balancers in Charm++ Amit Sharma Parallel Programming Lab, UIUC.
Energy-Efficient Randomized Switching for Maximizing Lifetime in Tree- Based Wireless Sensor Networks Sk Kajal Arefin Imon, Adnan Khan, Mario Di Francesco,
A Simulation-Based Study of Overlay Routing Performance CS 268 Course Project Andrey Ermolinskiy, Hovig Bayandorian, Daniel Chen.
Saran Jenjaturong, Chalermek Intanagonwiwat Department of Computer Engineering Chulalongkorn University Bangkok, Thailand IEEE CROWNCOM 2008 acceptance.
TreeCast: A Stateless Addressing and Routing Architecture for Sensor Networks Santashil PalChaudhuri, Shu Du, Ami K. Saha, and David B. Johnson Department.
1 Low Latency Multimedia Broadcast in Multi-Rate Wireless Meshes Chun Tung Chou, Archan Misra Proc. 1st IEEE Workshop on Wireless Mesh Networks (WIMESH),
MIDDLEWARE SYSTEMS RESEARCH GROUP Divide and Conquer Algorithms for Pub/Sub Overlay Design Chen Chen 1 joint work with Hans-Arno Jacobsen 1,2, Roman Vitenberg.
Hongyu Liang Institute for Theoretical Computer Science Tsinghua University, Beijing, China The Algorithmic Complexity.
Errol Lloyd Design and Analysis of Algorithms Approximation Algorithms for NP-complete Problems Bin Packing Networks.
1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Algorithms for Radio Networks Winter Term 2005/2006.
TU/e Algorithms (2IL15) – Lecture 11 1 Approximation Algorithms.
A Study of Group-Tree Matching in Large Scale Group Communications
Presentation transcript:

April 14, 2009, Arizona State University Committee: Andrea W. Richa (Chair) Goran Konjevod Rida Bazzi Christian Scheideler Overlay Network Construction in Highly Decentralized Networks Melih Onus PhD Thesis Defense

Publish/Subscribe (Pub/Sub) N1 Subscription(N1)={B,C,D} N2 {A,B,C,E,} N3 {A,D} N4 {A,B,X} N5 {A,X} Message Bus Publish(M1, A) M1

Scalability of Pub/Sub Most traditional pub/sub systems are geared towards small scale deployment –E.g., Isis MDS, TIB, MQSeries, Gryphon New generation of applications… –Large data centers: Amazon, Google, Yahoo, EBay,… –RSS, feed/news readers, on-line stock trading and banking –Web 2.0, Second Life …drive dramatic growth in scale –10,000s of nodes, 1000s of topics, Internet-wide distribution Emerging systems address this trend using P2P techniques

Overlay-Based Pub/Sub N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 (M1, A) SCRIBE Corona Feedtree Sub-2-Sub TERA... Relay

Overlay Topologies for Pub/Sub “Good” overlay will allow for efficient and simple publication routing –Small routing tables, low load on relays, –low latency Ideally, overlay is topic-connected: i.e., one connected component for each topic-induced sub- graph –Most existing implementations construct topic-connected overlays

Topic-Connectivity Topics B,C,X,E are connected Topics A and D are disconnected N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4

Topic-Connectivity: Simple Solution N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4  Node degree grows linearly with the subscription size  Roughly twice as big as the subscription size for rings/trees

Scalability of the Simple Solution Negative impact on performance due to –CPU load: neighbor monitoring, message processing –Connection maintenance and header overhead –Memory overhead: per-link state associated with routing and/or compression schemes being used, etc.  Scalability barrier for large systems offering a wide range of subscription choices Can we do better?

Outline Minimum Maximum Degree Publish-Subscribe Overlay Network Design Parameterized Maximum and Average Degrees in Publish-Subscribe Overlay Network Design Constant Diameter Publish-Subscribe Overlay Network Design

The MinMax-TCO Problem Minimum Maximum Degree Topic-Connected Overlay (MinMax-TCO) problem: –For a set of nodes V, set of topics T, and I nterest: V  T  {true, false} –Construct a topic-connected overlay G with the minimum possible maximum degree TCO (decision version): –Decide whether there is a topic-connected overlay with maximum degree k (for a given k )

GM Algorithm The GM algorithm can have maximum degree of (n), when constant maximum degree overlay network exists.

Complexity of MinMax-TCO Lemma: MinMax-TCO(V,T,Interest,k)  NP Proof: Topic connectivity is verifyable in polynomial time Lemma: MinMax-TCO(V,T,Interest,k) is NP-hard Proof: 1.Define an auxiliary problem Single Node TCO (SN-TCO) which is to decide if there is a topic-connected overlay in which the degree of single given node  d 2.Set Cover is polynomially reducible to SN-TCO 3.SN-TCO is polynomially reducible to TCO Theorem: MinMax-TCO is NP-complete

Approximating MinMax-TCO The idea: exploiting subscription overlaps –Connecting the nodes with overlapping interests improves connectivity of several topics at once Overlay Design Algorithm (ODA): –Start from a singleton connected component for each (v, t)  V  T –At each iteration: add an edge that reduces the number of connected components for the biggest number of topics among the ones which increase maximum degree minimally –Stop, once there is a single connected component for each topic

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A4 B3 C2 D2 X2 E1

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A3 B2 C2 D2 X2 E1

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A3 B2 C2 D1 X2 E1

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A3 B1 C1 D1 X2 E1

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A2 B1 C1 D1 X1 E1

Overlay Design Algorithm N1 {B,C,D} N2 {A,B,C,E} N3 {A,D} {A,B,X} N5 {A,X} N4 Topic# of conn. comps A1 B1 C1 D1 X1 E1  Maximum degree of 2 vs. almost 4 for ring-per- topic!

ODA Running Time O(|V| 4  |T|) –At most |V| 2 iterations –At most |V| 2 edges inspected at each iteration –At most |T| steps to inspect an edge Can be optimized to run in O(|V| 2  |T|) –For each e  V  V, weight(e) = the number of connected components merged by e –At each iteration, output the heaviest edge and adjust the other edge weights accordingly –Stop once there are no more edges with weight > 0

Approximability Results Lemma: The number of edges in the overlay constructed by GM  log(|V|  |T|) OPT Proof: Similar to that of the approximation ratio of the greedy algorithm for Set Cover Uses Maximum Weighted Matching Uses Edge Coloring Theorem: No algorithm can approximate MinMax-TCO within a constant factor (unless P=NP) Proof: Existence of such an algorithm would imply existence of the constant factor approximation for Set Cover which is known to be impossible (unless P=NP)

Experimental Results I Maximum Node Degree #topics: 100 #subscriptions: 10 Uniform distribution

Experimental Results II Average Node Degree #topics: 100 #subscriptions: 10 Uniform distribution

Experimental Results III Maximum Node Degree #topics: 100 #nodes: 100 Uniform distribution

Outline Minimum Maximum Degree Publish-Subscribe Overlay Network Design Parameterized Maximum and Average Degrees in Publish-Subscribe Overlay Network Design Constant Diameter Publish-Subscribe Overlay Network Design

ODA Algorithm The ODA algorithm can have average degree of (n), when constant average degree overlay network exists. v n-1 v1v1 v2v2 v3v3 vnvn … v1v1 v2v2 v3v3 vnvn …… v3v3 v1v1 v2v2 vnvn

ODA and GM Algorithms GM Algorithm: Choose edge with maximum benefit –Average Degree: O(log nt) approximation –Maximum Degree: O(n) approximation ODA Algorithm: Choose edge with maximum benefit among the ones that increases maximum degree minimally –Average Degree: O(n) approximation –Maximum Degree: O(log nt) approximation How to approximate both average and maximum degree?

Parameterized Algorithm e 1 : Edge with maximum benefit e 2 : Edge with maximum benefit among the ones that increases maximum degree minimally If w(e 2 ) > w(e 1 ) / k, choose e 2 Otherwise, choose e 1 1 < k < n

Algorithms GM Algorithm: –Average Degree: O(log nt) approximation –Maximum Degree: O(n) approximation ODA Algorithm: –Average Degree: O(n) approximation –Maximum Degree: O(log nt) approximation P-ODA Algorithm: –Average Degree: O(k * log nt) approximation –Maximum Degree: O((n/k)*log nt) approximation

Outline Minimum Maximum Degree Publish-Subscribe Overlay Network Design Parameterized Maximum and Average Degrees in Publish-Subscribe Overlay Network Design Constant Diameter Publish-Subscribe Overlay Network Design

Constant Diameter Overlays Constant Diameter Topic-Connected Overlay (CD- TCO) problem: –For a set of nodes V, set of topics T, and I nterest: V  T  {true, false} –Construct a topic-connected, constant diameter overlay G with the minimum possible average degree The GM algorithm can have diameter of (n), where n is number of nodes in the pub/sub system.

Constant Diameter Overlay Algorithm Constant Diameter Overlay Design Algorithm: –At each iteration: Find number of neighbors for each node Add a star which connects maximum number of nodes, Remove topics which are connected by the star –Stop, once there is a single connected component for each topic Number of neighbors of node u:

Constant Diameter Overlay Algorithm I Constant Diameter Overlay Design Algorithm I: –At each iteration: Find weight for each node Add a star which connects the node with maximum weight, Remove topics which are connected by the star –Stop, once there is a single connected component for each topic Weight of node u:

Constant Diameter Overlay Algorithm II Constant Diameter Overlay Design Algorithm II: –At each iteration: Find number of neighbors for each node Add a star which connects the node with maximum density, Remove topics which are connected by the star –Stop, once there is a single connected component for each topic Density of node u:

Experimental Results I Average Node Degree Varying #nodes #topics: 100 #subscription: 10 Uniform distribution Only 2.3 times more edge

Experimental Results II Average Node Degree Varying #topics #nodes: 100 #subscription: 20 Uniform distribution Only 1.9 times more edge

Experimental Results III Average Node Degree Varying #subscription #nodes: 100 #topics: 100 Uniform distribution Only 1.8 times more edge

Conclusions Formal study of the problem of designing efficient and scalable overlay topologies for pub/sub Defined the problem (MinMax-TCO) capturing the cost of constructing topic-connected overlays –NP-Completeness, polynomial approximation, inapproximability results Empirical evaluation showed effectiveness of our approximation algorithm on practical inputs Parameterized algorithm with low maximum and average degree Defined the problem (CD-TCO), empirical results

Future Directions Study dynamic case Investigate other overlay design problems Study distributed case –Partial knowledge of other node interest –Dynamically changing interest assignments Proving diameter results theoretically

Publications Parameterized Maximum and Average Degrees in Topic-based Publish-Subscribe Overlay Network Design, M. Onus and A. W. Richa,Submitted to 21st Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), August Minimum Maximum Degree Publish-Subscribe Overlay Network Design, M. Onus and A. W. Richa, 28th Annual IEEE Conference on Computer Communications (INFOCOM), April 2009, Rio De Janeiro, Brazil. Distributed Coloring with O(log n) bits, K. Kothapalli, M. Onus, C. Scheideler and C. Schindelhauer, To appear in Journal of Parallel and Distributed Computing (JPDC), Linearization: Locally Self Stabilizing Sorting in Graphs, M. Onus, A. W. Richa, C. Scheideler, Workshop on Algorithm Engineering & Experiments (ALENEX), January 2007, New Orleans, Louisiana. A Scalable Multilevel Algorithm for Community Structure Detection, H. Djidjev and M. Onus, 4th Workshop on Algorithms and Models for the Web-Graph (WAW), November 2006, Banff, Alberta. Heuristics for Minimum Brauer Chain Problem, F.Gelgi and M.Onus, 21st International Symposium on Computer and Information Sciences (ISCIS), Springer LNCS 4263, November 2006, Istanbul, Turkey. Distributed Coloring with O(log n) bits, K. Kothapalli, C. Scheideler, M. Onus and C. Schindelhauer, 20th IEEE Parallel & Distributed Processing Symposium (IPDPS), April 2006, Rhodes Island, Greece. Efficient Broadcasting and Gathering in Wireless Ad-Hoc Networks, M. Onus, A. W. Richa, K. Kothapalli and C. Scheideler.International Symposium on Parallel Architectures, Algorithms and Networks (I-SPAN), December 2005, Las Vegas, Nevada. Constant Density Spanners for Wireless Ad-Hoc Networks, K. Kothapalli, C. Scheideler, M. Onus and A. W. Richa. 17th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), July 2005, Las Vegas, Nevada.

Thank You!