Operator Placement for In-Network Stream Query Processing.

Slides:



Advertisements
Similar presentations
Improved Approximation for Orienting Mixed Graphs Iftah Gamzu CS Division, The Open Univ., and CS School, Tel-Aviv University Moti Medina EE School, Tel-Aviv.
Advertisements

Great Theoretical Ideas in Computer Science
Minimum Clique Partition Problem with Constrained Weight for Interval Graphs Jianping Li Department of Mathematics Yunnan University Jointed by M.X. Chen.
Multicast in Wireless Mesh Network Xuan (William) Zhang Xun Shi.
Great Theoretical Ideas in Computer Science for Some.
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Minimum Energy Mobile Wireless Networks IEEE JSAC 2001/10/18.
How Bad is Selfish Routing? By Tim Roughgarden Eva Tardos Presented by Alex Kogan.
Adaptive Monitoring of Bursty Data Streams Brian Babcock, Shivnath Babu, Mayur Datar, and Rajeev Motwani.
Efficient Query Evaluation on Probabilistic Databases
1 Stochastic Event Capture Using Mobile Sensors Subject to a Quality Metric Nabhendra Bisnik, Alhussein A. Abouzeid, and Volkan Isler Rensselaer Polytechnic.
Great Theoretical Ideas in Computer Science.
The Cache Location Problem IEEE/ACM Transactions on Networking, Vol. 8, No. 5, October 2000 P. Krishnan, Danny Raz, Member, IEEE, and Yuval Shavitt, Member,
June 3, 2015Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
Network Correlated Data Gathering With Explicit Communication: NP- Completeness and Algorithms R˘azvan Cristescu, Member, IEEE, Baltasar Beferull-Lozano,
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
1 Maximizing Lifetime of Sensor Surveillance Systems IEEE/ACM TRANSACTIONS ON NETWORKING Authors: Hai Liu, Xiaohua Jia, Peng-Jun Wan, Chih- Wei Yi, S.
1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.
1 Network-supported Rate Control Mechanism for Multicast Streaming Media Kiyohide NAKAUCHI, Hiroyuki MORIKAWA, and Tomonori AOYAMA, School of Engineering,
Operator Placement for In-Network Stream Query Processing U. Srivastava, K. Mungala, and J. Widom, PODS 2005 ICS280 class presentation by Iosif Lazaridis.
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
1 Delay-efficient Data Gathering in Sensor Networks Bin Tang, Xianjin Zhu and Deng Pan.
Parallel Routing Bruce, Chiu-Wing Sham. Overview Background Routing in parallel computers Routing in hypercube network –Bit-fixing routing algorithm –Randomized.
Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause Carlos Guestrin Joe Hellerstein.
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
Energy-Efficient Rate Scheduling in Wireless Links A Geometric Approach Yashar Ganjali High Performance Networking Group Stanford University
SMART: A Scan-based Movement- Assisted Sensor Deployment Method in Wireless Sensor Networks Jie Wu and Shuhui Yang Department of Computer Science and Engineering.
Dean H. Lorenz, Danny Raz Operations Research Letter, Vol. 28, No
Flow Algorithms for Two Pipelined Filtering Problems Anne Condon, University of British Columbia Amol Deshpande, University of Maryland Lisa Hellerstein,
Near-Optimal Network Design with Selfish Agents By Elliot Anshelevich, Anirban Dasgupta, Eva Tardos, Tom Wexler STOC’03 Presented by Mustafa Suleyman CIFTCI.
Chain: Operator Scheduling for Memory Minimization in Data Stream Systems Authors: Brian Babcock, Shivnath Babu, Mayur Datar, and Rajeev Motwani (Dept.
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
Distributed Combinatorial Optimization
QoS Routing with Performance- Dependent Costs Funda Ergun ;Rakesh Sinha ;Lisa Zhang INFOCOM 2000.Nineteenth Annual Joint Conference of the IEEE Computer.
Network Optimization Problems: Models and Algorithms
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
CS Dept, City Univ.1 The Complexity of Connectivity in Wireless Networks Presented by LUO Hongbo.
Heuristic Algorithms for Multiconstrained Quality-of-Service Routing Xin Yuan, Member, IEEE IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 10, VO. 2, APRIL.
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
MAXIMIZING SPECTRUM UTILIZATION OF COGNITIVE RADIO NETWORKS USING CHANNEL ALLOCATION AND POWER CONTROL Anh Tuan Hoang and Ying-Chang Liang Vehicular Technology.
Sharanya Eswaran, Penn State University Matthew Johnson, CUNY Archan Misra, Telcordia Technolgoies Thomas La Porta, Penn State University Utility-driven.
Efficient Gathering of Correlated Data in Sensor Networks
Network Aware Resource Allocation in Distributed Clouds.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
An Algorithm for the Coalitional Manipulation Problem under Maximin Michael Zuckerman, Omer Lev and Jeffrey S. Rosenschein AAMAS’11.
An Algorithm for the Coalitional Manipulation Problem under Maximin Michael Zuckerman, Omer Lev and Jeffrey S. Rosenschein (Simulations by Amitai Levy)
Efficient Algorithms for Maximum Lifetime Data Gathering and Aggregation in Wireless Sensor Networks Selected from Elsevier: Computer Networks Konstantinos.
When In-Network Processing Meets Time: Complexity and Effects of Joint Optimization in Wireless Sensor Networks Department of Computer Science, Wayne State.
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
Minimum Average Routing Path Clustering Problem in Multi-hop 2-D Underwater Sensor Networks Presented By Donghyun Kim Data Communication and Data Management.
A Faster Approximation Scheme for Timing Driven Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, and Charles J. Alpert** *Dept of ECE, Michigan Technological.
An Efficient Linear Time Triple Patterning Solver Haitong Tian Hongbo Zhang Zigang Xiao Martin D.F. Wong ASP-DAC’15.
A Optimal On-line Algorithm for k Servers on Trees Author : Marek Chrobak Lawrence L. Larmore 報告人:羅正偉.
ASSIGNMENT, DISTRIBUTION AND QOS PROVISIONING IN COMMUNICATION NETWORKS.
Collaborative Broadcasting and Compression in Cluster-based Wireless Sensor Networks Anh Tuan Hoang and Mehul Motani National University of Singapore Wireless.
Variations of the Prize- Collecting Steiner Tree Problem Olena Chapovska and Abraham P. Punnen Networks 2006 Reporter: Cheng-Chung Li 2006/08/28.
Network-Aware Query Processing for Stream- based Application Yanif Ahmad, Ugur Cetintemel - Brown University VLDB 2004.
Adaptive Ordering of Pipelined Stream Filters Babu, Motwani, Munagala, Nishizawa, and Widom SIGMOD 2004 Jun 13-18, 2004 presented by Joshua Lee Mingzhu.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
1 Planning Base Station and Relay Station Locations in IEEE j Multi-hop Relay Networks Yang Yu, Seán Murphy, Liam Murphy Department of Computer Science.
Minimizing Energy Expense for Chain-Based Data Gathering in Wireless Sensor Networks Li-Hsing Yen Chung Hua University Taiwan EWSN 05.
A Pseudo Random Coordinated Scheduling Algorithm for Bluetooth Scatternets MobiHoc 2001.
Holistic Twig Joins Optimal XML Pattern Matching Nicolas Bruno Columbia University Nick Koudas Divesh Srivastava AT&T Labs-Research SIGMOD 2002.
Rate-Based Query Optimization for Streaming Information Sources Stratis D. Viglas Jeffrey F. Naughton.
Optimal Relay Placement for Indoor Sensor Networks Cuiyao Xue †, Yanmin Zhu †, Lei Ni †, Minglu Li †, Bo Li ‡ † Shanghai Jiao Tong University ‡ HK University.
Approximation Algorithms based on linear programming.
Dejun Yang (Arizona State University)
Coverage Approximation Algorithms
Brian Babcock, Shivnath Babu, Mayur Datar, and Rajeev Motwani
Polynomial time approximation scheme
Presentation transcript:

Operator Placement for In-Network Stream Query Processing

Outline  Introduction  Preliminaries  Filter placement  Extensions  Conclusions

Introduction  In-network query processing  Consider a video surveillance application Environment Target  Suspicious activity  dark, movement Need  filter for calculating intensity (F1)  filter for detecting sufficient motion (F2)

Introduction  Previous work  push down all filters since CPU cost << communication cost  What if the queries involve expensive predicates ?  Objective  place each filter at the “ best" node based on selectivity and cost  minimize the overall cost

Introduction  Operator placement problem  Tradeoff Lower computational costs  Put on the nodes higher up Lower transmission cost  Put on the nodes lower down  Candidate m-level hierarchy n filters  m n possible solutions  In this paper…  Key idea Model network links as filters  Content define the problem provide a greedy alg. that failed present a polynomial-time optimal alg. extend to multiway stream join …

Preliminaries  Consider a linear chain of nodes  Notation S = data acquired by node N 1 F = { F 1, F 2, …, F n }  Query

Cost Model  Three quantities  Selectivity of filter F : s(F) fraction of the tuples in stream S that are expected to satisfy F  Cost of filter F : c(F, i) per-tuple cost of execution on node N i c(F, i+1) =  i c(F, i)   i ≤ 1 (if  i > 1 )  Cost of network transmission : l i per-tuple cost of transmitting from N i to N i+1 r s( F )r

Cost Model  Notation  P(F) = i if filter F is executed on N i  F i = { F | P(F) = i }  F ’ = F ’ 1, F ’ 2, …, F ’ n ’ c(F ’, i) = the cost per tuple of executing F ’ at node N i  r(F i ) = F i in rank order   Ref. J. Hellerstein and M.Stonebraker. Predicate migration: Optimizing queries with expensive predicates  Cost on a single node  Overall cost 

Example 2.2 c(P) = c(F 1, 1) + s(F 1 ) c(F 2, 1) + s(F 1 ) s(F 2 ) [ l 1 + l 2 + c(F 3, 3) ] + s(F 1 ) s(F 2 ) s(F 3 ) [ l 3 + c(F 4, 4) ] = (½) (½) (½) [ (1/5) (1/2) 1300 ] + (½) (½) (½) [ (1/5) (1/2) (1/4) 2500 ] = = s(F) = 1/2

Filter Placement 1.Greedy algorithm 2.Optimal algorithm

Greedy algorithm  Notation  c(P, i) = part of the total cost c(P) incurred at N i including transmission from N i to N i+1  network link N i to N i+1 : s( ) = 0, c(,1) = l i 

Example 3.3 At N 1, r(F 1 ) = 400, r(F 2 ) = 800, r(F 3 ) = 2600, r(F 4 ) = 5000, F l 1 = 700 > r(F 1 ) At N 2, r(F 2 ) = 160, r(F 3 ) = 520, r(F 4 ) = 1000, F l 2 = 500 > r(F 2 ) At N 3, r(F 3 ) = 260, r(F 4 ) = 500, F l 3 = 300 > r(F 3 ) At N 4, r(F 4 ) c(P) = =

Optimal algorithm  Notation  network link N i to N i+1 :, 

Optimal algorithm  Short-circuiting  Rank  Cost scaleup

Optimal algorithm

Example 3.7 Model links as filters = , r(F 1 ) = 400, r(F 2 ) = 800, r(F 3 ) = 2600, r(F 4 ) = 5000, r(F l 1 ) = 875, r(F l 2,4 ) = r(F 1 ) < r(F 2 ) < r(F l 1 ) < r(F 3 ) < r(F l 2,4 ) < r(F 4 ) c(P) = =

Extensions  Correlated filters  Tree hierarchies  Joins  Other extensions

Correlated filters  Definition  Conditional selecivity s(F|Q) = the fraction of tuples that satisfy F given that they satisfy all the filters in Q  Reference  Optimal ordering of correlated filters at a single node NP-hard guaranteed to find a cost at most 4 times the opt. cost  Approximation ratio of 4 the best possible unless P = NP 

Correlated filters  Definition ,  Short-circuiting  Optimal solution  Tree hierarchy  = Each of the queries operates on different data. There is no sharing computation or transmission among them.

Joins  Problem  k different data streams acquired by N 1  Solution  Reference Sliding-window join MJoin operator  at a single node  join tree is left as future work  Query W 1 and W 2 represent the lengths of the windows (time-pased or tuple- based) on streams S 1 and S 2.

Joins  Joint operator  Illustration  Selectivity s(  ) = the fraction of the cross product that occurs in the join result  Cost r1r1 r2r2 s(  )r 1 r 2

Joins  Notation  F i = filters that can be applied either on S i before the join or after  | F i | = n i  F 12 = filters that can be applied only on after e the join 

Joins  Time complexity : O(n 2 n 1 m(n+m)log(n+m))

Extensions  Constrained nodes   Per-filter cost scaling  c(F, i+1) / c(F, i) may be different for different F.  Modeling network links as filters no longer applies.  It becomes NP-hard.

Conclusion  Environment  Operator placement problem  Tradeoff Lower computational costs  Put on the nodes higher up Lower transmission cost  Put on the nodes lower down  Provide  Greedy alg. & Optimal alg.  Extensions

Lemma 3.1 by (2)

F 1 in P is chosen according to the theorem. ∵ Lemma 3.1 and s(F l 1 )=0 ∴ F’ 1 in P’ s.t. c( P’, 1 ) < c( P, 1 ) ∵ Theorem 2.1 ∴ c( P, 1 ) ≦ c( P’, 1 ) → contradiction Theorem 3.2

Lemma 3.4

F 1 in P is chosen according to the theorem. ∵ Lemma 3.4 ∴ P’ s.t. c( P’, 1 ) < c( P, 1 ) ∵ Theorem 2.1 ∴ c( P, 1 ) ≦ c( P’, 1 ) → contradiction Theorem 3.5

Suppose and the best Moving the filters on node Ni to Ni-1 Moving the filters on node Ni to Ni+1 ∵ P is best plan ∴ c( P) < c( P’), c( P) < c( P”) → implies → contradiction Lemma 3.6