Mario A. Nascimento Univ. of Alberta, Canada http: //www.cs.ualberta.ca/~mn With R. Alencar and A. Brayner. Work partially supported by NSERC and CBIE.

Slides:



Advertisements
Similar presentations
2.6. B OUNDING V OLUME H IERARCHIES Overview of different forms of bounding volume hierarchy.
Advertisements

Using Trees to Depict a Forest
Fast algorithm for detecting community structure in networks M. E. J. Newman Department of Physics and Center for the Study of Complex Systems, University.
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates By Yihong Zhao, Prasad M. Desphande and Jeffrey F. Naughton Presented by Kia Hall.
A Hierarchical Multiple Target Tracking Algorithm for Sensor Networks Songhwai Oh and Shankar Sastry EECS, Berkeley Nest Retreat, Jan
Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.
Query Optimization of Frequent Itemset Mining on Multiple Databases Mining on Multiple Databases David Fuhry Department of Computer Science Kent State.
Multicast in Wireless Mesh Network Xuan (William) Zhang Xun Shi.
~1~ Infocom’04 Mar. 10th On Finding Disjoint Paths in Single and Dual Link Cost Networks Chunming Qiao* LANDER, CSE Department SUNY at Buffalo *Collaborators:
Fast Algorithms For Hierarchical Range Histogram Constructions
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
Constraint Optimization Presentation by Nathan Stender Chapter 13 of Constraint Processing by Rina Dechter 3/25/20131Constraint Optimization.
Aggregate Query Processing in Cache- Aware Wireless Sensor Networks Khaled Ammar University of Alberta.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
1 HYRISE – A Main Memory Hybrid Storage Engine By: Martin Grund, Jens Krüger, Hasso Plattner, Alexander Zeier, Philippe Cudre-Mauroux, Samuel Madden, VLDB.
Selectivity-Based Partitioning Alkis Polyzotis UC Santa Cruz.
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows COMP9314 Lecture Notes.
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
Branch and Bound Searching Strategies
Finite State Machine State Assignment for Area and Power Minimization Aiman H. El-Maleh, Sadiq M. Sait and Faisal N. Khan Department of Computer Engineering.
Part 4 b Forward-Backward Algorithm & Viterbi Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Ph.D. DefenceUniversity of Alberta1 Approximation Algorithms for Frequency Related Query Processing on Streaming Data Presented by Fan Deng Supervisor:
Using Analytic QP and Sparseness to Speed Training of Support Vector Machines John C. Platt Presented by: Travis Desell.
Aggregation in Sensor Networks NEST Weekly Meeting Sam Madden Rob Szewczyk 10/4/01.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
1 Branch and Bound Searching Strategies 2 Branch-and-bound strategy 2 mechanisms: A mechanism to generate branches A mechanism to generate a bound so.
Carmine Cerrone, Raffaele Cerulli, Bruce Golden GO IX Sirmione, Italy July
Anonymization of Set-Valued Data via Top-Down, Local Generalization Yeye He Jeffrey F. Naughton University of Wisconsin-Madison 1.
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Subgraph Containment Search Dayu Yuan The Pennsylvania State University 1© Dayu Yuan9/7/2015.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Time-focused density-based clustering of trajectories of moving objects Margherita D’Auria Mirco Nanni Dino Pedreschi.
Loading a Cache with Query Results Laura Haas, IBM Almaden Donald Kossmann, Univ. Passau Ioana Ursu, IBM Almaden.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Optimal Scheduling of File Transfers with Divisible Sizes on Multiple Disjoint Paths Mugurel Ionut Andreica Polytechnic University of Bucharest Computer.
1 Vicky :: Cao Hui Ping Sherman :: Chow Sze Ming CTH :: Chong Tsz Ho Ronald :: Woo Lok Yan Ken :: Yiu Man Lung Implementing Data Cubes Efficiently.
Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps Tan Apaydin – The Ohio State University Guadalupe Canahuate – The Ohio.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Optimal Partitioning of Fine-Grained Scalable Video Streams Mohamed Hefeeda.
1 Using Tiling to Scale Parallel Datacube Implementation Ruoming Jin Karthik Vaidyanathan Ge Yang Gagan Agrawal The Ohio State University.
Supervisor: Antoine Bagula Students: Mthokozisi Moyo Luis Sa Wireless Sensor Network Repairing.
1 Branch and Bound Searching Strategies Updated: 12/27/2010.
Spectral Sequencing Based on Graph Distance Rong Liu, Hao Zhang, Oliver van Kaick {lrong, haoz, cs.sfu.ca {lrong, haoz, cs.sfu.ca.
BARD / April BARD: Bayesian-Assisted Resource Discovery Fred Stann (USC/ISI) Joint Work With John Heidemann (USC/ISI) April 9, 2004.
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
Answering Top-k Queries Using Views Gautam Das (Univ. of Texas), Dimitrios Gunopulos (Univ. of California Riverside), Nick Koudas (Univ. of Toronto), Dimitris.
Multi-Query Optimization and Applications Prasan Roy Indian Institute of Technology - Bombay.
Divide and Conquer Optimization problem: z = max{cx : x  S}
QoS Supported Clustered Query Processing in Large Collaboration of Heterogeneous Sensor Networks Debraj De and Lifeng Sang Ohio State University Workshop.
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
Topology Management -- Power Efficient Spatial Query Presented by Weihang jiang.
An Exact Algorithm for Difficult Detailed Routing Problems Kolja Sulimma Wolfgang Kunz J. W.-Goethe Universität Frankfurt.
Branch and Bound Searching Strategies
Optimal Relay Placement for Indoor Sensor Networks Cuiyao Xue †, Yanmin Zhu †, Lei Ni †, Minglu Li †, Bo Li ‡ † Shanghai Jiao Tong University ‡ HK University.
Yinghui Wu, SIGMOD Incremental Graph Pattern Matching Wenfei Fan Xin Wang Yinghui Wu University of Edinburgh Jianzhong Li Jizhou Luo Harbin Institute.
1 Chapter 5 Branch-and-bound Framework and Its Applications.
Scheduling Jobs Across Geo-distributed Datacenters Chien-Chun Hung, Leana Golubchik, Minlan Yu Department of Computer Science University of Southern California.
Ing-Ray Chen, Member, IEEE, Hamid Al-Hamadi Haili Dong Secure and Reliable Multisource Multipath Routing in Clustered Wireless Sensor Networks 1.
Applying Combinatorial Testing to Data Mining Algorithms
Zhu Han University of Houston Thanks for Professor Dan Wang’s slides
A Study of Group-Tree Matching in Large Scale Group Communications
Spatial Online Sampling and Aggregation
Analysis and design of algorithm
Automatic Physical Design Tuning: Workload as a Sequence
DATA CACHING IN WSN Mario A. Nascimento Univ. of Alberta, Canada
Chapter 11 Limitations of Algorithm Power
Algorithms for Budget-Constrained Survivable Topology Design
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

Mario A. Nascimento Univ. of Alberta, Canada http: // With R. Alencar and A. Brayner. Work partially supported by NSERC and CBIE (Canada) and CAPES (Brazil) D ATA C ACHING IN WSN

2 2 /23 Outline Motivation Cache-Aware Query Processing Cache-Aware Query Optimization Query Partitioning Cached Data Selection Cache Maintenance Experimental Results

3 3 /23 (One) Application Scenario

4 4 /23 Using Previous (Cached) Queries Q’ : Q “minus” {P i } { P i } : Set of previous queries Q : Current query

5 5 /23 Query Partitioning Overhead Query processing cost is estimated through an analytical cost-model Query Processing: query is forwarded, locally flooded, results are collected and shipped back

6 6 /23 Overall Architecture Current query Relevant Cached Queries Non-stale subset of P and its dataset Subset of relevant queries and sub-queries (min: query cost) Answer

7 7 /23 “Query Plan” Problem (QSP) Less larger sub-queries vs. more smaller sub-queries For obtaining Q’ we used the General Polygon Clipper library. For partitioning Q’ into the set of sub-queries Θ we used a O(v log v) algorithm which finds a sub-optimal solution (minimizing the number of sub-queries).

8 8 /23 B+B (Heuristic) Solution to QSP For each node Q is “clipped” using a subset of P’’, a set of sub-queries is generated and its cost is obtained. The search stops at a local minimun.

9 9 /23 Other Heuristic Solutions to QSP In addition to the B+B we also used two more aggressive greedy heuristics: GrF (GrE) starts with all (no) cached queries removing (inserting) the smallest (largest) cached query as long as there is some gain. GrF “path”

10 10 /23 Cache Maintenance

11 11 /23 Cache Maintenance

12 12 /23 Losses wrt Optimal Solution B+B is the Branch-and-Bound heuristic. GrF (GrE) is an aggressive greedy heuristic, starting with all (no) cache and removing (inserting) the smallest (largest) cached queries available as long as there is some gain.

13 13 /23 Gains wrt NOT Using Cache By design GrE cannot be any worse that no using any cache.

14 14 /23 Gains wrt Using ALL Cache By design GrF cannot be any worse that using all of the cache.

15 15 /23 Detailed results or skip to main conclusions?skip

16 16 /23 Detailed results We investigate the performance of the proposed approach wrt efficiency (for finding the query plan) and effectiveness (cost of solution) when varying: Number of sensors Size of cache (number of cached queries) Query size (wrt total area) Validity time (of cached results)

17 17 /23 Varying # of Sensors

18 18 /23 Varying Cache Size

19 19 /23 Varying Query Size

20 20 /23 Varying Query Validity Time

21 21 /23 Conclusions The cached query selection, query clipping and sub- queries generation amounts to a fairly complex and combinatorial problem Although a query cost model is needed, our proposal is orthogonal to it If nothing can be done your best shot is to use all of the cache, but …

22 22 /23 Conclusions The Branch-and-Bound heuristic : Finds a “query plan” orders of magnitude faster than the exhaustive search Is typically less than 2% more expensive than the optimal query cost Is robust with respect to a number of different parameters Next stop: Aggregation queries …

23 23 /23 Thanks