Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore,

Slides:



Advertisements
Similar presentations
Dynamic Source Routing (DSR) algorithm is simple and best suited for high mobility nodes in wireless ad hoc networks. Due to high mobility in ad-hoc network,
Advertisements

The Skyline Operator (Stephan Borzsonyi, Donald Kossmann, Konrad Stocker) Presenter: Shehnaaz Yusuf March 2005.
Distributed DBMSPage © 1998 M. Tamer Özsu & Patrick Valduriez Outline Introduction Background Distributed DBMS Architecture Distributed Database.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.6/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.
July 29HDMS'08 Caching Dynamic Skyline Queries D. Sacharidis 1, P. Bouros 1, T. Sellis 1,2 1 National Technical University of Athens 2 Institute for Management.
Sharing Aggregate Computation for Distributed Queries Ryan Huebsch, UC Berkeley Minos Garofalakis, Yahoo! Research † Joe Hellerstein, UC Berkeley Ion Stoica,
Localized Techniques for Power Minimization and Information Gathering in Sensor Networks EE249 Final Presentation David Tong Nguyen Abhijit Davare Mentor:
1  Simple Nested Loops Join:  Block Nested Loops Join  Index Nested Loops Join  Sort Merge Join  Hash Join  Hybrid Hash Join Evaluation of Relational.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Optimization of Spatial Joins on Mobile Devices N. Mamoulis 1, P. Kalnis 2, S. Bakiras 3, X. Li 2 1 Department of Computer Science and Information Systems,
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
A Local Facility Location Algorithm Supervisor: Assaf Schuster Denis Krivitski Technion – Israel Institute of Technology.
Ad-hoc Distributed Spatial Joins on Mobile Devices Panos Kalnis, Xiaochen Li National University of Singapore Nikos Mamoulis The University of Hong Kong.
Abstract Shortest distance query is a fundamental operation in large-scale networks. Many existing methods in the literature take a landmark embedding.
CS401 presentation1 Effective Replica Allocation in Ad Hoc Networks for Improving Data Accessibility Takahiro Hara Presented by Mingsheng Peng (Proc. IEEE.
FLANN Fast Library for Approximate Nearest Neighbors
Hash, Don’t Cache: Fast Packet Forwarding for Enterprise Edge Routers Minlan Yu Princeton University Joint work with Jennifer.
Welcome To. Improving Remote File Transfer Speeds By The Solution For: %
A Highly Adaptive Distributed Routing Algorithm for Mobile Wireless Networks Research Paper By V. D. Park and M. S. Corson.
Itrat Rasool Quadri ST ID COE-543 Wireless and Mobile Networks
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
1 Fast Failure Recovery in Distributed Graph Processing Systems Yanyan Shen, Gang Chen, H.V. Jagadish, Wei Lu, Beng Chin Ooi, Bogdan Marius Tudor.
Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore,
1 BitHoc: BitTorrent for wireless ad hoc networks Jointly with: Chadi Barakat Jayeoung Choi Anwar Al Hamra Thierry Turletti EPI PLANETE 28/02/2008 MAESTRO/PLANETE.
Leader Election Algorithms for Mobile Ad Hoc Networks Presented by: Joseph Gunawan.
Database Management 9. course. Execution of queries.
Network: Location Management Y. Richard Yang 3/21/2011.
Data Distribution Dynamic Data Distribution. Outline Introductory Comments Dynamic (Value based) Data Distribution: HLA Data Distribution Management –Routing.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Evaluating FERMI features for Data Mining Applications Masters Thesis Presentation Sinduja Muralidharan Advised by: Dr. Gagan Agrawal.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Bin Cui, Hua Lu, Quanqing Xu, Lijiang Chen, Yafei Dai, Yongluan Zhou ICDE 08 Parallel Distributed Processing of Constrained Skyline Queries by Filtering.
1 Mobile-Assisted Localization in Wireless Sensor Networks Nissanka B.Priyantha, Hari Balakrishnan, Eric D. Demaine, Seth Teller IEEE INFOCOM 2005 March.
Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.
MEMORY ORGANIZTION & ADDRESSING Presented by: Bshara Choufany.
WIRELESS AD-HOC NETWORKS Dr. Razi Iqbal Lecture 6.
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Lecture 15- Parallel Databases (continued) Advanced Databases Masood Niazi Torshiz Islamic Azad University- Mashhad Branch
Dual-Region Location Management for Mobile Ad Hoc Networks Yinan Li, Ing-ray Chen, Ding-chau Wang Presented by Youyou Cao.
A Method for Distributed Computation of Semi-Optimal Multicast Tree in MANET Eiichi Takashima, Yoshihiro Murata, Naoki Shibata*, Keiichi Yasumoto, and.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
PRIN WOMEN PROJECT Research Unit: University of Naples Federico II G. Ferraiuolo
An Optimal Broadcast Algorithm for Content-Addressable Networks Ludovic Henrio Fabrice Huet Justine Rochas 1 18/12/ OPODIS (Nice)
Interactive Data Exploration Using Semantic Windows Alexander Kalinin Ugur Cetintemel, Stan Zdonik.
Parallel and Distributed Simulation Data Distribution II.
Query Execution. Where are we? File organizations: sorted, hashed, heaps. Indexes: hash index, B+-tree Indexes can be clustered or not. Data can be stored.
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
A Multicast Routing Algorithm Using Movement Prediction for Mobile Ad Hoc Networks Huei-Wen Ferng, Ph.D. Assistant Professor Department of Computer Science.
Movement-Based Check-pointing and Logging for Recovery in Mobile Computing Systems Sapna E. George, Ing-Ray Chen, Ying Jin Dept. of Computer Science Virginia.
Parallelizing Functional Tests for Computer Systems Using Distributed Graph Exploration Alexey Demakov, Alexander Kamkin, and Alexander Sortov
Handling Data Skew in Parallel Joins in Shared-Nothing Systems Yu Xu, Pekka Kostamaa, XinZhou (Teradata) Liang Chen (University of California) SIGMOD’08.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Igor EPIMAKHOV Abdelkader HAMEURLAIN Franck MORVAN
Tian Xia and Donghui Zhang Northeastern University
Parallel Density-based Hybrid Clustering
任課教授:陳朝鈞 教授 學生:王志嘉、馬敏修
Query-Friendly Compression of Graph Streams
Yu Su, Yi Wang, Gagan Agrawal The Ohio State University
A Fault-Tolerant Routing Strategy for Fibonacci-Class Cubes
Outline Introduction Background Distributed DBMS Architecture
Paraskevi Raftopoulou, Euripides G.M. Petrakis
Effective Replica Allocation
Study of performance of regular TCP in MANETs (using simulator).
Spatial Databases: Spatio-Temporal Databases
Automatic and Efficient Data Virtualization System on Scientific Datasets Li Weng.
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore, Singapore 2 Aalborg University, Denmark

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion

Introduction Skyline query –Operator based on dominance MANET –Self-organizing, wireless mobile ad-hoc networks –Physical environment of this work Lightweight devices

Skyline Queries in MANETs Assumptions –Each resource-constrained device holds a portion of the entire dataset –Devices communicate through MANET –A mobile user is only interested in data of a limited geographical area, though the query involves data stored on multiple mobile devices

Example M 1 to M 4 hold different hotel relations M 2 is interested in cheap and good hotels within the circle area M1M1 M2M2 M3M3 M4M4

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion and Future Work

Problem Setting MANET of m mobile devices –{M 1, M 2, …, M m } Local relation R i on each device M i – Skyline issued by a device M org – id: network id of query originator M org pos: position of M org d:distance (from pos) of interest

Technical Challenges Slow and unreliable wireless channels compared to wired connections –To reduce data transferred between devices Resource-constrained devices –Storage and processing saving techniques on mobile devices

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion

Straightforward Strategy Query originator M org –Executing a local skyline query: SK org –Sends query to other mobile devices –Merges results when receiving them A mobile device M i –Executing a local skyline query too –Sends result SK i back to M org –Instead of sending whole R i

Discussion Final skyline result: SK SK≠ U SK i, SK U SK i FSK = U SK i –SK FSK contains all those tuples that are not in SK but sent between devices Identify SK i –SK on device M i –Inspiration of semi-join U|U|

Filtering Strategy Any tuple tp i in SK i –SK is dominated by some tuple(s) tp j in SK Where to find such tp j s? –Pick from M org ’ s local result –Send as query –M i filters out tuples using tp j Which one to pick? –Dominating region

Dominating Region The ability of tp j to dominate others –Tuple value –Data space boundaries Volume of dominating region –VDR j =∏ k (b k -p jk ) Choose from SK org tp flt with max VDR j –Indep. distribution Dominating Region 0 b 1 b 2 p j1 p j2 Max corner of data space p 1 p 2 tp j

Dominating Ability HotelPriceRating h11207 h12405 h13807 h14804 h h HotelPriceRating h21603 h22802 h h h Two hotel relations –Price range ( ) –Smaller rating means better (1..10) Relation R 1 Relation R 2 (M org ) VDR (200-60)*(10-3)=980 (200-80)*(10-2)=960 ( )*(10-1)=720 − −

Estimated Dominating Region Over-estimation –VDR j =∏ k (max k -p jk ) –max k :pre-specified larger value Under-estimation –VDR j =∏ k (h k -p jk ) –h k :local maximum

Dynamic Filtering Tuples HotelPriceRating h31603 h32805 h HotelPriceRating h41802 h h Three hotel relations –M 4 -> M 3 -> M 1 Relation R 3 Relation R 4 (M org ) HotelPriceRating h11207 h12405 h13807 h14804 h h Relation R 1 VDR 31 =980VDR 41 =960

Query Log Mechanism To avoid the same query more than once on any device M i Add a tag cnt to query issued by M org – M i records/checks/updates –Processes and forwards only cnt log <cnt cnt can be a byte to save cost –A device can issue 256 queries –Reset after a period, say one day

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion

Dataset Storage Goals –Space efficient –Local processing efficient Operations –Spatial extent check Distinct coordinates –Attribute value comparison Floats Duplicates

Hybrid Storage Model Spatial coordinates –Real values –MBR i (x max, y max, x min, y min ) Attribute values –Ascending domains –IDs –Sort p 1 xyp1p1 …pnpn … … …j …………… k…3 Relation R i v0v0 v1v1 … vkvk Sorted domains … p1p1 pnpn v0v0 v1v1 … vjvj

Local Skyline Computing Sptial check –mindist(pos org, MBR i ) > d Skyline computing –Comparison of IDs instead of true values of float type –p 2 to p n only Update filtering tuple if necessary –Choose the one with larger VDR value

Assembly on Query Originator When M org receives SK i from others –Duplication elimination –False positive removing A simple nested loop is enough –Comparing coordinates Identify duplicates –Comparing attribute values Identify false positive reports from both SK org and SK i

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion

Experiment Parameters Number of mobile device3 2, 4 2, …, 10 2 Cardinality of global reln100K, 200K, …, 1000K Cardinality of local reln10K, 20K, …, 100K Local storage modelFlat, Hybrid Number of non-spatial attr2, 3, 4, 5 Non-spatial attri range[0.0, 9.9], [0,1000] Spatial extent of global reln1000 X 1000 Attribute distributionIndep., Anti-Correl. Query distance of interest100, 250, 500

Studies on Local Optimization HP iPAQ h6365 pocket PC –MS Windows Mobile 2003 –200MHz TI OMAP1510 processor –64MB SDRAM (55MB user accessible) SuperWaba –Java-based open-source platform for PDA and smartphone applications –

Time vs Local Cardinality Flat Storage vs Hybrid Storage Anti-Correlated vs Independent HS incurs less processing cost

Time vs Local Dimensionality Average of costs on both distributions –Coz they are close to each other HS still performs better

Performance in Simulation Simulated MANET –JiST-SWANS A Jave based MANET simulator –Pentium IV desktop PC MS Windows XP 2.99GHz CPU 1 GB memory

Settings Device setting –Data partitioned and allocated to devices using a grid of m 1/2 by m 1/2 –1-5 queries per device MANET settings –Total simulation time: 2 hours –Speed range: 2 unit/s – 10 unit/s –Holding time: 120 seconds –Wireless routing protocol: AODV

Data Reduction Efficiency Data Reduction Rate – –SK i ’ is the local skyline after filtering Pre-tests in static setting –Forwarding query out recursively –Findings No significant difference between exact VDR and estimated VDRs Dynamic filtering is more powerful

Data Reduction Rate

Response Time - BF Breadth-First query forwarding –Parallel Time receiving answers from 80% other devices –Cannot ensure all devices are always reachable and available in MANETs M org M4M4 M1M1 M5M5 M2M2 M3M3 Query message Result message

Response Time - DF Depth-First forwarding –Serialized Query ends when originator finds all neighbors have processed the query M4M4 M org M1M1 M5M5 M2M2 M3M3 Query message Result message

Response Time

Query Message Count Only mobile device number affects the query message count obviously Better performance of BF is not free

Outline Introduction Problem Definition Skyline Queries in MANETs Optimizations on Mobile Devices Experimental Studies Conclusion

Conclusion Problem setting –MANET of lightweight devices –Skyline queries with spatial constraints Solution highlights –Filtering based distributed query processing strategy to reduce communication cost –Specialized local storage and algorithm to speed up local processing –Experimentally verified performance

Q & A Thanks!