1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.

Slides:



Advertisements
Similar presentations
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
Advertisements

Supporting top-k join queries in relational databases Ihab F. Ilyas, Walid G. Aref, Ahmed K. Elmagarmid Presented by Rebecca M. Atchley Thursday, April.
Bio Michel Hanna M.S. in E.E., Cairo University, Egypt B.S. in E.E., Cairo University at Fayoum, Egypt Currently is a Ph.D. Student in Computer Engineering.
Query Execution, Concluded Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 18, 2003 Some slide content may.
Danzhou Liu Ee-Peng Lim Wee-Keong Ng
A Generic Framework for Monitoring Continuous Spatial Queries over Moving Objects.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
CPSC 335 Dr. Marina Gavrilova Computer Science University of Calgary Canada.
Chapter 11 Indexing and Hashing (2) Yonsei University 2 nd Semester, 2013 Sanghyun Park.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree.
Join Processing in Databases Systems with Large Main Memories
Mohamed F. Mokbel University of Minnesota
Department of Computer Science Spatio-Temporal Histograms Hicham G. Elmongui*Mohamed F. Mokbel + Walid G. Aref* *Purdue University, Department of Computer.
Spatio-temporal Databases Time Parameterized Queries.
On Reducing Communication Cost for Distributed Query Monitoring Systems. Fuyu Liu, Kien A. Hua, Fei Xie MDM 2008 Alex Papadimitriou.
Spatio-Temporal Databases
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
Chapter 10: Stream-based Data Management Title: Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core Authors:
Approximate querying about the Past, the Present, and the Future in Spatio-Temporal Databases Jimeng Sun, Dimitris Papadias, Yufei Tao, Bin Liu.
Temporal Databases. Outline Spatial Databases Indexing, Query processing Temporal Databases Spatio-temporal ….
1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur.
Chapter 3: Data Storage and Access Methods
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
CSCI 5708: Query Processing I Pusheng Zhang University of Minnesota Feb 3, 2004.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
Evaluation of Top-k OLAP Queries Using Aggregate R-trees Nikos Mamoulis (HKU) Spiridon Bakiras (HKUST) Panos Kalnis (NUS)
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Sorting and Query Processing Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 29, 2005.
Image Based Positioning System Ankit Gupta Rahul Garg Ryan Kaminsky.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
Processing Monitoring Queries on Mobile Objects Lecture for COMS 587 Department of Computer Science Iowa State University.
Dept. of Electrical Engineering and Computer Science, Northwestern University Context-Aware Optimization of Continuous Query Maintenance for Trajectories.
Optimizing Query Execution Zachary G. Ives University of Pennsylvania CIS 650 – Implementing Data Management Systems September 18, 2008 Content on hashing.
Systems and Internet Infrastructure Security (SIIS) LaboratoryPage Systems and Internet Infrastructure Security Network and Security Research Center Department.
Efficient Processing of Top-k Spatial Preference Queries
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
Supporting Top-k join Queries in Relational Databases Ihab F. Ilyas, Walid G. Aref, Ahmed K. Elmagarmid Presented by: Z. Joseph, CSE-UT Arlington.
Wei-Shinn Ku Slide 1 Auburn University Computer Science and Software Engineering Query Integrity Assurance of Location-based Services Accessing Outsourced.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Monitoring k-NN Queries over Moving Objects Xiaohui Yu University of Toronto Joint work with Ken Pu and Nick Koudas.
Presentation Template KwangSoo Yang Florida Atlantic University College of Engineering & Computer Science.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
February 4, Location Based M-Services Soon there will be more on-line personal mobile devices than on-line stationary PCs. Location based mobile-services.
Page 1 A Platform for Scalable One-pass Analytics using MapReduce Boduo Li, E. Mazur, Y. Diao, A. McGregor, P. Shenoy SIGMOD 2011 IDS Fall Seminar 2011.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
CS 440 Database Management Systems Lecture 5: Query Processing 1.
Chapter 9: Web Services and Databases Title: NiagaraCQ: A Scalable Continuous Query System for Internet Databases Authors: Jianjun Chen, David J. DeWitt,
AegisDB: Integrated realtime geo-stream processing and monitoring system Chengyang Zhang Computer Science Department University of North Texas.
CS 540 Database Management Systems
Spatio-Temporal Databases. Term Project Groups of 2 students You can take a look on some project ideas from here:
1 Overview of Query Evaluation Chapter Outline  Query Optimization Overview  Algorithm for Relational Operations.
Presenters : Virag Kothari,Vandana Ayyalasomayajula Date: 04/21/2010.
Dense-Region Based Compact Data Cube
Database Performance Tuning and Query Optimization
Spatio-temporal Pattern Queries
Join Processing in Database Systems with Large Main Memories (part 2)
On Spatial Joins in MapReduce
Spatio-Temporal Databases
Presented by: Mahady Hasan Joint work with
Chapter 11 Database Performance Tuning and Query Optimization
Spatio-Temporal Histograms
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by Nilu Thakur Prasad Sriram SIGMOD 2004 June 13-18, Paris, France.

2 Outline Introduction Problem definition Contributions Key Concepts Shared Execution Hashing Invalidation Joining Validations Assumptions Rewrite Today

3 Introduction (1/3) Moving query on stationary objects Find the nearest gas station(s) within 1 miles of moving red car

4 Introduction (2/3) Another Example…. Moving query on moving objects Continuously find all police cars within 3 miles of the moving red car

5 Introduction (3/3) Another Example…. Stationary query on moving objects Continuously find all vehicles within 1 miles of my house My House

6 Problem Definition Input Given a large number of mobile/stationary objects and continuous spatio-temporal queries Output Produce fast, complete and correct results Objective Continuous evaluation Scalability in terms of number of queries Report only updates to previous answer Constraints Any delay in query response might result in outdated answer Limited Network bandwidth

7 Contributions  Shared execution paradigm  Groups similar queries in a query table  Spatial join between moving queries and moving objects  Differ from previous approaches of using R-tree and Q index structure for moving query on moving object (Instead uses spatial join assuming no indexing structure)  Incremental evaluation (Most Significant)  Maintains an in-memory table to store positive and negative updates  Negative updates may cancel previous positive update & vice versa  Sends a set of updates to queries every ‘T’ time

8 Spatial join between moving objects & moving queries Key concept: Shared execution

9 Shared Spatio- temporal Join. Q1Q1 +/- Split. Q2Q2 +/-. QNQN +/- Stream of Moving Objects Stream of Moving Queries Shared Operator Shared Memory Buffer among all C. Queries Stationary Range +/-... +/-... Q1Q1 Q2Q2 QNQN Stream of Moving Objects Moving kNN Moving Range Shared Execution Slides Courtesy [Mokbel et al]

10 Shared Execution Spatial join can use R-tree index for stationary objects Q-index can be used for stationary queries No index structure when both query and object are moving Incremental Evaluation: Hashing Invalidation Joining Key concepts: (continued)

11 State diagram of SINA In- Memory Hashing Stream of moving objects & moving queries DISK Memory Full or Timeout Incremental Result Invalidatio n Memory Full or Timeout Memory- disk Join Send Incremental results to queries Negative & positive update Q1 Q2………Qn-1Qn Done Negative update Positive update HASHINGINVALIDATIONJOINING

12 Key concept: An example to understand Q1-Q5 represents 5 continuous Range Queries P1-p9 represents objects, White circle: Moving objects (p1,p2,p3,p4) black circle: Non-moving objects dashed line represents moving queries(q1, q3, q5)

13 Key concept: Step I-Hashing  Two in-memory hash table with N buckets for storing moving objects & moving queries  One in-memory query table to keep track of upper left and lower right corners of query region  Hashing --> probing -->storing --> (q3,+p2) reported

14 Key Concept: Step II-Invalidation Map objects and queries to one or more disk-based N*N grid cells Flush out the buckets containing moved objects and queries If object maps to same grid then the object has not moved Else Add the object entry in this grid cell Look for queries that contain this object. Remove these objects from the queries by sending negative updates. Repeat the same procedure for invalidating queries. Query entry Object entry

15 Key concept: Step III- Joining No additional data structure Two spatial join operations for each grid cell Join in-memory objects with in-disk queries Join in-memory moving queries with in-disk objects Send updated answers to clients Clear all memory data structures

16 Performance Analysis (1) Answer size Impact of Grid Size N

17 Performance Analysis (2) Scalability with number of objects Scalability with number of queries

18 Performance Analysis (3) % of moving objects Scalability with update rates

19 Extensibility Querying the future K-Nearest Neighbor queries Aggregate queries Out-of-Sync clients

20 Assumptions  No computational capabilities on the Client side  No Storage capabilities on the client side  Both the assumptions are fair considering that many times client uses cheap, low battery and passive devices that do not have computational or storage capabilities.  No velocity Assumptions.  Optimal time interval for sending updates to queries set to 10 seconds.

21 Validations Methodology  Experiments performed on synthetic data  Used Network-Based Generator of Moving Objects  Input to generator is road map of city of Oldenburg, Germany  Theorem-Proving Validation criteria  Comparison with other non-incremental algorithms based on  Size of the results  Impact of grid size  Scalability with number of objects  Performance in terms of CPU and I/O time Advantages  Very much appropriate to check correctness & efficiency of proposed algorithm where rich datasets with various problem features are not available. Disadvantages  Real world conditions might differ from experimental results

22 Rewrite today Assumptions: No unreasonable assumptions made. In fact, removes some previous assumptions made by other related techniques Preservations Incremental way of sending updates Shared execution Not having assumptions about computational capabilities of client Improvements Incorporate some techniques to determine the optimal ‘T’ i.e., time between sending updates Through experiments Learning based on the past statistics about how valid the previous updates were Extend to handle queries involving huge object histories