Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.

Slides:



Advertisements
Similar presentations
Clustering k-mean clustering Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Advertisements

The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
State Estimation and Kalman Filtering CS B659 Spring 2013 Kris Hauser.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Fast Algorithms For Hierarchical Range Histogram Constructions
Partially Observable Markov Decision Process (POMDP)
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Indexing and Range Queries in Spatio-Temporal Databases
Similarity Search on Bregman Divergence, Towards Non- Metric Indexing Zhenjie Zhang, Beng Chi Ooi, Srinivasan Parthasarathy, Anthony K. H. Tung.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Dynamic Bayesian Networks (DBNs)
Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne.
1 Efficient Subgraph Search over Large Uncertain Graphs Ye Yuan 1, Guoren Wang 1, Haixun Wang 2, Lei Chen 3 1. Northeastern University, China 2. Microsoft.
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
Probabilistic Threshold Range Aggregate Query Processing over Uncertain Data Wenjie Zhang University of New South Wales & NICTA, Australia Joint work:
A Crowd-Enabled Approach for Efficient Processing of Nearest Neighbor Queries in Incomplete Databases Samia Kabir, Mehnaz Tabassum Mahin Department of.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
ACM GIS An Interactive Framework for Raster Data Spatial Joins Wan Bae (Computer Science, University of Denver) Petr Vojtěchovský (Mathematics,
Probabilistic Similarity Search for Uncertain Time Series Presented by CAO Chen 21 st Feb, 2011.
Single Point of Contact Manipulation of Unknown Objects Stuart Anderson Advisor: Reid Simmons School of Computer Science Carnegie Mellon University.
1 Rasit Onur Topaloglu and Alex Orailoglu University of California, San Diego Computer Science and Engineering Department.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
ICDE A Peer-to-peer Framework for Caching Range Queries Ozgur D. Sahin Abhishek Gupta Divyakant Agrawal Amr El Abbadi Department of Computer Science.
Ensemble Learning (2), Tree and Forest
Particle Filters++ TexPoint fonts used in EMF.
Speed and Direction Prediction- based localization for Mobile Wireless Sensor Networks Imane BENKHELIFA and Samira MOUSSAOUI Computer Science Department.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Efficient Gathering of Correlated Data in Sensor Networks
Brian Renzenbrink Jeff Robble Object Tracking Using the Extended Kalman Particle Filter.
General Principle of Monte Carlo Fall 2013 By Yaohang Li, Ph.D.
Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach Wenjie Zhang, Xuemin Lin The University of New South Wales & NICTA Ming Hua,
Searching for Extremes Among Distributed Data Sources with Optimal Probing Zhenyu (Victor) Liu Computer Science Department, UCLA.
Probabilistic Reasoning for Robust Plan Execution Steve Schaffer, Brad Clement, Steve Chien Artificial Intelligence.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
Efficient Processing of Top-k Spatial Preference Queries
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
Dual-Region Location Management for Mobile Ad Hoc Networks Yinan Li, Ing-ray Chen, Ding-chau Wang Presented by Youyou Cao.
Dr. Sudharman K. Jayaweera and Amila Kariyapperuma ECE Department University of New Mexico Ankur Sharma Department of ECE Indian Institute of Technology,
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
Clustering of Uncertain data objects by Voronoi- diagram-based approach Speaker: Chan Kai Fong, Paul Dept of CS, HKU.
Facility Location in Dynamic Geometric Data Streams Christiane Lammersen Christian Sohler.
Monitoring k-NN Queries over Moving Objects Xiaohui Yu University of Toronto Joint work with Ken Pu and Nick Koudas.
Approximate NN queries on Streams with Guaranteed Error/performance Bounds Nick AT&T labs-research Beng Chin Ooi, Kian-Lee Tan, Rui National.
Location Privacy Protection for Location-based Services CS587x Lecture Department of Computer Science Iowa State University.
Week Aug-24 – Aug-29 Introduction to Spatial Computing CSE 5ISC Some slides adapted from the book Computing with Spatial Trajectories, Yu Zheng and Xiaofang.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
Cost Modeling of Spatial Query Operators Using Nonparametric Regression Songtao Jiang Department of Computer Science University of Vermont October 10,
RELIABLE DYNAMIC ANALYSIS OF TRANSPORTATION SYSTEMS Mehdi Modares, Robert L. Mullen and Dario A. Gasparini Department of Civil Engineering Case Western.
1 Queryy Sampling Based High Dimensional Hybrid Index Junqi Zhang, Xiangdong Zhou Fudan University.
Håkan L. S. YounesDavid J. Musliner Carnegie Mellon UniversityHoneywell Laboratories Probabilistic Plan Verification through Acceptance Sampling.
A Binary Linear Programming Formulation of the Graph Edit Distance Presented by Shihao Ji Duke University Machine Learning Group July 17, 2006 Authors:
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Kalman Filter and Data Streaming Presented By :- Ankur Jain Department of Computer Science 7/21/03.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Advanced Database Aggregation Query Processing
SIMILARITY SEARCH The Metric Space Approach
Probabilistic Data Management
Sameh Shohdy, Yu Su, and Gagan Agrawal
Probabilistic Data Management
Spatio-Temporal Databases
Probabilistic Data Management
DATABASE HISTOGRAMS E0 261 Jayant Haritsa
Lecture 4 - Monte Carlo improvements via variance reduction techniques: antithetic sampling Antithetic variates: for any one path obtained by a gaussian.
Efficient Processing of Top-k Spatial Preference Queries
CISE-301: Numerical Methods Topic 1: Introduction to Numerical Methods and Taylor Series Lectures 1-4: KFUPM CISE301_Topic1.
Presentation transcript:

Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science Aalborg University Meihui Zhang, Su Chen, Christian S. Jensen, Beng Chin Ooi, Zhenjie Zhang

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

Introduction Trends and applications –Positioning Systems, Wireless Communication, etc. –Intelligent Transport Systems, Location-Based Services, etc.

Motivation Existing work on moving object management Assumption: deterministic movement Real world –Limited accuracy –Complex and stochastic movement –…–… School bus in Athens metropolitan

Motivation Problem –Information gap between real movement and deterministic models Solution –Introduce an uncertainty model

Our contributions Uncertain moving object model –Take into account the uncertainties of both location and velocity Movement inference model –Infer the location distribution at t Ease of integration into existing index structures –Indexing –Query processing

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

Uncertain Moving Object Model Discrete timestamps 2-D moving objects Uncertain moving object representation –Distributions –Instead of exact values

Distribution Representation Distribution –Domain discretization –Probability assigned to each cell –Uniform distribution assumption in each cell Location distributionVelocity distribution  Cells with non-zero probabilities

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

Movement Inference Model Location distribution prediction –given the location and velocity distributions of o i, update time t u, query time t –derive a new location distribution for object o i at near- future time t Solutions –Rectangle inference –Monte Carlo simulation

Rectangle Inference Location distributionVelocity distribution

Rectangle Inference Location distributionVelocity distribution

Rectangle Inference tutu t u t u IR IR IR IR IR IR

Monte Carlo Simulation Randomized method to simulate the motion –Error rate , confidence , simulation number N –For each simulation Initial step: selects a random location following steps: pick up a velocity final step: returns location –Estimate the location distribution with the simulation results

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

B x -Tree Use B + -tree to index moving objects Space filling curve –2-D location  1-D key value –2-D range query  Several 1-D range queries

Indexing Uncertain Moving Objects Index structure –Time domain partition –Two sub-trees –Reference time –Sub-trees roll over –Index each location cell with non-zero probability along with the probability and velocity distribution info

Indexing Uncertain Moving Objects Index update –Insertion Identify the sub-tree in which to insert Infer the location distribution at t ref Insert the spatial cells with non-zero probabilities –Deletion Locate the record in data file Identify the sub-tree Infer the location distribution Delete from the index

Velocity-Based Partitioning Uncertain  larger query expansion Tighten velocity bound s.t. decrease query expansion Velocity Minimal Bounding Rectangle (VMBR) Partition each sub-tree into K logical sub-trees –reduce VMBRs recorded at each sub-tree root

Query Processing Probabilistic Range Query –Given a spatial range R, a query time t, and a threshold θ, the probabilistic range query returns all uncertain moving objects falling into R with probability no smaller than θ at time t Top-k Probabilistic NN Query(k-PNN) –Given a query location q and a query time t, the k-PNN query returns k uncertain moving objects with the highest probabilities of being the nearest neighbor of q

Probabilistic Range Query Growing step –Issue range query on index –Construct a candidate object list Verification step –Rectangle inference works as a filter –Monte Carlo Simulation verifies

k-PNN Query Issue a series of circular region range queries Maintain –lower bound –upper bound –accumulated probability Terminate –k th highest lower bound > any other upper bound –accumulated probability = 1 q 

k-PNN Query q  PC(o 1,q,3  ) = 0.8 PC(o i,q,r) Probability of o i belonging to circle centered at q with radius r PR(o i,q,r 1,r 2 ) (r 1 < r 2 ) Probability of o i belonging to ring centered at q with radius between r 1 and r 2 o1o1 PR(o 1,q, ,2  ) = 0.2 PR(o 1,q,2 ,3  ) = 0.6

k-PNN Query q  iterationobjectacc i low i up i iterationobjectacc i low i up i iteration 2 iterationobjectacc i low i up i iteration 2 o1o iterationobjectacc i low i up i iteration 2 o1o o2o iterationobjectacc i low i up i iteration 2 o1o o2o o3o3 00 iterationobjectacc i low i up i iteration 2 o1o o2o o3o3 00 iteration 3 o1o o2o o3o iterationobjectacc i low i up i iteration 2 o1o o2o o3o3 00 iteration 3 o1o o2o o3o iteration 4 o1o o2o o3o PC(o 1,q,2  ) = 0.2 PC(o 2,q,2  ) =  1   1  1 0  0.8   1  1 PR(o 1,q,3 ,4  ) = 0.2 PC(o 1,q,3  ) =  0.4   0.4  PC(o 1,q,2  ) = 0.81-PC(o 1,q,3  ) = 0.2 PR(o 1,q, ,2  ) = 0.2 o1o1 PR(o 1,q,2 ,3  ) = 0.6 PR(o 2,q,2 ,3  ) = PC(o 2,q,2  ) = 11-PC(o 3,q,2  ) = 1 o3o3 PR(o 3,q,2 ,3  ) = PC(o 2,q,3  ) = 0.41-PC(o 3,q,3  ) = 0.9 o2o2 PR(o 2,q,3 ,4  ) = 0.3PR(o 3,q,3 ,4  ) = 0.7

Outline Introduction Uncertain Moving Object Model Movement Inference Model Indexing and Query Processing –Index structure –Probabilistic Range Query –k-PNN Query Experiments Conclusion

Experiments Synthetic data –Uniformly distribute locations –Randomly select directions and speeds –Model uncertainty by Gaussian distribution Performance study –Certain model vs. uncertain model –Efficiency tests

Certain Model vs. Uncertain Model Certain model –Simple linear motion function –Average velocity and location Measurement –Recall –Precision

Certain Model vs. Uncertain Model Varying probability threshold (a) Recall(b) Precision

Efficiency Tests Range query size –NP-tree: index w./o. velocity partition –VP-tree: index w. velocity partition (a) I/O cost(b) CPU cost

Efficiency Tests Range query time –NP-tree: index w./o. velocity partition –VP-tree: index w. velocity partition (a) I/O cost(b) CPU cost

Conclusion Inferring current/near-future uncertain locations from past uncertain velocity and location information Indexing the uncertain moving objects by means of an adapted B x -tree Processing probabilistic range and nearest neighbor queries

Thank You!