A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.

Slides:



Advertisements
Similar presentations
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Advertisements

An Interactive-Voting Based Map Matching Algorithm
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
03/20/2003Parallel IR1 Papers on Parallel IR Agenda Introduction Paper 1:Inverted file partitioning schemes in multiple disk systems Paper 2: Parallel.
1 Top-k Spatial Joins
Mining Mobile Group Patterns: A Trajectory-based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu NSYSU, Taiwan Ee-Peng Lim NTU, Singapore.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Indexing and Range Queries in Spatio-Temporal Databases
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
An Efficient Multi-Dimensional Index for Cloud Data Management Xiangyu Zhang Jing Ai Zhongyuan Wang Jiaheng Lu Xiaofeng Meng School of Information Renmin.
Query Processing of Massive Trajectory Data based on MapReduce Qiang Ma, Bin Yang (Fudan University) Weining Qian, Aoying Zhou (ECNU) Presented By: Xin.
Yoshiharu Ishikawa (Nagoya University) Yoji Machida (University of Tsukuba) Hiroyuki Kitagawa (University of Tsukuba) A Dynamic Mobility Histogram Construction.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Travel Time Estimation of a Path using Sparse Trajectories
Indexing Network Voronoi Diagrams*
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows COMP9314 Lecture Notes.
2-dimensional indexing structure
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
Spatio-Temporal Databases
Computer Science Spatio-Temporal Aggregation Using Sketches Yufei Tao, George Kollios, Jeffrey Considine, Feifei Li, Dimitris Papadias Department of Computer.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Approximate querying about the Past, the Present, and the Future in Spatio-Temporal Databases Jimeng Sun, Dimitris Papadias, Yufei Tao, Bin Liu.
Chapter 3: Data Storage and Access Methods
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
FLANN Fast Library for Approximate Nearest Neighbors
1 Indexing Large Trajectory Data Sets With SETI V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel University of Michigan Presented by Guangyue Jia.
Mining Interesting Locations and Travel Sequences From GPS Trajectories Yu Zheng and Xing Xie Microsoft Research Asia March 16, 2009.
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Practical Database Design and Tuning. Outline  Practical Database Design and Tuning Physical Database Design in Relational Databases An Overview of Database.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Michael Vassilakopoulos.
1 SD-Rtree: A Scalable Distributed Rtree Witold Litwin & Cédric du Mouza & Philippe Rigaux.
Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore,
AAU Novel Approaches to the Indexing of Moving Object Trajectories Presented by YuQing Zhang  Dieter Pfoser Christian S. Jensen Yannis Theodoridis.
ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.
DIST: A Distributed Spatio-temporal Index Structure for Sensor Networks Anand Meka and Ambuj Singh UCSB, 2005.
Reporter : Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.
Efficient Processing of Top-k Spatial Preference Queries
VLDB 2006, Seoul1 Indexing For Function Approximation Biswanath Panda Mirek Riedewald, Stephen B. Pope, Johannes Gehrke, L. Paul Chew Cornell University.
An Efficient Linear Time Triple Patterning Solver Haitong Tian Hongbo Zhang Zigang Xiao Martin D.F. Wong ASP-DAC’15.
Observer Relative Data Extraction Linas Bukauskas 3DVDM group Aalborg University, Denmark 2001.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
AQWA Adaptive Query-Workload-Aware Partitioning of Big Spatial Data Dimosthenis Stefanidis Stelios Nikolaou.
Attribute Allocation in Large Scale Sensor Networks Ratnabali Biswas, Kaushik Chowdhury, and Dharma P. Agrawal International Workshop on Data Management.
Spatio-Temporal Databases. Term Project Groups of 2 students You can take a look on some project ideas from here:
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan.
Spatial Approximate String Search. Abstract This work deals with the approximate string search in large spatial databases. Specifically, we investigate.
Spatio-Temporal Databases
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Urban Sensing Based on Human Mobility
Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data
Spatio-Temporal Databases
Efficient Cost Models for Spatial Queries Using R-Trees
Continuous Motion Pattern Query
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research Asia

Outline Introduction Modeling user behavior Index design Experimental results Conclusion

Outline Introduction Modeling user behavior Index design Experimental results Conclusion

Introduction Background – GPS-enabled devices become prevalent – Large amount of GPS logs have been accumulated – Quite a few GPS-data-sharing applications appeared Spatio-temporal index is necessary – For system: to manage the potentially large-scale data – For users: to explore the GPS data interested them

Introduction Problem Definition – Retrieve the GPS trajectories across a given region and intersecting a given time span Present techniques are not optimized to these applications Spatial queryTemporal query

Introduction Our contributions – A stochastic process model: simulating user behavior of uploading GPS tracks Users prefer to upload data they created recently The insert frequency of different parts of index are skewed – A novel indexing scheme: optimized to the user behavior of uploading GPS tracks Smaller index size Minimal update efforts Satisfactory retrieval performance

Outline Introduction Modeling user behavior Index design Experimental results Conclusion

Modeling User Behavior A GPS track Duration of a GPS track Interval between trajectory created and uploaded

Modeling User Behavior Upload log file to server at time Tup Users’ arrival can be modeled as Poisson process T dur follows Gaussian distribution The interval between uploading time and end time of trajectory T int = Tup -Te Can be modeled as Rayleigh distribution Summarized from photos uploaded by multiple users over a period of 3 months on Flickr Ts Te T dur = Te -Ts GPS Log File

Modeling User Behavior A (Ts, Te) represents a GPS track

Outline Introduction Modeling user behavior Index design Experimental results Conclusion

Index Design Architecture – Partition space into disjoint grids – Maintain a temporal index for each grid – The temporal index (CSE-Tree) is special

Temporal Index (CSE-Tree) A GPS segment can be represented by a pair (Ts, Te) A point on two dimensional plane A temporal query is a time span (Time min, Time max )

Temporal index Structure – Partition the points into groups by Te – Build a start time index (B+ Tree) to index points of each group – Build a end time index (B+ Tree) to index groups Ts Te t1 t2t2 ti ti+1

Temporal Index (CSE-Tree) Three operations – Insert – Compress – Search

Temporal Index (CSE-Tree) Compress operation – Occur when update frequency drops to some extent – Convert B+ tree to dynamic array dynamic array B+ Tree

Temporal Index (CSE-Tree) Search operation – Te> Time min : Search End Time index to get the corresponding start time indexes – Ts< Time max : Look up each start time index candidate to find the correct points

Outline Introduction Modeling user behavior Index design Experimental results Conclusion

Experimental Settings Platform – PC with 3.00 GHz Intel Pentium 4 CPU, Windows XP SP2 platform, and 0.99 GB RAM Parameters – B+ tree: Inner node size is 64 bytes Leaf size 1024 bytes – Poisson process: 100, 300, 500 and 700 – Total duration of the process is 2400 hours (100 days) – Rayleigh distribution: T int is – Normal distribution of Tdur: mean (0.42), variance (0.98).

Experimental Results The compress operation saves index size – No overlap between nodes – B+ tree  Dynamic array Index size comparison

Experimental Results Insert efforts – Less node access than both SEB-tree and R-tree – Most inserts occur in the area surrounded by the broken line – Few node access in End Time Tree Mean number of node access in one insertion

Experimental Results Query performance Mean number of node access in one query

Conclusion A model simulating user behavior of upload data – Based on stochastic process theory – statistical analysis on the data collection in real world CSE-Tree – Smaller index size – Less node access in insertion – Slightly more node access than SEB-tree in query

Thanks! Q&A