Download presentation
Presentation is loading. Please wait.
1
Managing Massive Trajectories on the Cloud
Jie Bao, Ruiyuan Li, Xiuwen Yi, Yu Zheng Urban Computing Group Microsoft Research, China
2
Motivation Bridge the gap between massive trajectory data and
urban computing applications Urban Computing Applications Massive Trajectory Data Enabling large scale storage & analysis real-time service providing Cloud-based Trajectory Data Management Platform (Our Work)
3
Key functionalities Cloud computing platforms do not support trajectory queries well ID-temporal query: trajectory ID + time period -> trajectory segments ST-Range query: Spatio-temporal range -> partial trajectory segments Map-Matching: trajectory -> road segments Reverse trajectory access: road segments -> trajectories ID-Temporal Queries ST-Range Queries Trajectory Map-Matching Call for spatial and spatio-temporal indexing structures and retrieval algorithms
4
Azure Preliminary Azure Storage Azure Parallel Computing - HDinsght
Azure Blob (Azure Files) Azure Table (Key-Value Storage) Efficient for key-based access Efficient for range access within the same partition Azure Parallel Computing - HDinsght Azure Hadoop Azure Spark Azure Storm Distributed streaming system
5
System Overview
6
Trajectory Storage Step 1: Pre-Processing Step 2: Trajectory Store
7
Trajectory ST-indexing
An extra spatio-temporal copy It may incur multiple accesses to multiple trajectory table (not efficient) Storage pricing is Cheap in Microsoft Azure (less than 0.1 USD per 100TB/month) Storage Schema Spatial Partition -> Table Temporal Range -> Partition Keys
8
Trajectory Map-Matching
Important function for many urban applications Traffic inference, route recommendations, and … Challenges: Huge Volume Complex Computation Real-time requirement
9
Experiments Data Sets Real-time trajectory feed from ~7,000 taxi cabs in City of Guiyang, China
10
Experiments Trajectory Storage
11
Experiments Parallel Map-Matching Limitation on degree of parallelism
12
Case Study 1 Taxi Trajectory Data Management
Plate-temporal Query ID-temporal Query Spatio-temporal Range Query OD-based Trajectory Query
13
Conclusion We provide a first attempt on building a holistic cloud-based system on managing massive trajectories. It is used extensively in internally in Urban Computing group We learnt a lot of lessons during the system implementation We will add more functions on querying trajectories, add more indexing structures, e.g., R-tree, Quad-tree We hope it will be included as a standard component in Microsoft Azure to facilitate more spatio- temporal applications
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.