Presentation is loading. Please wait.

Presentation is loading. Please wait.

Managing Massive Trajectories on the Cloud

Similar presentations


Presentation on theme: "Managing Massive Trajectories on the Cloud"— Presentation transcript:

1 Managing Massive Trajectories on the Cloud
Jie Bao, Ruiyuan Li, Xiuwen Yi, Yu Zheng Urban Computing Group Microsoft Research, China

2 Motivation Bridge the gap between massive trajectory data and
urban computing applications Urban Computing Applications Massive Trajectory Data Enabling large scale storage & analysis real-time service providing Cloud-based Trajectory Data Management Platform (Our Work)

3 Key functionalities Cloud computing platforms do not support trajectory queries well ID-temporal query: trajectory ID + time period -> trajectory segments ST-Range query: Spatio-temporal range -> partial trajectory segments Map-Matching: trajectory -> road segments Reverse trajectory access: road segments -> trajectories ID-Temporal Queries ST-Range Queries Trajectory Map-Matching  Call for spatial and spatio-temporal indexing structures and retrieval algorithms

4 Azure Preliminary Azure Storage Azure Parallel Computing - HDinsght
Azure Blob (Azure Files) Azure Table (Key-Value Storage) Efficient for key-based access Efficient for range access within the same partition Azure Parallel Computing - HDinsght Azure Hadoop Azure Spark Azure Storm Distributed streaming system

5 System Overview

6 Trajectory Storage Step 1: Pre-Processing Step 2: Trajectory Store

7 Trajectory ST-indexing
An extra spatio-temporal copy It may incur multiple accesses to multiple trajectory table (not efficient) Storage pricing is Cheap in Microsoft Azure (less than 0.1 USD per 100TB/month) Storage Schema Spatial Partition -> Table Temporal Range -> Partition Keys

8 Trajectory Map-Matching
Important function for many urban applications Traffic inference, route recommendations, and … Challenges: Huge Volume Complex Computation Real-time requirement

9 Experiments Data Sets Real-time trajectory feed from ~7,000 taxi cabs in City of Guiyang, China

10 Experiments Trajectory Storage

11 Experiments Parallel Map-Matching Limitation on degree of parallelism

12 Case Study 1 Taxi Trajectory Data Management
Plate-temporal Query ID-temporal Query Spatio-temporal Range Query OD-based Trajectory Query

13 Conclusion We provide a first attempt on building a holistic cloud-based system on managing massive trajectories. It is used extensively in internally in Urban Computing group We learnt a lot of lessons during the system implementation We will add more functions on querying trajectories, add more indexing structures, e.g., R-tree, Quad-tree We hope it will be included as a standard component in Microsoft Azure to facilitate more spatio- temporal applications


Download ppt "Managing Massive Trajectories on the Cloud"

Similar presentations


Ads by Google