Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data

Slides:



Advertisements
Similar presentations
Incremental Clustering for Trajectories
Advertisements

An Interactive-Voting Based Map Matching Algorithm
PRESS: A Novel Framework of Trajectory Compression in Road Networks
Urban Computing with Taxicabs
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
User Oriented Trajectory Similarity Search ACM SIGKDD International Workshop on Urban Computing (UrbComp 2012), August 12, Beijing, China Haibo.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Travel Time Estimation of a Path using Sparse Trajectories
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
CAPS RoutePro Routing Environment. Solution Methods. Backhauls. Dispatcher Interface. Demonstration.
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
Route Planning Vehicle navigation systems, Dijkstra’s algorithm, bidirectional search, transit-node routing.
Growing Charging Station Networks with Trajectory Data Analytics Yanhua Li 1, Jun Luo 2, Chi-Yin Chow 3, Kam-Lam Chan 3, Ye Ding 4, and Fan Zhang 2 1 U.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Data Mining – Intro.
LAMI 1. User Oriented Trajectory Similarity Search 2. Calibration-free Localization using Relative Distance Estimations 3. From GPS Traces to a Routable.
Bei Pan (Penny), University of Southern California
Assembler Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data Sheng QIAN SIGKDD 2015.
KNR-tree: A novel R-tree-based index for facilitating Spatial Window Queries on any k relations among N spatial relations in Mobile environments ANIRBAN.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and David H.C. Du Dept. of.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Siyuan Liu *#, Yunhuai Liu *, Lionel M. Ni *# +, Jianping Fan #, Minglu Li + * Hong Kong University of Science and Technology # Shenzhen Institutes of.
Traffic Prediction in a Bike-Sharing System
DISSECTING REGIONAL WEATHER-TRAFFIC SENSITIVITY THROUGHOUT A CITY Ye Ding 1, Yanhua Li 2, Ke Deng 3, Haoyu Tan 1, Mingxuan Yuan 4, Lionel M. Ni 5 1 Guangzhou.
Trajectory Data Mining
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
U of Minnesota DIWANS'061 Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and.
Urban Traffic Simulated From A Dual Perspective Hu Mao-Bin University of Science and Technology of China Hefei, P.R. China
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
2013 IEEE 14th International Conference on Mobile Data Management Authors: 1. Jiansu Pu 2. Siyuan Liu 3. Ye Ding 4. Huamin Qu 5. Lionel Ni By: Farah Kamw.
TRACE ANALYSIS AND MINING FOR SMART CITIES By G. Pan Zhejiang Univ., Hangzhou, China G. Qi ; W. Zhang ; S. Li ; Z. Wu ; L. T. Yang.
Introduction to Vehicle Trajectory Processing Tools.
Click to edit Present’s Name AP-Tree: Efficiently Support Continuous Spatial-Keyword Queries Over Stream Xiang Wang 1*, Ying Zhang 2, Wenjie Zhang 1, Xuemin.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Managing Massive Trajectories on the Cloud
Effective and Efficient: Large-scale Dynamic City Express
Prof. Yu-Chee Tseng Department of Computer Science
Lessons learned from Metro Vancouver
Presented by: Mi Tian, Deepan Sanghavi, Dhaval Dholakia
Data Mining – Intro.
Strategies for Spatial Joins
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Introduction to Spatial Statistical Analysis
Pagerank and Betweenness centrality on Big Taxi Trajectory Graph
Urban Sensing Based on Human Mobility
1st November, 2016 Transport Modelling – Developing a better understanding of Short Lived Events Marcel Pooke – Operational Modelling & Visualisation Manager.
Network-Wide Bike Availability Clustering Using the College Admission Algorithm: A Case Study of San Francisco Bay Area Hesham Rakha, Ph.D., P.Eng. Samuel.
Clustering Uncertain Taxi data
Mining the Most Influential k-Location Set from Massive Trajectories
Spatio-temporal Pattern Queries
Spatial Online Sampling and Aggregation
Chao Zhang1, Yu Zheng2, Xiuli Ma3, Jiawei Han1
Divide Areas Algorithm For Optimal Multi-Robot Coverage Path Planning
Efficient Evaluation of k-NN Queries Using Spatial Mashups
Predicting Traffic Dmitriy Bespalov.
MATS Quantitative Methods Dr Huw Owens
Showcasing work by Jing Yuan, Yu Zheng, Xing Xie, Guangzhou Sun
Continuous Density Queries for Moving Objects
Topological Signatures For Fast Mobility Analysis
Efficient Processing of Top-k Spatial Preference Queries
Donghui Zhang, Tian Xia Northeastern University
Visually Analyzing Latent Accessibility Clusters of Urban POIs
Presentation transcript:

Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data Guojun Wu#, Yichen Ding#, Yanhua Li#, Jie Bao†, Yu Zheng†, Jun Luo*,$ #Worcester Polytechnic Institute (WPI) †Microsoft Research *Shenzhen Institutes of Advanced Technologies $Lenovo Research at Hong Kong 1

Imagine This… Location-based Advertising Aim to arrange special activities for business and need to decide spatial regions to distribute coupons The reachable region is time-varying 6 p.m. 1 p.m. In today’s talk, I am focusing on managing and analyzing big trajectory data from urban networks. In an urban network, vehicles, mobile phone users serve as urban roving sensors delivering a big trajectory data. These trajectories characterize versatile moving patterns, and reflect the urban issues, such as traffic jam, malicious events, etc. location-based advertising, where some business owner finds out the potential spatial regions to arrange special activities, such as distributing coupons and sales discount Orange Road means traffic jam, while green road means clear. 2

Real-world Problems ( a ) Location-based Recommendation ( b ) Location-based Advertising (a) A shopping mall distributes its special activities within reachable area. (b) Friends arrange their entertainment activities as much as possible. (c) Two delivery stations manage their affiliated delivery sites and dispatch ( c ) Business Coverage Analysis ( d ) Emergency Response Analysis

State-of-the-art works Related Work 10 miles 15 min 10 miles ST Reachability Our work Free space: circle 10 mile radius Road Network: road length 10 mile in all directions, with different time periods ST-Method: spatio-temporal reachable region, consider time aspect, with a given time period Free Space Road Network State-of-the-art works

Spatio-Temporal Reachability Query Definition: Find the reachable area in a spatial network from a location in a given time period. Example: Start from my home at 8:00 AM, where I can reach in 30 minutes One starting location Multiple starting locations

Naïve Method: Exhaustive Search Start from the querying location S and time T, then search the neighboring road segments throughout the whole road network.

Query Processing Framework

Stage 1: Preprocessing Road Re-segmentation Map Matching Partition the original road segments based on the given spatial granularity Map Matching Map trajectory point to real road network Chop based on intersection Road segment length differs a lot, 1mile to several feet. Map points to where it should be, based on distance and direction.

Stage 2: Indexing Structure ST-Index B-tree temporal R-tree spatial Nodetime list (different dates) item in time listdisk page

Stage 2: Indexing Structure Connection Index Based on speed information from trajectory data, we have two list for one segment ,at least where you can get starting from this segment and at most where you can get.

Stage 3: Query Processing Single Location Maximum bounding region search Trace back search With the ST-Index and Connection Index, now we are able to process queries from the user. In this processing, we employ 2 main techniques. One is the maximum/minimum bounding region search which use our index structures to generate a rough estimation of the upper bound of the reachable region based on the query parameters. The other is the Trace Back Search which uses connection index to refine the region from the rough bounding region. For the single location spatio-temporal reachability queries, we first estimate the maximum bounding region and then trace back search starts from the maximum bounding region to minimum one until satisfy the probability requirement. In this figure, the 2 solid circles indicate the maximum and minimum bounding region, respectively. And the dashed circle represents the prob-reachable region with respect to Probability requirement. As the result, all road segments within this dashed circle is to answer the single location query. maximum bounding region

Stage 3: Query Processing Multiple Locations Unified maximum bounding region search Trace back search r1 r2 r2 r4 r3 Compared with single location, multiple locations query has multiple starting locations. The difference in query processing is to estimate the unified maximum bounding region to skip the overlapping region search. As shown in left figure, road segment r1 and r2 are starting locations. Solid line indicates the outer boundary of the unified maximum bounding region of both r1 and r2 while dashed line indicates the boundary of overlapping region. And r3 is on the boundary of r2 while r4 on the r1. However, r3 is not on the outer-most boundary while r4 is. The right figure indicates the trace back search which is similar to single location starts from maximum bounding region to inner minimum one. r1 Unified bounding region Unified maximum bounding region

Evaluation Dataset: 194 GB real human mobility data in a large city in eastern China Baseline Algorithm Exhaustive search Evaluation metric Processing Time & Reachable Road Length Statistics Value City Size 400 square miles City Population Size three million people Duration 30 days in November, 2014 Number of taxis 21,385 unique taxis Number of trajectories 400 million (407, 040, 083)

Evaluation Effects on query probability Prob Exhaustive search and duration L = {10,15} min Processing Time vs. Reachable Road Length Decrease 50% Time

Visualization (a) Prob = 20% (b) Prob = 60% (c) Prob = 80% (d) Prob = 100%

Evaluation Effects on Start Time T Duration L = {5,10} min Processing Time vs. Reachable Road Length Same trend => consistent with each other

Visualization

Evaluation Comparison of s-query and m-query Duration L = {5,10,…,35}min Number of locations # = {1,2,…,9} Up to 70% Time Up to 90% Time

Visualization

Summary Spatio-Temporal Reachability Query Index Construction Spatio-Temporal Index Connection Index Query Processing Single-location ST Reachability Query via SQMB algorithm + TBS algorithm Multi-location ST Reachability Query via MQMB algorithm + TBS algorithm

Thank you! Contact: Yanhua Li (yli15@wpi.edu) Website: http://wpi.edu/~yli15/