Showcasing work by Jing Yuan, Yu Zheng, Xing Xie, Guangzhou Sun

Slides:



Advertisements
Similar presentations
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
Advertisements

Problem solving with graph search
ADAPTIVE FASTEST PATH COMPUTATION ON A ROAD NETWORK: A TRAFFIC MINING APPROACH Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag.
An Interactive-Voting Based Map Matching Algorithm
Driving with Knowledge from the Physical World Jing Yuan, Yu Zheng Microsoft Research Asia.
Urban Computing with Taxicabs
On Map-Matching Vehicle Tracking Data
NUS CS5247 Motion Planning for Camera Movements in Virtual Environments By Dennis Nieuwenhuisen and Mark H. Overmars In Proc. IEEE Int. Conf. on Robotics.
Yu Stephanie Sun 1, Lei Xie 1, Qi Alfred Chen 2, Sanglu Lu 1, Daoxu Chen 1 1 State Key Laboratory for Novel Software Technology, Nanjing University, China.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Travel Time Estimation of a Path using Sparse Trajectories
Adaptive Fastest Path Computation on a Road Network : A Traffic Mining Approach Hector Gonzalez Jiawei Han Xiaolei Li Margaret Myslinska John Paul Sondag.
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
August 7, 2003 Virtual City : A Heterogeneous System Model of an Intelligent Road Navigation System Incorporating Data Mining Concepts Mike Kofi Okyere.
1 Techniques for Efficient Road- Network-Based Tracking of Moving Objects Speaker : Jia-Hui Huang Date : 2006/10/23.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
Route Planning Vehicle navigation systems, Dijkstra’s algorithm, bidirectional search, transit-node routing.
Map Analysis with Networks Francisco Olivera, Ph.D., P.E. Department of Civil Engineering Texas A&M University Some of the figures included in this presentation.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
The Shortest Path Problem
Reducing Uncertainty of Low-sampling-rate Trajectories Kai Zheng, Yu Zheng, Xing Xie, Xiaofang Zhou University of Queensland & Microsoft Research Asia.
Sensys 2009 Speaker:Lawrence.  Introduction  Overview & Challenges  Algorithm  Travel Time Estimation  Evaluation  Conclusion.
Bei Pan (Penny), University of Southern California
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
1 Optimal Cycle Vida Movahedi Elder Lab, January 2008.
The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,
Efficient Route Computation on Road Networks Based on Hierarchical Communities Qing Song, Xiaofan Wang Department of Automation, Shanghai Jiao Tong University,
Elastic Pathing: Your Speed Is Enough to Track You Presented by Ali.
Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.
Group 8: Denial Hess, Yun Zhang Project presentation.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining
Trends: Spatio-temporal graphs Introduction to Spatial Computing.
Capstone Project Fall Course Information Instructor Ye Zhao –Office: MSB 220 – Fall 2015 (MSB162) –Time: Tue, Thu 10:45am.
Trajectory Data Mining
DYNAMICALLY COMPUTING FASTEST PATHS FOR INTELLIGENT TRANSPORTATION SYSTEMS MEERA KRISHNAN R.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
C ROWD P LANNER : A C ROWD -B ASED R OUTE R ECOMMENDATION S YSTEM Han Su, Kai Zheng, Jiamin Huang, Hoyoung Jeung, Lei Chen, Xiaofang Zhou.
REU 2009-Traffic Analysis of IP Networks Daniel S. Allen, Mentor: Dr. Rahul Tripathi Department of Computer Science & Engineering Data Streams Data streams.
Traffic Models Alaa Hleihel Daniel Mishne /32.
Presented by: Shahab Helmi Spring 2016
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Constraint-Based Routing
Pagerank and Betweenness centrality on Big Taxi Trajectory Graph
Computing and Compressive Sensing in Wireless Sensor Networks
Network Analysis with ArcGIS Online
Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data
Parametric calibration of speed–density relationships in mesoscopic traffic simulator with data mining Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2009/10/20.
Motion Planning for Multiple Autonomous Vehicles
Design and Analysis of Algorithms (07 Credits / 4 hours per week)
ECE 544 Protocol Design Project 2016
Statistical Optimal Hash-based Longest Prefix Match
1 Department of Engineering, 2 Department of Mathematics,
CS 548 Sequence Mining Showcase By Bian Du, Wa Gao, and Cam Jones
Efficient Evaluation of k-NN Queries Using Spatial Mashups
Finding Fastest Paths on A Road Network with Speed Patterns
Predicting Traffic Dmitriy Bespalov.
1 Department of Engineering, 2 Department of Mathematics,
Unit 4: Dynamic Programming
1 Department of Engineering, 2 Department of Mathematics,
CS & CS Capstone Project & Software Development Project
Continuous Density Queries for Moving Objects
Topological Signatures For Fast Mobility Analysis
A Better Approximation for Minimum Total Routing Path Clustering Problem in 2-D Underwater Sensor Networks Wei Wang, Donghyun Kim, and Weili Wu, A Better.
Design and Analysis of Algorithms (04 Credits / 4 hours per week)
Presentation transcript:

CS548 Fall 2018 Clustering Showcase by Yingxue Zhang, Mo Cheng, Ashish Gurung, jake scheide Showcasing work by Jing Yuan, Yu Zheng, Xing Xie, Guangzhou Sun on “T-Drive: Enhancing Driving Directions with Taxi Drivers’ Intelligence”

References [1] J. Yuan, Y. Zheng, C. Zhang, W. Xie, G. Sun, H. Yan, and X. Xie, “T-Drive: Driving Directions Based on Taxi Trajectories,” Proc.18th SIGSPATIAL Int’l Conf. Advances in Geographic Information Systems (GIS), 2010. [2] T. Hunter, R. Herring, P. Abbeel, and A. Bayen, “Path and Travel Time Inference from GPS Probe Vehicle Data,” Proc. Neural Information Processing Systems (NIPS), 2009. [3] Y. Lou, C. Zhang, Y. Zheng, X. Xie, W. Wang, and Y. Huang, “Map-Matching for Low-Sampling-Rate GPS Trajectories,” Proc. Int’l Conf. Advances in Geographic Information Systems (GIS), 2009. [4] J. Yuan, Y. Zheng, C. Zhang, and X. Xie, “An Interactive-Voting Based Map Matching Algorithm,” Proc. Int’l Conf. Mobile Data Management (MDM), 2010. [5] E. Kanoulas, Y. Du, T. Xia, and D. Zhang, “Finding Fastest Paths on a Road Network with Speed Patterns,” Proc. Int’l Conf. Data Eng. (ICDE), 2006. [6] A. Orda and R. Rom, “Shortest-Path and Minimum-Delay Algorithms in Networks with Time- Dependent Edge-Length,” J. ACM, vol. 37, no. 3, pp. 607-625, 1990. [7] N. Leibowitz, B. Baum, G. Enden, and A. Karniel, “The Exponential Learning Equation as a Function of Successful Trials Results in Sigmoid Performance,” J. Math. Psychology, vol. 54, no. 3, pp. 338-340, 2010.

Motivation Find fast driving routes for drivers A good routing service Physical feature of a route Time-dependent traffic flow Driving behavior Cloud-based cyber-physical system Use taxi trajectories Compute practically fast routes

Process Introduction Taxis are used as mobile sensors probing the traffic rhythm Aggregate and mine the info from taxis as well as other sources from Internet in cloud The knowledge in the cloud is used in turn to serve Internet users and ordinary drivers. A mobile client accepts a user’s query, communicates with the cloud, and presents the result to the user

Data Collection Road network of Beijing Taxi trajectories: 4.96 million trajectories generated by over 33,000 taxis over three months. A taxi trajectory is a sequence of GPS points pertaining to one trip. Each point consists of a longitude, latitude, and a time stamp

Data Preprocessing Convert the GPS points to trajectories representing individual trips according to the taximeter’s transaction records Convert the trajectories to road segments sequences

Landmark Graph Landmark: top-k rod segments that are frequently traversed by taxi drivers according to the trajectories Convert the road segments sequences to landmark sequences

VE-Clustering The travel time of an landmark edge was found to be dependent upon the following factors: number of traffic lights on the route different routes chosen by different drivers travelling the landmark edge drivers personal behavior, skill and preferences Making the road network dynamic in nature. Earlier papers[5][6] based their assumptions on the fact that travel time is dependent on time of the day. A new approach that took into account the Variance and Entropy of the data for clustering was used.

V-Clustering Variance Clustering: The travel times are clustered into several categories based on their variance. Fig(b) shows the clustering of time taken to traverse a specific landmark edge.

V-Clustering Arrange the travel times taken to traverse a landmark edge in ascending order Partition the list into multiple sub-lists in a binary recursive way  Calculate the variance of all the travel times minimal weighted average variance (WAV) is used to find the best split point Formula, Recursion stopping condition: [maximum decrease in variance < threshold value]

E-Clustering Entropy Clustering: Information gain was used to determine a time partition for each landmark edge. E-Clustering aims to split time of day into several time slots such that travel time has a stable distribution. E-Clustering used the same approach as the one used for V clustering. Instead of WAV we have Weighted Average Entropy(WAE) The clusters generated in V-Clustering is used to determine the proportion during the calculation of information entropy.

VE-Clustering Here, figure (a) shows the travel time for a landmark edge against time of the day for various drivers Figure (b) is the visualization of the data after V-clustering Figure(c) shows the final result after E-clustering is implemented

Route Computing Drivers take a different amount of time to drive the same stretch of road at the same time A custom factor between 0 and 1 models driver speed through an identical route A route is computed by searching for the 3 nearest landmarks to the start node, and creating m x m landmark pairs Fastest route is determined through Dijkstra's algorithm generalization For example, a custom factor = 0.7 travels a route in 272 seconds, whereas a custom factor of 0.3 travels the identical route at the same time in 197 seconds.

Refining routes Sampling rate is low (taxis only report location every 2-5 minutes) Algorithm accuracy is < 70%[4] “Nonroundabout routes” follow three principles 1. Source farther principle (the next node in the sequence is further away from the start than the current node) 2. Destination closer principle (The next node in the sequence is closer to the destination than the current node) 3. Next nearest principle (every node after the next node should be further away from the current node than the next node) Nonroundabout routes are ideal. Any route that violates one of these principles is smoothed using sorting algorithms.

Route Smoothing Global smoothing is solved in O(n log n) by finding the longest increasing subsequence of distance pairs (node 1, source), (node 2, source) Local smoothing uses a custom algorithm which finds the longest subsequence from a globally smoothed sequence to satisfy the next-nearest principle. Typically this is completed in exponential time, but the team’s optimizations reduce this to polynomial time.

Evaluation Evaluating Landmark Graph Top-k frequent road segment: from 500 to 13000 Time of traversed by taxis: 900 times Maximum time : 30 min Evaluation on Travel Time Estimation 𝑡( 𝑅 𝑖 ) : real travel time 𝑡 ( 𝑅 𝑖 ): estimated travel time

Evaluation Routing Evaluation A: T-Drive system method B: Baseline method In-the-field Evaluation

Summary T-Drive: a system to find out the practically fastest route for a particular user at a given departure time. Estimate the time: VE-Clustering Routing: custom factor & smoothing Data: A real system with real-time GPS trajectories generated by over 33,000 taxis in a period of three months. Evaluation: more than 60% of the route found by the paper’s method are faster than that of the existing online map services, and 50% of these routes are at least 20% faster than the latter. On average, the paper’s method can save about 16% of time for a trip.

Thank you!