ADAPTIVE FASTEST PATH COMPUTATION ON A ROAD NETWORK: A TRAFFIC MINING APPROACH Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag.

Slides:



Advertisements
Similar presentations
Meet Hadoop Doug Cutting & Eric Baldeschwieler Yahoo!
Advertisements

CSE 211 Discrete Mathematics
Xiaolei Li, Zhenhui Li, Jiawei Han, Jae-Gil Lee. 1. Motivation 2. Anomaly Definitions 3. Algorithm 4. Experiments 5. Conclusion.
Incremental Clustering for Trajectories
Social network partition Presenter: Xiaofei Cao Partick Berg.
Urban Computing with Taxicabs
Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.
gSpan: Graph-based substructure pattern mining
Surface Simplification Using Quadric Error Metrics Speaker: Fengwei Zhang September
Efficient Keyword Search for Smallest LCAs in XML Database Yu Xu Department of Computer Science & Engineering University of California, San Diego Yannis.
Decision Tree Approach in Data Mining
On Map-Matching Vehicle Tracking Data
CS171 Introduction to Computer Science II Graphs Strike Back.
Adaptive Fastest Path Computation on a Road Network : A Traffic Mining Approach Hector Gonzalez Jiawei Han Xiaolei Li Margaret Myslinska John Paul Sondag.
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
實驗室 : 先進網路技術與服務實驗室 報告者 : 黃福銘 (Angus F.M. Huang) Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach TMSG
C++ Programming: Program Design Including Data Structures, Third Edition Chapter 21: Graphs.
Computability and Complexity 23-1 Computability and Complexity Andrei Bulatov Search and Optimization.
CSE 222 Systems Programming Graph Theory Basics Dr. Jim Holten.
Randomized Planning for Short Inspection Paths Tim Danner Lydia E. Kavraki Department of Computer Science Rice University.
1 Techniques for Efficient Road- Network-Based Tracking of Moving Objects Speaker : Jia-Hui Huang Date : 2006/10/23.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Computational Data Modeling and Query Processing in Road Networks Irina Aleksandrova, Augustas Kligys, Laurynas Speičys 4-th WIM meeting, Aalborg 2002.
Route Planning Vehicle navigation systems, Dijkstra’s algorithm, bidirectional search, transit-node routing.
Computational Data Modeling and Queries for Location-Based Services in Road Networks Irina Aleksandrova, Augustas Kligys, Laurynas Speičys.
Computational Methods for Management and Economics Carla Gomes
Minimum Spanning Trees What is a MST (Minimum Spanning Tree) and how to find it with Prim’s algorithm and Kruskal’s algorithm.
FAST FREQUENT FREE TREE MINING IN GRAPH DATABASES Marko Lazić 3335/2011 Department of Computer Engineering and Computer Science,
JUTS JSim Urban Traffic Simulator 1 J-Sim Urban Traffic Simulator J-Sim based, XML using grafical and console simulation tool. David Hartman ZČU-FAV-KIV.
Geometric clustering for line drawing simplification
1 CS 4396 Computer Networks Lab Dynamic Routing Protocols - II OSPF.
Lecture 13 Graphs. Introduction to Graphs Examples of Graphs – Airline Route Map What is the fastest way to get from Pittsburgh to St Louis? What is the.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.
1/52 Overlapping Community Search Graph Data Management Lab, School of Computer Science
Efficient Route Computation on Road Networks Based on Hierarchical Communities Qing Song, Xiaofan Wang Department of Automation, Shanghai Jiao Tong University,
Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.
Group 8: Denial Hess, Yun Zhang Project presentation.
LOGO 1 Mining Templates from Search Result Records of Search Engines Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hongkun Zhao, Weiyi.
1 Knowledge Discovery from Transportation Network Data Paper Review Jiang, W., Vaidya, J., Balaporia, Z., Clifton, C., and Banich, B. Knowledge Discovery.
CCR = Connectivity Residue Ratio = Pr. [ node pair connected by an edge are together in a common page on computer disk drive.] “U of M Scientists were.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
APEX: An Adaptive Path Index for XML data Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim SIGMOD 2002 Presentation: M.S.3 HyunSuk Jung Data Warehousing Lab. In.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Chapter 20: Graphs. Objectives In this chapter, you will: – Learn about graphs – Become familiar with the basic terminology of graph theory – Discover.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra.
Graph Indexing From managing and mining graph data.
Bayesian Hierarchical Clustering Paper by K. Heller and Z. Ghahramani ICML 2005 Presented by David Williams Paper Discussion Group ( )
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Construction of Optimal Data Aggregation Trees for Wireless Sensor Networks Deying Li, Jiannong Cao, Ming Liu, and Yuan Zheng Computer Communications and.
Proof of correctness of Dijkstra’s algorithm: Basically, we need to prove two claims. (1)Let S be the set of vertices for which the shortest path from.
Data Structures and Algorithm Analysis Lecture 5
Presented by: Mi Tian, Deepan Sanghavi, Dhaval Dholakia
Data Transformation: Normalization
CS522 Advanced database Systems
Intra-Domain Routing Jacob Strauss September 14, 2006.
Jiawei Han Department of Computer Science
Visualization of query processing over large-scale road networks
Finding Fastest Paths on A Road Network with Speed Patterns
Predicting Traffic Dmitriy Bespalov.
DATA MINING Introductory and Advanced Topics Part II - Clustering
Louisiana Travels.
Switch controller: Routing
Chapter 6 Network Flow Models.
Continuous Density Queries for Moving Objects
CSE572: Data Mining by H. Liu
Presentation transcript:

ADAPTIVE FASTEST PATH COMPUTATION ON A ROAD NETWORK: A TRAFFIC MINING APPROACH Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag Department of Computer Science University of Illinois at Urbana-Champaign VLDB 2007

Outline  Introduction  Algorithms  Offline… Road Network Partitioning Speed & Driving Pattern Mining Pre-computation & Upgrades  Online… Fastest Path Computation  Experiment

Introduction: Application  Route Planning System  Fastest Path Computation on Road Network  Consider different factors such as time/weather/safety “Learn from history” (mine patterns)

Introduction: Contributions  The hierarchy of roads can be used to partition the road network into area, and different path pre- computation strategies can be used at the area level  We can limit our route search strategy to edges and path segments that are actually frequently travelled in the data  Drivers usually traverse the road network through the largest roads available given the distance of the trip, except if there are small roads with a significant speed advantage over the large ones. (Small Road Upgrades)

Introduction: Problem Definition I  Definition 2.1. A road network is a directed graph G(V,E), where…(omitted)

Introduction: Problem Definition II  Definition 2.2. A speed pattern is a tuple of the form, where edge_id is an edge, (t_start, t_end) is a time interval, each d i is a value for speed factor D i, and m is an aggregate function computed on edge speed.

Introduction: Problem Definition III  Definition 2.3. A driving pattern is a sequence s of edges e (1), e (2),..., e (l) that appears more than min_sup times in the path database, and that is a valid path in the road network graph G(V,E). We define support(s) as the number of paths that contain the sequence s. We define the length of the sequence, length(s), as the number of edges that it contains.

Introduction: Problem Definition IV  Definition 2.4. An edge forecast model F(edge_id, t), returns a tuple (d 1, d 2, …, d k ) with the expected driving conditions for edge edge_id at time t.  Example: “At 5 pm [time], for highway 74 between Champaign and Normal [edge], Weather = rain, and Construction = no [conditions]".  Problem Statement. Given a road network G(V,E), a set of speed patterns S, an edge forecast model F, and a query q  (s, e, start_time), compute a fast route q r between nodes s and e starting from s at time start time, such that q r contains a large number of frequent driving patterns.

Introduction: Traffic Database (“history”)  In the example  support(e 1 )=3, support(e 2 )=3, support(e 3 )=1…

Outline  Introduction  Algorithms  Offline… Road Network Partitioning Speed & Driving Pattern Mining Pre-computation & Upgrades  Online… Fastest Path Computation  Experiment

Road Network Partitioning I  Edge Class  Node Class  Partitioning class(e)=1 class(e)=2

Road Network Partitioning II  Edge Class  Node Class  Definition 4.1. Given a road network G(V,E), with pre- defined edges classes class(e) for each edge e, the class of a node n denoted class(n), is defined as the biggest (lowest class number) of any incoming or outgoing edge to/from n.

Road Network Partitioning III  Node Class  Partitioning  Definition 4.2. Given edges of class k, a partition P(k) of a road network G(V,E) divides nodes into areas V 1 k,…, V n k, with V =  i V i k. Areas are defined as all sets of strongly connected components after the removal of nodes with class(n)<k from G. A node n, with class(n)>k in strongly connected component i, belongs to area V i k, and it is said to be interior to the area. A node n, with class(n)≤k belongs to all areas V i k such that there is an edge e, with class(e)>k, connecting n to n’ and n’  V i k, such nodes are said to be border nodes of all the areas they connect to.

Road Network Partitioning IV  The Algorithm  Iteratively, start from a “seed” and grow the area to all reachable nodes

Outline  Introduction  Algorithms  Offline… Road Network Partitioning Speed & Driving Pattern Mining Pre-computation & Upgrades  Online… Fastest Path Computation  Experiment

Traffic Mining: Speed Pattern Mining  Goal  Input: Traffic tuples of the form  Output: rules such as “if area = a1 and weather = icy and time = rush hour then speed = 1/4 x base speed".  Method (the paper does not provide detail)  Pre-processing: discretize speed factors via clustering  Decision tree induction

Traffic Mining: Driving Pattern Mining  “Frequent edges” or “frequent routes”?  In this paper: frequent edges  minimal support relative to the traffic volume of each edge class in the area

Outline  Introduction  Algorithms  Offline… Road Network Partitioning Speed & Driving Pattern Mining Pre-computation & Upgrades  Online… Fastest Path Computation  Experiment

Area Level Pre-computation  Goal  Pre-compute important local fastest paths with given time interval and road condition  Method  For an area in level m, look at pairs of nodes of level m+1 in that area  “Guide the pre-computation by the set of speed rules mined for the area, and limit the analysis paths involving edges with few speed rules.”

Small road upgrades  Example  In traffic hours, a small (low level) road may have higher speed than highway in the area. In such case, upgrade the small road.  Algorithm  Bottom-up  Scan all edges

Outline  Introduction  Algorithms  Offline… Road Network Partitioning Speed & Driving Pattern Mining Pre-computation & Upgrades  Online… Fastest Path Computation  Experiment

Fastest Path Computation I  Algorithm: A* (maintain a heap; Best first search)  From 1:7:5 (level 3)  expand to a 1:7 edge node (level 2)  then to 1  to 1:3  to 1:3:5

Fastest Path Computation II  Continue the algorithm  When expand to node n, update expected path cost to cost(start, n) + h(n, end) h(n, end) = distance(n, end)/max_speed  Online path re-computation  When condition changes (i.e., start raining), re-compute fastest path from current position to end node

Experiment Setting  Maps – road are in 2 levels  San Francisco Bay Area (175K nodes, 223K edges)  Illinois (831K nodes, 1M edges)  San Joaquin CA (18K nodes, 24K edges)  A few hundred megabytes traffic data & 100 queries: stimulated  Algorithms  A* (without hierarchical search; correct answer)  Hier (hierarchical search; without pre-computation and small road upgrade)  Adapt (this paper)

Experiment: Query Length

Experiment: Upgraded Paths

Experiments: others