黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica TrajPattern: Mining Sequential Patterns from Imprecise Trajectories.

Slides:



Advertisements
Similar presentations
Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.
Advertisements

Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
Fast Algorithms For Hierarchical Range Histogram Constructions
1. Algorithms for Inverse Reinforcement Learning 2
Fingerprint Minutiae Matching Algorithm using Distance Histogram of Neighborhood Presented By: Neeraj Sharma M.S. student, Dongseo University, Pusan South.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Oklahoma State University Generative Graphical Models for Maneuvering Object Tracking and Dynamics Analysis Xin Fan and Guoliang Fan Visual Computing and.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Data Mining Techniques So Far: Cluster analysis K-means Classification Decision Trees J48 (C4.5) Rule-based classification JRIP (RIPPER) Logistic Regression.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Chapter 16 Parallel Data Mining 16.1From DB to DW to DM 16.2Data Mining: A Brief Overview 16.3Parallel Association Rules 16.4Parallel Sequential Patterns.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Cascading Spatio-Temporal Pattern Discovery P. Mohan, S.Shekhar, J. Shine, J. Rogers CSci 8715 Presented by: Atanu Roy Akash Agrawal.
1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.
Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Mining Long Sequential Patterns in a Noisy Environment Jiong Yang, Wei Wang, Philip S. Yu, Jiawei Han SIGMOD 2002.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
ROBOT MAPPING AND EKF SLAM
A Platform for Location Aware Service -- with human computation Ling-Jyh Chen, Meng Chang Chen Ming-Syan Chen, Sheng-Wei Chen, Jan-Ming Ho, Wang-Chien.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
USpan: An Efficient Algorithm for Mining High Utility Sequential Patterns Authors: Junfu Yin, Zhigang Zheng, Longbing Cao In: Proceedings of the 18th ACM.
Efficient Gathering of Correlated Data in Sensor Networks
Time-focused density-based clustering of trajectories of moving objects Margherita D’Auria Mirco Nanni Dino Pedreschi.
Presented by Tienwei Tsai July, 2005
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
Presented by, Lokesh Chikkakempanna Authoritative Sources in a Hyperlinked environment.
Trajectory Pattern Mining
Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 Probabilistic Continuous Update.
Spatial-Temporal Models in Location Prediction Jingjing Wang 03/29/12.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Comp. Genomics Recitation 3 The statistics of database searching.
Shape-based Similarity Query for Trajectory of Mobile Object NTT Communication Science Laboratories, NTT Corporation, JAPAN. Yutaka Yanagisawa Jun-ichi.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
1 Discovering Robust Knowledge from Databases that Change Chun-Nan HsuCraig A. Knoblock Arizona State UniversityUniversity of Southern California Journal.
1 Murat Ali Bayır Middle East Technical University Department of Computer Engineering Ankara, Turkey A New Reactive Method for Processing Web Usage Data.
AAAI 2011, San Francisco Trajectory Regression on Road Networks Tsuyoshi Idé (IBM Research – Tokyo) Masashi Sugiyama (Tokyo Institute of Technology)
1 Efficient Algorithms for Incremental Update of Frequent Sequences Minghua ZHANG Dec. 7, 2001.
MINING COLOSSAL FREQUENT PATTERNS BY CORE PATTERN FUSION FEIDA ZHU, XIFENG YAN, JIAWEI HAN, PHILIP S. YU, HONG CHENG ICDE07 Advisor: Koh JiaLing Speaker:
MMAC: A Mobility- Adaptive, Collision-Free MAC Protocol for Wireless Sensor Networks Muneeb Ali, Tashfeen Suleman, and Zartash Afzal Uzmi IEEE Performance,
PatternHunter: A Fast and Highly Sensitive Homology Search Method Bin Ma Department of Computer Science University of Western Ontario.
黃福銘 (Angus). Angus Fuming Huang Academia Sinica, Institute of Information Science, ANTS Lab Jae-Gil Lee Jiawei Han UIUC Kyu-Young Whang KAIST ACM SIGMOD’07.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mixture Kalman Filters by Rong Chen & Jun Liu Presented by Yusong Miao Dec. 10, 2003.
Data Mining and Decision Support
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Paging Area Optimization Based on Interval Estimation in Wireless Personal Communication Networks By Z. Lei, C. U. Saraydar and N. B. Mandayam.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
CLUSTERING HIGH-DIMENSIONAL DATA Elsayed Hemayed Data Mining Course.
Path Planning Based on Ant Colony Algorithm and Distributed Local Navigation for Multi-Robot Systems International Conference on Mechatronics and Automation.
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
Graph Indexing From managing and mining graph data.
Imran N. Junejo, Omar Javed and Mubarak Shah University of Central Florida, Proceedings of the 17th International Conference on Pattern Recognition, pp ,
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Using category-Based Adherence to Cluster Market-Basket Data Author : Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen Graduate : Chien-Ming Hsiao.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
8 Sept 2006, DEMA2006Slide 1 An Introduction to Computer Experiments and their Design Problems Tony O’Hagan University of Sheffield.
CS 9633 Machine Learning Support Vector Machines
CARPENTER Find Closed Patterns in Long Biological Datasets
Discovery of Significant Usage Patterns from Clickstream Data
Accurate Video Localization
Mole: Motion Leaks through Smartwatch Sensors
Presentation transcript:

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica TrajPattern: Mining Sequential Patterns from Imprecise Trajectories of Mobile Objects TMSG- Paper Reading

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Agenda Authors & Publication Paper Presentation My Comments 2

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Authors & Publication Jiong Yang ( 杨炯 ) – Meng Hu ( 胡萌 ) – Case Western Reserve University – EECS – EDBT 2006 (International Conference on Extending Database Technology) – Springer LNCS 3896 – 3

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Introduction Digital trajectory appearance – Mobile devices – Global positioning system – Trajectories of mobile objects Moving pattern applications – Location prediction – Location-based commerce advertisement – Animals migration 4

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Introduction (contd.) Trajectory data is often imprecise – Energy consumption reducing – GPS miss – It is hard to formulate the pattern Goal – Not a particular prediction model, but rather a general data mining framework – Can be applied to existing location prediction methods 5

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Introduction (contd.) Sequential pattern model – Practical and popular in trajectory mining Method – Normalized match measure (NM) Apriori property (x) Min-max property (o) A new mining algorithm: TrajPattern – Growing process – Pruning process Pattern groups 6

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Preliminaries and Problem Statement Problem – Identifying sequential patterns of imprecise trajectories of mobile devices Assumption – A server and a set of mobile devices – Be capable to know their own locations (e.g. via GPS) – Asynchronously report locations 7

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Location Reporting Scheme A general pattern mining framework – Various different location inference models Location prediction method property – Given time, predicted location, actual location, certain distribution Predicted position – Without loss of generality… 8

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Location and Velocity Trajectories Synchronous snapshots – A consistent view of all objects Location trajectory – Mean & Standard deviation of the distribution of the true location of object o at ith snapshot Difference of the mean & standard deviation Velocity trajectory – The same as the above concepts 9

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Model of Trajectory Pattern Trajectory pattern, – p i is location Indifferent parameter, δ – Object is at most δ away from coordinate (x, y) Indifferent probability, – How likely an object is truly very close to a position p Match, between a pattern P and a trajectory T’ Normalized Match, 10

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Definition of Pattern Group Definition 1: Similar patterns – No larger than a pre-defined value, ϒ – ϒ, the maximum similar pattern distance Definition 2: Pattern group Problem Statement – For a given set of imprecise trajectories – To find k patterns with the most normalized match – Represented via the concept of pattern groups 11

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Properties of Trajectory Patterns Definition 3: Super-pattern – Contain relationship – Super-pattern, sub-pattern, proper super-pattern, proper sub-pattern Definition 4: i-trajectory pattern – Include i positions Property 1: Min-max property – Proof… 12

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica TrajPattern Algorithm Observations – Discovered trajectory patterns is usually shorter – Dynamically increasing NM threshold – Seed patterns based on the min-max property Procedures 1.Grid partition Singular patterns 2.Pattern generation Low and high patterns 3.Pruning 1-extension property 4.Pattern groups discovery Length clustering 13

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica 1-extension property 14

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica 1-extension property (contd.) The reason that we only need to retain the set of low patterns satisfying the 1-extension property… – So! Let’s remove all low patterns that do not satisfy this property 15

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Example of the Pattern Groups Discovery S1: (p 1, p 3, p 4, p 5 ), (p 2, p 6 ) S2: (p’ 1, p’ 3, p’ 6 ), (p’ 2, p’ 4 ), (p’ 5 ) – The smallest snapshot group: (p’ 5 ) – Extract 1-pattern: (p 5 ) S1: (p 1, p 3, p 4 ), (p 2, p 6 ) S2: (p’ 1, p’ 3, p’ 6 ), (p’ 2, p’ 4 ) – The smallest snapshot group: (p 2, p 6 ) – Extract 1-pattern: (p 2 ), (p 6 ), (p 4 ) S1: (p 1, p 3 ) S2: (p’ 1, p’ 3 ) – Extract 2-pattern: (p 1, p 3 ) Pattern groups: (p 2 ), (p 4 ), (p 5 ), (p 6 ), (p 1, p 3 ) 16

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Correctness Analysis 17

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Complexity Analysis Time complexity = O(k 2 M 2 NG) Space complexity = O(kMG) – k, user needed top patterns – G, the number of grids – M, the maximum length of a pattern – N, the size of the input trajectory data set 18

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Effectiveness of the NM Model Comparison of three prediction modules – Linear model (LM) – Linear Kalman Filter (LMF) – Recursive motion function (RMF) Real data set – 50 buses, 500 traces, 100 snapshots, weekday – Training: 450 ; Testing: 50 19

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Effectiveness of the NM Model (contd.) 20% to 40% vs. 10% to 20% 20

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Scalability and Sensitivity Baseline : InfoMiner – a projection based approach (PB) Synthetic data set – ZebraNet project (Kenya) Modification – Simulate some groups – Simulate some leaving – Simulate some moving distance and direction 21

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Scalability and Sensitivity (contd.) Scalability – The number of patterns needed, K – The number of sequences, S – The average length of a sequence, L – The various number of grids, G 22 Sensitivity – The effect of the indifferent threshold, δ

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Conclusion Mining trajectory patterns from a set of imprecise trajectories – Min-max property – TrajPattern algorithm – Pattern group Demonstrating the usefulness and efficiency – Real and synthetic data sets 23

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica My Comments Nice~ – The formal definitions are beautiful – Resources of experiments are various and living – Breakthrough of his previous research Weak~ – Do not explain the setting and applying of the three compared model clearly – Research scope is a little rough and inconsistent General DM framework vs. Specific sequential pattern 24

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica My Comments (contd.) Others~ – [Yang 2001] is worth to read InfoMiner: Mining Surprising Periodic Patterns 57 cites vs. 33 cites – If no certain distribution !!! What will happen ??? (pp.8) 25

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Thanks for your listening……….. 26