Part III – Trajectory Data Mining Vania Bogorny Universidade Federal de Santa Catarina, Brazil Department of Informatics and Statistics www.inf.ufsc.br/~vania.

Slides:



Advertisements
Similar presentations
Density-Based Clustering Math 3210 By Fatine Bourkadi.
Advertisements

Incremental Clustering for Trajectories
ADAPTIVE FASTEST PATH COMPUTATION ON A ROAD NETWORK: A TRAFFIC MINING APPROACH Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag.
Swarm: Mining Relaxed Temporal Moving Object Clusters
Mining Frequent Spatio-temporal Sequential Patterns
Fundamental tools: clustering
A Framework for Clustering Evolving Data Streams Charu C. Aggarwal, Jiawei Han, Jianyong Wang, Philip S. Yu Presented by: Di Yang Charudatta Wad.
VLDB  Motivation  TraClass: Trajectory Feature Generation  Trajectory Partitioning  Region-Based Clustering  Trajectory-Based.
9/15/2008 CTBTO Data Mining/Data Fusion Workshop 1 Spatiotemporal Stream Mining Applied to Seismic+ Data Margaret H. Dunham CSE Department Southern Methodist.
Alla Petrakova.  Becoming familiar with Motion Pattern algorithms described in: Similarity Invariant Classification of Events by KL Divergence Minimization.
Computational Movement Analysis Lecture 4: Movement patterns Joachim Gudmundsson.
Clustering Trajectories of Moving Objects in an Uncertain World
DMiST- Data Mining in Spatio-Temporal sets
Trajectory Pattern Mining ACMGIS’2011 Hoyoung Jeung† Man Lung Yiu‡ Christian S. Jensen* † Ecole Polytechnique F´ed´erale de Lausanne (EPFL) ‡ Hong Kong.
Marc van Kreveld (and Giri Narasimhan) Department of Information and Computing Sciences Utrecht University.
Spatio-Temporal Data Mining Vania Bogorny Universidade Federal do Rio Grande do Sul Shashi Shekhar University.
Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping Trajectory Pattern Mining Reporter : Hsieh, Hsun-Ping 解巽評 (R ) Fosca Giannotti Mirco Nanni Dino.
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma Microsoft Research Asia
Cluster Analysis.
CENTRE Cellular Network’s Positioning Data Generator Fosca GiannottiKDD-Lab Andrea MazzoniKKD-Lab Puntoni SimoneKDD-Lab Chiara RensoKDD-Lab.
Trajectory Simplification
On Discovering Moving Clusters in Spatio-temporal Data Panos Kalnis National University of Singapore Nikos Mamoulis University of Hong Kong Spiridon Bakiras.
Median trajectories: define and compute a trajectory composed of the input trajectories and that is somehow in the middle Marc van Kreveld Department of.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 5: Association Rules, Sequential Associations.
University of Athens, Greece Pervasive Computing Research Group Predicting the Location of Mobile Users: A Machine Learning Approach 1 University of Athens,
Computational Geometry and Spatial Data Mining Marc van Kreveld Department of Information and Computing Sciences Utrecht University.
Spatial Data Mining: Three Case Studies For additional details Shashi Shekhar, University of Minnesota Presented.
1 A DATA MINING APPROACH FOR LOCATION PREDICTION IN MOBILE ENVIRONMENTS* by Gökhan Yavaş Feb 22, 2005 *: To appear in Data and Knowledge Engineering, Elsevier.
© NICTA 2007 Joachim Gudmundsson Detecting Movement Patterns Among Trajectory Data.
Geographic Data Mining Marc van Kreveld Seminar for GIVE Block 1, 2003/2004.
Tokyo Research Laboratory © Copyright IBM Corporation 2009 | 2009/04/03 | SDM 09 / Travel-Time Prediction Travel-Time Prediction using Gaussian Process.
GPS Trajectories Analysis in MOPSI Project Minjie Chen SIPU group Univ. of Eastern Finland.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Intrusion Detection Jie Lin. Outline Introduction A Frame for Intrusion Detection System Intrusion Detection Techniques Ideas for Improving Intrusion.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Bei Pan (Penny), University of Southern California
Mirco Nanni, Roberto Trasarti, Giulio Rossetti, Dino Pedreschi Efficient distributed computation of human mobility aggregates through user mobility profiles.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
 Larger datasets are becoming available from GPS, GSM, RFID, and other sensors.  Interest in movement has shifted from raw movement data analysis to.
Developing space-time data models and representations for GIS
Trajectory Pattern Mining
Trajectory Pattern Mining Fosca Giannotti, Mirco Nanni, Dino Pedreschi, Fabio Pinelli KDD Lab (ISTI-CNR & Univ. Pisa) Presented by: Qiming Zou.
Spatial-Temporal Models in Location Prediction Jingjing Wang 03/29/12.
Garrett Poppe, Liv Nguekap, Adrian Mirabel CSUDH, Computer Science Department.
Classification and Novel Class Detection in Data Streams Classification and Novel Class Detection in Data Streams Mehedy Masud 1, Latifur Khan 1, Jing.
Semantic Aspects in Spatial Data Mining Vania Bogorny.
Trajectory Outlier Detection: A Partition-and-Detect Framework1 04/08/08 April 8, 2007 Trajectory Outlier Detection: A Partition-and-Detect Framework Jae-Gil.
黃福銘 (Angus). Angus Fuming Huang Academia Sinica, Institute of Information Science, ANTS Lab Jae-Gil Lee Jiawei Han UIUC Kyu-Young Whang KAIST ACM SIGMOD’07.
Towards Semantic Trajectory Outlier Detection Artur Ribeiro de Aquino 1 Luis Otavio Alvares 1 Chiara Renso 2 Vania Bogorny Dep. de Matemática e Estatística.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Trajectory Data Mining
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
Trajectory Simplification: On Minimizing the Direction-based Error
Eick: Papers, Articles, and Other Material to be used in COSC 7363 COSC 7363 Overview Advanced AI Hyla-Tree Frog 1994 “AI” Turing Award Lectures AI and.
Trajectory Data Mining
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
2IMA20 Algorithms for Geographic Data Spring 2016 Lecture 3: Movement Patterns.
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Presented by: Shahab Helmi Spring 2016
Chapter Two Trajectory Indexing and Retrieval
Yongli Zhang and Christoph F. Eick University of Houston, USA
A conceptual Data Model for Trajectory Data Mining
2IMG15 Algorithms for Geographic Data
Spatial Data Mining: Three Case Studies
CSCE 4143 Section 001: Data Mining Spring 2019.
Chapter Two Trajectory Indexing and Retrieval
Promising “Newer” Technologies to Cope with the
Presentation transcript:

Part III – Trajectory Data Mining Vania Bogorny Universidade Federal de Santa Catarina, Brazil Department of Informatics and Statistics Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM – 2010) 7/5/20161 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Data (Giannotti 2007 – Spatio-temporal Data Represented as a set of points, located in space and time T=(x 1,y 1, t 1 ), …, (x n, y n, t n ) => position in space at time t i was (x i,y i ) Tid position (x,y) time (t) : : : : : : : /5/20162 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Patterns Explicar o processo de KDD em geral 7/5/20163 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Mining Trajectories: Clustering Group together similar trajectories For each group produce a summary = cell Fosca Giannotti 2007 – 7/5/20164 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Mining Trajectories : Frequent patterns Fosca Giannotti 2007 – Frequent followed paths = cell 7/5/20165 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Mining Trajectories: classification models Extract behaviour rules from history Use them to predict behaviour of future users 60% 7% 8% 5% 20% ? Fosca Giannotti 2007 – = cell 7/5/20166 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Data Mining Methods 7/5/20167 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Spatio-Temporal Data Mining Methods Two approaches: Geometry-based spatio-temporal data mining: Density-based clustering methods Focus on physical similarity Consider only geometrical properties of trajectories (space and time) Semantic-based spatio-temporal data mining Deal with sparse data also Patterns are computed based on the semantics of the data Trajectories are pre-processed to enrich the data 7/5/20168 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Geometry-based Trajectory Data Mining Methods 7/5/20169 of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

General Geometric Trajectory Patterns 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Relative Motion Patterns ( Laube 2004 ) T1T1 T2T2 T3T3 T4T4 T5T5 convergence Proposed 5 kinks of trajectory patterns based on movement, direction, and location: convergence, encounter, flock, leadership, and recurrence Convergence: At least m entities pass through the same circular region of radius r, not necessarily at the same time (e.g. people moving to train station) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Relative Motion Patterns ( Laube 2004) Flock pattern: At least m entities are within a region of radius r and move in the same direction during a time interval >= s (e.g. traffic jam) Leadership: At least m entities are within a circular region of radius r, they move in the same direction, and at least one of the entities is heading in that direction for at least t time steps. (e.g. bird migration, traffic accident) Encounter: At least m entities will be concurrently inside the same circular region of radius r, assuming they move with the same speed and direction. (e.g. traffic jam at some moment if cars keep moving in the same direction) T2T2 T3T3 T1T1 Flock Leadership Encounter 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Relative Motion Patterns ( Laube 2004) F1F1 F1 F1F1 Recurrence Recurrence: at least m entities visit a circular region at least k times 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Extension of the work proposed by [Laube 2004, 2005] Gudmundsson(2006) Computes the longest duration flock patterns The longest pattern has the longest duration And has at least a minimal number of trajectories Gudmundsson (2007) proposes approximate algorithms for computing the patterns leadership, encounter, convergence, and flock Focus relies on performance issues 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Frequent Trajectory Patterns 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Frequent Mobile Group Patterns (Hwang, 2005) A group pattern is a set of trajectories close to each other (with distance less than a given minDist) for a minimal amount of time (minTime) Direction is not considered Frequent groups are computed with the algorithm Apriori Group pattern: time, distance, and minsup 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Co-Location Patterns (Cao 2006) Co-location episoids in spatio-temporal data Trajectories are spatially close in a time window and move together w1 w2 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Traclus (Han, 2007) Clustering algorithm (TraClus-Trajectory Clustering) Group sub-trajectories Density-based Partition-and-group method 1) each trajectory is partitioned into a set of line segments (sub- trajectories) with lenght L defined by the user 2) similar segments (close segments) are grouped Similarity is based on a distance function Interesting approach for trajectories of hurricanes Main drawback: Clustering is based on spatial distance time is not considerd 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Sequential Patterns 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

T-Patterns ( Giannotti, 2007) Sequential Trajectory Pattern Mining Considers both space and time Objective is to describe frequent movement Considering visited regions of interest During movements and the duration of movements Steps: 1. Compute or find regions of interest, based on dense spatial regions (no time is considered) 2. Select trajectories that intersect two or more regions in a sequence, annotating travel time from one region to another 3. Compute sequences of regions visited in same time intervals 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Fix a set of pre-defined regions time A B C Map each (x,y) of the trajectory to its region Sample pattern: T-Patterns ( Giannotti, 2007) B A .min20 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Detect significant regions thru spatial clustering time Map each (x,y) of the trajectory to its region Sample pattern: around(x 1,y 1 ) around(x 2,y 2 ) T-Patterns ( Giannotti, 2007) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Classification The idea is to classify types of trajectories 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

TraClass Algorithm (Lee 2008) Two main steps algorithm: First: region – based clustering Second: trajectory-clustering Time is not considered 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

TraClass Algorithm (Lee 2008) Classify subtrajectories instead of whole trajectories Examples:  Red trajectories move from Port A to Container Port and then to Port B  Blue trajectories move from Port A to Refinery and then to Port B Classifying whole trajectory would classify all trajectories as moving from Port A to Port B 25 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Trajectory Outlier Detection 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

The objective is to find trajectories that have different behavior in relation to other trajectories For instance: – A fishing vessel that has a behaviour different from other fishing vessels in the same area – A hurricane that may change behaviour in certain parts of its trajectory – Cars or pedestrians with suspishious behaviour Trajectory Outlier Detection 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Partition trajectories into subtrajectories Compare subtrajectories based on: – distance and length If a subtrajectory is not close to other trajectories for a minimal lenght – It is an outlier TraOD - Trajectory Outlier Detection (Lee 2008) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Example: – Looking to the whole trajectory, TR 3 is not detected as an outlier since its overall behavior is similar to neigbouhr trajectories Looking at the subtrajectories, T3 can be an outlier 29 TR 5 TR 1 TR 4 TR 3 TR 2 An outlying sub-trajectory TraOD - Trajectory Outlier Detection (Lee 2008) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Two phases: partitioning and detection TR 5 TR 1 TR 4 TR 3 TR 2 A set of trajectories (1) Partition (2) Detect TR 3 A set of trajectory partitions An outlier Outlying trajectory partitions TraOD - Trajectory Outlier Detection (Lee 2008) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

31 13 Outliers from Hurricane Data TraOD - Trajectory Outlier Detection (Lee 2008) 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

Summary Tid geometry timest : : : : : : : : : : : : These data mining approaches deal with Trajectory Samples 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

References Laube, P. and Imfeld, S. (2002). Analyzing relative motion within groups of trackable moving point objects. In Egenhofer, M. J. and Mark, D. M., editors, GIScience, volume 2478 of Lecture Notes in Computer Science, pages 132–144. Springer. Laube, P., Imfeld, S., and Weibel, R. (2005a). Discovering relative motion patterns in groups of moving point objects. International Journal of Geographical Information Science, 19(6):639–668. Laube, P., van Kreveld, M., and Imfeld, S. (2005b). Finding REMO: Detecting Relative Motion Patterns in Geospatial Lifelines. Springer. Lee, J.-G., Han, J., and Whang, K.-Y. (2007). Trajectory clustering: a partition-and-group framework. In Chan, C. Y., Ooi, B. C., and Zhou, A., editors, SIGMOD Conference, pages 593–604. ACM. Li, Y., Han, J., and Yang, J. (2004). Clustering moving objects. In KDD ’04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 617–622, New York, NY, USA. ACM Press. Nanni, M. and Pedreschi, D. (2006). Time-focused clustering of trajectories of moving objects. Journal of Intelligent Information Systems, 27(3):267–289. 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

References Verhein, F. and Chawla, S. (2006). Mining spatio-temporal association rules, sources, sinks, stationary regions and thoroughfares in object mobility databases. In Lee, M.- L., Tan, K.-L., and Wuwongse, V., editors, DASFAA, volume 3882 of Lecture Notes in Computer Science, pages 187–201. Springer. Gudmundsson, J. and van Kreveld, M. J. (2006). Computing longest duration flocks in trajectory data. In [de By and Nittel 2006], pages 35–42. Gudmundsson, J., van Kreveld, M. J., and Speckmann, B. (2007). Efficient detection of patterns in 2d trajectories of moving points. GeoInformatica, 11(2):195–215. Hwang, S.-Y., Liu, Y.-H., Chiu, J.-K., and Lim, E.-P. (2005). Mining mobile group patterns: A trajectory-based approach. In Ho, T. B., Cheung, D. W.-L., and Liu, H., editors, PAKDD, volume 3518 of Lecture Notes in Computer Science, pages 713–718. Springer. Cao, H., Mamoulis, N., and Cheung, D. W. (2006). Discovery of collocation episodes in spatiotemporal data. In ICDM, pages 823–827. IEEE Computer Society. Zhenhui Li, Jae-Gil Lee, Xiaolei Li, Jiawei Han: Incremental Clustering for Trajectories. DASFAA (2) 2010: 32-46Xiaolei Li 7/5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)

More... Huiping Cao, Nikos Mamoulis, David W. Cheung: Discovery of Periodic Patterns in Spatiotemporal Sequences. IEEE Trans. Knowl. Data Eng. 19(4): (2007) Panos Kalnis, Nikos Mamoulis, Spiridon Bakiras: On Discovering Moving Clusters in Spatio- temporal Data. SSTD, (2005) Florian Verhein, Sanjay Chawla: Mining spatio-temporal patterns in object mobility databases. Data Min. Knowl. Discov. 16(1): 5-38 (2008) Florian Verhein, Sanjay Chawla: Mining Spatio-temporal Association Rules, Sources, Sinks, Stationary Regions and Thoroughfares in Object Mobility Databases. DASFAA, (2006) Cao, H., Mamoulis, N., and Cheung, D. W. (2005). Mining frequent spatio-temporal sequential patterns. In ICDM ’05: Proceedings of the Fifth IEEE International Conference on Data Mining, pages 82–89, Washington, DC, USA. IEEE Computer Society. Jae-Gil Lee, Jiawei Han, Xiaolei Li, and Hector Gonzalez, “TraClass: Trajectory Classification Using Hierarchical Region-Based and Trajectory-Based Clustering”, Proc Int. Conf. on Very Large Data Base (VLDB'08), Auckland, New Zealand, Aug Jae-Gil Lee, Jiawei Han, and Xiaolei Li, "Trajectory Outlier Detection: A Partition-and-Detect Framework", Proc Int. Conf. on Data Engineering (ICDE'08), Cancun, Mexico, April /5/ of 90 Tutorial on Spatial and Spatio-Temporal Data Mining (ICDM 2010)