VLDB 2008. 2008-08-28 2  Motivation  TraClass: Trajectory Feature Generation  Trajectory Partitioning  Region-Based Clustering  Trajectory-Based.

Slides:



Advertisements
Similar presentations
Incremental Clustering for Trajectories
Advertisements

Ziming Zhang, Yucheng Zhao and Yiwen Wan.  Introduction&Motivation  Problem Statement  Paper Summeries  Discussion and Conclusions.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Sensor-Based Abnormal Human-Activity Detection Authors: Jie Yin, Qiang Yang, and Jeffrey Junfeng Pan Presenter: Raghu Rangan.
Adapting Ocean Surveys to the Observed Fields Characteristics Maria-João Rendas I3S, CNRS-UNSA.
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,
Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.
1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,
SOMM: Self Organizing Markov Map for Gesture Recognition Pattern Recognition 2010 Spring Seung-Hyun Lee G. Caridakis et al., Pattern Recognition, Vol.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Co-Training and Expansion: Towards Bridging Theory and Practice Maria-Florina Balcan, Avrim Blum, Ke Yang Carnegie Mellon University, Computer Science.
Iterative Optimization of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial Intelligence.
DNA Barcode Data Analysis: Boosting Assignment Accuracy by Combining Distance- and Character-Based Classifiers Bogdan Paşaniuc, Sotirios Kentros and Ion.
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
Minimum Classification Error Networks Based on book chapter 9, by Shigeru Katagiri Jaakko Peltonen, 28 th February, 2002.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:
Predicting Matchability - CVPR 2014 Paper -
05/06/2005CSIS © M. Gibbons On Evaluating Open Biometric Identification Systems Spring 2005 Michael Gibbons School of Computer Science & Information Systems.
Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
嵌入式視覺 Pattern Recognition for Embedded Vision Template matching Statistical / Structural Pattern Recognition Neural networks.
M ULTIFRAME P OINT C ORRESPONDENCE By Naseem Mahajna & Muhammad Zoabi.
Presented by Tienwei Tsai July, 2005
Classification. An Example (from Pattern Classification by Duda & Hart & Stork – Second Edition, 2001)
BINF6201/8201 Hidden Markov Models for Sequence Analysis
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica TrajPattern: Mining Sequential Patterns from Imprecise Trajectories.
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
Towards Coastal Threat Evaluation Decision Support Presentation by Jacques du Toit Operational Research University of Stellenbosch 3 December 2010.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Tracking People by Learning Their Appearance Deva Ramanan David A. Forsuth Andrew Zisserman.
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
Trajectory Outlier Detection: A Partition-and-Detect Framework1 04/08/08 April 8, 2007 Trajectory Outlier Detection: A Partition-and-Detect Framework Jae-Gil.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
黃福銘 (Angus). Angus Fuming Huang Academia Sinica, Institute of Information Science, ANTS Lab Jae-Gil Lee Jiawei Han UIUC Kyu-Young Whang KAIST ACM SIGMOD’07.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
1 An Efficient Classification Approach Based on Grid Code Transformation and Mask-Matching Method Presenter: Yo-Ping Huang.
1 Neighboring Feature Clustering Author: Z. Wang, W. Zheng, Y. Wang, J. Ford, F. Makedon, J. Pearlman Presenter: Prof. Fillia Makedon Dartmouth College.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Effective Anomaly Detection with Scarce Training Data Presenter: 葉倚任 Author: W. Robertson, F. Maggi, C. Kruegel and G. Vigna NDSS
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
Extracting stay regions with uncertain boundaries from GPS trajectories a case study in animal ecology Haidong Wang.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Rate Distortion Theory. Introduction The description of an arbitrary real number requires an infinite number of bits, so a finite representation of a.
Tracking Groups of People for Video Surveillance Xinzhen(Elaine) Wang Advisor: Dr.Longin Latecki.
Meeting 8: Features for Object Classification Ullman et al.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
1 Discriminative Frequent Pattern Analysis for Effective Classification Presenter: Han Liang COURSE PRESENTATION:
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Date of download: 7/8/2016 Copyright © 2016 SPIE. All rights reserved. A scalable platform for learning and evaluating a real-time vehicle detection system.
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
Hidden Markov Models BMI/CS 576
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
DIGITAL SIGNAL PROCESSING
Supervised Time Series Pattern Discovery through Local Importance
Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.
REMOTE SENSING Multispectral Image Classification
Handwritten Characters Recognition Based on an HMM Model
A Block Based MAP Segmentation for Image Compression
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Presentation transcript:

VLDB 2008

 Motivation  TraClass: Trajectory Feature Generation  Trajectory Partitioning  Region-Based Clustering  Trajectory-Based Clustering  Classification Strategy  Performance Evaluation  Related Work  Conclusions

Feature Generation Classifier Class label Training data Features Prediction Unseen data (Jeff, Professor, 4, ?) Tenured = Yes Scope of this paper

 A trajectory is a sequence of the location and timestamp of a moving object Hurricanes Turtles Vessels Vehicles

 Definition: The process of predicting the class labels of moving objects based on their trajectories and other features  Applications: Homeland security, weather forecast, law enforcement, etc.  Example: Detection of vessel types (e.g., container ships, tankers, and fishing boats) from satellite images

 Several trajectory classification methods have been proposed mainly in the fields of pattern recognition, bioengineering, and video surveillance  A common characteristic of earlier methods is that they use the shapes of whole trajectories to do classification, e.g., by using the HMM Note: Although a few methods partition trajectories, the purpose of their partitioning is just to approximate or smooth trajectories

 Problem Statement: Given a set of labeled trajectories, generate discriminative trajectory features that make a specific class distinguishable from other classes  Observations: (1) Discriminative features are likely to appear at parts of trajectories, not at whole trajectories; (2) Discriminative features appear not only as common movement patterns, but also as regions

 Observation 1: Parts of trajectories near the container port and near the refinery enable us to distinguish between container ships and tankers even if they share common long paths  Observation 2: Those in the fishery enable us to recognize fishing boats even if they have no common path there Region Sub-trajectory

 The classification accuracy of earlier methods might not be high since the overall shapes of whole trajectories are similar to each other  Our framework TraClass aims at discovering both region and sub- trajectory features Overall shape

 Extract features in a top-down fashion, first by region-based clustering and then by trajectory- based clustering Trajectory partitions in non-homogeneous regions Region-based and Trajectory-based clusters Trajectory partitions Recursively quantize non-homogeneous regions Repeatedly find finer-granularity clusters

 Achieve high classification accuracy owing to the collaboration between the two types of clustering  Region features ← Region-based clustering  Sub-trajectory features ← Trajectory partitioning and trajectory-based clustering

Trajectory partitions in non-homogeneous regions Region-based and Trajectory-based clusters Trajectory partitions Recursively quantize non-homogeneous regions Repeatedly find finer-granularity clusters

Trajectories are partitioned based on their shapes as in the partition-and-group framework [12] 2. Trajectory partitions are further partitioned by the class labels  The real interest here is to guarantee that trajectory partitions do not span the class boundaries Additional partitioning points Non-discriminative Discriminative Class A Class B

 If the most prevalent class around one endpoint is different from that around the other endpoint, further partition it  Example: Class A Class B Prevalent class = Class A Prevalent class = Class B Need to be further partitioned

Trajectory partitions in non-homogeneous regions Region-based and Trajectory-based clusters Trajectory partitions Recursively quantize non-homogeneous regions Repeatedly find finer-granularity clusters

 Discover regions that have trajectories mostly of one class regardless of their movement patterns  The region-based cluster is a set of trajectory partitions of the same class within a rectangular region regardless of their movement patterns (1)(2)

 Homogeneity: The class distribution in each region should be as homogeneous as possible  Conciseness: The number of regions should be as small as possible Note: Two properties are contradictory to each other  Need to find a good tradeoff between the properties One large regionMany small regions homogeneity conciseness

 The minimum description length (MDL) cost consists of the description cost and the code cost  The former measures conciseness, and the latter homogeneity  The best hypothesis is the one that minimizes the sum of the description cost and the code cost  Finding a good quantization translates to finding the best hypothesis using the MDL principle

 Progressively find a better partitioning alternately for the X axis and for the Y axis as long as the MDL cost decreases  Select the partition that has the maximum code cost and divide it into two parts in order to decrease the MDL cost (1) (2) (3) (4)

Trajectory partitions in non-homogeneous regions Region-based and Trajectory-based clusters Trajectory partitions Recursively quantize non-homogeneous regions Repeatedly find finer-granularity clusters

 Discover sub-trajectories that indicate common movement patterns of each class  The trajectory-based cluster is a set of trajectory partitions of the same class which share a common movement pattern (3)(4)

 Similar to our trajectory clustering algorithm [12], but incorporate the class labels into clustering  The algorithm is based on DBSCAN [5]  If an ε-neighborhood contains trajectory partitions mostly of the same class, it is used for clustering; otherwise, it is discarded immediately Non-homogeneous Homogeneous ε-neighborhood ε-neighborhood L1L1 L2L2 XO

 After trajectory-based clusters are found, discriminative clusters are selected for effective classification  If the average distance to other clusters of different classes is high, the discriminative power of the cluster is high  Example: C1C1 C2C2 Class A Class B C 1 is more discriminative than C 2

 A cluster link is a sequence of connectable (i.e., consecutive) trajectory-based clusters  Two clusters are connectable if they share enough trajectories (more formally, the ratio of common trajectories is higher than χ)  The benefit of cluster links is to derive also whole- trajectory features  Cluster links are added to the set of trajectory-based clusters for use in classification

Partition trajectories by considering the class labels 2. Perform region-based clustering 3. Perform trajectory-based clustering 4. Select discriminative trajectory-based clusters 5. Find cluster links from trajectory-based clusters 6. Convert each trajectory into a feature vector  Each feature is either a region-based cluster or a trajectory-based cluster  The i-th entry of a feature vector is the frequency that the i-th feature occurs in the trajectory 7. Feed the feature vectors to the SVM

 Use three real trajectory data sets  Animal movement data set  Movements of elk, deer, and cattle for the years 1993 through 1996  Three classes: Elk, Deer, and Cattle  Number of trajectories (points): 38 (7117), 30 (4333), and 34 (3540)  Vessel navigation data set  Navigation paths of two vessels in August 2000  Two classes: Point Lobos and Point Sur  Number of trajectories (points): 600 (65500) and 550 (125750)  Hurricane track data set  Atlantic Hurricanes for the years 1950 through 2006  Two classes: Category 2 and Category 3  Number of trajectories (points): 61 (2459) and 72 (3126)  Randomly select 20% of trajectories for the test set

 Measure classification accuracy, training time, and prediction time for the three data sets  Compare two versions of the algorithm  TB-ONLY: Perform trajectory-based clustering only  RB-TB: Perform both types of clustering  TB-ONLY is expected to be no worse than earlier methods since it discovers also whole-trajectory features by cluster- link generation Classification accuracy = # of test trajectories correctly classified total # of test trajectories

Data SetAnimalVesselHurricane VersionTB-ONLYRB-TBTB-ONLYRB-TBTB-ONLYRB-TB Accuracy (%) Training Time (ms) Prediction Time (ms)  The classification accuracy of RB-TB is much higher than that of TB-ONLY  The training time of RB-TB is much shorter than that of TB-ONLY

Data: Three classes Features: 10 region-based clusters 37 trajectory-based clusters Red: Elk Blue: Deer Black: Cattle Accuracy = 83.3%

Gulf of Mexico 1 region-based cluster 15 trajectory-based clusters Red: Category 2 Blue: Category 3 Stronger hurricanes tend to go further than weaker ones These hurricanes entered the Gulf of Mexico and thus stayed longer at sea before landfall than others; They are likely to get strong because hurricanes gain energy from the evaporation of warm ocean water

 Effect of region-based clustering  Effect of the data size (scalability test)

 Pattern recognition [1] e.g., speech, handwriting, signature, and gesture recognition  Classifying human motion trajectories  Employing the hidden Markov model (HMM)  Bioengineering [16]  Classifying biological motion trajectories  Video surveillance [15]  Detecting suspicious behaviors of pedestrians  Time-series classification [20,21]  Moving-object anomaly detection [14]

 A novel and comprehensive feature generation framework for trajectories has been proposed  The primary advantage is the high classification accuracy owing to the collaboration between the two types of clustering  Various real-world applications, e.g., vessel classification, can benefit from our framework