Clustering of Trajectory Data obtained from Soccer Game Record -A First Step to Behavioral Modeling Shoji Hirano Shusaku Tsumoto

Slides:



Advertisements
Similar presentations
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Advertisements

Fast Algorithms For Hierarchical Range Histogram Constructions
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Patch to the Future: Unsupervised Visual Prediction
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
DMEC Neurons firing Black trace is the rat’s trajectory. Red dots are spikes recorded from one neuron. Eventually a hexagonal activity pattern emerges.
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai et al, KDD 2010 Presenter: Yunchao Gong Dept. Computer Science, UNC Chapel Hill.
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
A New Force-Directed Graph Drawing Method Based on Edge- Edge Repulsion Chun-Cheng Lin and Hsu-Chen Yen Department of Electrical Engineering, National.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
SSCP: Mining Statistically Significant Co-location Patterns Sajib Barua and Jörg Sander Dept. of Computing Science University of Alberta, Canada.
CVPR 2006 New York City Granularity and Elasticity Adaptation in Visual Tracking Ming Yang, Ying Wu NEC Laboratories America Cupertino, CA 95014
An Experimental Evaluation on Reliability Features of N-Version Programming Xia Cai, Michael R. Lyu and Mladen A. Vouk ISSRE’2005.
Tracking Video Objects in Cluttered Background
COMP 290 Computer Vision - Spring Motion II - Estimation of Motion field / 3-D construction from motion Yongjik Kim.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Data Mining – Intro.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
Chapter 1: Introduction to Statistics
Unit 2: Engineering Design Process
Building a Visual Summary of Multiple Trajectories Natalia Andrienko & Gennady Andrienko
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Presented by Tienwei Tsai July, 2005
FlowString: Partial Streamline Matching using Shape Invariant Similarity Measure for Exploratory Flow Visualization Jun Tao, Chaoli Wang, Ching-Kuang Shene.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
3D SLAM for Omni-directional Camera
Timothy Reeves: Presenter Marisa Orr, Sherrill Biggers Evaluation of the Holistic Method to Size a 3-D Wheel/Soil Model.
Evaluating a Research Report
An Introduction to Measurement and Evaluation Emily H. Wughalter, Ed.D. Summer 2010 Department of Kinesiology.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Temporal Analysis of Platelet Data in Chronic Viral Hepatitis Dataset Shoji HiranoShusaku Tsumoto Department of Medical.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
ICDM 2003 Review Data Analysis - with comparison between 02 and 03 - Xindong Wu and Alex Tuzhilin Analyzed by Shusaku Tsumoto.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Robustness of complex networks with the local protection strategy against cascading failures Jianwei Wang Adviser: Frank,Yeong-Sung Lin Present by Wayne.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Motion Analysis using Optical flow CIS750 Presentation Student: Wan Wang Prof: Longin Jan Latecki Spring 2003 CIS Dept of Temple.
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
21 June 2009Robust Feature Matching in 2.3μs1 Simon Taylor Edward Rosten Tom Drummond University of Cambridge.
The E ngineering Design Process Foundations of Technology The E ngineering Design Process © 2013 International Technology and Engineering Educators Association,
Extracting features from spatio-temporal volumes (STVs) for activity recognition Dheeraj Singaraju Reading group: 06/29/06.
Measurement and Scaling
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
School of Computer Science 1 Information Extraction with HMM Structures Learned by Stochastic Optimization Dayne Freitag and Andrew McCallum Presented.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
Data Mining and Decision Support
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
AQUARadar Identification of temporally stable Z-R relationships using measurements of micro-rain radars M. Clemens (1), G. Peters (1), J. Seltmann (2),
Tracking Groups of People for Video Surveillance Xinzhen(Elaine) Wang Advisor: Dr.Longin Latecki.
Introduction to Scale Space and Deep Structure. Importance of Scale Painting by Dali Objects exist at certain ranges of scale. It is not known a priory.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Event Tactic Analysis Based on Broadcast Sports Video Guangyu Zhu, Changsheng Xu, Senior Member, IEEE, Qingming Huang, Member, IEEE, Yong Rui, Senior Member,
Scale-Space Representation of 3D Models and Topological Matching
An Introduction to Correlational Research
CHAPTER 7: Information Visualization
Group 9 – Data Mining: Data
Presented by Xu Miao April 20, 2005
Low-Rank Sparse Feature Selection for Patient Similarity Learning
Presentation transcript:

Clustering of Trajectory Data obtained from Soccer Game Record -A First Step to Behavioral Modeling Shoji Hirano Shusaku Tsumoto Dept of Medical Informatics, Shimane Univ. School of Medicine, Japan

Outline Introduction Data Structure Method Experimental Results Conclusions and Future Work

Introduction Clustering of Spatio-temporal Data Provides a way to discover interesting characteristics about the motion of targets Related field: meteorology, medical image analysis, sports, crime research etc. Approaches Spatial clustering + temporal continuity trace (e.g. tracking of moving object) Spatial clustering based on temporal correlation (e.g. fMRI analysis) Spatial clustering + observation of the temporal changes of the clusters (e.g. Observation of the climate regimes)

Objective Development of a clustering method for trajectories with multiscale structural comparison scheme Compare trajectories according to both local and global views. Visualize common characteristics of trajectories Application: Clustering of trajectories of passes in soccer game records Discovery of interesting spatio-temporal patterns of passes which may reflect the strategy and tactics of the team Globally similar passes: strategy of the team -ex. Attack from right side Locally similar passes: tactics of the ream -x. Frequent use of one-two passes

Data Structure Soccer game records ( provided for research purpose by DataStadium Inc., Japan)

Data Structure Field geometry and Pass sequence X Y PASS start IN GOAL t

Pass sequence clustering: Problems Irregularly-sampled spatio-temporal sequence Data point is generated when a player takes an interaction with a ball High interaction -> Dense Data Low interaction -> Sparse Data Need for Multiscale Observation Strategy -> global pass feature Tactics -> local pass feature Both exist concurrently It is required to partly change comparison scale according to the granularity of data and type of events Sparse Dense

Trajectory Mining Preprocessing Segmentation and Generation of Multiscale Trajectories Segment Hierarchy Trace and Matching Calculation of Dissimilarities Clustering of Trajectories

Method: Multiscale Matching A pattern matching method that compares structural similarity of planar curves across multiple observation scales Able to compare objects by partly changing observation scales Simultaneously compare both global and local similarities Sequence A Sequence B Scale  Matched Pairs segment

Multiscale Description (Witkin et al 1984, Mokhatan et al. 1986) Describe convex/concave structure at multiple scales Sequence description : t : course parameter Sequence x(t) at scale  Scale  controls the degree of smoothing  = small : local feature,  large : global feature Scale 

Multiscale Matching based on Convex/Concave Structure of Segments (Ueda et al. 1990) Segment: Partial sequence between adjacent inflection points Curvature K (t,  ) at scale  Inflection point: Represent a sequence as a set of segments Scale 

Matching Procedure Sequence A Sequence B Inflection Points Scale 0Scale 1Scale 2 IN GOAL A 0 (0) A 1 (0) A 2 (0) A 3 (0) A 4 (0) B 2 (0) B 0 (0) B 5 (0) B 6 (0) B 3 (0) B 4 (0) B 1 (0) B 0 (1) B 1 (1) B 2 (1) B 3 (1) B 4 (1) B 0 (2) B 1 (2) B 2 (2) A 0 (1) A 2 (1) A 1 (1) A 0 (2) A 2 (2) A 1 (2) t

Segment Dissimilarity Dissimilarity of Segments Dissimilarity of sequences Length Segment ai(k) Segment bi(j) Rotation Angle Max(, ) P: the number of matched pairs

Indiscernibility-based Clustering: Overview 1.Assignment of initial equivalence relations (ERs) Assign an initial ER to each of the N objects. An ER independently performs binary classification, similar or dissimilar, based on the relative proximity. Indiscernible objects under all of the N ERs form a cluster. 2.Iterative refinement of initial ERs For each pair of objects, count the ratio of ERs that have ability to discriminate them (indiscernibility degree) If the number is small, assume that these ERs give too fine classification and disable their discrimination ability Iterate step2 until the clusters become stable

Experiments Data Game records of FIFA WorldCup 2002 (64 games, including all heats and finals) Number of goals: 168 (own goals excluded) Procedure Select series containing ‘ IN GOAL ’ event, and generate a total of 168 trajectories of 2-D ball location. For every possible pair of the trajectories, calculate dissimilarity by using multiscale matching. Group the trajectories by using the obtained dissimilarities and indiscernibility-based clustering

Experimental Results Cluster Constitution ClusterCases ClusterCases Note: 55.2% (7839/14196) of triplet in the dissimilarity matrix did not satisfy the triangular inequality due to matching failure

Experimental Results (cont ’ d) Cluster 1 (87 cases) Turkey vs Japan Italy vs Korea IN GOAL Matching Result Corner Kick – Goal Europe: 45, South America: 24, Asia: 9

Experimental Results (cont ’ d) Cluster 2 (24 cases) Poland vs Portugal Germany vs Cameroon IN GOAL Matching Result Complex Pass – Side attack- Goal Europe: 13, South America: 7, Asia: 3

Experimental Results (cont ’ d) Cluster 4 (16 cases) Slovenia vs Paraguay IN GOAL China vs Turkey Matching Result Side Change – Centering/Dribble – Goal Europe: 10, South America: 4, Africa: 2

Experimental Results (cont ’ d) Cluster 3 (17 cases) Side Change – Centering/Dribble – Goal (Intermediate cases between Cluster 2 and 4) Europe: 10, South America: 2, Africa: 2 Asia 2

Summary of Experimental Results Goal success patterns can be classified into 4 major groups (with 8 minor patterns) Patterns: complexity of pass sequences With additional information Dribble/Centering/Side change: European Style However, the differences are not statistically significant. Key is “Side Change” Players (Defenders) should take care of the other side of the ball movement. The higher complexity of pass transactions, the higher rate of goal success gains by side change.

Conclusions Presented a new scheme of spatio-temporal data mining Grouped similar patterns using multiscale comparison and indiscernibility-based clustering techniques. Visualized similar patterns using matching results. Application to real World Cup data: Grouping and visualization of interesting pass patterns: ex. Complex pass -> side attack -> goal

Future Work Technical Issues Numerical Evaluation Validation and improvement of segment dissimilarity measure; inclusion of event type to dissimilarity Apply the proposed method to all path series including non- ‘ IN GOAL ’ series Differences between success and failure are very small. This suggests that the patterns of soccer attack are simple. Apply the proposed method to medical environment Trajectories of Laboratory Examinations (IEEE ICDM06) Trajectories of Patients’ Movement: Patient Safety

Criteria for determining the best set of segment pairs Complete match; original sequence should be correctly formed by concatenating the selected segments without any overlaps or gaps Minimization of total segment difference Matching Criteria Overlap Gap P : Number of matched segment pairs A B a1 a2 a3 a4 a5 b1 b2 b3 b4 b5 : dissimiarity of segments

Matching Failure Problem in MSM Theoretically, any sequence can finally become a single segment at enough high scales. Therefore, any pair of sequences should be successfully matched. Practically, there should be an upper limit of scales in order to reduce computational complexity. Therefore, the number of segments can be different even at the highest scales. If matching is not successful, the method should return infinite dissimilarity or a magic value that indicates matching failure. Scale 1 Scale 2 Scale n no-match match

Trajectory Mining Preprocessing Segmentation and Generation of Multiscale Trajectories Segment Hierarchy Trace and Matching Calculation of Dissimilarities Clustering of Trajectories