Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, 2012 11/16/2018.

Slides:



Advertisements
Similar presentations
Face Recognition Sumitha Balasuriya.
Advertisements

Bayesian Decision Theory Case Studies
CVPR2013 Poster Modeling Actions through State Changes.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Clustering & image segmentation Goal::Identify groups of pixels that go together Segmentation.
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
Object Inter-Camera Tracking with non- overlapping views: A new dynamic approach Trevor Montcalm Bubaker Boufama.
Automatic in vivo Microscopy Video Mining for Leukocytes * Chengcui Zhang, Wei-Bang Chen, Lin Yang, Xin Chen, John K. Johnstone.
Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Segmentation Divide the image into segments. Each segment:
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
On the Use of Computable Features for Film Classification Zeeshan Rasheed,Yaser Sheikh Mubarak Shah IEEE TRANSCATION ON CIRCUITS AND SYSTEMS FOR VIDEO.
Highlights Lecture on the image part (10) Automatic Perception 16
Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
The Recognition of Human Movement Using Temporal Templates Liat Koren.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
ICBV Course Final Project Arik Krol Aviad Pinkovezky.
Chapter 2. Image Analysis. Image Analysis Domains Frequency Domain Spatial Domain.
Machine Vision for Robots
8D040 Basis beeldverwerking Feature Extraction Anna Vilanova i Bartrolí Biomedical Image Analysis Group bmia.bmt.tue.nl.
Presented by Tienwei Tsai July, 2005
8D040 Basis beeldverwerking Feature Extraction Anna Vilanova i Bartrolí Biomedical Image Analysis Group bmia.bmt.tue.nl.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
1 Recognition by Appearance Appearance-based recognition is a competing paradigm to features and alignment. No features are extracted! Images are represented.
Multimodal Information Analysis for Emotion Recognition
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
資訊工程系智慧型系統實驗室 iLab 南台科技大學 1 A Static Hand Gesture Recognition Algorithm Using K- Mean Based Radial Basis Function Neural Network 作者 :Dipak Kumar Ghosh,
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Spectrograms Revisited Feature Extraction Filter Bank Analysis EEG.
Motion Analysis using Optical flow CIS750 Presentation Student: Wan Wang Prof: Longin Jan Latecki Spring 2003 CIS Dept of Temple.
Human pose recognition from depth image MS Research Cambridge.
Motivation Increase bandwidth of BCI. Reduce training time Use non invasive technique.
Extracting features from spatio-temporal volumes (STVs) for activity recognition Dheeraj Singaraju Reading group: 06/29/06.
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
Counting How Many Words You Read
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Course 5 Edge Detection. Image Features: local, meaningful, detectable parts of an image. edge corner texture … Edges: Edges points, or simply edges,
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
Bayesian Decision Theory Case Studies CS479/679 Pattern Recognition Dr. George Bebis.
IMAGE PROCESSING APPLIED TO TRAFFIC QUEUE DETECTION ALGORITHM.
Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
Course : T Computer Vision
Face Detection EE368 Final Project Group 14 Ping Hsin Lee
Distinctive Image Features from Scale-Invariant Keypoints
Action Recognition in the Presence of One
Motion Detection And Analysis
Dynamical Statistical Shape Priors for Level Set Based Tracking
Computer Vision Lecture 5: Binary Image Processing
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
In summary C1={skin} C2={~skin} Given x=[R,G,B], is it skin or ~skin?
Context-Aware Modeling and Recognition of Activities in Video
Presented by :- Vishal Vijayshankar Mishra
Object Recognition Today we will move on to… April 12, 2018
Parallelization of Sparse Coding & Dictionary Learning
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Marian Stewart Bartlett, Gwen C. Littlewort, Mark G. Frank, Kang Lee 
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, 2012 11/16/2018

Tremor Detection Using Motion Filtering and SVM [Soran et al Tremor Detection Using Motion Filtering and SVM [Soran et al., ICPR’12] Problem: Automatic hand tremor detection using a static camera Motivation: Providing doctors objective information for diagnosis The problem we are interested in this work is automatic hand detection using a static camera. We believe that such as system can be very beneficial to doctors by providing objective information for diagnosis purposes. 11/16/2018

Related Work [Garg et al., 2009] categorize the hand gesture recognition approaches into 3D model-based and appearance-based. [Murthy and Jadon, 2009] adopted an appearance based approach based on low-level hand features from intensity images. [Uhrikova et al., 2009] measures the hand tremor frequencies. They compared their calculated frequencies with those measured by an accelerometer. [Aggarwal and Ryoo, 2011] reduces the problem to object matching when actions are represented as 3-D (X, Y, T) volumes. The most similar work to ours was done by Uhrikova et al., who also worked on hand tremors, but their purpose was to … A straightforward method used in appearance-based approaches is to extract skin-colored regions from the image. Another method is to use the eigenspace to produce a compact description of a large set of high-dimensional data using a small set of eigenvectors [44]. Aggarwal et al. suggests that if each action is represented by 3-D (X, Y, T) volumes, they can be recognized by solving an object matching problem. Uhrikova instead of classifying tremor instances from non-tremors 11/16/2018

Method Personalized Skin Detection & Blob Extraction Optical Flow Calculation & Feature Extraction Classification with SVM Background and Shadow Removal -> 11/16/2018

Video of a patient waving his hand for ~3 sec Method: Personalized Skin Detection & Blob Extraction Optical Flow Calculation & Feature Extraction Classification with SVM Input: Video of a patient waving his hand for ~3 sec Method: Apply skin color thresholds Compute optical-flow and extract moving regions Merge regions and compute color histograms Fit Gaussian function to use as personalized skin model Perform Blob Detection, discard small regions. Maybe converted to a chart Blob Detection is performed, regions smaller than some size are discarded. 11/16/2018

Optical Flow Calculation & Feature Extraction Personalized Skin Detection & Blob Extraction Optical Flow Calculation & Feature Extraction Classification with SVM For each blob compute optical-flow direction and direction change using 3 consecutive frames: 85 ° 87° +2 60° 77° +17 … … Calculate sum of motion direction change: +19 The sign of the sum shows the general direction change of the flow: up or down. In a tremor movement, the main discriminative features are not the motion directions themselves, but the motion directional changes between consecutive frames. While directional features are sensitive to rotation, directional change features are more Rotation invariant. Note that while one part of the hand can move upward, another part can move downward, and a tremor can exist in any region of a hand. We assume (and have observed through experiments) that if a tremor occurs, the directional change of the tremor dominates all directional changes that can exist in any region of the hand. Moreover, not all hand regions exhibit tremor behavior, and averaging the MDC features over the whole hand can smooth the values while decreasing the discriminative power of the features. Therefore, we need to identify the hand regions that exhibit significant MDCs related to the tremor. For this purpose, the MDC values in the hand blob are sorted from the most positive to the most negative. If the sum of the MDC in the hand blob is positive, we take the average of the top P% of the sorted values as our representative MDC; otherwise, if it is negative, we take the average of the bottom P% of the sorted values. The result of this process gives us the representative feature component of one frame. The procedure is repeated until all the frames in the video are processed, to get a 118-dimensional feature vector, out of 120 frames, which represents the temporal evolution of dominating motion directional changes. To detect tremor, the frequency of the MDC is more useful. Therefore, we apply the Discrete Cosine Transform (DCT), as defined in Equation 1, to the feature vectors.

This becomes our feature vector from one video +32.28 -25.41 … Sort the direction changes descending. If sum is (-) use average direction change of the last 25% points, else use that of the first 25% points as the feature component. Start the process from the next frame with a new blob extraction, since optical flow accumulates error. Apply Discrete Cosine Transform Apply MDL Discretization [Fayyad and Irani, 1993] obtaining final Motion Direction Change (MDC) features. The feature vector from each video has one number for each three frames. DCT because, frequency of MDC is more helpful for discriminating tremors from other periodic movements. MDL: recursive entropy minimization heuristic and couple this with minimum description length criterion to control the number of intervals produced over continuous space. Minimum description length of an object is defined to be the minimum number of bits required to uniquely specify that object out of the universe of all objects.

Dataset 90 positive (simulating tremor), 83 negative Static camera, 30 fps Indoors, static background 6 human subjects Negative (ordinary movements) Some videos contain challenging movements or very subtle motions 11/16/2018

% of MDC Features from Hand Region Personalized Skin Detection & Blob Extraction Optical Flow Calculation & Feature Extraction Classification with SVM % of MDC Features from Hand Region LOOCV 10 % 79.2 % 10 % with MDL discretization 89.6 % 25 % 86.7 % 25 % with MDL discretization 95.4 % 11/16/2018

Discussion False negative rate is close to zero Requires a single short video, minimizes diagnostic discomfort Automatically detects hand tremors with 95.4% accuracy using LOOCV Distinguishes very subtle motions and tremor-like movements from tremors False negative rate is close to zero , such as tapping fingers or waving hand very quickly, from tremor movements. Although tremor detection is a hard problem, in this pilot study, with the described methodology a A dataset of simulated tremor/non-tremor hand motions is prepared and the methodology tested on that. False negatives are close to zero, which means, tremors are detected almost 100%. The error usually results from false positives, which means false signaling of tremor. The negative dataset consists of many challenging videos. This is good for handling real cases. It’s probable that the highly accurate segmentation of the videos has also contributed to this high accuracy. 11/16/2018

11/16/2018