Head Tracking Using Video Analytics Xuan Wang 1, Yuhen Hu 1, Robert G. Radwin 2, John D. Lee 2 University of Wisconsin – Madison 1 Dept. Electrical and.

Slides:



Advertisements
Similar presentations
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
Advertisements

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
FixtureFinder: Discovering the Existence of Electrical and Water Fixtures Vijay Srinivasan*, John Stankovic, Kamin Whitehouse University of Virginia *(Currently.
Electrical & Computer Engineering Dept. University of Patras, Patras, Greece Evangelos Skodras Nikolaos Fakotakis.
Patch to the Future: Unsupervised Visual Prediction
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Robust Object Tracking via Sparsity-based Collaborative Model
Learning Semantic Scene Models From Observing Activity in Visual Surveillance Dimitios Makris and Tim Ellis (2005) Presented by Steven Wilson.
Oklahoma State University Generative Graphical Models for Maneuvering Object Tracking and Dynamics Analysis Xin Fan and Guoliang Fan Visual Computing and.
The Viola/Jones Face Detector Prepared with figures taken from “Robust real-time object detection” CRL 2001/01, February 2001.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
Cluster Analysis.  What is Cluster Analysis?  Types of Data in Cluster Analysis  A Categorization of Major Clustering Methods  Partitioning Methods.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Student: Hsu-Yung Cheng Advisor: Jenq-Neng Hwang, Professor
Presented by Zeehasham Rasheed
Background Subtraction for Urban Traffic Monitoring using Webcams Master Graduation Project Final Presentation Supervisor: Rein van den Boomgaard Mark.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Non-invasive Techniques for Human Fatigue Monitoring Qiang Ji Dept. of Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute
Non-invasive Techniques for Human Fatigue Monitoring Qiang Ji Dept. of Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute
Introduction to Machine Learning Approach Lecture 5.
Face Recognition and Retrieval in Video Basic concept of Face Recog. & retrieval And their basic methods. C.S.E. Kwon Min Hyuk.
Real Time Abnormal Motion Detection in Surveillance Video Nahum Kiryati Tammy Riklin Raviv Yan Ivanchenko Shay Rochel Vision and Image Analysis Laboratory.
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
Optimal Placement and Selection of Camera Network Nodes for Target Localization A. O. Ercan, D. B. Yang, A. El Gamal and L. J. Guibas Stanford University.
Facial Feature Detection
Tracking Pedestrians Using Local Spatio- Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino IEEE TRANSACTIONS ON PATTERN ANALYSIS.
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
Tracking by Sampling Trackers Junseok Kwon* and Kyoung Mu lee Computer Vision Lab. Dept. of EECS Seoul National University, Korea Homepage:
LMDD : L ight-weight M agnetic-based D oor D etection with Your Smartphone Yiyang Zhao Tsinghua University Chen Qian University of Kentucky Liangyi Gong.
A General Framework for Tracking Multiple People from a Moving Camera
3D SLAM for Omni-directional Camera
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
A Comparative Evaluation of Three Skin Color Detection Approaches Dennis Jensch, Daniel Mohr, Clausthal University Gabriel Zachmann, University of Bremen.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Delivering Business Value through IT Face feature detection using Java and OpenCV 1.
Improved Object Detection
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Person Following with a Mobile Robot Using Binocular Feature-Based Tracking Zhichao Chen and Stanley T. Birchfield Dept. of Electrical and Computer Engineering.
Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.
SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.
Face Detection 蔡宇軒.
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Date of download: 7/8/2016 Copyright © 2016 SPIE. All rights reserved. A scalable platform for learning and evaluating a real-time vehicle detection system.
Experience Report: System Log Analysis for Anomaly Detection
Evaluating Classifiers
Attention Tracking Tool
Interactive Offline Tracking for Color Objects
Collective Network Linkage across Heterogeneous Social Platforms
estimated tracklet partition
Video Summarization via Determinantal Point Processes (DPP)
Approaching an ML Problem
Report 2 Brandon Silva.
Presentation transcript:

Head Tracking Using Video Analytics Xuan Wang 1, Yuhen Hu 1, Robert G. Radwin 2, John D. Lee 2 University of Wisconsin – Madison 1 Dept. Electrical and Computer Engineering 2 Dept. Industrial and Systems Engineering GLOBALSIP 2015, Orlando, FL

Backgrounds Related works Proposed approach –Spurious face elimination –Head tracking Experiment result Conclusion 2 Outline

Driver Distraction: –Texting, phone calling, dozing, passenger interruption, eating, drinking, maneuvering gadgets, etc. A major cause of highway accidents! 3 Driver Distraction

Program: Naturalistic Driving Study (NDS) Sponsor: US Federal Highway Administration Goal: To study driver distraction and attentiveness during driving using continuous video monitoring. Tool: Over two petabytes of video recordings of 3,400 people driving vehicles has been collected A typical video frame: 4 Naturalistic Driving Study

Big data Identify factors of distraction-related crashes Manually coding cannot scale up Task –Video analytic algorithms Goals 5 Video Analytics: Quantifying Driver Distraction Distraction Level: 9 Causal relation between distraction & accidents Driver status: Head turned back

Camera fixed: Challenge: Video quality is often low! 6 Specific Task: Driver’s Head Tracking

Related Works “Visual Tracking Decomposition” – Junseok Kwon, 2010 “Structured Output Tracking” – Sam Hare, 2011 “Sparsity-based Collaborative Model” – Wei Zhong, 2012 “Tracking by Sampling Trackers” – Kyoung Mu Lee, 2011 “HyHope System”- Murphy-chutorian, 2008 The prior observation, requirement of video quality, computation load required by these methods motivate us to propose our video analytics approach. 7

Features of our approach: –Uses statistical information of head detections of whole video –Uses temporal correlations between successive frames Implementation of our approach: –A modular based, multi-step video analytic work-flow 8 Proposed Approach Video Face Detection Head tracking Spurious Face Elimination Anno- tation 2 Anno- tation 1 Anno- tation 3 Head Pose Estimation

Viola-Jones Frontal Face Detection Video Face Detectio n Head tracking Spurious Face Eliminatio n Annot ation 2 Annot ation 1 Annot ation 3 Head Pose Estimatio n 9

Approach: –It adopts adaboost to construct Haar feature classifier and build cascaded classifiers from training dataset to detect objects. Advantages: –Good performance –Existing implementation Shortcomings: –Only detect frontal face –Performance not easily estimated An example of different types of failures: 10 Viola-Jones Face Detector

Spurious Face Elimination Video Face Detectio n Head tracking Spurious Face Eliminatio n Annot ation 2 Annot ation 1 Annot ation 3 Head Pose Estimatio n 11

Spurious faces: An un-supervised learning problem Clustering is based on 2 key features: –Distribution model of head position of the whole video –Gaussian distribution of head positions displacement 12 Automated Spurious Face Elimination

Driver’s Head Position Analysis Head position distribution –Clustered –Not circular symmetric –k-means not suitable –DBSCAN feasible Changes of head positions –Gaussian distributed 13

Density-Based Spatial Clustering of Applications with Noise Basics of algorithm: Advantages: –No requirement of k –No requirement of shape "DBSCAN-Illustration" by Chire - Own work. Licensed under CC BY-SA 3.0 via Commons - Illustration.svg#/media/File:DBSCAN-Illustration.svg 14 DBSCAN

15 Spurious Face Elimination: DBSCAN vs Kmeans

Head Tracking Video Face Detectio n Head tracking Spurious Face Eliminatio n Annot ation 2 Annot ation 1 Annot ation 3 Head Pose Estimatio n 16

Video Analysis –Head position displacement –Template changes little between successive frames 17 Head Tracking Template Matching Approach -- A Bayesian Estimation – Search range (Prior) – Template matching (max. likelihood)

24 challenging-quality video clips are collected. 18 Experiment Data

Experiment Evaluation Criteria Precision = TP/(TP + FP) –(a.k.a. positive predicted value) –It measures fraction of positively identified driver’s head positions being true (correctness). Recall = TP/(TP + FN) –(a.k.a. sensitivity) –It measures fraction of driver’s head positions detected out of all true driver’s head positions (Completeness). 19

Experiment Results P GT HeadNot Head Head Not Head 2391 P GT HeadNot Head Head Not Head 0114 P GT HeadNot Head Head Not Head 0114 Face Detection Spurious Face Elimination Head Tracking Precision: 99.27% Recall: 28.09% Precision: 100% Recall: 31.50% Precision: 100% Recall: 88.24% Confusion Matrix of selected 24 challenging-quality video clips. P is for prediction, GT is for ground truth. 20

Conclusion Our algorithm handles most of challenges of the NDS video. Computationally efficient for statistical inference in a batch mode for video clips with variable quality. This algorithm is new and outstanding to handle big video databases. Further work: improve recall by modified template matching. 21

22 Thank you!