Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu

Slides:

Advertisements

Similar presentations

Real-Time Detection, Alignment and Recognition of Human Faces

Advertisements

1 Hierarchical Part-Based Human Body Pose Estimation * Ramanan Navaratnam * Arasanathan Thayananthan Prof. Phil Torr * Prof. Roberto Cipolla * University.

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.

Online Multiple Classifier Boosting for Object Tracking Tae-Kyun Kim 1 Thomas Woodley 1 Björn Stenger 2 Roberto Cipolla 1 1 Dept. of Engineering, University.

Joint Eye Tracking and Head Pose Estimation for Gaze Estimation

Face Alignment at 3000 FPS via Regressing Local Binary Features

AdaBoost & Its Applications

Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.

Robust Object Tracking via Sparsity-based Collaborative Model

Face detection Many slides adapted from P. Viola.

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Detecting Pedestrians by Learning Shapelet Features

Computer and Robot Vision I

Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.

Real-Time Non-Rigid Shape Recovery via AAMs for Augmented Reality Jackie Zhu Oct. 24, 2006.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Viewpoint Tracking for 3D Display Systems A look at the system proposed by Yusuf Bediz, Gözde Bozdağı Akar.

Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.

Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.

Person Detection and Tracking using Binocular Lucas-Kanade Feature Tracking and K-means Clustering Chris Dunkel Committee: Dr. Stanley Birchfield, Committee.

Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.

EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.

Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.

KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.

A General Framework for Tracking Multiple People from a Moving Camera

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.

Face detection Slides adapted Grauman & Liebe’s tutorial

Pedestrian Detection and Localization

Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.

A Comparative Evaluation of Three Skin Color Detection Approaches Dennis Jensch, Daniel Mohr, Clausthal University Gabriel Zachmann, University of Bremen.

ECE738 Advanced Image Processing Face Detection IEEE Trans. PAMI, July 1997.

BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

Real-Time Detection, Alignment and Recognition of Human Faces Rogerio Schmidt Feris Changbo Hu Matthew Turk Pattern Recognition Project June 12, 2003.

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.

The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.

HIGH PERFORMANCE OBJECT DETECTION BY COLLABORATIVE LEARNING OF JOINT RANKING OF GRANULES FEATURES Chang Huang and Ram Nevatia University of Southern California,

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.

Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.

Face detection Many slides adapted from P. Viola.

ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,

Face Detection 蔡宇軒.

Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA

Computer vision: models, learning and inference

2. Skin - color filtering.

Guillaume-Alexandre Bilodeau

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Nearest-neighbor matching to feature database

Compositional Human Pose Regression

LOCUS: Learning Object Classes with Unsupervised Segmentation

Paper Presentation: Shape and Matching

Real-Time Human Pose Recognition in Parts from Single Depth Image

Unsupervised Face Alignment by Robust Nonrigid Mapping

Object detection as supervised classification

A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.

Nearest-neighbor matching to feature database

Efficient Deformable Template Matching for Face Tracking

Brief Review of Recognition + Context

Outline Background Motivation Proposed Model Experimental Results

Lecture 29: Face Detection Revisited

Presentation transcript:

Human Upper Body Pose Recognition Using Adaboost Template for Natural Human Robot Interaction Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu Institute for Infocomm Research (I2R), Singapore

Outline Introduction Related Works The Method The Problem Template Modeling Adaboost Template Recognition & Segmentation Experiments and Evaluations Conclusions

Introduction Motivations Difficulties for approaches on 2D images Upper body pose is one of important clues of human social behavior in natural conversation, especially multiple persons are involved in the conversation; A social robot has to be aware of various clues from human body for intelligent and natural human-robot-interaction; An important clues in our social robots for attention estimation and engagement management (direction, distance, motion state, upper body pose, face pose, gaze, etc.) Difficulties for approaches on 2D images Pose ambiguity due to the lost depth information and self-occlusion; Limited view of human objects when engaged in face-to-face interaction; Variations of human shapes, scales, clothes, poses, etc. Complexity of visual features due to lighting conditions, cluttered backgrounds, and crowded scenes.

Related Works Human Body Pose Recognition in Computer Vision 2D silhouette based approaches (e.g., Gavirla and Philomin, ICCV’09, Mittal, et al, IEEE AVSS’03, Dimitrijevic, et al, ICCV Workshop’05) 2D pictorial models (e.g., Ju, et al, FG’96, Felzenszwalb and Huttenlocher, IJCV 2005, Andriluka, et al, CVPR’09, Ferrari, et al, CVPR’09) 3D structure models (e.g., Taylor, CVIU 2000, Lee and Cohen, ECCV’04) Template Matching Deformable template matching (e.g., Cootes, Edwards, Taylor, Active Appearance Models) Object tracking (Yilmaz, et al, ACM CS 2006 (Survey)) Face detection (Yang, et al, IEEE T-PAMI 2002 (Survey)) Image registration (Zitova and Flusser, IVC 2003 (Survey)) Adaboost Learning in Vision Face detection (Viola & Jones, CVPR’01 (Cascade Classifiers)) Multi-view face detection (Huang, et al., IEEE T-PAMI 2007 (Vector Boosting Algorithm)) Multiclass object detection with shared features (Torralba, et al, CVPR’04 (Joint Boosting Algorithm)) Online tracking (e.g., Avidan, “Ensemble Tracking,” IEEE T-PAMI 2007)

Method: The Problem Problem formulation Challenges Classify the upper body poses into seven categories: views of 0°, ±30°, ±60°, and ±90° to the camera. Challenges The depth measures from disparity images are not accurate; Inter-class variations due to variations of human sizes, shapes, poses, and clothes; Inter-class variations due to human positions to the camera; Incompletion of disparity measures from body due to the lack of texture features.

Method: Template Modeling Learning the basic templates Learning the mean template for each category Learning the variance template for each category Learning the percentage template for each category

Method: Adaboost Template Definition of positive and negative regions Design of weak classifiers R+ R−

Method: Learning Adaboost learning algorithm Given Nc training samples for category c. Initialize: For t=1,…,T For each pixel x in the template Compute the error with respect to the distribution Dt Choose Tune the template boundary Update the distribution

Method: Recognition & Segmentation Adaptive model-driven segmentation: Quality level of disparity measurements Adaptively compensate for the missing disparity measurements

Experiments and Evaluations A New Benchmarking Data Set Camera: Videre Design STOC stereo camera. Data Set: 430 images from 19 individuals. Training samples: Randomly select 93 images of 8 persons from the data set, among them, 28 for 0° view, 13 for +30° view, 10 for -30° view, 11 for +60° view, 10 for -60° view, 11 for +90° view, and 10 for -90° view. Baseline Algorithm: Template matching: 3D surface template matching (Breitenstein, et al, “Real-Time Face Pose Estimation from Single Range Images”, CVPR’08) Distance: Let T(x) be a normalized input sample Recognition

Experiments and Evaluations Results on recognition: On average, the accuracy rate increased from 67.4% to 90.7%. Template Matching -90° -60° -30° 0° +30° +60° +90° 100% 60% 37.1% 2.9% 2.6% 31.6% 65.8% 1.75% 14.3% 78.6% 3.6% 61.0% 9.7% 29.3% 27.8% 72.2% Adaboost Template -90° -60° -30° 0° +30° +60° +90° 100% 3.4% 88% 6.9% 1.7% 3.3% 81.7% 15% 9.5% 87.3% 3.2% 5.2% 77.6% 17.2% 98.3%

Experiments and Evaluations Results on segmentation: Pose recognition Quality estimation Top-down segmentation

Application: Attention Estimation Deployed in a robot receptionist for attention estimation

Conclusions A new approach of human upper body pose recognition for human robot interaction A new template model: Adaboost template Easy for training (no need of negative samples) Achieve good balance between generality and specialties of training samples Both recognition and segmentation Deployed and tested on a robot receptionist for attention estimation and the management of engagement in dialogs which may involve multiple participants.

Thank You!