DDDAS: Stochastic Multicue Tracking of Objects with Many Degrees of Freedom PIs: D. Metaxas, A. Elgammal and V. Pavlovic Dept of CS, Rutgers University.

Slides:



Advertisements
Similar presentations
Active Appearance Models
Advertisements

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.
Face Alignment with Part-Based Modeling
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling Jungong Han Dirk Farin Peter H. N. IEEE CSVT 2008.
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
RECOGNIZING FACIAL EXPRESSIONS THROUGH TRACKING Salih Burak Gokturk.
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Recognizing and Tracking Human Action Josephine Sullivan and Stefan Carlsson.
CSE 291 Final Project: Adaptive Multi-Spectral Differencing Andrew Cosand UCSD CVRR.
Tracking Video Objects in Cluttered Background
Augmented Reality: Object Tracking and Active Appearance Model
Trinity College Dublin PixelGT: A new Ground Truth specification for video surveillance Dr. Kenneth Dawson-Howe, Graphics, Vision and Visualisation Group.
Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.
Presented by Pat Chan Pik Wah 28/04/2005 Qualifying Examination
Instructor : Dr. K. R. Rao Presented by: Rajesh Radhakrishnan.
Visual Screen: Transforming an Ordinary Screen into a Touch Screen Zhengyou Zhang & Ying Shan Vision Technology Group Microsoft Research
A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,
Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
A Brief Overview of Computer Vision Jinxiang Chai.
Olga Zoidi, Anastasios Tefas, Member, IEEE Ioannis Pitas, Fellow, IEEE
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
TP15 - Tracking Computer Vision, FCUP, 2013 Miguel Coimbra Slides by Prof. Kristen Grauman.
Exploiting video information for Meeting Structuring ….
REU Project RGBD gesture recognition with the Microsoft Kinect Steven Hickson.
A General Framework for Tracking Multiple People from a Moving Camera
A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.
Hands segmentation Pat Jangyodsuk. Motivation Alternative approach of finding hands Instead of finding bounding box, classify each pixel whether they’re.
Introduction EE 520: Image Analysis & Computer Vision.
N n Debanga Raj Neog, Anurag Ranjan, João L. Cardoso, Dinesh K. Pai Sensorimotor Systems Lab, Department of Computer Science The University of British.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Recognizing Action at a Distance Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik Computer Science Division, UC Berkeley Presented by Pundik.
Computer Vision Michael Isard and Dimitris Metaxas.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
 Detecting system  Training system Human Emotions Estimation by Adaboost based on Jinhui Chen, Tetsuya Takiguchi, Yasuo Ariki ( Kobe University ) User's.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Rick Parent - CIS681 Motion Analysis – Human Figure Processing video to extract information of objects Motion tracking Pose reconstruction Motion and subject.
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
CSC321 Lecture 5 Applying backpropagation to shape recognition Geoffrey Hinton.
Suspicious Behavior in Outdoor Video Analysis - Challenges & Complexities Air Force Institute of Technology/ROME Air Force Research Lab Unclassified IED.
Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.
Person Following with a Mobile Robot Using Binocular Feature-Based Tracking Zhichao Chen and Stanley T. Birchfield Dept. of Electrical and Computer Engineering.
Week 5 Emily Hand UNR. AdaBoost For our previous detector, we used SVM.  Color Histogram We decided to try AdaBoost  Mean Blocks.
Instantaneous Geo-location of Multiple Targets from Monocular Airborne Video.
Computer vision: models, learning and inference
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Lecture 26 Hand Pose Estimation Using a Database of Hand Images
Tracking parameter optimization
V. Mezaris, I. Kompatsiaris, N. V. Boulgouris, and M. G. Strintzis
Representing Moving Images with Layers
Object tracking in video scenes Object tracking in video scenes
Representing Moving Images with Layers
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Multi-UAV Detection and Tracking
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

DDDAS: Stochastic Multicue Tracking of Objects with Many Degrees of Freedom PIs: D. Metaxas, A. Elgammal and V. Pavlovic Dept of CS, Rutgers University C. Neidle Linguistics, Boston Univ. C. Vogler, Gallaudet University

Goals and Objectives Develop vision-based system for the automated detection and analysis of nonverbal communication: –ASL Analysis –Biometrics Applications –Medical Applications

DDDAS: Technical Approach Approach: a) Analysis of video from CCD cameras b) Creation of database for face, upper body and gait c) Stochastic and Deformable model-based algorithms for face and upper body analysis d) Combination of Continuous and Discrete Trackers e) Data Driven Model Adaptability (co-Training Methods for Model refinement and Data Improvement) f) Data Driven error analysis for Model Switching

Technical Approach: Our System Developed System PC Face and body analysis Gait Analysis Images

1) Dynamic Data-Driven 3D Hand Tracking 2D features: –Edges –Color –Contour (boundaries) –Optical flow Continuous approach: –Edges, optical flow –Model-based, 2D =>3D forces, model fitting, articulation and refinement –Use of a physics-based dynamic estimation approach Discrete approach: –Skin color, edges, contour, and integration of them into multi- frame descriptors. –Appearance-based: find the best matching hand configuration in a database and use it as solution to the current (input) frame

Track “continuously” for fast results and track “discretely” for model re-initialization –We don’t lose track when continuous tracking fails under strong rotations and occlusions –We can still track the hand faster than any discrete approach Dynamic Data-Driven detection of switching between trackers –Learn the mapping between 2D-3D (actual) error using a database with continuous tracking results and Support Vector Regression. –Given the result of the continuous tracking for an input frame, extract the 2D error and make a conclusion for the 3D error (from SV Regression) –The system uses the error of fit to the data and makes dynamically the correct decision –The system updates its mapping between 2D-3D based on new data (co-training methods) ie updates the results of SV regression) –Lagrangian Dynamics Inverse Nonlinear Problems Dynamic Data-Driven 3D Hand Tracking (cont.)

DDDAS: 3D Hand Tracking Examples

2) Dynamic Data Driven 3D Face Tracking Face detection –Skin color database to learn the color distribution (data driven) –Facial pixels detection based on the learning –Extract the face bounding box Facial features extraction in 2D –ASM + KLT = discrete + continuous tracking in 2D = fast and never loses track Dynamic Data Driven Facial Model-based tracking and Refinement –Previous 3D solution + current 2D solution = current 3D solution –Based on the current 2D features, refine the face model –Based on the current 2D features decide what is the right ASM model to use.

DDDAS: 3D Face Tracking Results

Data Collection and Applications so Far ASL Data Collection based on collaboration with Linguists (Carol Neidle BU) –Based on single and Multiple cameras –Manual Annotation of ASL data Automated Detection of Faces Automated Model Initialization (faces and hands) Automated Tracking and Annotation of events (e.g raising eyebrows) Validation based on annotated data

Future Work ASL –Automated segmentation of finger vs non finger spelling –Linguistically important relationships between hand movement and facial expressions –Eventual attempt to deal with analysis involving large vocabularies Medical –Automated Analysis of Stress in people –Analysis of movement in Autistic kids Theory –Develop novel stochastic, learning and deformable modeling methods which can deal with bad data, moving backgrounds and very fast motions. Basically DDDAS models than can deal with nonlinear and complex movements