Kernel-based tracking and video patch replacement Igor Guskov

Slides:

Advertisements

Similar presentations

CSCE643: Computer Vision Mean-Shift Object Tracking Jinxiang Chai Many slides from Yaron Ukrainitz & Bernard Sarel & Robert Collins.

Advertisements

The fundamental matrix F

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Model base human pose tracking. Papers Real-Time Human Pose Tracking from Range Data Simultaneous Shape and Pose Adaption of Articulated Models using.

Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.

Robust Object Tracking via Sparsity-based Collaborative Model

AAM based Face Tracking with Temporal Matching and Face Segmentation Dalong Du.

Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.

Image alignment Image from

EE 7730 Image Segmentation.

A Study of Approaches for Object Recognition

Recognising Panoramas

Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.

Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.

Computing motion between images

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.

Face Recognition from Face Motion Manifolds using Robust Kernel RAD Ognjen Arandjelović Roberto Cipolla Funded by Toshiba Corp. and Trinity College, Cambridge.

Augmented Reality: Object Tracking and Active Appearance Model

Presented by Pat Chan Pik Wah 28/04/2005 Qualifying Examination

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.

Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.

CSC 589 Lecture 22 Image Alignment and least square methods Bei Xiao American University April 13.

Projective Texture Atlas for 3D Photography Jonas Sossai Júnior Luiz Velho IMPA.

Computer Vision James Hays, Brown

CSE554Laplacian DeformationSlide 1 CSE 554 Lecture 8: Laplacian Deformation Fall 2012.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Mean-shift and its application for object tracking

Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.

1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.

A General Framework for Tracking Multiple People from a Moving Camera

The Brightness Constraint

Digital Face Replacement in Photographs CSC2530F Project Presentation By: Shahzad Malik January 28, 2003.

ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.

Multiscale Moment-Based Painterly Rendering Diego Nehab and Luiz Velho

Imaging Geometry for the Pinhole Camera Outline: Motivation |The pinhole camera.

CSE 185 Introduction to Computer Vision Pattern Recognition 2.

CSCE 643 Computer Vision: Structure from Motion

CS 4487/6587 Algorithms for Image Analysis

Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #16.

Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.

AS-RIGID-AS-POSSIBLE SHAPE MANIPULATION

Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.

1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.

Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,

Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.

Temporally Coherent Completion of Dynamic Shapes AUTHORS:HAO LI,LINJIE LUO,DANIEL VLASIC PIETER PEERS,JOVAN POPOVIC,MARK PAULY,SZYMON RUSINKIEWICZ Presenter:Zoomin(Zhuming)

Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.

AAM based Face Tracking with Temporal Matching and Face Segmentation Mingcai Zhou 1 、 Lin Liang 2 、 Jian Sun 2 、 Yangsheng Wang 1 1 Institute of Automation.

Features, Feature descriptors, Matching Jana Kosecka George Mason University.

CSE 185 Introduction to Computer Vision Feature Matching.

Local features: detection and description

776 Computer Vision Jan-Michael Frahm Spring 2012.

Person Following with a Mobile Robot Using Binocular Feature-Based Tracking Zhichao Chen and Stanley T. Birchfield Dept. of Electrical and Computer Engineering.

Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Nearest-neighbor matching to feature database

A segmentation and tracking algorithm

Image Stitching Slides from Rick Szeliski, Steve Seitz, Derek Hoiem, Ira Kemelmacher, Ali Farhadi.

Nearest-neighbor matching to feature database

The Brightness Constraint

Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.

Brief Review of Recognition + Context

Mesh Parameterization: Theory and Practice

CSE 185 Introduction to Computer Vision

Image Stitching Linda Shapiro ECE P 596.

Presentation transcript:

Kernel-based tracking and video patch replacement Igor Guskov

Overview Research areas –Geometry processing Compression of geometry Feature-based matching –Template matching in video

Projects Geometry processing –Semi-regular remeshing Parameterization Add structure to meshes Do wavelet compression –Dynamic mesh compression Soft-body animations Extract Do wavelet compression

Projects Matching –3D matching Automatic scan alignment Shape recognition –Tracking non-rigid geometry in video For geometry reconstruction –Real-time reconstruction For video editing and surveillance

Approximate Surface Alignment Approximate alignment Find approximate alignment automatically Registration: ICP Optimal alignment Joint work with Xinju Li

Video tracking Feature tracking –Classical approach: Lucas&Kanade tracker Based on mean-square error minimization We want to track larger patches

Tracking features Point features –Given point-to-point correspondences Can do reconstruction of 3D geometry, many other things Linear features –Track stick figures: limbs Reconstruct articulated characters –Recognize activity –Silhouettes Patch features –Active appearance models (AAMs) Geometry + texture + appearance –Face tracking –Video editing: monet from Imagineer Systems

Error-based tracking Mean-square error Image I(x) Template T(y) Warp map x=W[z](y) –For instance: W[p](y) = y+p –Small patch translated around W[( ,t)](y) =  y+t –Translation + uniform scaling W[h](y) = h(y) –Homography h min p || I(W[p](y)) – T(y) || 2 T(y) I(x)

Quad-marked surface tracking Collection of quads SCA 2003 –Real-time tracking and reconstruction –Four cameras

Mean-shift tracking Formulate tracking as mean-shift problem –Comaniciu, Ramesh, Meer CVPR 2000 Replace a pixel by the distribution of color values in a neighborhood –Histogram –Best match of a histogram Robust to noisy data Very fast algorithm

Histogram matching Bhattacharya coefficient  (p,q) –Given two distributions p(z) and q(z) –Related to bounds on the probability of classification error between these two distributions P(error) ≤  (p,q) –For matching, we want P(error)=1

Distance between distributions Metric space of histograms Not that important in the original paper Implement as a simple sum

Where is mean-shift? The way the histograms are computed –Weighted histograms Pixels at the blob center contribute more Setting the gradient of Bhattacharya coefficient to zero one gets –Each pixel contributes its opinion on how relevant it is to be the center of the blob

Mean-shift clustering Comaniciu, Meer PAMI 2002 –Kernel density estimation –Sum of bumps of width h

Extensions Previous work –Translation + scale [Collins 03] –Particle-tracking [Perez et al 02] –Multiple collaborating trackers [Hager et al 04] Template alignment –More general warps –Warp is the key Translation does not really warp Need to account for that properly

Templates I Multiple blobs tracked together –Each has its own histogram p k [t] Easy to do by considering squared sum of distances

Templates II: warp Where is that weighted histogram coming from? –Random variable X Displacement from the blob’s center –Histogram bin p a –With translation –General warp

Triangles Affine warps –Six parameters –Cannot account for perspective distortion Okay for weak perspective Multiple triangles needed –Relations among the collection of triangles Multiscale

A formula Histogram bin value All the pixels y in the image which fall into bin a Warp the pixel position back into canonical space and take its probability density Jacobian of the inverse warp

Simple illumination model Cannot rely on colors being constant –Illumination changes Outdoors: clouds etc. –Shadowing –Cameras set on automatic exposure Always collect relative colors –Average illumination locally L(x) –Histogram of I(W(X))-L(W(X)) This requires some texture to be present Roll-ball video

Optimization Bhattacharya coefficient Take the gradient w.r.t. z –Explicit formula Feed to the optimization library

Implementation YUV video –Histogram in two channels out of three –Y is luminance Higher resolution –UV is color Histograms 16x16 bins Templates have 120 blobs (16*15/2)

Results Videos –About one second per frame Extend to masked template

Video augmentation Previous work –Bartoli & Zisserman 2004 RBF estimation & grid –Pilet et al Keypoint features Real-time detection –Lin 2005 Near-regular textures

User input Masks for tracking and replacement Tracking of the templates –Warping of the replacement grid Poisson edit on the replacement region

Warping the grid Blend affine transformations –Warping of the replacement grid

Masks and grids

Replacement image Select replace Poisson edit

Motion blur Necessary for visual quality Smear the replacement region –Perform Poisson gradient fitting in a larger region

Results Videos

Conclusions Basic tracking procedure –Imperfect match –Non-rigid patches –Large areas Replacement in videos –Simple user input –Warping and Poisson edit

Better tracking of noisy videos Coming up… –Multiple primitives Multilevel within each primitive –Editing pipeline –Texture replacement