ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,

Slides:



Advertisements
Similar presentations
Example 1 Generating Random Photorealistic Objects Umar Mohammed and Simon Prince Department of Computer Science, University.
Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
Bayesian Belief Propagation
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Optimizing and Learning for Super-resolution
Automatic Image Annotation Using Group Sparsity
Lena Gorelick Joint work with Frank Schmidt and Yuri Boykov Rochester Institute of Technology, Center of Imaging Science January 2013 TexPoint fonts used.
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
Patch to the Future: Unsupervised Visual Prediction
Reducing Drift in Parametric Motion Tracking
Object Recognition & Model Based Tracking © Danica Kragic Tracking system.
Su-A Kim 3 rd June 2014 Danhang Tang, Tsz-Ho Yu, Tae-kyun Kim Imperial College London, UK Real-time Articulated Hand Pose Estimation using Semi-supervised.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.
Robust Object Tracking via Sparsity-based Collaborative Model
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Learning With Dynamic Group Sparsity Junzhou Huang Xiaolei Huang Dimitris Metaxas Rutgers University Lehigh University Rutgers University.
Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Robust Higher Order Potentials For Enforcing Label Consistency
1 Distributed localization of networked cameras Stanislav Funiak Carlos Guestrin Carnegie Mellon University Mark Paskin Stanford University Rahul Sukthankar.
Speaker Adaptation for Vowel Classification
ON THE IMPROVEMENT OF IMAGE REGISTRATION FOR HIGH ACCURACY SUPER-RESOLUTION Michalis Vrigkas, Christophoros Nikou, Lisimachos P. Kondi University of Ioannina.
Interactive Matting Christoph Rhemann Supervised by: Margrit Gelautz and Carsten Rother.
University of MarylandComputer Vision Lab 1 A Perturbation Method for Evaluating Background Subtraction Algorithms Thanarat Horprasert, Kyungnam Kim, David.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
Bootstrapping a Heteroscedastic Regression Model with Application to 3D Rigid Motion Evaluation Bogdan Matei Peter Meer Electrical and Computer Engineering.
Noise Estimation from a Single Image Ce Liu William T. FreemanRichard Szeliski Sing Bing Kang.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Webcam-synopsis: Peeking Around the World Young Ki Baik (CV Lab.) (Fri)
Generating panorama using translational movement model.
Feature and object tracking algorithms for video tracking Student: Oren Shevach Instructor: Arie nakhmani.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
A General Framework for Tracking Multiple People from a Moving Camera
Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker Jamie Shotton Toby Sharp Andrew Fitzgibbon Shahram.
What Does Motion Reveal About Transparency ? Moshe Ben-Ezra and Shree K. Nayar Columbia University ICCV Conference October 2003, Nice, France This work.
Yu-Wing Tai, Hao Du, Michael S. Brown, Stephen Lin CVPR’08 (Longer Version in Revision at IEEE Trans PAMI) Google Search: Video Deblurring Spatially Varying.
#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
Markov Random Fields Probabilistic Models for Images
Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.
Fields of Experts: A Framework for Learning Image Priors (Mon) Young Ki Baik, Computer Vision Lab.
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
Hilbert Space Embeddings of Conditional Distributions -- With Applications to Dynamical Systems Le Song Carnegie Mellon University Joint work with Jonathan.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Multi-view Synchronization of Human Actions and Dynamic Scenes Emilie Dexter, Patrick Pérez, Ivan Laptev INRIA Rennes - Bretagne Atlantique
Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.
Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA
A. M. R. R. Bandara & L. Ranathunga
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Tracking Objects with Dynamics
Computer Vision, Robotics, Machine Learning and Control Lab
René Vidal and Xiaodong Fan Center for Imaging Science
Deconvolution , , Computational Photography
Compositional Human Pose Regression
A Unified Algebraic Approach to 2D and 3D Motion Segmentation
Segmentation of Dynamic Scenes from Image Intensities
Optical Flow Estimation and Segmentation of Moving Dynamic Textures
Modeling and Segmentation of Dynamic Textures
Learning With Dynamic Group Sparsity
Real-Time Human Pose Recognition in Parts from Single Depth Image
Dynamical Statistical Shape Priors for Level Set Based Tracking
Combining Sketch and Tone for Pencil Drawing Production
Rob Fergus Computer Vision
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Anomaly Detection in Crowded Scenes
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Robust Feature Matching and Fast GMS Solution
Presentation transcript:

ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University 2

ICCV 2007 Outline Background Goals & Problems Related Works Proposed Method Experiment Results Discussion & Conclusion

ICCV 2007 Background Dynamic textures (DT) –static camera, exhibits a certain stationary Moving Dynamic textures (MDT) –dynamic textures captured by a moving camera DT, [Kwatra et al. SIGGRAPH’03]MDT, [Fitzgibbon ICCV’01]

ICCV 2007 Background Video registration –Required by many video analysis applications Traditional assumption –Static, rigid, brightness constancy –Bergen et al. ECCV’92, Black et al. ICCV’93 Relaxing rigid assumption –Dynamic textures –Doretto et al. IJCV’03, Yuan at al. ECCV’04, Chan et al. NIPS’05, Lin et al. PAMI’07, Rav-Acha at al. Workshop at ICCV’05

ICCV 2007 Our Goals Registration of MDT –Recover the camera motion and register the image sequences including moving dynamic textures Left TranslationRight Translation

ICCV 2007 Complex Optimization Problems Complex optimization –Camera motion, dynamic texture model –Chicken-and-Egg Problems Challenges –About the mean images –About LDS model –About the camera motion?

ICCV 2007 Related Works Fitzgibbon, ICCV’01 –Pioneering attempt –Stochastic rigidity –Non-linear optimization Vidal et al. CVPR’05 –Time varying LDS model –Static assumption in small time window –Simple and general framework but under estimation

ICCV 2007 Formulation Registration of MDT –I(t), the video frame – camera motion parameters –y 0, the desired average image of the video –y(t), related with appearance of DT –x(t), related with dynamics of DT

ICCV 2007 Generative Model Generative image model for a MDT

ICCV 2007 First Observation Good registration –a good registration according to the accurate camera motion should simplify the dynamic texture model while preserving all useful information –Used by Fitzgibbon, ICCV’01, Minimizing the entropy function of an auto regressive process –Used by Vidal, CVPR’05, optimizing time varying LDS model by optimizing piecewise LDS model

ICCV 2007 Second Observation Good registration –A good registration according to the accurate camera motion should lead to a sharp average image whose statistics of derivative filters are similar to those of the input image frames. Image statistics –Student-t distribution / heavy tailed image priors – Huang et al. CVPR’99, Roth et al. CVPR’05

ICCV 2007 Prior Models The Average image priors The motion priors The dynamic priors

ICCV 2007 Average Image Priors Student-t distribution –Model parameters / contrastive divergence method (a) Before registration, (b) in the middle of registration (c) after registration

ICCV 2007 Motion / Dynamic Priors Gaussian Perturbation (Motion) –U ncertainty in the motion modeled by a Gaussian perturbation about the mean estimation M 0 / the covariance matrix S ( a diagonal matrix.) –Motivated by the work [Pickup et al. NIPS’06] GPDM / MAR model (Dynamic) –M arginalizing over all possible mappings between appearance and dynamics –Motivated by the work [Wang et al. NIPS’05] [Moon et al. CVPR’06]

ICCV 2007 Joint Optimization Generative image model Optimization –Final marginal likelihood –Scaled conjugate gradients algorithm (SCG)

ICCV 2007 Procedures Obtaining image derivative prior model Dividing the long sequence into many short image sequences Initialization for video registration Performing model optimization with the proposed prior models until model convergence. With estimated y0, Y and X, the camera motion is then obtained

ICCV 2007 Obtaining Data Three DT video sequences –DT data, [Kwatra et al. SIGGRAPH’03] Synthesized MDT video sequence –60 frames each, no motion from 1 st to 20 th frame and from 41 st to 60 th –Camera motions with speed [1, 0] from 21 st to 40 th

ICCV 2007 Grass MDT Video The average image (a) One frame, (b) the average image after registration, (c) before registration

ICCV 2007 Grass MDT Video The statistics of derivative filter responses

ICCV 2007 Evaluation / Comparison False Estimation Fraction Comparison with two classical methods –Hybrid method, [Bergen et al. ECCV’92] [Black et al. ICCV’93] –Vidal’method, [Vidal et al. CVPR’05]

ICCV 2007 Waterfall MDT Video Motion estimation (a) Ground truth, (b) by hybrid method, (c) by Vidal’s, (d) proposed

ICCV 2007 Waterfall MDT Video The average Image and its statistics The average image and related distribution after registration by (a) proposed method, (b) Vidal’s method, (c) hybrid method

ICCV 2007 FEF Comparisons On three synthesized MDT video

ICCV 2007 Real MDT Video Moving flower bed video Ours –554 frames totally –Ground truth 110 pixels –Estimation pixels ( FEF 4.98%) Vidal’s –250 frames –Ground truth 85 pixels –Estimation 60 pixels ( FEF 29.41%)

ICCV 2007 Conclusions What proposing: –Powerful priors for MDT registration What getting out: –Camera motions –Average image –Dynamic texture model What learning? –Registration simplify DT model while preserving useful information –Better registration lead to sharper average image

ICCV 2007 Thanks !

ICCV 2007 Thanks !

ICCV 2007 Future Works More complex camera motions Different Metric functions for evaluation Multiple dynamic texture segmentation