Image Congealing (batch/multiple) image (alignment/registration) Advanced Topics in Computer Vision (048921) Boris Kimelman.

Slides:



Advertisements
Similar presentations
Part 2: Unsupervised Learning
Advertisements

Joint Face Alignment The Recognition Pipeline
Overview Definition of Norms Low Rank Matrix Recovery Low Rank Approaches + Deformation Optimization Applications.
A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Multi-Label Prediction via Compressed Sensing By Daniel Hsu, Sham M. Kakade, John Langford, Tong Zhang (NIPS 2009) Presented by: Lingbo Li ECE, Duke University.
Structured Sparse Principal Component Analysis Reading Group Presenter: Peng Zhang Cognitive Radio Institute Friday, October 01, 2010 Authors: Rodolphe.
Introduction To Tracking
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
Robust Object Tracking via Sparsity-based Collaborative Model
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
A Study of Approaches for Object Recognition
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.
Visual Recognition Tutorial
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
AN ANALYSIS OF SINGLE- LAYER NETWORKS IN UNSUPERVISED FEATURE LEARNING [1] Yani Chen 10/14/
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Computer Science Department Detection, Alignment and Recognition of Real World Faces Erik Learned-Miller with Vidit Jain, Gary Huang, Andras Ferencz, et.
CS 376b Introduction to Computer Vision 04 / 02 / 2008 Instructor: Michael Eckmann.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Computer Science Department Distribution Fields A Unifying Representation for Low-Level Vision Problems Erik Learned-Miller with Laura Sevilla Lara, Manju.
A General Framework for Tracking Multiple People from a Moving Camera
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
BrainStorming 樊艳波 Outline Several papers on icml15 & cvpr15 PALM Information Theory Learning.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
Combined Central and Subspace Clustering for Computer Vision Applications Le Lu 1 René Vidal 2 1 Computer Science Department, Johns Hopkins University,
Face Recognition: An Introduction
Introduction to Video Background Subtraction 1. Motivation In video action analysis, there are many popular applications like surveillance for security,
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised and weakly-supervised discovery of events in video (and audio) Fernando De la Torre.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Probability and Statistics in Vision. Probability Objects not all the sameObjects not all the same – Many possible shapes for people, cars, … – Skin has.
Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy.
Implicit Active Shape Models for 3D Segmentation in MR Imaging M. Rousson 1, N. Paragio s 2, R. Deriche 1 1 Odyssée Lab., INRIA Sophia Antipolis, France.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
Deeply learned face representations are sparse, selective, and robust
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Article Review Todd Hricik.
René Vidal and Xiaodong Fan Center for Imaging Science
LOCUS: Learning Object Classes with Unsupervised Segmentation
Paper Presentation: Shape and Matching
Unsupervised Face Alignment by Robust Nonrigid Mapping
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Supervised Classification
Probabilistic Models with Latent Variables
Unsupervised Learning of Models for Recognition
Announcements more panorama slots available now
Paper Reading Dalong Du April.08, 2011.
Announcements more panorama slots available now
Recognition and Matching based on local invariant features
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
EM Algorithm and its Applications
“Traditional” image segmentation
Robust Feature Matching and Fast GMS Solution
Presentation transcript:

Image Congealing (batch/multiple) image (alignment/registration) Advanced Topics in Computer Vision (048921) Boris Kimelman

Introduction Dramatic increase in popularity of image and video sharing sites Hard to measure image similarity: – Illumination – Occlusion – Misalignment 2

Problem Definition 3

Applications Batch image alignment (congealing) Identification pre-processing Video stabilization Background segmentation Facial contour detection Inpainting 4

Congealing example Input imagesInput images realigned using the transformations computed by RASL 5

Unsupervised Joint Alignment of Complex Images Gary B Huang, Vidit Jain, Erik Learned-Miller ICCV

Basic assumptions Input images have similar structure and shape Thus, low variability of pixel values at specific location Distribution Field: empirical density function at each pixel 7 Pixel stack

Basic algorithm 8 Each stage increases image likelihood

Funneling: new image alignment Add to training set and re-run Instead, save sequence of distribution fields and increase likelihood of new image at each iteration New ImageAligned Image Image Funnel 9

Congealing Color Images 10

Congealing with SIFT descriptor (1) Cluster SIFT descriptors using k-means Congealing on hard assignments forces pixels to take relatively small number of values Use soft assignment of pixels to clusters (GMM EM) Analogy with grayscale using binary alphabet 11

Congealing with SIFT descriptor (2) Window around pixel SIFT vector and clusters Posterior distribution 12

Mathematical formulation 13

Labeled Faces in the Wild database images Size: 250X250X16MB 5749 people 1680 people with two or more images 14

Results on faces 15

Align for identification 16 Hyper feature based identifier

Results on cars 17

Evaluation LFW database contribution Novel: Information theory point of view Funneling process  Demo code available Results:  No measure of alignment accuracy  Comparison only against face alignment algorithm Writing level: convincing  illustrations would help 18

RASL: Robust Alignment by Sparse and Low- Rank Decomposition for Linearly Correlated Images Yigang Peng, Arvind Balasubramanian, John Wright, Ma Yi CVPR

How to measure image similarity? 20 Least Squares Learned-Miller Generalize: lower rank as much as possible

Basic Assumptions 21

Mathematical Formulation 22

Graphical Explanation Matrix of corrupted observations Underlying low-rank matrixSparse error matrix 23

Modeling Misalignment 24

Optimization Formulation (1) 25

Optimization Formulation (2) 26

Nuclear norm 27

Constraint Linearization 28

RASL Algorithm 29

Region of Attraction 30

Results on controlled data set 100 misaligned images Vedaldi CVPR 08 direct/gradient RASL 31

Results on LFW 32

Stabilizing video frames 33

Aligning planar surfaces despite occlusions 34

Evaluation Novel: Unifying framework for image congealing Rank minimization as image similarity Code available Results: Comprehensive algorithm assessment  Compare only against one algorithm Extensive site about rank minimization Writing level: Convincing  Advanced mathematics required (optimization) 35

Comparison of Papers Learned-MillerRASL Align similar imagesYes Align different images YesNo Trained to specific object No Robust to variationsYes Robust to occlusions Yes-Yes Occlusion removalNoYes Run timeLow?Low PerformanceHigh 36

Future issues Multi sensor congealing: complex relationship between corresponding pixels Learned Miller – occlusion removal by interspace alignment RASL – mix between image spaces – funneling 37

Thank You