Learning Jigsaws for clustering appearance and shape John Winn, Anitha Kannan and Carsten Rother NIPS 2006.

Slides:

Advertisements

Similar presentations

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Advertisements

University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent.

Optimizing and Learning for Super-resolution

Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto.

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

A Bayesian Approach to Recognition Moshe Blank Ita Lifshitz Reverend Thomas Bayes

Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Part 4: Combined segmentation and recognition by Rob Fergus (MIT)

A Novel Approach for Recognizing Auditory Events & Scenes Ashish Kapoor.

Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep

Yuanlu Xu Human Re-identification: A Survey.

Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.

LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.

GrabCut Interactive Foreground Extraction using Iterated Graph Cuts Carsten Rother Vladimir Kolmogorov Andrew Blake Microsoft Research Cambridge-UK.

MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.

Stephen J. Guy 1. Photomontage Photomontage GrabCut – Interactive Foreground Extraction 1.

GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.

Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.

Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.

Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.

Non-metric affinity propagation for unsupervised image categorization Delbert Dueck and Brendan J. Frey ICCV 2007.

Unsupervised Learning of Categorical Segments in Image Collections *California Institute of Technology **Technion Marco Andreetto*, Lihi Zelnik-Manor**,

Problem Sets Problem Set 3 –Distributed Tuesday, 3/18. –Due Thursday, 4/3 Problem Set 4 –Distributed Tuesday, 4/1 –Due Tuesday, 4/15. Probably a total.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.

Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:

Computing motion between images

High-Quality Video View Interpolation

Image Renaissance Using Discrete Optimization Cédric AllèneNikos Paragios ENPC – CERTIS ESIEE – A²SI ECP - MAS France.

Projective Texture Atlas for 3D Photography Jonas Sossai Júnior Luiz Velho IMPA.

Computer Vision James Hays, Brown

1 Patch Complexity, Finite Pixel Correlations and Optimal Denoising Anat Levin, Boaz Nadler, Fredo Durand and Bill Freeman Weizmann Institute, MIT CSAIL.

Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.

Clustering appearance and shape by learning jigsaws Anitha Kannan, John Winn, Carsten Rother.

Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.

Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)

University of Toronto Aug. 11, 2004 Learning the “Epitome” of a Video Sequence Information Processing Workshop 2004 Vincent Cheung Probabilistic and Statistical.

Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.

Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,

Expectation-Maximization (EM) Case Studies

Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.

Inference in generative models of images and video John Winn MSR Cambridge May 2004.

Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge.

Towards Total Scene Understanding: Classiﬁcation, Annotation and Segmentation in an Automatic Framework N 工科所錢雅馨 2011/01/16 Li-Jia Li, Richard.

Adaptive Wavelet Packet Models for Texture Description and Segmentation. Karen Brady, Ian Jermyn, Josiane Zerubia Projet Ariana - INRIA/I3S/UNSA June 5,

Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen

TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,

Epitome Ji Soo Yi and Woo Young Kim Instructor: Prof. James Rehg April 27, Spring 2004, CS7636 Computational Perception.

Part 4: combined segmentation and recognition Li Fei-Fei.

Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.

ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,

May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.

Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.

LOCUS: Learning Object Classes with Unsupervised Segmentation

Image Segmentation Techniques

Learning Layered Motion Segmentations of Video

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Transformation-invariant clustering using the EM algorithm

Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu

Patch-Based Image Classification Using Image Epitomes

Image Segmentation.

Clustering appearance and shape by Jigsaw, and comparing it with Epitome. Papers (1) Clustering appearance and shape by learning jigsaws (2006 NIPS) (2)

The EM Algorithm With Applications To Image Epitome

Presentation transcript:

Learning Jigsaws for clustering appearance and shape John Winn, Anitha Kannan and Carsten Rother NIPS 2006

Learning jigsaws Aim: Cluster regions in images with similar appearance and shape. Examples of clusters (jigsaw pieces) Eye Noses Cheek Eyebrows

Road map  Clustering image patches  The Jigsaw model  Results on toy and real images  Learning jigsaw pieces  Discussion and conclusions

Clustering image patches Patches Clusters [Leibe & Schiele, BMVC 2003]

Clustering image patches Cluster? Patch wrong shape

Clustering image patches Cluster? Patch wrong shape

Clustering image patches Cluster? Part is occluded

Clustering image patches Cluster? Need to adapt the patch shape depending on the image.

Road map  Clustering image patches  The Jigsaw model  Results on toy and real images  Learning jigsaw pieces  Discussion and conclusions

Aims of jigsaw model Learn clusters (jigsaw pieces) so that: 1. Clustered patches have similar shape and appearance 2. Patches are as large as possible 3. Every image pixel belongs to exactly one patch (i.e. the images are segmented into patches)

The Jigsaw model Jigsaw J Image I 1... Image I 2 I N Offset map L 2 L N L 1 Region of constant offset

The Jigsaw model Jigsaw J Offset map prior (Potts model) Appearance model Jigsaw Mean μ(z) and inverse variance λ(z) for each jigsaw pixel z. Image I Offset map L offset at pixel i cost of patch boundary

Road map  Clustering image patches  The Jigsaw model  Results on toy and real images  Learning jigsaw pieces  Discussion and conclusions

Toy example Learned by iteratively maximising joint probability w.r.t. jigsaw and offset maps (see paper for details) Imagewith segmentationJigsaw MeanVariance

Comparison: Mixture of Gaussians  fixed patch shape Cluster centres

Comparison: Epitome [Jojic et al., ICCV 2003]  fixed patch shape  translation invariant Epitome

Comparison: Jigsaw  learned patch shape  translation invariant  non-overlapping patches Jigsaw

Comparison: all methods Original Jigsaw Epitome Error = Error = MoG Error = 0.103

Faces example Source: Olivetti face database Face imageswith segmentations Jigsaw 128  128 mean

Road map  Clustering image patches  The Jigsaw model  Results on toy and real images  Learning jigsaw pieces  Discussion and conclusions

Learning the jigsaw pieces Jigsaw J... Image I 1 I 2 I N Offset map L 2 L N L 1

Learning the jigsaw pieces Jigsaw J... Image I 1 I 2 I N Offset map L 2 L N L 1

Learning the jigsaw pieces Jigsaw J... Image I 1 I 2 I N Offset map L 2 L N L 1

Shape clustering on faces Jigsaw showing pieces Commonly used pieces

Road map  Clustering image patches  The Jigsaw model  Results on toy and real images  Learning jigsaw pieces  Discussion and conclusions

Jigsaw applications  Can be used as ‘plug-and-play’ replacement for fixed-shape patch model in existing systems.  Applications include:  Object recognition/detection  Object segmentation  Stereo matching  Texture synthesis  Super-resolution  Motion segmentation  Image/video compression

Future work  Allow rotation/scaling/deformation of the patches.  Incorporate shape clustering into the probabilistic model  Incorporate additional invariances e.g. to illumination  Apply to other domains: audio, biology

Conclusions  Jigsaw model allows learning the shape and appearance of recurring regions in images.  Jigsaw performs unsupervised discovery of object parts.

Thank you Jigsaw paper (compressed)

Comparison: Epitome [Jojic et al., ICCV 2003]  fixed patch shape  translation invariant  overlapping patches Epitome

Patch averaging Error = 0.071Error = EpitomeMoG