Accurate Binary Image Selection From Inaccurate User Input Kartic Subr, Sylvain Paris, Cyril Soler, Jan Kautz University College London, Adobe Research,

Slides:

Advertisements

Similar presentations

Mean-Field Theory and Its Applications In Computer Vision1 1.

Advertisements

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Mean-Field Theory and Its Applications In Computer Vision2

CMPUT 615 Applications of Machine Learning in Image Analysis

A Graph based Geometric Approach to Contour Extraction from Noisy Binary Images Amal Dev Parakkat, Jiju Peethambaran, Philumon Joseph and Ramanathan Muthuganapathy.

Multiclass SVM and Applications in Object Classification

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Medical Image Registration Kumar Rajamani. Registration Spatial transform that maps points from one image to corresponding points in another image.

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.

Exploration of bump, parallax, relief and displacement mapping

Graph cut Chien-chi Chen.

Presenter ： Kuang-Jui Hsu Date ： 2011/5/12(Tues.).

Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Salvatore giorgi Ece 8110 machine learning 5/12/2014

I Images as graphs Fully-connected graph – node for every pixel – link between every pair of pixels, p,q – similarity w ij for each link j w ij c Source:

LING 111 Teaching Demo By Tenghui Zhu Introduction to Edge Detection Image Segmentation.

Efficient Inference for Fully-Connected CRFs with Stationarity

S I E M E N S C O R P O R A T E R E S E A R C H 1 1 A Seeded Image Segmentation Framework Unifying Graph Cuts and Random Walker Which Yields A New Algorithm.

GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.

Stephen J. Guy 1. Photomontage Photomontage GrabCut – Interactive Foreground Extraction 1.

Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.

GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.

Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)

Fast Computational Methods for Visually Guided Robots Maryam Mahdaviani, Nando de Freitas, Bob Fraser and Firas Hamze Department of Computer Science, University.

Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)

Gaussian KD-Tree for Fast High-Dimensional Filtering A. Adams, N. Gelfand, J. Dolson, and M. Levoy, Stanford University, SIGGRAPH 2009.

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.

Presented By : Murad Tukan

Clustering Vertices of 3D Animated Meshes

Image Segmentation Rob Atlas Nick Bridle Evan Radkoff.

Fast Bilateral Filtering

High dynamic range imaging. Camera pipeline 12 bits8 bits.

Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Fourier Analysis of Stochastic Sampling For Assessing Bias and Variance in Integration Kartic Subr, Jan Kautz University College London.

1 Complex Images k’k’ k”k” k0k0 -k0-k0 branch cut   k 0 pole C1C1 C0C0 from the Sommerfeld identity, the complex exponentials must be a function.

CS654: Digital Image Analysis Lecture 30: Clustering based Segmentation Slides are adapted from:

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

Manifold learning: MDS and Isomap

Associative Hierarchical CRFs for Object Class Image Segmentation

Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.

Thank you for the introduction

CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.

Math 285 Project Diffusion Maps Xiaoyan Chong Department of Mathematics and Statistics San Jose State University.

Implementing the By: Matthew Marsh Supervisors: Prof Shaun Bangay Mrs Adele Lobb segmentation technique as a plugin for the GIMP.

 Mentor : Prof. Amitabha Mukerjee Learning to Detect Salient Objects Team Members - Avinash Koyya Diwakar Chauhan.

Color Image Segmentation Mentor : Dr. Rajeev Srivastava Students: Achit Kumar Ojha Aseem Kumar Akshay Tyagi.

May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.

Spectral Methods for Dimensionality

Scalable Person Re-identification on Supervised Smoothed Manifold

Data Mining, Neural Network and Genetic Programming

GrabCut Interactive Foreground Extraction using Iterated Graph Cuts Carsten Rother Vladimir Kolmogorov Andrew Blake Microsoft Research Cambridge-UK.

COMP 9517 Computer Vision Segmentation 7/2/2018 COMP 9517 S2, 2017.

Nonparametric Semantic Segmentation

Graph Analysis by Persistent Homology

Self-Organizing Maps for Content-Based Image Database Retrieval

A segmentation and tracking algorithm

Fast and Robust Object Tracking with Adaptive Detection

Efficient Graph Cut Optimization for Full CRFs with Quantized Edges

“grabcut”- Interactive Foreground Extraction using Iterated Graph Cuts

EM Algorithm and its Applications

“Traditional” image segmentation

Presentation transcript:

Accurate Binary Image Selection From Inaccurate User Input Kartic Subr, Sylvain Paris, Cyril Soler, Jan Kautz University College London, Adobe Research, INRIA-Grenoble

Selection is a common operation in images

For example

Tools available: Related work Simple brush and lasso – magnetic lasso [MB95] – edge-aware brush [CPD07,OH08] – for multi-touch screens [BWB06] User indication – bounding box [RKB04] – scribbles [BJ01, ADA*04, LSS09, LAA08]

Accurate marking can be tedious

Precise input requires skill…

… and patience

Robust approaches: Related work soft selection – AppProp [AP08] – instant propagation [LJH10] interactive segmentation – dynamic, iterative graph cut [SUA12]

Summary of related work Accurate methods – require precise input – unstable Robust methods – not accurate – marking is not intuitive

Ours: intuitive, robust, fast Input Output foreground scribble background scribble foreground background

Bird’s eye view formulate as image-labeling problem – labels: foreground, background, void Input – histogram of probabilities for each label Output – each pixel assigned foreground/background label

Formulate as labeling problem probability f b v pixel gridlabeled pixel grid Labeling

Formulate as labeling problem probability f b v pixel gridlabeled pixel grid t

Histogram of probabilities using input scribbles fgbgvoid

Labeling: MAP inference in dense CRF [KK2011] approximate fast: 0.2 s (50K vars) – reduces to bilateral filtering

Labeling: MAP inference in dense CRF [KK2011] approximate fast: 0.2 s (50K vars) – reduces to bilateral filtering Low memory requirement – does not store full distance matrix between pixels

Fast inference assumes Gaussian kernel [KK2011] Pair-wise potential (between pixels) – linear sum of Gaussians – Gaussians over Euclidean feature space We generalize to arbitrary kernels!

Generalizing MAP inference to arbitrary kernels Deep in the details (see paper) lurks a Gaussian kernel + Euclidean feature space … K(, ) = exp(-1/2 ( - ) T ∑ -1 ( - )) feature vectors in Euclidean space

Generalizing MAP inference to arbitrary kernels K(, ) = exp(-1/2 ( - ) T ∑ -1 ( - )) feature vectors in Euclidean space what if the input is an arbitrary dissimilarity measure between pixels? D(, ) pixels Deep in the details (see paper) lurks a Gaussian kernel + Euclidean feature space …

Need an Euclidean embedding! K(, ) = exp(-1/2 ( - ) T ∑ -1 ( - )) D(, ) embedding

Contributions Generalized kernel – approx. mean field inference (fully-connected CRF) Application: interactive image binary selection – robust to inaccurate input

Overview input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity

Overview input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity t

Provide dissimilarity measure between pixels input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity t D(, ) pixels

Overview input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity t t

Approximately-Euclidean pixel embedding pixel p i pixel p j dissimilarity matrix

Approximately-Euclidean pixel embedding pixel p i pixel p j qjqj qiqi dissimilarity matrix embedding

Approximately-Euclidean pixel embedding pixel p i pixel p j qjqj qiqi t D(, ) ≈ q i - q j 2 For each p i find q i so that holds for all pixels p i embedding

Landmark multidimensional scaling (LMDS) [dST02] distance matrix might be huge – elements for 1 MPix image stochastic sampling approach – Nystrom approximation Complexity – time: O(N(c+p 2 ) + c 3 ) – space: O(Nc)

Landmark multidimensional scaling (LMDS) [dST02] distance matrix might be huge – elements for 1 MPix image stochastic sampling approach – Nystrom approximation Complexity – time: O(N(c+p 2 ) + c 3 ) – space: O(Nc) N: # pixels c: # stochastic samples p: dimensionality of embedding

Overview input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity t t

Overview input Euclidean embedding dense CRF + Inference output t t` t embedded pixels t [KK11] 1.image 2.scribbles 3.dissimilarity t

Thank you

You can’t be serious! What about results? Importance of embedding Role of fully-connected CRF (FC-CRF) Validation Comparison with related work Examples

Embedding allows use of arbitrary dissimilarities Input Euclidean distance in RGB [KK11] Chi-squared distance on local histograms + FC-CCRF

Embedding alone is not sufficiently accurate Chi-squared distance (local histograms) + nearest neighbour labeling Input Euclidean distance in RGB [KK11] Chi-squared distance on local histograms + FC-CCRF t

Validation: Accurate output for high input errors Precise output Precise input

Color dominant vs texture dominant selection Color dominant Texture dominant

Qualitative comparison Ours [LJH10] [CLT12] [FFL10]

Quantitative comparison Ours

Quantitative comparison Ours

Summary Our selection is robust – relies on relative indication of foreground and background

Conclusion Most selection algorithms require precise input – ours is relatively robust Genaralising dissimilarities is powerful – in context of pairwise potentials for FC-CRF Two distance metrics stood out – RGB distance (colour dominant images) – Chi-squared distance on local histograms (texture dominant images)

Thank you

Ours [LJH10] [CLT12] [FFL10]

And with human scribbles