1 Segmentation with Scene and Sub-Scene Categories Joseph Djugash Input Image Scene/Sub-Scene Classification Segmentation.

Slides:



Advertisements
Similar presentations
Top-Down & Bottom-Up Segmentation
Advertisements

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Clustering with k-means and mixture of Gaussian densities Jakob Verbeek December 3, 2010 Course website:
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.
Grouping and Segmentation Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/01/10.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Fitting: The Hough transform
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Recognition: A machine learning approach
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Image segmentation. The goals of segmentation Group together similar-looking pixels for efficiency of further processing “Bottom-up” process Unsupervised.
Algorithms & Applications in Computer Vision Lihi Zelnik-Manor Lecture 11: Structure from Motion.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Segmentation. Terminology Segmentation, grouping, perceptual organization: gathering features that belong together Fitting: associating a model with observed.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Lecture 28: Bag-of-words models
Learning from Observations Chapter 18 Section 1 – 4.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Normalized Cuts Demo Original Implementation from: Jianbo Shi Jitendra Malik Presented by: Joseph Djugash.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Bag-of-features models
WORD-PREDICTION AS A TOOL TO EVALUATE LOW-LEVEL VISION PROCESSES Prasad Gabbur, Kobus Barnard University of Arizona.
Perceptual Organization: Segmentation and Optical Flow.
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of.
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Graph-based Segmentation
Image Segmentation Image segmentation is the operation of partitioning an image into a collection of connected sets of pixels. 1. into regions, which usually.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Segmentation and Grouping Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/23/10.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Computer Vision James Hays, Brown
CSE 185 Introduction to Computer Vision Pattern Recognition.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
1 Contours and Junctions in Natural Images Jitendra Malik University of California at Berkeley (with Jianbo Shi, Thomas Leung, Serge Belongie, Charless.
Multiscale Symmetric Part Detection and Grouping Alex Levinshtein, Sven Dickinson, University of Toronto and Cristian Sminchisescu, University of Bonn.
CS 558 C OMPUTER V ISION Lecture VI: Segmentation and Grouping Slides adapted from S. Lazebnik.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
EFFICIENT ROAD MAPPING VIA INTERACTIVE IMAGE SEGMENTATION Presenter: Alexander Velizhev CMRT’09 ISPRS Workshop O. Barinova, R. Shapovalov, S. Sudakov,
SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Fitting: The Hough transform
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.
Segmentation & Grouping Tuesday, Sept 23 Kristen Grauman UT-Austin.
CS654: Digital Image Analysis
Machine Learning ICS 178 Instructor: Max Welling Supervised Learning.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Image segmentation.
Segmentation slides adopted from Svetlana Lazebnik.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
SIFT.
SIFT Scale-Invariant Feature Transform David Lowe
A Personal Tour of Machine Learning and Its Applications
Nearest-neighbor matching to feature database
Context-based vision system for place and object recognition
Finding Clusters within a Class to Improve Classification Accuracy
Nearest-neighbor matching to feature database
Learning to Combine Bottom-Up and Top-Down Segmentation
Prepared by: Mahmoud Rafeek Al-Farra
Image Segmentation CS 678 Spring 2018.
“Traditional” image segmentation
Presentation transcript:

1 Segmentation with Scene and Sub-Scene Categories Joseph Djugash Input Image Scene/Sub-Scene Classification Segmentation

2 Problem Statement Goal:  Accurate segmentation of salient objects/regions in any image. Problems/Issues:  What is a salient object(s)?  How do we identify the presence of these objects? What metric/feature/cue do we use? Is this consistent over all images?

3 Outline Normalized Cut  Learning segmentation Methods  The Naïve Approach  Affinity Matching  Minimizing Segmentation Error Discussion of Results

4 Bottom-Up Segmentation Normalized Cuts, Mean Shift, etc. Drawbacks  Parameters Number of Clusters/Segments Cluster Size (Sigma values)  Performance drops when dealing with a wide variety of images 1. Jianbo Shi; Malik, J. Normalized cuts and image segmentation. PAMI (2000) 2. D. Comaniciu, P. Meer. Mean Shift: A Robust Approach toward Feature Space Analysis. PAMI (2002)

5 Class Specific Segmentation Image patches from training images are fit to the input image Patch consistency checks correct inaccurate matching and segmentation Only known object classes can be segmented correctly Large database required to encompass all possible objects 3. Eran Borenstein, Shimon Ullman. Class-Specific, Top-Down Segmentation. ECCV (2002)

6 Outline Normalized Cut  Learning segmentation Methods  The Naïve Approach  Affinity Matching  Minimizing Segmentation Error Discussion of Results

7 The Naïve Approach Scene Categories Parameter Learning Parameter Look-Up using Bag- Of-Words Scene Categories Training Phase Image Segmentation Testing Phase Learnt Parameters: Cluster Range {min & max # of clusters} Sigma Values {controls size of cluster/segment} The parameters are learnt using a supervised reinforcement learning algorithm (Feedback +/ o /–)

8 Results: The Naïve Approach

9 Outline Normalized Cut  Learning segmentation Methods  The Naïve Approach  Affinity Matching  Minimizing Segmentation Error Discussion of Results

10 Affinity Matching Parameter Learning Nearest Neighbor Classification& Parameter Look-Up Training Phase Image Segmentation Testing Phase Parameters Learnt using reinforcement learning Nearest Neighbor Matching performed on a set of stored Affinity matrices Query images are classified and learnt parameters are used to Segments Image

11 Results: Affinity Matching

12 Outline Normalized Cut  Learning segmentation Methods  The Naïve Approach  Affinity Matching  Minimizing Segmentation Error Discussion of Results

13 Minimizing Segmentation Error LabelMe Data Gist Features Kmeans Clustering Image Segmentation Training Phase Parameter Learning Minimizing Segmentation Cost to Human Labeled Data Parameter Learning Minimizing Segmentation Cost to Human Labeled Data Segmentation Error J =  i ( J N + J S + J O ) Similarity to Non-Objects J N =  j W( px In, px Out ) Cluster Cost J S =  j e ( γ *|#Seg – #Obj|) i є Images in Cluster j є Labeled Objects Testing Phase Dissimilarity within Objects J N =  j e ( ) px In W( px In, px In ) 4. A. Torralba, K. P. Murphy, W. T. Freeman and M. A. Rubin. Context-based vision system for place and object recognition. AIM (2003) Object Pixels

14 Results: Minimizing Segmentation Error

15 Results: Minimizing Segmentation Error

16 Discussion The Naïve Approach  Uses intuitive categories that humans can identify with  Human feedback can be unreliable and inconsistent  Broad category labels lead to segmentation inaccuracy Affinity Matching  Affinity matrix provides a good feature space for segmentation  Closeness in the affinity matrix might not necessarily imply similarity in segmentation parameters Minimizing Segmentation Error  A qualitative measure for evaluating the validity of a segmentation  Human segmentation should try to realize image cues NCut segmentation prefers edge contours in the image

17 Questions?

18 Outline – Detailed The Naïve Approach  Reinforcement Learning  Bag-of-words scene classification Affinity Matching  NN classification on Affinity Matrix  Learning similar image structure Human Segmentation & Minimizing Cost  LabelMe Data  Gist Features

19

20 Learning Scene Categories – Bag-of-Words Scene Categories learnt from clustering on image codewords Feature Detection and Representation can be done using various Image Cues Codewords stored in the database need to cover a wide spectrum 3. Fei-Fei and P. Perona. A Bayesian hierarchical model for learning natural scene categories. CVPR (2005)