An Additive Latent Feature Model

Slides:

Advertisements

Similar presentations

Topic models Source: Topic models, David Blei, MLSS 09.

Advertisements

Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.

Aggregating local image descriptors into compact codes

Automatic Image Annotation Using Group Sparsity

Integrated Instance- and Class- based Generative Modeling for Text Classification Antti PuurulaUniversity of Waikato Sung-Hyon MyaengKAIST 5/12/2013 Australasian.

1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.

Simultaneous Image Classification and Annotation Chong Wang, David Blei, Li Fei-Fei Computer Science Department Princeton University Published in CVPR.

Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.

1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.

Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Object Recognition & Model Based Tracking © Danica Kragic Tracking system.

Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers.

DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Object Recognition. So what does object recognition involve?

Latent Dirichlet Allocation a generative model for text

CS292 Computational Vision and Language Visual Features - Colour and Texture.

Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.

Exercise Session 10 – Image Categorization

Multiclass object recognition

Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.

Bayesian Hierarchical Clustering Paper by K. Heller and Z. Ghahramani ICML 2005 Presented by HAO-WEI, YEH.

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013.

Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Histograms of Oriented Gradients for Human Detection(HOG)

Latent Dirichlet Allocation

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

Week 10 Emily Hand UNR.

Towards Total Scene Understanding: Classiﬁcation, Annotation and Segmentation in an Automatic Framework N 工科所錢雅馨 2011/01/16 Li-Jia Li, Richard.

Convolutional Restricted Boltzmann Machines for Feature Learning Mohammad Norouzi Advisor: Dr. Greg Mori Simon Fraser University 27 Nov

Categorical Perception 강우현. Introduction Scalable representations for visual categorization Representation for functional and affordance-based categorization.

1.Learn appearance based models for concepts 2.Compute posterior probabilities or Semantic Multinomial (SMN) under appearance models. -But, suffers from.

CVC, June 4, 2012 Image categorization using Fisher kernels of non-iid image models Gokberk Cinbis, Jakob Verbeek and Cordelia Schmid LEAR team, INRIA,

Topic Modeling for Short Texts with Auxiliary Word Embeddings

Presented by David Lee 3/20/2006

Learning Mid-Level Features For Recognition

Performance of Computer Vision

Unsupervised Riemannian Clustering of Probability Density Functions

Paper Presentation: Shape and Matching

Trevor Savage, Bogdan Dit, Malcom Gethers and Denys Poshyvanyk

Object detection as supervised classification

Using Transductive SVMs for Object Classification in Images

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

A Tutorial on HOG Human Detection

Action Recognition in Temporally Untrimmed Videos

CS 1674: Intro to Computer Vision Scene Recognition

CVPR 2014 Orientational Pyramid Matching for Recognizing Indoor Scenes

The Open World of Micro-Videos

Brief Review of Recognition + Context

Matching Words with Pictures

Object Classes Most recent work is at the object level We perceive the world in terms of objects, belonging to different classes. What are the differences.

Word Embedding Word2Vec.

Creating Data Representations

Resource Recommendation for AAN

Michal Rosen-Zvi University of California, Irvine

Topic Models in Text Processing

AHED Automatic Human Emotion Detection

Unsupervised learning of visual sense models for Polysemous words

Presentation transcript:

An Additive Latent Feature Model for Mario Fritz UC Berkeley Michael Black Brown University Gary Bradski Willow Garage Sergey Karayev UC Berkeley Trevor Darrell UC Berkeley

Motivation Transparent objects are ubiquitous in domestic environments Relevant to domestic service robots Traditional local feature approach inappropriate Full physical model intractable

Related Work Material recognition: Recognition by Specularities: Finding Glass [McHenry@CVPR05/06] Detecting Specular Surfaces in Natural Images [DelPozo@CVPR07] Classifying Materials from their Reflectance Properties [Nillius@ECCV04] Low-Level Image Cues in the Perception of Translucent Materials [Fleming, Transactions on Applied Perception ’05] Recognition by Specularities: Using Specularities for Recognition [Osadchy@ICCV03] Transparent Motion and Layered Phenomena E.g. [Roth@CVPR06], [Ben-Ezra@ICCV03], [Darrell@CVPR93] … Acquisition and rendering of refractive patterns Environment Matting and Composition [Zongker@Siggraph99]

Related Work Material recognition: Recognition by Specularities: Finding Glass [McHenry@CVPR05/06] Detecting Specular Surfaces in Natural Images [DelPozo@CVPR07] Classifying Materials from their Reflectance Properties [Nillius@ECCV04] Low-Level Image Cues in the Perception of Translucent Materials [Fleming, Transactions on Applied Perception ’05] Recognition by Specularities: Using Specularities for Recognition [Osadchy@ICCV03] Transparent Motion and Layered Phenomena E.g. [Roth@CVPR06], [Ben-Ezra@ICCV03], [Darrell@CVPR93] … Acquisition and rendering of refractive patterns Environment Matting and Composition [Zongker@Siggraph99]

Related Work Material recognition: Recognition by Specularities: Finding Glass [McHenry@CVPR05/06] Detecting Specular Surfaces in Natural Images [DelPozo@CVPR07] Classifying Materials from their Reflectance Properties [Nillius@ECCV04] Low-Level Image Cues in the Perception of Translucent Materials [Fleming, Transactions on Applied Perception ’05] Recognition by Specularities: Using Specularities for Recognition [Osadchy@ICCV03] Transparent Motion and Layered Phenomena E.g. [Roth@CVPR06], [Ben-Ezra@ICCV03], [Darrell@CVPR93] … Acquisition and rendering of refractive patterns Environment Matting and Composition [Zongker@Siggraph99]

Related Work Material recognition: Recognition by Specularities: Finding Glass [McHenry@CVPR05/06] Detecting Specular Surfaces in Natural Images [DelPozo@CVPR07] Classifying Materials from their Reflectance Properties [Nillius@ECCV04] Low-Level Image Cues in the Perception of Translucent Materials [Fleming, Transactions on Applied Perception ’05] Recognition by Specularities: Using Specularities for Recognition [Osadchy@ICCV03] Transparent Motion and Layered Phenomena E.g. [Roth@CVPR06], [Ben-Ezra@ICCV03], [Darrell@CVPR93] … Acquisition and rendering of refractive patterns Environment Matting and Composition [Zongker@Siggraph99]

Related Work Material recognition: Recognition by Specularities: Finding Glass [McHenry@CVPR05/06] Detecting Specular Surfaces in Natural Images [DelPozo@CVPR07] Classifying Materials from their Reflectance Properties [Nillius@ECCV04] Low-Level Image Cues in the Perception of Translucent Materials [Fleming, Transactions on Applied Perception ’05] Recognition by Specularities: Using Specularities for Recognition [Osadchy@ICCV03] Transparent Motion and Layered Phenomena E.g. [Roth@CVPR06], [Ben-Ezra@ICCV03], [Darrell@CVPR93] … Acquisition and rendering of refractive patterns Environment Matting and Composition [Zongker@Siggraph99] Non of these approaches addresses transparent objects recognition in real-world conditions

Traditional Local Feature-based Recognition Codebook: quantize histogram Codebook clusters assume prototypical global patch appearance classifier

SIFT-type Descriptors SIFT is popular choice for local feature computation It performs spatial binning of orientation quantized gradient information Unnormalized distribution over local gradient statistics We will use the a particular visualization as proposed for the related HOG method

The Problem of Transparency Significant variation in patch appearance Often gradient energy is dominated by background

The Problem of Transparency Significant variation in patch appearance Often gradient energy is dominated by background ... but common latent structure

The Problem of Transparency Codebook: quantize histogram Codebook clusters assume prototypical global patch appearance classifier

The Problem of Transparency Codebook: quantize histogram Codebook clusters assume prototypical global patch appearance classifier

Key Idea: Local Latent Factorization Components: Latent component model latent histogram Codebook is replaced by a set of latent components classifier

Local Additive Feature Model Factor gradient descriptor into Unknown non-negative mixture weights Unknown mixture components Regularize with sparsity assumption Advantages vs. e.g. VQ, PCA: Additive model allows for superimposed structures Appropriate model for factorizing local gradient distribution No reliance on global patch appearance ….. …..

LDA-SIFT Factor SIFT descriptor into latent components using LDA/sLDA [Blei03,Griffiths04,Blei07]: additivity is realized as multinomial mixture model sparsity assumption is implemented as Dirichlet priors Graphical model Document = Patch Dirichlet prior …. … Image evidence Latent topics Transparent visual words inference Different from projections like PCA -> Inhibition effects Different from clustering -> Decomposition into substructures

LDA-SIFT Factor SIFT descriptor into latent components using LDA/sLDA [Blei03,Griffiths04,Blei07]: additivity is realized as multinomial mixture model sparsity assumption is implemented as Dirichlet priors Learnt mixture components Graphical model Document = Patch Dirichlet prior …. … Image evidence Latent topics Transparent visual words inference Different from projections like PCA -> Inhibition effects Different from clustering -> Decomposition into substructures

Comparison to previous SIFT/LDA

Transparent Visual Words Average occurrence on train Latent component Occurrences on test

Recognition Architecture Infer transparent visual words … T glass X Y Classifier LDA T background … X Y

Experiments

Evaluation Data

Results vs. baseline Training on 4 different glasses in front of screen Testing on 49 glass instances in home environment Sliding window linear SVM-detection

Recognition Architecture … T glass X Y Classifier LDA T background … X Y

Results: general vocabulary Training on 4 different glasses in front of screen Testing on 49 glass instances in home environment Sliding window linear SVM-detection

Recognition Architecture … T glass X Y Classifier sLDA T background … X Y

Results: sLDA Training on 4 different glasses in front of screen Testing on 49 glass instances in home environment Sliding window linear SVM-detection

Conclusion Traditional local feature models (VQ, NN) are poorly suited for transparent object recognition Proposed additive local feature models can detect superimposed image structures Developed statistical approach to learn such representations using probabilistic topic models Sparse factorization of local gradient statistics Encouraging results on real-world data

Future Work Different feature representations; extend model in hierarchical fashion Investigate addition of material property cues; discriminative inverse local light transport models Explore benefits for opaque object recognition; understand relationship to sparse image coding as well as to biological motivated models

Thank you for your attention.