The chains model for detecting parts by their context Leonid Karlinsky, Michael Dinerstein, Daniel Harari, and Shimon Ullman.

Slides:

Advertisements

Similar presentations

Advertisements

EuroCondens SGB E.

Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.

Summative Math Test Algebra (28%) Geometry (29%)

Object Recognition Using Locality-Sensitive Hashing of Shape Contexts Andrea Frome, Jitendra Malik Presented by Ilias Apostolopoulos.

突破信息检索壁垒－SciFinder Scholar 介绍

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

The basics for simulations

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.

2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.

2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.

DTU Informatics Introduction to Medical Image Analysis Rasmus R. Paulsen DTU Informatics TexPoint fonts.

Static Equilibrium; Elasticity and Fracture

Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.

Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.

Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.

Patch to the Future: Unsupervised Visual Prediction

Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.

Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.

USING LINKING FEATURES IN LEARNING NON-PARAMETRIC PART MODELS * Ammar Kamal Hattab ENGN2560 Final Project Presentation May 17, 2013 * Leonid Karlinsky,

Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Herding: The Nonlinear Dynamics of Learning Max Welling SCIVI LAB - UCIrvine.

Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.

1 Jun Wang, 2 Sanjiv Kumar, and 1 Shih-Fu Chang 1 Columbia University, New York, USA 2 Google Research, New York, USA Sequential Projection Learning for.

1 Face Tracking in Videos Gaurav Aggarwal, Ashok Veeraraghavan, Rama Chellappa.

The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects By John Winn & Jamie Shotton CVPR 2006 presented by Tomasz.

Carnegie Mellon AISTATS 2009 Jonathan Huang Carlos Guestrin Carnegie Mellon University Xiaoye Jiang Leonidas Guibas Stanford University.

Computer vision: models, learning and inference Chapter 10 Graphical Models.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Face Detection using the Viola-Jones Method

1 Naïve Bayes Models for Probability Estimation Daniel Lowd University of Washington (Joint work with Pedro Domingos)

Building Face Dataset Shijin Kong. Building Face Dataset Ramanan et al, ICCV 2007, Leveraging Archival Video for Building Face DatasetsLeveraging Archival.

Professor: S. J. Wang Student : Y. S. Wang

Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.

Class-Specific Hough Forests for Object Detection Zhen Yuan Hsu Advisor：S.J.Wang Gall, J., Lempitsky, V.: Class-specic hough forests for object detection.

Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova ， Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.

Sampling for Part Based Object Models Daniel Huttenlocher September, 2006.

Efficient Discriminative Learning of Parts-based Models M. Pawan Kumar Andrew Zisserman Philip Torr

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.

School of Computer Science 1 Information Extraction with HMM Structures Learned by Stochastic Optimization Dayne Freitag and Andrew McCallum Presented.

KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.

CVPR 2012 POSTER Mobile Object Detection through Client-Server based Vote Transfer.

KNN & Naïve Bayes Hongning Wang

Detecting Occlusion from Color Information to Improve Visual Tracking

Predictive Automatic Relevance Determination by Expectation Propagation Y. Qi T.P. Minka R.W. Picard Z. Ghahramani.

Lecture 26 Hand Pose Estimation Using a Database of Hand Images

Tracking Objects with Dynamics

Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani

Intelligent Information System Lab

Markov Networks.

Learning to Combine Bottom-Up and Top-Down Segmentation

Learning Markov Networks

Vessel Extraction in X-Ray Angiograms Using Deep Learning

“The Truth About Cats And Dogs”

The topic discovery models

Learning complex visual concepts

Chap 8. Instance Based Learning

Nearest Neighbors CSC 576: Data Mining.

Introduction to Object Tracking

Learning complex visual concepts

Image Morphing using mesh warp and feature based warp

Presentation transcript:

The chains model for detecting parts by their context Leonid Karlinsky, Michael Dinerstein, Daniel Harari, and Shimon Ullman

Detects a “difficult” part by extending from an “easy” reference part Detect by marginalizing over chains of intermediate features Use the face to disambiguate the hand Goals & Intuition

Examples & Generality Same approach for any parts of any object!

Pre-processing: Detect the face Extract features Modeling the posterior: Model the probability that hand is in L h T – ordered subset of M features Exact inference: Sum over all simple chains in the graph Approximate inference: Sum over all chains (limit the max. length) Can be easily computed by matrix multiplication The chains model M,T Chains model

Inference M,T Chains model object vs. background likelihood ratios = Entry i-j is the marginal over all chains between F i and F j = voting probabilities =

Inference Instead of full marginalization, MAP inference over C ∙ S part is done: For each feature we compute the best chain leading to this feature from head This feature votes for nearby features

Training & probability estimationTraining: Get example images with detected reference part and target part marked by single point. Non-parametric probability estimation in a nutshell: Store training image features and pairs of nearby features* in an efficient ANN search structures (KD-trees) Compute the estimated probability by approximate KDE E.g.: - search for K nearest neighbors of, compute * use a feature selection strategy (see paper) to reduce the number of features and pairs

Chains & Stars Limitations of the star model: 1.Far away features do not contribute. 2.Features contribute to detection without testing for connectivity with the reference part. Star model – a special case of “chains” of length one star resultchains resultchains graph

Feature graph examples chains graphchains result chains graph

Results

Buehler et al 2008 – sign language dataset Training: 600 frames of a news sequenceTesting: 195 frames from 5 other sequences Our result: 84.9% detection Best in the literature: 79% [Kumar et al, 2009] (avg. over 6 body parts)

Ferrari et al Buffy dataset Training: movies shot in our labTesting: 308 frames / 3 episodes Our result: 47% detection Best in the literature: 46.1% [Andriluka et al, 2009]

Buehler et al – BBC news dataset (96.8%) Best in the literature: 95.6% [Buehler et al, 2008] Self trained scenario (87%) Person cross generalization scenario (86.9%) Full generalization scenario (73.2%)

UIUC cars dataset Training: 550 positive / 500 neg.Testing: 200 cars in 170 images Our result: 99.5% detection Best in the literature: 99.9% [Mutch & Lowe, 2006]

Summary: Same approach (same params) works for other object types (out of the box) A single non-parametric model for any object Detect difficult parts extending from the ‘easy’ parts Near features vote directly, far features vote through chains Can be used without the reference part and for full objects Our result: Our result: 63.1% - 1 st max only 90.3% - 1 st & 2 nd max detailed numerical results …

ChainsStar Scenario + Dataset1 max2 max3 max4 max1 max2 max3 max4 max self trained (ST) self trained (Horse) person cross-generalization (PCG) full generalization (ST) Chains [Buehler et al., 2008] [Kumar et al., 2009] [Ferrari et al., 2008] [Andriluka et al., 2009] Setting1 max2 max3 max4 max1 max BBC news signers Buffy % chains model (trained on Caltech-101 cars) 99.5% chains model (trained on UIUC) 98.5%Gall and Lempitsky, %Lampert et al, %Leibe et al, %Mutch and Lowe, 2006 Precision-Recall at EER - cars Methods chains model Gall and Lempitsky, 2009 Shotton et al, 2008 Methods 97.5  1% 93.9% 95.7% Precision-Recall at EER - horses 1 – average on 6 upper body parts all except Andriluka et al. and us use tracking all except us use color back …

Feature graph examples * weights – shown in logarithmic color scale chains graph*chains result chains graph*