Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23
Abstract Provide a framework for large scale learning and annotation of structured models 1.Interactive labeling 2.Online learning
Interactive Part Localization: model and notation Define score of a part configuration: part configuration, image x, part tree model T = (V, E) location (x p, y p ), scale s p, orientation r p, aspect v p : value from part detector (HOG feature) : quadratic function on the relative displacement between parts Maximum likelihood solution:
Interactive Part Localization: incorporating user input score incorporating user input sequence of user input operations up to time step t. j(t): index of the part annotated by the user in time step t is the user’s label of the part location unary potential Simple extensions include allowing imperfect user responses
Interactive Part Localization: Dynamic Programming(1) The goal is to compute cache tables which are indexable by pixel location :an array storing the maximum likelihood solutions for part q at time step t ※ p->q the optimal solution conditioned on placing p at position :
Interactive Part Localization: Dynamic Programming(2) : distance transform operation : cost associated with part q being at an offset of from p
Interactive Part Localization: Dynamic Programming(3) Using the pre-computed cache table
Interactive Part Localization: Algorithm
Online Structured Learning maximum margin structured learning problem (structured SVM): search optimal vector of weights w* error function: vector of HOG features + squared distances between adjacent parts x i image, y i = ɵ, penalty stochastic gradient descent (SGD): gradient:
Online Structured Learning: Algorithm
Experiments