Detecting Categories in News Video Using Image Features Slav Petrov, Arlo Faria, Pascal Michaillat, Alex Berg, Andreas Stolcke, Dan Klein, Jitendra Malik.

Slides:



Advertisements
Similar presentations
Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.
Advertisements

Object Recognition Using Locality-Sensitive Hashing of Shape Contexts Andrea Frome, Jitendra Malik Presented by Ilias Apostolopoulos.
Content-Based Image Retrieval
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification Computer Vision, ICCV IEEE 11th International.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.
Spatial Histograms for Head Tracking Sriram Rangarajan Department of Electrical and Computer Engineering, Clemson University, Clemson, SC
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Image and video descriptors
Ziming Zhang *, Ze-Nian Li, Mark Drew School of Computing Science, Simon Fraser University, Vancouver, B.C., Canada {zza27, li, Learning.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
SUPER: Towards Real-time Event Recognition in Internet Videos Yu-Gang Jiang School of Computer Science Fudan University Shanghai, China
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Fast intersection kernel SVMs for Realtime Object Detection
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.
One-Shot Multi-Set Non-rigid Feature-Spatial Matching
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
A Study of Approaches for Object Recognition
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.
A New Correspondence Algorithm Jitendra Malik Computer Science Division University of California, Berkeley Joint work with Serge Belongie, Jan Puzicha,
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
Overview Introduction to local features
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Computer vision.
Action recognition with improved trajectories
Problem Statement A pair of images or videos in which one is close to the exact duplicate of the other, but different in conditions related to capture,
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Group Sparse Coding Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow Google Mountain View, CA (NIPS2009) Presented by Miao Liu July
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Evaluation of Research Theme CogB. Objectives LEAR: LEArning and Recognition in vision Visual recognition and scene understanding –Particular objects.
Advanced Multimedia Image Content Analysis Tamara Berg.
Computer Vision Group University of California Berkeley On Visual Recognition Jitendra Malik UC Berkeley.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,
Histograms of Oriented Gradients for Human Detection(HOG)
Deconstruction: Discriminative learning of local image descriptors Samantha Horvath Learning Based Methods in Vision 2/14/2012.
Advanced Multimedia Image Content Analysis Tamara Berg.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.
Categorical Perception 강우현. Introduction Scalable representations for visual categorization Representation for functional and affordance-based categorization.
Image Features (I) Dr. Chang Shu COMP 4900C Winter 2008.
Learning Mid-Level Features For Recognition
Nonparametric Semantic Segmentation
Recognition using Nearest Neighbor (or kNN)
Paper Presentation: Shape and Matching
Geometric Blur Descriptors for Point Correspondence
Digit Recognition using SVMS
Self-Organizing Maps for Content-Based Image Database Retrieval
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A Tutorial on HOG Human Detection
Detecting Artifacts and Textures in Wavelet Coded Images
CS 1674: Intro to Computer Vision Scene Recognition
Brief Review of Recognition + Context
Recognition and Matching based on local invariant features
Presentation transcript:

Detecting Categories in News Video Using Image Features Slav Petrov, Arlo Faria, Pascal Michaillat, Alex Berg, Andreas Stolcke, Dan Klein, Jitendra Malik

System Overview Video Category correlation Sequential context Audio Images ASR TFIDF MFCC GB Feature extraction SVM GMM SVM Primary systems Source combination 1-best selection Higher-level systems

Image Features in TrecVid ’05  IBM:  Color Histogram  Color Correlogram  Color Moments  CMU (local):  Color Histograms (in different color spaces)  Texture Histograms  Columbia (part based model):  Color  Texture  Tsinghua (local and global):  Color Auto-Correlograms  Color Coherence Vectors  Color Histograms  Co-occurence Texture  Wavelet Texture Grid  Edge Histogram Layout  Edge Histograms  Size  Spatial Relation  Color Moments  Edge Histograms  Wavelet Texture

Image Features in TrecVid ’05 IBM CMU Columbia Tsinghua Berkeley Color Histograms  Moments  Correlograms  Texture Histograms  Wavelets  Edge Histograms  Shape 

Exemplars for Recognition  Use exemplars for recognition  Compare query image and each exemplar using shape cues Query Image Database of Exemplars

Finding similar patches Exemplar Query

Compute sparse channels from image Geometric Blur (Local Appearance Descriptor) Geometric Blur Descriptor ~ Apply spatially varying blur and sub-sample Extract a patch in each channel Idealized signal Descriptor is robust to small affine distortions [Berg & Malik, CVPR’01]

Horizontal Channel Vertical Channel Increasing Blur GB in Practice  In practice compute discrete blur levels for whole image and sample as needed for each feature location.

Comparing Images  Sample 200 GB features from edge points  Dissimilarity from A to B is where the F x are the GB features. [Berg, Berg & Malik, CVPR’05]

Caltech 101 Dataset  Object Recognition Benchmark  101 Categories:  Stereotypical pose  Little clutter  Objects centered  One object per image

Caltech 101 Results [Zhang, Berg, Maire & Malik, CVPR’06] uses GB features

Primal features for SVM  Compare to 50 prototypes from each class  Use distances as feature vector for an SVM …….. Query Prototype s Featur e Vector … 0.1 … 0.8 … ………. 0.7

SVM features interpretation  Slices of the Kernel Matrix:  Fixed-points in a higher dimensional vector space: q q titi titi tjtj tktk tktk tjtj

SVM Specifics  SVM light package  Same parameters for all categories:  Linear kernel  Default regularization parameter  Asymmetric cost doubling the weight of positive examples

Results ’06 Sports Meeting Computer- TV-Screen Car mAP = 0.11 Results ’05 Berkeley-Shape mAP = 0.38 Best ’05 (IBM) mAP = 0.34 Best Berkeley-Shape Median

Limitations  Several objects per image:  Features do not capture:  Different Scales  Color

Conclusions  Shape is an important cue for object recognition.  System that uses shape features only can have competitive performance.  Shape features are orthogonal to features used in the past.

Thank You!