Spatio-chromatic image content descriptors and their analysis using Extreme Value theory Vasileios Zografos and Reiner Lenz

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Aggregating local image descriptors into compact codes
Presented by Xinyu Chang
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
Content-Based Image Retrieval
Presented By: Vennela Sunnam
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
CS324e - Elements of Graphics and Visualization Color Histograms.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
The Statistics of Fingerprints A Matching Algorithm to be used in an Investigation into the Reliability of the Use of Fingerprints for Identification Bob.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
A Study of Approaches for Object Recognition
SWE 423: Multimedia Systems
1 Visual Information Extraction in Content-based Image Retrieval System Presented by: Mian Huang Weichuan Dong Apr 29, 2004.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Feature Creation and Selection.
Information Retrieval in Practice
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Computer vision.
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
The MPEG-7 Color Descriptors
Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Boris Babenko Department of Computer Science and Engineering University of California, San Diego Semi-supervised and Unsupervised Feature Scaling.
Texture analysis Team 5 Alexandra Bulgaru Justyna Jastrzebska Ulrich Leischner Vjekoslav Levacic Güray Tonguç.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Like.com vs. Ugmode Non-infringement arguments *** CONFIDENTIAL *** Prepared by Ugmode, Inc.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
CSE 185 Introduction to Computer Vision Feature Matching.
Methods for 3D Shape Matching and Retrieval
Miguel Tavares Coimbra
Distinctive Image Features from Scale-Invariant Keypoints
3D Reconstruction Using Image Sequence
A Genetic Algorithm-Based Approach to Content-Based Image Retrieval Bo-Yen Wang( 王博彥 )
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Heechul Han and Kwanghoon Sohn
A. M. R. R. Bandara & L. Ranathunga
Learning Mid-Level Features For Recognition
Data Preparation for Deep Learning
Machine Learning Feature Creation and Selection
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Image Segmentation Techniques
Contrasts & Statistical Inference
SIFT keypoint detection
Contrasts & Statistical Inference
CSE 185 Introduction to Computer Vision
Contrasts & Statistical Inference
Presentation transcript:

Spatio-chromatic image content descriptors and their analysis using Extreme Value theory Vasileios Zografos and Reiner Lenz Computer Vision Laboratory, Linköping University, Sweden 1. Introduction Challenges for Content based image retrieval (CBIR):  Increase in online visual information  Large variation in content, appearance and quality  Images indexed by simple and erroneous textual tags  Complex, sophisticated, slow descriptors are not suited for large scale CBIR tasks Our proposal:  Fast spatio-chromatic descriptors suited for fast search over large image databases  Low dimensional representation using models derived from Extreme Value theory Garnics 2. Spatio-chromatic descriptors 3. Extreme value theory (EVT) 4. Our approach 5. Experiments – The scale-shape space 6. Experiments – classification and retrieval 7. Conclusions “The limiting distribution of the extrema of a large number of i.i.d. random variables, is one of the three parametric forms:” Weibull:, Frechet: Gumbel: (1)  Our filters are essentially sums of differences of correlated variables [3]. This also leads to the EVT forms (1)  We can use (1) as analytical models of the spatio-chromatic filtered image distribution. Method:  Filter each image with the 48 spatio-chromatic filters organized in 24 vectors  Represent the magnitude of each filter vector as: model type + 3 parameters from (1)  Parameter estimation: ML estimation using Newton-Raphson descent  Model type selection: Residual based goodness-of-fit (g.o.f.) with the coeff. of determination R 2 Result:  We can do analysis and classification in a low dimensional space 24x3  Additional benefits of the EVT models compared to histograms: Continuous; natural clustering in scale-shape space; semantic information (saliency) isolation How well do the EVT models explain our filtered data?  2 image databases (1100 colour photos thumbnails) – natural and synthetic  Tested all 3 models in (1)  Various g.o.f. measures (K-S test, g-test, chi-square, R 2 ) Results:  The EVT models provide a good fit to over 80% of the filtered images  Especially suited for natural images  The R 2 test is the most robust measure than other typical statistical measures Fig 4. The intensity and colour filters also have a natural, distinct distribution in this space. Fig 2. Trajectories of model parameters in scale-shape space of an image under linear and nonlinear transformations (left) and increase in noise and smoothing (right)  The filters and EVT models can be used for very fast classification and retrieval.  Trained an SVM on the 24x3 parameters  4 class classification example of scenes and painting styles (abstract classes) Fig 5. Top ranked results from the 4 classes. Overall All-to-All classification score 40.5%. Fig 3. Original, downscaled image (left) and a filtered result (middle). The filter responses at the tails (i.e. extrema) of the distribution are shown on the right. We can see that extrema typically correspond to salient features such as edges and corners. The scale-shape space is the space spanned by the two parameters of the models in (1). We can analyse the location and dispersion of filtered images in that space and their trajectories as their properties vary. It turns out that the images occupy different portions of that space depending on their texture properties and intensity variation. Fig 1. Samples from a photo database distributed in scale- shape space. This behaviour generalises to other datasets.  Presented a set of spatio-chromatic descriptors well suited for fast image retrieval  We have used the EVT models to describe the filter output distributions  More flexible, more descriptive and more compact than other competing representations such as histograms and fragmentation theory. References: [1] R. Lenz. “Investigation of receptive fields using representations of dihedral groups” JVCIR 6 (1995) [2] R. Lenz et al. “A group theoretical toolbox for color image operators” ICIP 3. (2005) [3] E. Bertin et al. “Generalized extreme value statistics and sum of correlated variables” J. Phys. A: Math. Gen , (2006) This research was funded by the EU FP7/ programme, under grant agreement No – GARNICS. Symmetry groups and filter design:  Filter systems should be adapted to transformations of the image grid properties of the RGB color space  Digital Images are defined on grids (square or hexagonal) their symmetry groups are the dihedral groups D(4) and D(6). (See [1]).  RGB channels are on average interchangeable the RGB symmetry group is the permutation group equal to the dihedral group D(3). (See [2]).  The representation theory of the dihedral groups is used to construct filter systems with: simple transformation properties under grid and color transformations uncorrelated filter responses minimum-mean-squared error encoding are generalizations of the FFT for spatial RGB distributions Symmetry groups D(4) and D(3)