An Image Database Retrieval Scheme Based Upon Multivariate Analysis and Data Mining Presented by C.C. Chang Dept. of Computer Science and Information.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Aggregating local image descriptors into compact codes
Input Space versus Feature Space in Kernel- Based Methods Scholkopf, Mika, Burges, Knirsch, Muller, Ratsch, Smola presented by: Joe Drish Department of.
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Principal Component Analysis
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
吳家宇 吳明翰 Face Detection Based on Template Matching and 2DPCA Algorithm 2009/01/14.
Face Recognition using PCA (Eigenfaces) and LDA (Fisherfaces)
Fast Image Replacement Using Multi-Resolution Approach Chih-Wei Fang and Jenn-Jier James Lien Robotics Lab Department of Computer Science and Information.
Implementing a reliable neuro-classifier
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
1 Embedded colour image coding for content-based retrieval Source: Journal of Visual Communication and Image Representation, Vol. 15, Issue 4, December.
Fast vector quantization image coding by mean value predictive algorithm Authors: Yung-Gi Wu, Kuo-Lun Fan Source: Journal of Electronic Imaging 13(2),
Summarized by Soo-Jin Kim
Presented By Wanchen Lu 2/25/2013
Presented by Tienwei Tsai July, 2005
ECE 8443 – Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Normal Distributions Whitening Transformations Linear Discriminants Resources.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
Date: Advisor: Jian-Jung Ding Reporter: Hsin-Hui Chen.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Unsupervised Learning Motivation: Given a set of training examples with no teacher or critic, why do we learn? Feature extraction Data compression Signal.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
Reduces time complexity: Less computation Reduces space complexity: Less parameters Simpler models are more robust on small datasets More interpretable;
A Fast LBG Codebook Training Algorithm for Vector Quantization Presented by 蔡進義.
Content Based Color Image Retrieval vi Wavelet Transformations Information Retrieval Class Presentation May 2, 2012 Author: Mrs. Y.M. Latha Presenter:
Image Compression Using Address-Vector Quantization NASSER M. NASRABADI, and YUSHU FENG Presented by 蔡進義 P IEEE TRANSACTIONS ON COMMUNICATIONS,
Unsupervised Learning II Feature Extraction
Unsupervised Learning II Feature Extraction
Background on Classification
ARTIFICIAL NEURAL NETWORKS
LECTURE 10: DISCRIMINANT ANALYSIS
Data Mining and Its Applications to Image Processing
Chapter 3 向量量化編碼法.
Principal Component Analysis (PCA)
A new data transfer method via signal-rich-art code images captured by mobile devices Source: IEEE Transactions on Circuits and Systems for Video Technology,
Principal Component Analysis
REMOTE SENSING Multispectral Image Classification
REMOTE SENSING Multispectral Image Classification
A Color Image Hiding Scheme Based on SMVQ and Modulo Operator
Chair Professor Chin-Chen Chang Feng Chia University
指導教授: Chang, Chin-Chen (張真誠)
A Data Hiding Scheme Based Upon Block Truncation Coding
PCA is “an orthogonal linear transformation that transfers the data to a new coordinate system such that the greatest variance by any projection of the.
Introduction to Statistical Methods for Measuring “Omics” and Field Data PCA, PcoA, distance measure, AMOVA.
Introduction PCA (Principal Component Analysis) Characteristics:
第七章 資訊隱藏 張真誠 國立中正大學資訊工程研究所.
Foundation of Video Coding Part II: Scalar and Vector Quantization
A Study of Digital Image Coding and Retrieving Techniques
Advisor: Chin-Chen Chang1, 2 Student: Yi-Pei Hsieh2
X.1 Principal component analysis
Generally Discriminant Analysis
第 四 章 VQ 加速運算與編碼表壓縮 4-.
LECTURE 09: DISCRIMINANT ANALYSIS
Density-Based Image Vector Quantization Using a Genetic Algorithm
Color Image Retrieval based on Primitives of Color Moments
Principal Component Analysis
Chair Professor Chin-Chen Chang Feng Chia University
第 九 章 影像邊緣偵測 9-.
A Self-Reference Watermarking Scheme Based on Wet Paper Coding
A Color Image Hiding Scheme Based on SMVQ and Modulo Operator
Hiding Information in VQ Index Tables with Reversibility
A Virtual Image Cryptosystem Based upon Vector Quantization
EM Algorithm and its Applications
A Self-Reference Watermarking Scheme Based on Wet Paper Coding
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Random Neural Network Texture Model
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

An Image Database Retrieval Scheme Based Upon Multivariate Analysis and Data Mining Presented by C.C. Chang Dept. of Computer Science and Information Engineering, National Chung Cheng University

Outline Introduction Image Retrieval The Proposed Scheme Based Upon PCA and Data Mining Image Feature Extraction Data Mining for Image Features Illustration Future Works Conclusions

Introduction Image database Query image The ability to develop an efficient and effective image retrieval system to access desired images in the depth of the database has been a more and more interesting and challenging topic of research

Introduction Image retrieval system Text-based retrieval Text-based Content-based Text-based retrieval Query by keywords Keywords: setting sun, mountain, ocean, purple,… The ability to develop an efficient and effective image retrieval system to access desired images in the depth of the database has been a more and more interesting and challenging topic of research

Introduction Content-based image retrieval Images are indexed by their content, color, shape, texture, features and so on. Feature extraction methods Histogram Neural network (NN) Support vector machines (SVM) Genetic algorithm (GA) Principal component analysis (PCA) … The ability to develop an efficient and effective image retrieval system to access desired images in the depth of the database has been a more and more interesting and challenging topic of research

The proposed scheme based upon PCA and data mining If a digital image can be transformed into a transaction database, then we can use its corresponding derived association rules as its main features to filter out all the undesired digital images for a query image.

Principal component analysis (PCA) Given a set of points Y1, Y2, …, and YM where every Yi is characterized by a set of variables X1, X2, …, and XN. We want to find a direction D = (d1, d2, …, dN), where such that the variance of points projected onto D is maximized.

Principal component analysis (PCA) Algorithm of PCA Start by coding the variables Y = (Y1, Y2, …YN) to have zero means and unit variances. Calculate the covariance matrix C of the samples. Find the eigenvalues λ1, λ2, …, λN, for C, where λi λi+1, i = 1, 2, …, N-1. Let D1, D2, … DN denote the corresponding eigenvectors. D1 is the first principal component direction, D2 is the second principal component direction, … , DN is the Nth principal component direction .

Principal component analysis (PCA) Let A be a n*n covariance matrix. is an eigenvalue of A, and x is an eigenvector associated with the eigenvalue x = Ix, where I is an n*n identity matrix The characteristic polynomial of the matrix A

Principal component analysis (PCA) For example, Let A be a 2*2 matrix.

PCA For example, 40 samples with 2 variables, X1 and X2 Covariance matrix λ1 =1160.139 λ2 =36.780

Principal component analysis (PCA) D1 = [0.710 0.703] D2 = [-0.703 0.710]

Image Feature Extraction -PCA Gray level value M = Next, we shall illustrate how PCA can be used to extract features from images. There is a example image M with 10 * 10 pixels.

Image Feature Extraction -PCA 10*10 pixels Each block with 4 pixels We partition the image into 5 * 5 blocks each with 4 pixels. Where NB is the number of blocks which is 25. Number of blocks (NB) is 25

Image Feature Extraction -PCA Let matrix A be a matrix, which collects blocks of the image.

Image Feature Extraction -PCA (1) Compute the covariance matrix of an image C1 C4 CM = Next, we construct a variance covariance matrix (VCM) for A. Each column can be regarded as a variable, which means the number of variables is N. Let Ck denote a variable that is the kth column of A. Here we given two variables Cs and Ct, and are the means of Cs and Ct, respectively. Equation shows the formula of covariance between any two variables. Var (Ck) = Cov(Ck,Ck)

Image Feature Extraction -PCA (1) Compute the covariance matrix of an image CM = This slide shows the variance covariance matrix of A.

Image Feature Extraction -PCA (2) Determine eigenvalues and eigenvectors =21860, =1743, =877.335, and =393.73, The EValues of M are =21860, =1743, =877.335, and =393.73. Eigenvalues

Image Feature Extraction -PCA (2) Determine eigenvalues and eigenvectors CM = =21860, =1743, =877.335, and =393.73, Eigenvectors Each EVector corresponds to an EValue; therefore, there are as many EVectors as EValues. Each EVector can be seen as a direction of an axis.

Image Feature Extraction -PCA (3) Form the principal components (PCs) M = 23.9 = 20 * 0.419 + 8 * 0.488 + 15 * 0.57 + 6 * 0.511

Image Feature Extraction -PCA (4) Normalize the projected values

Image Feature Extraction -PCA (4) Normalize the projected values

Principal component analysis (PCA) PCA is a popular multivariate analysis technique, which can be used to extract features from images and to filter candidate images from image database. Nerveless, the number of candidate images offered by PCA is usually very large for a huge image database. Therefore, data mining technique is applied to speed up the retrieving speed and increase the accuracy rate.

Data Mining – Association Rules Candidate 1-itemsets I = {A, B, C, D} Frequent 1-itemsets Minimum Support = 3

Data Mining – Association Rules Candidate 2-itemsets I = {A, B, C, D} Frequent 2-itemsets Minimum Support = 3

Data Mining – Association Rules Minimum Confidence = 100% Frequent 2-itemsets Association Rules

Data Mining for Image Features

Data Mining for Image Features

Data Mining for Image Features Database for Normalization Projected Image(NPIDB) In Horizontal Direction

Data Mining for Image Features Minimum Support = 3 Candidate 1-itemsets Candidate 2-itemsets Frequent 1-itemsets

Data Mining for Image Features Minimum Confidence = 75% Frequent 2-itemsets

Data Mining for Image Features Association Rules in Horizontal Direction

Data Mining for Image Features Database for Normalization Projected Image(NPIDB) In Vertical Direction

Data Mining for Image Features Association Rules in Vertical Direction

Data Mining for Image Features Database for Normalization Projected Image(NPIDB) In Diagonal Direction

PCA and data mining

Illustration 450 full-color images 300 blocks for each image 4*4 pixels for a block

Illustration A query image Q The set of eigenvalues of Q is {0, 2, 4, 6, 8}

Illustration Rules of Q are File name is “SW003.JPG.”

Future works - VQ and PCA Vector Quantization (VQ) An image is separated into a set of input vectors Each input vector is matched with a codeword of the codebook

Vector Quantization (VQ) Definition of vector quantization (VQ): , where Y is a finite subset of Rk. VQ is composed of the following three parts: Codebook generation process, Encoding process, and Decoding process.

Vector Quantization (VQ) Image Index table

Vector Quantization (VQ) Codebook generation 1 . N-1 N Training Images Training set Separating the image to vectors

Vector Quantization (VQ) Codebook generation 1 . 1 . 254 255 N-1 N Initial codebook Training set Codebook initiation

Vector Quantization (VQ) 1 . Index sets 1 . 254 255 (1, 2, 5, 9, 45, …) (101, 179, 201, …) (8, 27, 38, 19, 200, …) N-1 N (23, 0, 67, 198, 224, …) Codebook Ci Training set 1 . Compute mean values 254 255 Replace the old vectors New Codebook Ci+1 Training using iteration algorithm

Example Codebook To encode an input vector, for example, v = (150,145,121,130) (1) Compute the distance between v with all vectors in codebook d(v, cw1) = 114.2 d(v, cw2) = 188.3 d(v, cw3) = 112.3 d(v, cw4) = 124.6 d(v, cw5) = 122.3 d(v, cw6) = 235.1 d(v, cw7) = 152.5 d(v, cw8) = 63.2 (2) So, we choose 8 to replace the input vector v.

The Encoding algorithm using PCA Codebook The covariance matrix

The Encoding algorithm using PCA From the covariance matrix, we compute D1: (0.5038, 0.4904, 0.4788, 0.5259), λ1=19552, D2: (-0.4915, -0.5126, 0.4293, 0.5580), λ2=151, D3: (-0.0294, -0.0292, 0.7658, -0.6418), λ3=86 and D4: (0.7098, -0.7042, -0.0108, -0.0134), λ4=6. D1: (0.5038, 0.4904, 0.4788, 0.5259) is a coordinate D1 reserves 98.77% information of the variance of the codewords.

The Encoding algorithm using PCA The new sorted codebook and the corresponding projected value of codewords Codebook The sorted codewords The projected values D1: (0.5038, 0.4904, 0.4788, 0.5259)

The Encoding algorithm using PCA Encode an input vector v = (150, 145, 121, 130) Transform v to α=D1*v α= (0.5038, 0.4904, 0.4788, 0.5259) * (150, 145, 121, 130)T= 272.98 321.93 is the closet value to 272.98 For 321.93, d(v, cw’5) = 63.2 For 162.60, d(v, cw’4) = 122.3 For 382.84, d(v, cw’6) = 114.2 So, we choose cw’5 to replace the v.

VQ and PCA for image retrieval Association Rules:

VQ and PCA for image retrieval Association Rules: ~

Query image Projected image ~

Conclusions An efficient image retrieval scheme based upon multivariate analysis technique and a data mining technique. PCA – extracting image features Association rules - matching the candidate images. VQ and PCA for similar image retrieval.