1 Storage of images for Efficient Retrieval  Representing IDB as relations  straightforward  Representing IDB with spatial data structures  represent.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Multidimensional Data Structures
Indexing DNA Sequences Using q-Grams
The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Content-Based Image Retrieval
Multidimensional Indexing
Multidimensional Data
Similarity Search in High Dimensions via Hashing
3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.
Multimedia DBs. Multimedia dbs A multimedia database stores text, strings and images Similarity queries (content based retrieval) Given an image find.
Preferential top-k search over local data dissertation thesis RNDr. Martin Šumák supervisor: doc. RNDr. Stanislav Krajči, PhD. consultant: RNDr. Peter.
Spatial Mining.
Multidimensional Data. Many applications of databases are "geographic" = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Group 3 Akash Agrawal and Atanu Roy 1 Raster Database.
Young Deok Chun, Nam Chul Kim, Member, IEEE, and Ick Hoon Jang, Member, IEEE IEEE TRANSACTIONS ON MULTIMEDIA,OCTOBER 2008.
Spring 2008MMDB--MIRS1 Multimedia Information Retrieval Systems.
Multimedia DBs.
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
Indexing Time Series. Time Series Databases A time series is a sequence of real numbers, representing the measurements of a real variable at equal time.
1 Image Databases  Conventional relational databases, the user types in a query and obtains an answer in response  It is different in image databases.
Spatio-Temporal Databases
CS 128/ES Lecture 5b1 Vector Based Data. CS 128/ES Lecture 5b2 Spatial data models 1.Raster 2.Vector 3.Object-oriented Spatial data formats:
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
ISP 433/633 Week 5 Multimedia IR. Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods.
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
1 Geometric index structures April 15, 2004 Based on GUW Chapter , [Arge01] Sections 1, 2.1 (persistent B- trees), 3-4 (static versions.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
COMP 451/651 Multiple-key indexes
1Bloom Filters Lookup questions: Does item “ x ” exist in a set or multiset? Data set may be very big or expensive to access. Filter lookup questions with.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Dimensionality Reduction: Principal Components Analysis Optional Reading: Smith, A Tutorial on Principal Components Analysis (linked to class webpage)
Multimedia and Time-series Data
An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002.
Efficient Algorithms for Robust Feature Matching Mount, Netanyahu and Le Moigne November 7, 2000 Presented by Doe-Wan Kim.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Mutlidimensional Indices Instructor: Randal Burns Lecture for 29 November 2005 Computer Science Johns Hopkins University.
IMAGE DATABASES Prof. Hyoung-Joo Kim OOPSLA Lab. Computer Engineering Seoul National University.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Fast Subsequence Matching in Time-Series Databases Author: Christos Faloutsos etc. Speaker: Weijun He.
Pattern Recognition April 19, 2007 Suggested Reading: Horn Chapter 14.
Answering Similar Region Search Queries Chang Sheng, Yu Zheng.
Spatial and Geographic Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria University (Karachi Campus)
INTERACTIVELY BROWSING LARGE IMAGE DATABASES Ronald Richter, Mathias Eitz and Marc Alexa.
Similarity Searching in High Dimensions via Hashing Paper by: Aristides Gionis, Poitr Indyk, Rajeev Motwani.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Spatial Database 2/5/2011 Reference – Ramakrishna Gerhke and Silbershatz.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
Spatial and Geographic Databases. Spatial databases store information related to spatial locations, and support efficient storage, indexing and querying.
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
Database Seminar The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors Authors : Christian Bohm, Alexey Pryakhin,
Indexing OLAP Data Sunita Sarawagi Monowar Hossain York University.
Indexing Time Series. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Time Series databases Text databases.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
Multimedia Alicja Wieczorkowska Multimedia database systems and GIS.
FastMap : Algorithm for Indexing, Data- Mining and Visualization of Traditional and Multimedia Datasets.
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES.
Oracle Spatial Extension of the RDBMS Oracle by spatial data types and operations –introduced in version 8 (current version.
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
1 Image Databases  Conventional relational databases, the user types in a query and obtains an answer in response  It is different in image databases.
Spatial Data Management
Spatial Indexing I Point Access Methods.
Histogram—Representation of Color Feature in Image Processing Yang, Li
Presentation transcript:

1 Storage of images for Efficient Retrieval  Representing IDB as relations  straightforward  Representing IDB with spatial data structures  represent the image as rectangles  Representing IDB using image transformations  can be represented by a vector of length k (k is between 100 and 200)  However,  different photographs of the same person may vary, depending on a variety of factors F time of the day at which the photographs were taken F the lighting conditions F the camera used F the exact position of the subject’s head and facial expression F etc.

2 Representing IDB with Relations  (1) Create a relation called images having the scheme:  (Image, ObjID, XLB,XUB,YLB,YUB) F where Image is the name of the image file and (XLB,XUB,YLB,YUB) represent the rectangle in question  Suppose R is a rectangle specified by (XLB, XUB,YLB,YUB) and R is in Rec(I), the there exists a tuple F (I, newid, XLB XUB,YLB,YUB)  (2) For each property p, create a relation R p having the scheme:  (Image, XLB, XUB,YLB,YUB, Value) F where Image is the image file and the (XLB, XUB,YLB,YUB) denote a rectangular cell in the image  Properties can be F Pixel level properties (RGB values) F Object/region level properties (Name, age) F Image level properties (when image was captured, where, by whom)

3 Querying IDB Relations  Eliciting the contents of an image is done using image processing algorithms  Image processing algorithms are only partially accurate  This implies that tuples placed in an IDB relation by an image processing program has certain associated probabilistic attributes  The probability that John Lee is the name of o 2 is 0.75  The probability that Ken Yip is the name of o 2 is 0.15  There is a 10% missing probability ObjId Name Prob o 1 Jim Hatch 0.8 o 2 John Lee 0.75 o 2 Ken Yip 0.15 o 3 John Lee 1

4 Complex Queries  Query: What is the probability that pic1.gif contains both Jim Hatch and Ken Yip? (assume o 1 and o 2 are in pic1.gif)  Is it 0.8*0.15 = 0.12?  In general, the answer is NO ObjId Name Prob o 1 Jim Hatch 0.8 o 2 John Lee 0.75 o 2 Ken Yip 0.15 o 3 John Lee 1

5 Complex Queries  Consider pic8.gif with o 10 and o 11  Four possibilities  possibility 1: o 10 is Ken Yip and o 11 is John Hatch  possibility 2: o 10 is Ken Yip and o 11 is not John Hatch  possibility 3: o 10 is not Ken Yip but o 11 is John Hatch  possibility 4: o 10 is not Ken Yip and o 11 is not John Hatch  Suppose p i denotes the probability of possibility i  p 1 + p 2 = 0.5  p 3 + p 4 = 0.5  p 1 + p 3 = 0.8  p 2 + p 4 = 0.2  p 1 + p 2 + p 3 + p 4 = 1 ObjId Name Prob o 10 Ken Yip 0.5 o 10 Jim Hatch 0.4 o 11 Jim Hatch 0.8 o 11 John Lee 0.1

6 Interval Probability  Need to solve these linear equations to determine the probability that pic8.gif contains both Ken Yip and John Hatch  The result is an interval of probabilities  p 1 between 0.3 and 0.5  Requires the use of Interval probabilities!!!  Interval probabilities allow us to represent the margin of error of image processing algorithms in identifying the object  e.g. with 3% error ObjId Name Prob(l) Prob(u) o 1 Jim Hatch o 2 John Lee o 2 Ken Yip o 3 John Lee

7 Representing image DBs with R-Trees  Create a relation with two attributes (ImageId, ObjId)  Create an R-tree that stores rectangles  if the same rectangle appears in two images, then we have an overflow list  Each rectangle has an associated set of fields that specifies the object/region level properties  Not good for nearest neighbor queries as the tree is constructed with only two dimensions out of the n+2 dimensions of the image  Generalized R-Trees use all the n+2 dimensions to construct the tree

8 Implementations  Many use object-oriented implementation  Support methods such as  rotate, segment, edit  Most implementations assume the whole image to compare  Perform feature extraction and thus represent an image as a vector of n fields  an index is created on an n-dimensional vector  multidimensional extension of point quadtree  R-Tree  To perform similarity search, they compute the Euclidian distance between the vector representing the query image and those of all images