SWE 423: Multimedia Systems

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
電腦視覺 Computer and Robot Vision I
Content-Based Image Retrieval
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Chapter 3 Image Enhancement in the Spatial Domain.
Computer Vision Lecture 16: Texture
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
IDEAL 2005, 6-8 July, Brisbane Multiresolution Analysis of Connectivity Atul Sajjanhar, Deakin University, Australia Guojun Lu, Monash University, Australia.
PHP-based Image Recognition and Retrieval of Late 18th Century Artwork Ben Goodwin Handouts are available for students writing summaries for class assignments.
Lecture 12 Content-Based Image Retrieval
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
SWE 423: Multimedia Systems
Image Filtering CS485/685 Computer Vision Prof. George Bebis.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
SWE 423: Multimedia Systems Chapter 4: Graphics and Images (4)
3. Introduction to Digital Image Analysis
RETRIEVAL OF MULTIMEDIA OBJECTS USING COLOR SEGMENTATION AND DIMENSION REDUCTION OF FEATURES Mingming Lu, Qiyu Zhang, Wei-Hung Cheng, Cheng-Chang Lu Department.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Project IST_1999_ ARTISTE – An Integrated Art Analysis and Navigation Environment Review Meeting N.1: Paris, C2RMF, November 28, 2000 Workpackage.
Content-Based Retrieval in Image Databases. References Ze-Nian Li & Mark S. Drew, Fundamentals of Multimedia, ISBN , Prentice-Hall, 2004.
Content-based Image Retrieval (CBIR)
Information Retrieval in Practice
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Introduction to Image Processing Grass Sky Tree ? ? Review.
Spatial-based Enhancements Lecture 3 prepared by R. Lathrop 10/99 updated 10/03 ERDAS Field Guide 6th Ed. Ch 5: ;
Computer vision.
Multimedia and Time-series Data
The MPEG-7 Color Descriptors
Image and Video Retrieval INST 734 Doug Oard Module 13.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Multimedia Information Retrieval
Image Retrieval Part I (Introduction). 2 Image Understanding Functions Image indexing similarity matching image retrieval (content-based method)
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Image Processing Edge detection Filtering: Noise suppresion.
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
Content-Based Image Retrieval Using Fuzzy Cognition Concepts Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung University.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
CSC508 Convolution Operators. CSC508 Convolution Arguably the most fundamental operation of computer vision It’s a neighborhood operator –Similar to the.
Autonomous Robots Vision © Manfred Huber 2014.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Yixin Chen and James Z. Wang The Pennsylvania State University
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 4 – Audio and Digital Image Representation Klara Nahrstedt Spring 2010.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
VISUAL INFORMATION RETRIEVAL Presented by Dipti Vaidya.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Image Features (I) Dr. Chang Shu COMP 4900C Winter 2008.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
1. 2 What is Digital Image Processing? The term image refers to a two-dimensional light intensity function f(x,y), where x and y denote spatial(plane)
Visual Information Retrieval
REMOTE SENSING Digital Image Processing Radiometric Enhancement Geometric Enhancement Reference: Chapters 4 and 5, Remote Sensing Digital Image Analysis.
Color-Texture Analysis for Content-Based Image Retrieval
Image Segmentation Techniques
Text Detection in Images and Video
Computer Vision Lecture 16: Texture II
Multimedia Information Retrieval
Grouping/Segmentation
Fourier Transform of Boundaries
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Presentation transcript:

SWE 423: Multimedia Systems Chapter 4: Graphics and Images (3)

Outline Image Processing Basics Textbook Section 4.3 Image Features Image Segmentation Textbook Section 4.3 Additional Reference: Wasfi Al-Khatib, Y. Francis Day, Arif Ghafoor, and P. Bruce Berra. Semantic modeling and knowledge representation in multimedia databases. IEEE Transactions on Knowledge and Data Engineering, 11(1):64-80, 1999.

Image Processing Image processing involves the analysis of scenes or the reconstruction of models from images representing 2D or 3D objects. Image Analysis Identifying Image Properties (Image Features) Image Segmentation Image Recognition We will look at image processing from a database perspective. Objective: Design of robust image processing and recognition techniques to support semantic modeling, knowledge representation, and querying of images.

Semantic Modeling and Knowledge Representation in Image Databases Feature Extraction. Salient Object Identification. Content-Based Indexing and Retrieval. Query Formulation and Processing.

Multi-Level Abstraction Semantic Modeling And Knowledge Representation Layer Semantic Specification Knowledge Base Semantic Identification Process Object Recognition Layer Object Models Object Recognition Process Feature Extraction Layer Feature Specification Feature Extraction Process Image Data Still Video Frames Multimedia Data

Feature Extraction Layer Image features: Colors, Textures, Shapes, Edges, ...etc. Features are mapped into a multi-dimensional feature space allowing similarity-based retrieval. Features can be classified into two types: Global and Local.

Global Features Generally emphasize coarse-grained pattern matching techniques. Transform the whole image into a functional representation. Finer details within individual parts of the image are ignored. Examples: Color histograms and coherence vectors, Texture, Fast Fourier Transform, Hough Transform, and Eigenvalues. What are some of the example queries?

Color Histogram How many pixels of the image take a specific color In order to control the number of colors, the domain is discretized E.g. consider the value of the two leftmost bits in each color channel (RGB). In this case , the number of different colors is equal to __________ How can we determine whether two images are similar using the color histogram?

Color Coherence Vector Based on the color histogram Each pixel is checked as to whether it is within a sufficiently large one-color environment or not. i.e. in a region related by a path of pixels of the same color If so, the pixel is called coherent, otherwise incoherent For each color j, compute the number of coherent and incoherent pixels (j , j), j = 1, ..., J When comparing two images with color coherence vectors (j , j) and (j , j), j = 1, ..., J, we may use the expression

Texture Texture is a small surface structure Examples include Natural or artificial Regular or irregular Examples include Wood barks Knitting patterns The surface of a sponge

Texture Examples Artificial/periodic Artificial/non-periodic Photographic/pseudo-periodic Photographic/random Photographic/structured Inhomogeneous (non-texture)

Texture Two basic approaches to study texture Structural analysis searches for small basic components and an arrangement rule Statistical analysis describes the texture as a whole based on specific attributes (local gray-level variance, regularity, coarseness, orientation, and contrast. Either done in the spatial domain or the spatial frequency domain

Global Features Advantages: Disadvantages: Simple. Low computational complexity. Disadvantages: Low accuracy

Local Features Images are segmented into a collection of smaller regions, with each region representing a potential object of interest (fine-grained). An object of interest may represent a simple semantic object (e.g. a round object). Choice of features is domain specific: X-ray imaging, GIS, ...etc require spatial features (e.g. shapes [may be calculated through edges] and dimensions.) Paintings, MMR imaging, ...etc may use color features in specific regions of the image.

Edge Detection A given input image E is used to gradually compute a (zero-initialized) output image A. A convolution mask runs across E pixel by pixel and links the entries in the mask at each position that M occupies in E with the gray value of the underlying image dots. The result of the linkage (and the subsequent sum across all products from the mask entry and the gray value of the underlying image pixel) is written to the output image A.

Convolution Convolution is a simple mathematical operation which is fundamental to many common image processing operators. Convolution provides a way of `multiplying together' two arrays of numbers, generally of different sizes, but of the same dimensionality, to produce a third array of numbers of the same dimensionality. This can be used in image processing to implement operators whose output pixel values are simple linear combinations of certain input pixel values. The convolution is performed by sliding the kernel over the image, generally starting at the top left corner, so as to move the kernel through all the positions where the kernel fits entirely within the boundaries of the image.

Convolution Computation If the image E has M rows and N columns, and the kernel K has m rows and n columns, then the size of the output image A will have M - m + 1 rows, and N - n + 1 columns and is given by: Example page 60. http://homepages.inf.ed.ac.uk/rbf/HIPR2/sobel.htm

Similarity Metrics Minkowski Distance Weighted Distance Average Distance Color Histogram Intersection

Prototype Systems QBIC (http://www.hermitagemuseum.org) Uses color, shape, and texture features Allows queries by sketching features and providing color information Chabot (Cypress) Uses color and textual annotation. Improved performance due to textual annotation (Concept Query) KMeD Uses shapes and contours as features. Features are extracted automatically in some cases and manually in other cases.

Demo (Andrew Berman & Linda G. Shapiro ) http://www.cs.washington.edu/research/imagedatabase/demo/seg/ http://www.cs.washington.edu/research/imagedatabase/demo/edge/ http://www.cs.washington.edu/research/imagedatabase/demo/fids/