Distance-Reciprocal Distortion Measure for Binary Document Images IEEE Signal Processing Letters, vol.11,No.2, Feb. 2004.

Slides:



Advertisements
Similar presentations
QR Code Recognition Based On Image Processing
Advertisements

2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Spatial Filtering (Chapter 3)
Integration of sensory modalities
Outline/Coverage Terms for reference Introduction
Middle Term Exam 03/04, in class. Project It is a team work No more than 2 people for each team Define a project of your own Otherwise, I will assign.
Lecture 4 Linear Filters and Convolution
Connecting with Computer Science, 2e
5. Halftoning Newspaper photographs simulate a greyscale, despite the fact that they have been printed using only black ink. A newspaper picture is, in.
Introduction to Image Quality Assessment
Analysis of Algorithms1 Estimate the running time Estimate the memory space required. Time and space depend on the input size.
Losslessy Compression of Multimedia Data Hao Jiang Computer Science Department Sept. 25, 2007.
CS443: Digital Imaging and Multimedia Filters Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University Spring 2008 Ahmed Elgammal Dept.
Weights of Observations
Number System Review This section reviews binary numbers, hexadecimal numbers, and binary arithmetic © 2014 B. Wilkinson Modification date: Dec 29,
Connecting with Computer Science 2 Objectives Learn why numbering systems are important to understand Refresh your knowledge of powers of numbers Learn.
VEHICLE NUMBER PLATE RECOGNITION SYSTEM. Information and constraints Character recognition using moments. Character recognition using OCR. Signature.
Chapter 5 Data representation.
Modeling and Finding Abnormal Nodes (chapter 2) 駱宏毅 Hung-Yi Lo Social Network Mining Lab Seminar July 18, 2007.
Entropy and some applications in image processing Neucimar J. Leite Institute of Computing
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Machine Vision ENT 273 Image Filters Hema C.R. Lecture 5.
Effect of Mutual Coupling on the Performance of Uniformly and Non-
Research Methods in Psychology
QNT 531 Advanced Problems in Statistics and Research Methods
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
Huijuan Yang, Alex C. Kot, IEEE Fellow IEEE Transactions on Multimedia, Vol. 9, No. 3, Apr Multimedia Security Final Project R 葉容瑜 R
Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.
What is Image Quality Assessment?
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
HP-PURDUE-CONFIDENTIAL Final Exam May 16th 2008 Slide No.1 Outline Motivations Analytical Model of Skew Effect and its Compensation in Banding and MTF.
Digital Image Processing CCS331 Relationships of Pixel 1.
Image Compression Fasih ur Rehman. Goal of Compression Reduce the amount of data required to represent a given quantity of information Reduce relative.
Image processing Fourth lecture Image Restoration Image Restoration: Image restoration methods are used to improve the appearance of an image.
Digital Image Processing Lecture 3: Image Display & Enhancement March 2, 2005 Prof. Charlene Tsai.
1 Nonparametric Statistical Techniques Chapter 17.
REECH ME: Regional Energy Efficient Cluster Heads based on Maximum Energy Routing Protocol Prepared by: Arslan Haider. 1.
Just Noticeable Difference Estimation For Images with Structural Uncertainty WU Jinjian Xidian University.
1 8. One Function of Two Random Variables Given two random variables X and Y and a function g(x,y), we form a new random variable Z as Given the joint.
Data Representation The storage of Text Numbers Graphics.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
An Improved Method Of Content Based Image Watermarking Arvind Kumar Parthasarathy and Subhash Kak 黃阡廷 2008/12/3.
CS654: Digital Image Analysis
Number Systems by Dr. Amin Danial Asham. References  Programmable Controllers- Theory and Implementation, 2nd Edition, L.A. Bryan and E.A. Bryan.
Computer simulation Sep. 9, QUIZ 2 Determine whether the following experiments have discrete or continuous out comes A fair die is tossed and the.
Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different.
Measurement and Scaling: Fundamentals and Comparative Scaling.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
Analysis of Experimental Data; Introduction
1 Marco Carli VPQM /01/2007 ON BETWEEN-COEFFICIENT CONTRAST MASKING OF DCT BASIS FUNCTIONS Nikolay Ponomarenko (*), Flavia Silvestri(**), Karen.
Course 5 Edge Detection. Image Features: local, meaningful, detectable parts of an image. edge corner texture … Edges: Edges points, or simply edges,
ESTIMATING WEIGHT Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257 RG712.
1 Nonparametric Statistical Techniques Chapter 18.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Spatial Filtering (Chapter 3) CS474/674 - Prof. Bebis.
April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.
Virtual University of Pakistan
Linear Algebra Review.
Analytic Hierarchy Process (AHP)
CS100: Discrete structures
Pei Qi ECE at UW-Madison
Presenter by : Mourad RAHALI
Ch2: Data Representation
Descriptive Analysis and Presentation of Bivariate Data
Chapter 2 Data Representation.
Hippocampal “Time Cells”: Time versus Path Integration
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Distance-Reciprocal Distortion Measure for Binary Document Images IEEE Signal Processing Letters, vol.11,No.2, Feb. 2004

2 I. Introduction It is inevitable to introduce visible artifacts when a Binary Image has been watermark embedded or data hided. Therefore, an effective visual distortion measure is a must for performance comparison or evaluation of such an application.

3 Subjective measure: –costly but important since a human is the ultimate viewer! Objective measure: –repeatable and easier to implement. Objective measure does not always agree with the subjective one  Objective measure based on Human Visual Perception Model is the preferred one.

4 The peak signal-to-noise ratio (PSNR) is a popular distortion measure used in image and video processing. (1) Where M and N are the dimensions of the image, and P is the maximum peak-to- peak signal swing, e.g., P=255 for 8-bit image Processing f(x,y) g(x,y)

5 For Binary Images, PSNR does not match well with subjective assessment, since it is a point-based measurement, and mutual relations between pixels are not taken into account

6 Natural ImagesBinary Images levelsContinuous multiple-tune Two level objectsPhysical objects Invented symbols and/or signals (such as documents)

7 The perception of distortion in (document) binary images is very different from that in natural images In a particular “Language”, such as English, people know very well what a certain alphabetic character should look like.  –(i) Distortion in document images could be more obtrusive than distortion in natural images –(ii) The distortion measures proposed for color/gray- level images are not often applicable to binary images

8 II. Distance Reciprocal Distance Measure A number of single-letter images are used to study distortion in binary document images. Each single-letter image is converted from a letter typed in Microsoft Word with a font size of 10 or 12, including both uppercase and lowercase, using Adobe Acrobat 5.0 with a resolution of 150 dots per inch (dpi).

9 Observation For a binary document image, the distance between two pixels play a major role in their mutual interference perceived by human eyes. Since readers are so familiar with alphabetic characters that even a single- pixel distortion can be perceived easily.  the main factor in distortion perception is focusing, i.e., whether the distortion is in a viewer’s focus.

10 The distortion (flipping) of one pixel is more visible when the field of view of the pixel is in focus. The nearer the two pixels are, the more sensitive it is to change one pixel when focusing on the other. Further, from a magnified viewing, each pixel is essentially a black or white square. Therefore, a diagonal neighbor pixel is considered to be further away from a pixel in focus than a horizontal or vertical neighbor one. Hence, diagonal neighbors have less effect on a center pixel in focus than horizontal or vertical neighbors.

11 The distortion of a processed image g(x,y) compared with the original image f(x,y) is measured by using a weighted matrix with each of its weights determined by the reciprocal of a distance measured from the center pixel.  Distance-Reciprocal Distortion Measure. Assume the weight matrix W m is of size m x m, where m = 2n+1 and. The center element of this matrix is at where

12 (2) The normalized weight matrix is defined as (3)

13

14 Suppose that there are S flipped pixels in g(x,y). Each pixel will have a distance- reciprocal distortion DRD k, k=1,2,…,S. For the k-th flipped pixel at (x,y) k in the output image g(x,y), the resulted distortion is calculated from an m x m block B k in f(x,y) that is centered at (x,y) k. The Distortion DRD k measured for this flipped pixel is given by (4)

15 Where the (i,j)-th element of the different matrix D k is given by (5) Thus, DRD k equals to the weighted sum of the pixels in the block B k of the original image that differ from the flipped pixel in the processed image.

16 The pixel does not contribute directly to DRD k,since its weight is always zero. For the possibly flipped pixels near the image edge or corner, where an m x m neighborhood may not exist, it is possible to expand the rest of the m x m neighborhood with the same value as which is equivalent to just ignoring the rest of the neighbors.

17 After walks over all the S flipped pixel positions, we sum the distortion as seen from each flipped pixel visited to get the distortion in g(x,y) as (6)

18 Where NUBN is to estimate the valid (nonempty) area in the image and it is defined as the number of non-uniform (not all black or white pixels) 8x8 blocks in f(x,y) The total pixel numbers M x N is not used in the denominator because uniform areas (e.g., all white pixel blocks) are common in binary document images and they may have a significant effect on the distortion value if used.

19 III. Experimental Results The original image is of size 198x109 and there are 122 (≈39%) non-uniform 8x8 blocks out of total 312 blocks

20 The design criteria for generating independent test images: –the number of flipped pixels is the same in each test image, test image generated through this distortion generator should have a wide variety in term of “how noticeable the flipping is” There are 1763 black pixels in Fig.2. We flip 40 pixels in the original image with some randomness to generate the test images with various amount of visual distortion.

21 1) The positions of all 1763 black pixels are recorded in a 2x1763 matrix. 2) Forty black pixels out of 1763 are randomly chosen using a random number generator with uniform distribution. 3) For each black pixel, one pixel is flipped in its neighboring area. As shown in Fig. 3, the pixel to be flipped is randomly selected from the Band 1 pixel (the black pixel itself), or eight Band 2 pixels, or 16 Band 3 pixels, with probability of p1, p2, and p3, respectively, and p1+p2+p3=1. For the Band 2 and Band 3 pixels, one neighbor is randomly chosen among the band pixels.

22 4) A total of test images are generated in the experiment by running the generator 10,000 times for each of the six sets of p1, p2, and p3, with p3=0, 0.2, 0.4, 0.6, 0.18 and 1, p1=(1- p3)/10, and p2=9 ·p1. 5) The images generated with the number of flipped pixels less than 40 are ignored. That is, the cases where at least one pixel is flipped more than once are dropped.

23

24 Since all the test images are generated from the same original image and they have the same number of flipped pixels, they have the same PSNR of dB according to (1). One set of the test images generated is shown in Fig.4.

25

26 Next, we divided all the test images generated into four groups according to the DRD values computed, with group 1 having smallest values, group 4 having largest values, and so on. The subjective assessment is done by 60 observers. Each observer is given the original image and four sets of test images, which are printed on a piece of 80 GSM quality paper using a HP LaserJet 4100 printer.

27 Each set of test images consists of four test images randomly chosen from the four groups. The observers are asked to rank the visual quality of the four images in each set according to the visual distortion that he or she perceives when he or she views the images at a comfortable distance under normal indoor lighting conditions in labs. A smaller ranking score indicates less distortion. There are four rankings (1, 2, 3, and 4) with score 1 for the least distortion and 4 for the most distortion perceived.

28 The ranking scores collected from the 60 observers are analyzed and compared with the rankings according to the average DRD values (with m=5 ), as shown in Table II. Although the PSNR is the same for all the test images, the DRD values obtained are different for different distorted images and their average values for the 4 groups have a normalized correlation of with the mean subjective rankings, indicating a very good match between our objective measure and the subjective evaluation.

29

30 The distribution of the subjective ranking scores for each group is shown in a subfigure in Fig.5. In each subfigure, the abscissa represents 4 ranking scores (1, 2, 3, and 4), and the ordinate shows the counts of the corresponding ranking scores given by the 60 human evaluators. Since each of the 60 observers is given 4 sets of test images, there are 240 scores in total for each group.

31