Entropy and some applications in image processing Neucimar J. Leite Institute of Computing
Outline Introduction –Intuitive understanding Entropy as global information Entropy as local information –edge detection, texture analysis Entropy as minimization/maximization constraints –global thresholding –deconvolution problem
Information Entropy (Shannon´s entropy) An information theory concept closely related to the following question: - What is the minimum amount of data needed to represent an information content? For images (compression problems): - How few data are sufficient to completely describe an images without (much) loss of information?
Intuitive understanding: - relates the amount of uncertainty about an event with a given probability distribution Event: randomly draw out a ball high uncertainty low uncertainty no uncertainty entropy = max min (uncertainty)
Example 1: Event: a coin flipping = { heads, tails } Probability: P(heads) = P(tails) = 1/2 0 heads 1 tails self-information: inversely related to the probability of E Self-information: - Units of information used to represent an event E
Example 2: amount of conveyed information of event E Entropy : average information
coding the balls (3 bits/ball) Entropy: = 3bits/ball Degree of information compression: equal length binary code for independent data:
H= -( 5/8 log 2 (5/8) + 1/8 log 2 (1/8) + 1/8 log 2 (1/8) + 1/8 log 2 (1/8) ) = 1.54 H = -8log 2 1 = 0 medium uncertainty: no uncertainty: code
H= -( 5/8 log 2 (5/8) + 1/8 log 2 (1/8) + 1/8 log 2 (1/8) + 1/8 log 2 (1/8) ) = 1.54 medium uncertainty: code bits/ball > 1.54 bit/ball code redundancy !!! and We need an encoding method for eliminating this code redundancy 22%
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 The Huffman encoding:
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 5/8 2/8 1/8
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 5/8 2/8 1/8 5/8 3/8
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 5/8 2/8 1/8 5/8 3/8 (1) (0)
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 5/8 2/8 1/8 5/8 3/8 (1) (0) (01) (00) (1)
BallProbabilityReduction 1Reduction 2 red black blue green 5/8 1/8 5/8 2/8 1/8 5/8 3/8 (1) (0) (01) (00) (1) (011) (010) (1) (00) variable length code red black blue green ball and (18,6%)
Entropy: x bit image: After Huffman encoding: bits/pixel Variable length coding does not take advantage of the high images pixel-to-pixel correlation: a pixel can be predicted from the values of its neighbors more redundancy lower entropy (bits/pixel)
Entropy: 7.45 After Huffman encoding: Entropy: 7.35 After Huffman encoding:
Coding the interpixel difference highlighting redundancies: Entropy: 4.73 instead of 7.45 After Huffman encoding: Entropy: 5.97 instead of 7.35 After Huffman encoding: instead of 1.07 instead of 1.08
Entropy as a local information: the edge detection example
Edge detection examples:
Entropy-based edge detection Low entropy values low frequencies uniform image regions High entropy values high frequencies image edges
Binary entropy function: Entropy H p 1.0
Entropy H p 1.0
Entropy H p 1.0
Entropy H p 1.0
Entropy H p 1.0
Entropy H p 1.0
Binary entropy function: Isotropic edge detection
H in a 3x3 neighborhood:
5x5 neighborhood:
7x7 neighborhood:
9x9 neighborhood:
Texture Analysis Similarity grouping based on brightness, colors, slopes, sizes etc The perceived patterns of lightness, directionality, coarseness, regularity, etc can be used to describe and segment an image
Texture description: statistical approach Characterizes textures as smooth, coarse, periodic, etc - Based on the intensity histogram prob. density function Descriptors examples: z i = random variable denoting gray levels p(z i ) = the intensity histogram in a region Mean: a measure of average intensity
Other moments of different orders: - e.g., standard deviation: a measure of average contrast Entropy: a measure of randomness
TextureAverage IntensityAverage contrastEntropy smooth coarse periodic smooth coarse periodic
Descriptors and segmentation: ?
Gray-level co-occurrence matrix: Haralick´s descriptors Conveys information about the positions of pixels having similar gray level values d= M d (a,b)
For the descriptor H: large empty spaces in M little information content cluttered areas large information content M d = the probability that a pixel with gray level i will have a pixel with level j a distance of d pixels away in a given direction d = 2, horizontal direction
Obviously, more complex texture analysis based on statistical descriptors should consider combination of information related to image scale, moments, contrast, homogeneity, directionality, etc
Entropy as minimization/maximization constraints
Global thresholding examples: mean histogram peaks
For images with levels 0-255: The probability that a given pixel will have value less than or equal t is: Now considering: Class A: Class B:
The optimal threshold is the value of t that maximizes where
Examples:
Entropy as a fuzziness measure In fuzzy set theory an element x belongs to a set S with a certain probability p x defined by a membership function p x (x) Example of a membership function for a given threshold t: p x (x) gives the degree to which x belongs to the object or background with gray-level average and, respectively.
How can the degree of fuzziness be measured? Example: t = 0 for a a binary image fuzziness = 0
Using the Shannon´s function (for two classes): the entropy of an entire fuzzy set of dimension MxN is and for segmentation purpose, the threshold t is such that E(t) is minimum t minimizes fuzziness
Segmentation examples
Maximum Entropy Restoration: the deconvolution problem
The image degradation model: f(x,y) h(x,y) + noise degraded image g(x,y)
The restoration problem: Given g, h, and we can find an estimate such that the residual Since there may exist many functions such that the above constraint is satisfied, we can consider the maximization entropy as an additional constraint for “optimum” restoration originaldegradedrestored
Wiener Lucy-Richardson Entropy h Degraded Other restoration methods:
Conclusions The entropy information has been extensively used in various image processing applications. Other examples concern distortion prediction, images evaluation, registration, multiscale analysis, high-level feature extraction and classification, etc