Debarati Kundu and Brian L. Evans The University of Texas at Austin

Slides:



Advertisements
Similar presentations
Image Registration  Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.
Advertisements

CSCE 643 Computer Vision: Template Matching, Image Pyramids and Denoising Jinxiang Chai.
Evaluation of Reconstruction Techniques
Visualization and graphics research group CIPIC May 25, 2004Realistic Image Synthesis1 Tone Mapping Presented by Lok Hwa.
(JEG) HDR Project Boulder meeting January 2014 Phil Corriveau-Patrick Le Callet- Manish Narwaria.
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
Computer graphics & visualization HDRI. computer graphics & visualization Image Synthesis – WS 07/08 Dr. Jens Krüger – Computer Graphics and Visualization.
H. R. Sheikh, A. C. Bovik, “Image Information and Visual Quality,” IEEE Trans. Image Process., vol. 15, no. 2, pp , Feb Lab for Image and.
黃文中 Preview 2 3 The Saliency Map is a topographically arranged map that represents visual saliency of a corresponding visual scene. 4.
Guillaume Lavoué Mohamed Chaker Larabi Libor Vasa Université de Lyon
VQEG Rennes meeting june 2012
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Introduction to Image Quality Assessment
Object Detection and Tracking Mike Knowles 11 th January 2005
1 Blind Image Quality Assessment Based on Machine Learning 陈 欣
P ERCEPTUAL E VALUATION OF M ULTI -E XPOSURE I MAGE F USION A LGORITHMS Kai Zeng, Kede Ma, Rania Hassen and Zhou Wang Department of Electrical and Computer.
Perceived video quality measurement Muhammad Saqib Ilyas CS 584 Spring 2005.
Introduction of Saliency Map
1.  Introduction  Gaussian and Laplacian pyramid  Application Salient region detection Edge-aware image processing  Conclusion  Reference 2.
Machine Vision ENT 273 Image Filters Hema C.R. Lecture 5.
1 Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur, , India
Introduction to Visible Watermarking IPR Course: TA Lecture 2002/12/18 NTU CSIE R105.
بسمه تعالی IQA Image Quality Assessment. Introduction Goal : develop quantitative measures that can automatically predict perceived image quality. 1-can.
(JEG) HDR Project: update from IRCCyN July 2014 Patrick Le Callet-Manish Narwaria.
ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.
What is Image Quality Assessment?
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
1 Security and Robustness Enhancement for Image Data Hiding Authors: Ning Liu, Palak Amin, and K. P. Subbalakshmi, Senior Member, IEEE IEEE TRANSACTIONS.
黃文中 Introduction The Model Results Conclusion 2.
R. Ray and K. Chen, department of Computer Science engineering  Abstract The proposed approach is a distortion-specific blind image quality assessment.
MDDSP Literature Survey Presentation Eric Heinen
Robert Edge  Dr. James Walker  Mathematics  University of Wisconsin-Eau Claire Universal Image Quality PSNR IQI BEST IQI BOATASWDRJPEGASWDRJPEG 1:
Digital Image Processing Definition: Computer-based manipulation and interpretation of digital images.
Digital Image Processing Lecture 10: Image Restoration March 28, 2005 Prof. Charlene Tsai.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Digital Image Processing Lecture 10: Image Restoration
Just Noticeable Difference Estimation For Images with Structural Uncertainty WU Jinjian Xidian University.
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,
Department of computer science and engineering Evaluation of Two Principal Image Quality Assessment Models Martin Čadík, Pavel Slavík Czech Technical University.
NO-REFERENCE SYNTHETIC IMAGE QUALITY ASSESSMENT USING SCENE STATISTICS Debarati Kundu and Brian L. Evans Embedded Signal Processing Laboratory (ESPL) Department.
Spatio-temporal saliency model to predict eye movements in video free viewing Gipsa-lab, Grenoble Département Images et Signal CNRS, UMR 5216 S. Marat,
A Low-Complexity Universal Architecture for Distributed Rate-Constrained Nonparametric Statistical Learning in Sensor Networks Avon Loy Fernandes, Maxim.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
1 Marco Carli VPQM /01/2007 ON BETWEEN-COEFFICIENT CONTRAST MASKING OF DCT BASIS FUNCTIONS Nikolay Ponomarenko (*), Flavia Silvestri(**), Karen.
Performance Measurement of Image Processing Algorithms By Dr. Rajeev Srivastava ITBHU, Varanasi.
(JEG) HDR & WCG? Project: September 2015 Patrick Le Callet.
Objective Quality Assessment Metrics for Video Codecs - Sridhar Godavarthy.
IMAGE PROCESSING is the use of computer algorithms to perform image process on digital images   It is used for filtering the image and editing the digital.
No-reference Image Quality Assessment for High Dynamic Range Images
Structure Similarity Index
PERFORMANCE ANALYSIS OF VISUALLY LOSSLESS IMAGE COMPRESSION
CPSC 6040 Computer Graphics Images
Digital Image Processing Lecture 10: Image Restoration
Lossy Compression of Stochastic Halftones with JBIG2
IIS for Image Processing
Nikolay Ponomarenkoa, Vladimir Lukina, Oleg I. Ieremeieva,
Detecting Artifacts and Textures in Wavelet Coded Images
Pei Qi ECE at UW-Madison
Image Segmentation Techniques
Tuning JPEG2000 Image Compression for Graphics Regions
A Review in Quality Measures for Halftoned Images
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
Digital Visual Effects, Spring 2006 Yung-Yu Chuang 2006/3/8
Digital television systems (DTS)
Image and Video Processing
Reduction of blocking artifacts in DCT-coded images
A Block Based MAP Segmentation for Image Compression
Gradient Domain Salience-preserving Color-to-gray Conversion
Review and Importance CS 111.
Lark Kwon Choi, Alan Conrad Bovik
Presentation transcript:

Debarati Kundu and Brian L. Evans The University of Texas at Austin September 26, 2016 Visual Attention Guided Quality Assessment for Tone Mapped Images Using Scene Statistics Debarati Kundu and Brian L. Evans The University of Texas at Austin

Introduction Scene luminance varying from 10-4 to 106 cd/m2 [Narwaria2013] High dynamic range (HDR) images preserve more detail HDR picture capture (e.g. smart phones and DSLR cameras) HDR video displays for home (e.g. Samsung) HDR streaming content (e.g. Amazon Video and Netflix) HDR graphics rendering (e.g. Unreal and CryEngine)

Tonemapping Operators [Larson1997] Uniformly spaced quantization of luminance overexposes the view through the window World luminance values for a window office in candelas per meter squared Luminance mapped to preserve visibility of both indoor & outdoor features using non-linear tonemapping

Tonemapping from HDR to SDR Different tonemapping operators produce different SDR images Estimate radiance map by merging pixels from different exposures Tonemap floating point irradiance map to SDR Registered SDR exposure stack Creating tonemapped images Reference image – floating-point HDR image is reference and tonemapped SDR image is test/processed image Propose three image quality assessment (IQA) algorithms Evaluate HDR radiance map and tonemapped SDR image

Tone Mapped Quality Index [Yeganeh2013] Overall Tone Mapped Quality Index a = 0.8012, g = 0.3046 and d = 0.7088 Structural fidelity (S) of HDR and tonemapped SDR image Structural similarity with penalty for large change in signal strength Pooling: Average modified SSIM on 11 x 11 windows Combine structural fidelity at each of five scales Naturalness (N) of tonemapped SDR image Compute its global mean m and global standard deviation d Pm and Pd are fits for global means & standard deviations for 3000 SDR natural images Tonemapping meant to change local intensity and contrast Pm and Pd are obtained from statistically analyzing a corpus 3000 SDR images. Pm is a fit to the global means of all of the images Pd is a fit to the global standard deviations of all of the images Luminance and contrast are statistically independent Pm(m) argument m is the mean for a test image Pd(d) argument d is the global deviation for contrast More Info

Change #1: Naturalness Measure Natural scene statistics (NSS) approach for IQA Statistics of pristine images occur irrespective of content Statistics of images with distortions deviate from scene statistics Mean subtracted contrast normalized pixels for image At pixel (i, j), use 11x11 window and uniform Gaussian filter (s = 1.17) is weighted mean is weighted standard deviation MSCN models divisive normalization in retina [Ruderman1993] In our work, we propose a different model of NSS to be used with the structural similarity term. MSCN coefficients have been used widely to predict the perceptual quality of SDR images inflicted with distortions like blur, Gaussian noise, interpolation, compression etc. 11 x 11 windows. Weights are uniformly Gaussian (std. dev. 7/6 = 1.17) from BRISQUE. Results don’t change.

Tonemappings of Same Scene MSCN coefficient distribution and σ-field distribution for different tonemapping operators

Proposed Naturalness Measure TMQI combines structural fidelity (S) and naturalness (N) Proposed naturalness measure based on scene statistics β: Exponent of generalized Gaussian fit of MSCN pixels of tonemapped SDR image ϕ: Standard deviation of σ-field of tonemapped SDR image a = 0.8012, g = 0.3046 and d1 = d2 = d = 0.7088 (same as in TMQI) Used in all three proposed IQA algorithms For well-lit natural images, the empirical distribution of the MSCN coefficients is approximately Gaussian. Hence, the shape and scale parameters of the GGD fit indicates the degree of naturalness of the image. Sensitivity parameters: delta1 = delta2 = delta, and gamma and a from Zhou Wang’s work

Change #2: Pooling Approach Average pooling gives same importance to every pixel Information Maximization TMQI [Nasrinpour2015] Propose non-uniform pooling strategies using scene statistics σ-map gives measures of edge magnitude and high contrast regions Local entropy indicates local randomness (contrast) Itti and Koch's saliency approach generalized for HDR images [Petit2009] Proved to be very useful for IQA in SDR images. Not used before for IQA of tonemapped images. Structural fidelity map – modified SSIM between floating-point HDR and the tonemapped image Structural fidelity map – take SSIM map multiplied by delta-map Tonemapped image Structural fidelity map Structural fidelity with pooling

TMQI Database [Yeganeh2013] 15 HDR source images, each mapped to SDR w/ 8 tonemaps Subjects ranked 8 SDR images for every HDR source image Correlated predicted and subjective ranks of tonemapped images Median of correlation computations shown below Full Reference IQA Algorithm SROCC PLCC KCC Time (s) Proposed TMQI-NSS-s pooling 0.8810 0.9439 0.7857 0.32 Proposed TMQI-NSS-Entropy pooling 0.9438 0.7143 1.28 Proposed SHDR-TMQI pooling from [Petit2009] 0.9346 0.80 FSITM-TMQI [Nafchi2014] 0.8571 0.9230 0.94 STMQI [Nasrinpour2015] 0.8503 0.9382 0.7638 1.54 TMQI-II [Ma2015] 0.8333 0.8790 0.20 Feature Similarity Index for Tone-Mapped Images (FSITM) [Nafchi2014] 0.8948 0.47 TMQI [Yeganeh2013] 0.8095 0.9082 0.6429 0.52 Execution time (in seconds) for each algorithm on a Linux desktop having 12 GB RAM, Intel Xeon CPU, 3.33 GHz clock

HDR-JPEG Database [Narwaria2013] 10 source HDR images, each has 14 degraded versions JPEG encoding at 7 different bit rates SSIM and MSE used to design HDR->SDR and SDR->HDR mappings 27 subjects rated individual HDR images on HDR displays on 1-5 scale Full Reference IQA Algorithm SROCC PLCC KCC Time (s) Proposed SHDR-TMQI pooling from [Petit2009] 0.8510 0.8533 0.6700 3.00 Proposed TMQI-NSS-s pooling 0.8485 0.8520 0.6659 1.65 Proposed TMQI-NSS-Entropy pooling 0.8454 0.8645 0.6719 6.74 TMQI [Yeganeh2013] 0.7947 0.8057 0.6127 3.45 FSITM-TMQI [Nafchi2014] 0.6300 0.6584 0.4762 8.35 TMQI-II [Ma2015] 0.5096 0.5137 0.3642 1.34 Feature Similarity Index for Tone-Mapped Images (FSITM) [Nafchi2014] 0.4720 0.5167 0.3422 5.26 STMQI [Nasrinpour2015] 0.3464 0.3244 0.2449 12.00 We show that in addition to just tonemapping artifacts, we also do well on tonemapped + JPEG compressed images. Execution time (in seconds) for each algorithm on a Linux desktop having 12 GB RAM, Intel Xeon CPU, 3.33 GHz clock.

Conclusion More Recent Work Perceptually-guided pooling boosts correlation with human subjective ratings vs. average pooling Pooling using σ-map has good correlation vs. runtime tradeoff Software: http://signal.ece.utexas.edu/~bevans/HDRImaging/ More Recent Work ESPL-LIVE HDR Image Database of 1800+ HDR pictures http://signal.ece.utexas.edu/~debarati/ESPL_LIVE_HDR_Database Crowdsourced study with 5000 observers and 300,000 opinion scores Proposed and evaluated no-reference IQA algorithms for HDR images Joint effort with D. Ghadiyaram and A. C. Bovik, UT Austin

References [Larson1997] G. W. Larson, H. Rushmeier, and C. Piatko. “A Visibility Matching Tone Reproduction Operator for High Dynamic Range Scenes”, IEEE Trans. on Visualization and Computer Graphics 3, 4, Oct. 1997, pp. 291-306. [Ma2015] Kede Ma; Yeganeh, H.; Kai Zeng; Zhou Wang, "High Dynamic Range Image Compression by Optimizing Tone Mapped Image Quality Index,” IEEE Trans. on Image Processing, vol.24, no.10, pp.3086-3097, Oct. 2015 [Nafchi2014] H. Ziaei Nafchi, A. Shahkolaei, R. Farrahi Moghaddam, and M. Cheriet, “Fsitm: A feature similarity index for tone-mapped images,” IEEE Signal Processing Letters, vol. 22, no. 8, pp. 1026-1029, Aug. 2015. [Narwaria2013] M. Narwaria, M. Perreira Da Silva, P. Le Callet, and R. Pepion, “Tone mapping-based high-dynamic-range image compression: study of optimization criterion and perceptual quality,” Optical Engineering, vol. 52, no. 10, Oct 2013. [Nasrinpour2015] 1 H. R. Nasrinpour and N. D. Bruce, “Saliency weighted quality assessment of tone-mapped images,” Proc. IEEE Int. Conf. Image Proc., Sep. 2015. [Petit2009] J. Petit, R. Brémond, and J.-P. Tarel, “Saliency maps of high dynamic range images,” Proc. ACM Symp. Appl. Perception in Graphics & Visualization, 2009. [Ruderman1993] D. L. Ruderman and W. Bialek, “Statistics of natural images: Scaling in the woods,” Proc. Neural Info. Processing Sys. Conf. and Workshops, 1993. [Yeganeh2013] H. Yeganeh and Z. Wang, “Objective quality assessment of tone-mapped images,” IEEE Trans. on Image Processing , vol. 22, no. 2, pp. 657-667, Feb 2013.

Questions?

Multi-Exposure Fusion HDR but in SDR format Merge exposure stack directly to get fused image ith pixel index kth exposure image Xk(i) luminance Wk(i) weight for perceptual importance of exposure level k SDR Registered exposure stack of K images Standard dynamic range (SDR) images Requires camera calibration and motion compensation

Distorted Image Statistics Different distortions affect scene statistics characteristically Used for distortion classification and blind quality prediction Replot MSCN Coefficients Steerable Pyramid Wavelet Coefficients Curvelet Coefficients Back

Tone Mapped Quality Index [Yeganeh2013] Tonemapping meant to change local intensity & contrast Structural fidelity modifies Structural Similarity (SSIM) Penalizes large change in strength in HDR vs. SDR image patch Local standard deviations nonlinearly mapped via Gaussian CDF Significant signal strength mapped to 1 Insignificant signal strength mapped to 0 Structural fidelity computation over five scales Naturalness measure of tonemapped SDR image Distribution of global means in 3000 natural images Distribution of global standard deviations in 3000 natural images Back

Itti and Koch’s Saliency Back Different scales Implemented as Gaussian Pyramid Center Surround mechanism Implemented with DoG LPF repeated over multiple scales 3 scales, 4 orientations used

Generalized Gaussian Density GGD includes the special cases = 1 (Laplacian density) = 2 (Gaussian density) = (uniform density) Many authors observe GGD behavior of bandpass image signals Wavelet coefficients DCT coefficients Usually reported that b » 1 but varies (0.8 < b < 1.4) [A. C. Bovik, EE381V Digital Video, UT Austin, Spring 2015]

Calculating Correlations Spearman’s Rank-Order Correlation Coefficient (SRCC) di is difference between ith image’s ranks is subjective and objective evaluations N is number of rankings Kendall’s correlation coefficient (KCC) Nc and Nd are the number of concordant (of consistent rank order) and discordant (of inconsistent rank order) pairs in the data set respectively Pearson’s Linear Correlation Coefficient (PLCC) Back