Image Priors and Optimization Methods for Low-level Vision Dilip Krishnan TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:

Slides:



Advertisements
Similar presentations
Removing blur due to camera shake from images. William T. Freeman Joint work with Rob Fergus, Anat Levin, Yair Weiss, Fredo Durand, Aaron Hertzman, Sam.
Advertisements

Optimizing and Learning for Super-resolution
Object Specific Compressed Sensing by minimizing a weighted L2-norm A. Mahalanobis.
Various Regularization Methods in Computer Vision Min-Gyu Park Computer Vision Lab. School of Information and Communications GIST.
L1 sparse reconstruction of sharp point set surfaces
Lecture 2: Convolution and edge detection CS4670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Digital Photography with Flash and No-Flash Image Pairs By: Georg PetschniggManeesh Agrawala Hugues HoppeRichard Szeliski Michael CohenKentaro Toyama,
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
Approaches for Retinex and Their Relations Yu Du March 14, 2002.
Wangmeng Zuo, Deyu Meng, Lei Zhang, Xiangchu Feng, David Zhang
Shaojie Zhuo, Dong Guo, Terence Sim School of Computing, National University of Singapore CVPR2010 Reporter: 周 澄 (A.J.) 01/16/2011 Key words: image deblur,
IMAGE UPSAMPLING VIA IMPOSED EDGE STATISTICS Raanan Fattal. ACM Siggraph 2007 Presenter: 이성호.
Unnatural L 0 Representation for Natural Image Deblurring Speaker: Wei-Sheng Lai Date: 2013/04/26.
1 Removing Camera Shake from a Single Photograph Rob Fergus, Barun Singh, Aaron Hertzmann, Sam T. Roweis and William T. Freeman ACM SIGGRAPH 2006, Boston,
Rob Fergus Courant Institute of Mathematical Sciences New York University A Variational Approach to Blind Image Deconvolution.
Personal Photo Enhancement using Example Images Neel Joshi Wojciech Matusik, Edward H. Adelson, and David J. Kriegman Microsoft Research, Disney Research,

ECE 472/572 - Digital Image Processing Lecture 8 - Image Restoration – Linear, Position-Invariant Degradations 10/10/11.
Image Denoising via Learned Dictionaries and Sparse Representations
New Results in Image Processing based on Sparse and Redundant Representations Michael Elad The Computer Science Department The Technion – Israel Institute.
SUSAN: structure-preserving noise reduction EE264: Image Processing Final Presentation by Luke Johnson 6/7/2007.
Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.
Image Deblurring with Optimizations Qi Shan Leo Jiaya Jia Aseem Agarwala University of Washington The Chinese University of Hong Kong Adobe Systems, Inc.
Lecture 2: Image filtering
Understanding and evaluating blind deconvolution algorithms
Dark Flash Photography Dilip Krishnan Rob Fergus Dept. of Computer Science Courant Institute, New York University.
Retinex by Two Bilateral Filters Michael Elad The CS Department The Technion – Israel Institute of technology Haifa 32000, Israel Scale-Space 2005 The.
Advanced Image Processing Image Relaxation – Restoration and Feature Extraction 02/02/10.
Fast Bilateral Filtering
ELE 488 F06 ELE 488 Fall 2006 Image Processing and Transmission ( ) Wiener Filtering Derivation Comments Re-sampling and Re-sizing 1D  2D 10/5/06.
Introduction to Image Processing Grass Sky Tree ? ? Review.
High dynamic range imaging. Camera pipeline 12 bits8 bits.
© by Yu Hen Hu 1 ECE533 Digital Image Processing Image Restoration.
Computer Vision - Restoration Hanyang University Jong-Il Park.
Chapter 7 Case Study 1: Image Deconvolution. Different Types of Image Blur Defocus blur --- Depth of field effects Scene motion --- Objects in the scene.
EE369C Final Project: Accelerated Flip Angle Sequences Jan 9, 2012 Jason Su.
Selective Block Minimization for Faster Convergence of Limited Memory Large-scale Linear Models Kai-Wei Chang and Dan Roth Experiment Settings Block Minimization.
Sparse Matrix Factorizations for Hyperspectral Unmixing John Wright Visual Computing Group Microsoft Research Asia Sept. 30, 2010 TexPoint fonts used in.
Yu-Wing Tai, Hao Du, Michael S. Brown, Stephen Lin CVPR’08 (Longer Version in Revision at IEEE Trans PAMI) Google Search: Video Deblurring Spatially Varying.
Image Processing Edge detection Filtering: Noise suppresion.
Image Processing Jitendra Malik. Different kinds of images Radiance images, where a pixel value corresponds to the radiance from some point in the scene.
Why is computer vision difficult?
Digital Image Processing Lecture 10: Image Restoration March 28, 2005 Prof. Charlene Tsai.
Image Restoration.
Image deconvolution, denoising and compression T.E. Gureyev and Ya.I.Nesterets
Learning to Perceive Transparency from the Statistics of Natural Scenes Anat Levin School of Computer Science and Engineering The Hebrew University of.
Image Denoising Using Wavelets
Digital Image Processing Lecture 10: Image Restoration
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Vincent DeVito Computer Systems Lab The goal of my project is to take an image input, artificially blur it using a known blur kernel, then.
1 Markov random field: A brief introduction (2) Tzu-Cheng Jen Institute of Electronics, NCTU
Biointelligence Laboratory, Seoul National University
Image Decomposition, Inpainting, and Impulse Noise Removal by Sparse & Redundant Representations Michael Elad The Computer Science Department The Technion.
1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.
Vincent DeVito Computer Systems Lab The goal of my project is to take an image input, artificially blur it using a known blur kernel, then.
ICCV 2007 National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences Half Quadratic Analysis for Mean Shift: with Extension.
Using Neumann Series to Solve Inverse Problems in Imaging Christopher Kumar Anand.
Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.
RECONSTRUCTION OF MULTI- SPECTRAL IMAGES USING MAP Gaurav.
Noise Filtering in Monte Carlo Rendering
Digital Image Processing Lecture 10: Image Restoration
Image Deblurring and noise reduction in python
A Neural Approach to Blind Motion Deblurring
Deconvolution , , Computational Photography
Digital 2D Image Basic Masaki Hayashi
Fast image deconvolution using Hyper-Laplacian Prior
Optimal sparse representations in general overcomplete bases
Deblurring Shaken and Partially Saturated Images
Lecture 7 Patch based methods: nonlocal means, BM3D, K- SVD, data-driven (tight) frame.
Presentation transcript:

Image Priors and Optimization Methods for Low-level Vision Dilip Krishnan TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A AA A A A

Introduction We consider underconstrained image processing problems such as denoising, deblurring and image inpainting. Also called Linear Inverse Problems. To overcome ill-posed nature and to introduce robustness to noise, image priors (regularization) are needed.

Introduction Canonical problem for prior-based methods: There are 2 issues one seeks to address: Develop better priors: lead to higher quality results (visual and/or SNR). Develop fast numerical algorithms to solve the resulting problems (to handle large image sizes) fast and accurately.

Existing Approaches to Denoising Prior-based Methods: Nonlinear Total Variation based Noise Removal Algorithms, Rudin, Osher, Fatemi, Multiscale (wavelet) Decomposition Methods: Image Denoising using Scale Mixtures of Gaussians in the Wavelet Domain, Portilla, Strela, Wainwright, Simoncelli, 2003.

Existing Approaches to Denoising Dictionary-based Methods: Learning Multiscale Sparse Representations for Image and Video Restoration, Mairal, Shapiro, Elad, Filtering Methods: A Fast Approximation of the Bilateral Filter using a Signal Processing Approach, Paris, Durand, Extension: called non-local means which provides state-of-the art results.

Existing Approaches to Non-Blind Deblurring In non-blind deblurring, we assume that the blurring kernel (blur filter) is known. Wiener Filter: Classical method; tries to find an ‘inverse blur filter’ based on noise characteristics – requires knowledge of unknown signal and noise. Richardson-Lucy: Simple iterative method – causes ringing.

Existing Approaches to Non-Blind Deblurring Prior-based methods: l_1: Linearized Bregman Methods for Frame-Based Image Deblurring, Cai, Osher, Shen, Total Variation: A Nonlinear Primal-Dual Method for Total Variation-Based Image Restoration, Chan, Golub, Mulet, 1999: inspired by earlier idea of Conn and Overton on joint primal- dual minimization for minimizing sum of Euclidean norms. Sub l_1 on gradients: Fast Image Deconvolution using Hyper-Laplacian Priors, Krishnan, Fergus 2009.

Existing Approaches to Blind Deconvolution Blurring kernel is unknown. Difficult problem as blurring kernel and image are all unknown, and there is noise in the image. Prior-based methods: l_1:, Blind Motion Deblurring from a single image using Sparse Approximation Priors, Cai et. al Gradient Based: Removing Camera Shake from a Single Photograph, Fergus et. al Sub l_1 on gradients: Blind Image Deblurring using Image Statistics, Levin, 2006.

Spatially Varying Deconvolution Most generalized (and therefore, difficult) problem where the blur kernel is spatially varying. Potentially, there could be a different blur kernel at every point in the image. Only gradient-based priors used currently.

Commonly Used Priors and Related Assumptions prior : Image to be recovered is sparse under some transformation (e.g. wavelet); problem formulation: : inverse wavelet transform; : blur kernel; Other possibilities are DFT, curvelets etc.

Commonly Used Priors and Related Assumptions Total Variation: Gradient of image is sparse i.e. image is piecewise smooth (many flat areas, sharp transitions): Content aware prior: Cho et. al. 2010: Spatially varying prior that depends on image content:

Commonly Used Priors and Related Assumptions Sparsity under learned filters: Natural Image Statistics and Efficient Coding, Olshausen and Field Filters are learnt using a neural network; and sparsity is imposed on the coefficients of the image convolved with these filters.

Commonly Used Priors and Related Assumptions Examples of such learned filters (from the paper):

Comments on Priors These priors work reasonably well for denoising; and other approaches are also available for denoising problem. Not so well for deblurring or blind deconvolution. Cost of prior is higher for unblurred images than for blurry images. This motivates the search for new priors.

Optimization Techniques for We consider commonly used techniques for solving problems of the type: Problems arise in deblurring, denoising and compressed sensing. Large size of images makes efficient techniques a necessity. Even an average sized 1000 x 1000 image means ~1 million unknowns. Second order methods such as interior point are infeasible for large number of unknowns.

Optimization Techniques for Reference: L1-L2 Optimization in Signal and Image Processing, Zibulevsky and Elad, Wide variety of techniques are used: mostly first- order or iterative methods. Iterative Shrinkage Methods (ISTA) are of the type: S is a component-wise shrinkage operation. Derived from a quadratic approximation to the original problem.

Optimization Techniques for This basic shrinkage was recently accelerated using Nesterov methods: A Method for Solving the Convex Programming Problem with Convergence Rate O(1/k^2), Y. E. Nesterov, Resulting algorithm is called FISTA:

Optimization Techniques for Bregman-type methods: Developed recently by Osher and fellow-researchers at UCLA. Motivation: use a different approximation of the objective function based on Bregman distance. Interestingly, the resulting algorithm actually ends up very similar to Iterative Shrinkage Methods:

Optimization Techniques for Now consider TV-regularized problems: Previous methods for can be extended by using splitting techniques. Introduce dummy variables and make a new problem: Then solve alternately for x,w. Problem in x is quadratic – often, can be solved very fast (using FFT’s). Problem in w is of type. The Split Bregman method for L1 Regularized Problems, Goldstein et. al., 2009.

Optimization Techniques for Problems of the type: D(x) is identity or gradient operator. Reweighted least squares can be used: slow, but accurate: Iteratively Re-weighted Least Squares Minimization for Sparse Recovery, Daubechies et. al., Splitting methods can also be used in conjuction with look-up tables to solve the seperable shrinkage problem: very effective in practice.

Previous Work

Overview Dark Flash Photography: SIGGRAPH Enables dazzle-free photography in low-light conditions. Fast Image Deconvolution using Hyper Laplacian Priors: NIPS Fast scheme to solve deconvolution/denoising problems that use nonconvex priors.

Overview Dark Flash Photography: SIGGRAPH Enables dazzle-free photography in low-light conditions. Fast Image Deconvolution using Hyper Laplacian Priors: NIPS Fast scheme to solve deconvolution/denoising problems that use nonconvex priors.

Our Camera & Dark Flash Dark Flash Emits Ultraviolet (UV) and Infrared (IR) light just outside visible wavelength range

Dark Flash Photography Dark flash is ~200 times dimmer than conventional Ground truthReconstruction2. Ambient image1. Dark Flash image

Key Challenges 1.How to add light to the scene without it being perceived by people. 2.How to obtain an image with correct colors.

Human Retina Spectral Sensitivity (Vos 1978)

Normal Camera Spectral Sensitivity Spectral response curves for an unmodified camera

Modify standard camera: 1. Remove IR-block sensor filter 2. Add lens filter to remove IR >800nm  Some sensitivity in UV & IR just outside visible Dark Flash Camera Spectral Sensitivity IR

Comparison with Human Sensitivity Camera’s response is significantly broader Possible to add light without being easily perceived Human Response Curve IR

Dark Flash Emission Spectrum Camera Spectral Sensitivity Dark Flash Emission

Key Challenges 1.How to add light to the scene without it being perceived by people. 2.How to obtain an image with correct colors.

Two Images: Five Spectral Bands In Dark Flash image: – “Blue” channel records UV – “Red” channel records IR Assumptions 1.Little ambient UV and IR light 2.UV/IR flash dominates ambient visible light 2. Ambient RGB 1. Dark Flash IR UV

Reconstruction of Red Channel Ambient RedGround Truth Red Intensities Compare to ground truth reference Ambient red has correct intensities but is noisy

Intensity Constraint Ambient RedOutput Red Intensities Gradients ?? ≈

Spectral Constraint Gradients in Red and IR are similar at most pixels But differ when material changes Ground Truth Gradients in Red Gradients in IR Need to allow occasional disagreement between gradients

Spectral Constraint (Also for UV flash) Gradients of Output Red Gradients in IR ? Minimize:

Final Cost Function Sum over all pixels. = 0.7. is IR channel; is UV. 3 cost functions – one for each channel. and are weighting terms. - mask to handle specularities/highlights UV Spectral Term IR Spectral Term Likelihood Sparsity

Implies that spectral reflectances are the same in UV/IR and visible NOT TRUE Effect of Varying α in Spectral Term Non-convex optimization Models actual distribution Later part of talk focuses on fast solution scheme. Convex optimization Fast: ~2 mins in Matlab for 1.4 megapixel image Can be easily be implemented on GPU Best results Acceptable results Unacceptable results

Results

Ambient : 1/20 th secReconstructionLong exposure: 4 sec

1/64 th 1/90 th 1/256 th Reconstruction Ambient (Fraction of Normal Illumination)

Limitations Rudimentary handling of shadows and highlights – Currently use simple mask (as in existing flash/no-flash work) Need some ambient illumination – Won’t work in complete darkness Color drift & loss of detail at very high noise levels Lack of UV/IR edges causes reconstruction to suffer Taking Photos in Low Light Levels

Overview of Talk Dark Flash Photography: SIGGRAPH Enables dazzle-free photography in low-light conditions. Fast Image Deconvolution using Hyper Laplacian Priors: NIPS Fast scheme to solve deconvolution problems that use nonconvex priors.

Hyper-Laplacian Priors Good models of gradient distributions in natural images. Heavy tails are well modeled. Used in deblurring, denoising, super-resolution etc. Drawback: cause problems to become non-convex.

Non-blind Deconvolution We consider the standard non-blind deconvolution problem: Likelihood Prior For >= 1, problem is convex and a global minimum can be found. We consider the solution of this problem when < 1. We want, given blurry/noise image and kernel. are gradient filters.

Existing Methods Iterative Reweighted Least Squares/Conjugate Gradients Very slow for large image sizes. IRLS takes ~ 1 hour for a 1 megapixel image. E.g.: Levin et. al. SIGGRAPH For = 1, Bregman iterations/iterative shrinkage techniques work well. Second-order methods are infeasible.

Our Algorithm We use a splitting method. Introduce a new auxiliary variable for each derivative filter. Old cost function: New cost function:

Our Algorithm We use a splitting method. Introduce a new auxiliary variable for each derivative filter. Old cost function: New cost function: Split

Our Algorithm As, the solution to this problem approaches that of the original problem. Now we can use an alternating mechanism to first minimize for, and then minimize for each. Coupled with a continuation method on.

x sub-problem is kept fixed from previous iteration. sub-problem is quadratic: Can be solved using FFTs (assuming circular boundary conditions). Need to solve: For 2 derivative filters, we require only 3 FFT’s at every iteration.

w sub-problem sub-problem is component-wise separable; is fixed from previous iteration. For a fixed pixel and filter, it reduces to: Simplified: Can be solved very accurately using Lookup Tables. For the problem can be solved analytically.

Overall Algorithm A : Start with a small value of. Alternate between and sub-problems till convergence. Increase the value of by a small amount. GOTO A. Achieves over 2 orders of magnitude speedup over IRLS with similar SNR quality. Timing for de-blurring with a 13 x 13 kernel: Image SizeIRLSOurs 1024 x x >

Original Blurred SNR=7.31 Ours SNR=18.96 t=1.2 IRLS SNR=19.05 t=483.9

Dark Flash Result Long Exposure Ambient 2.5 Lux IRLS ~ 1 hour Ours ~ 1 minute

Summary and Drawbacks Fast method for nonconvex nonblind deconvolution Orders of magnitude faster than IRLS. Can be used for other applications. – E.g. super-resolution, dark flash, denoising. FFTs and LUTs are well suited for hardware implementation. Circular boundary conditions cause boundary artefacts – negligible as resolution increases.