Massey University Image Resolution Improvement from Multiple Images Donald Bailey Institute of Information Sciences and Technology Massey University Palmerston.

Slides:



Advertisements
Similar presentations
Digital Image Processing
Advertisements

Applications of one-class classification
Bayesian Belief Propagation
A Crash Course in Radio Astronomy and Interferometry: 4
Shapelets Correlated with Surface Normals Produce Surfaces Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.
CSCE 643 Computer Vision: Template Matching, Image Pyramids and Denoising Jinxiang Chai.
3-D Computational Vision CSc Image Processing II - Fourier Transform.
QR Code Recognition Based On Image Processing
Object Specific Compressed Sensing by minimizing a weighted L2-norm A. Mahalanobis.
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
11/11/02 IDR Workshop Dealing With Location Uncertainty in Images Hasan F. Ates Princeton University 11/11/02.
Topic 6 - Image Filtering - I DIGITAL IMAGE PROCESSING Course 3624 Department of Physics and Astronomy Professor Bob Warwick.
Transforming images to images
Digital Signal Processing
EE4H, M.Sc Computer Vision Dr. Mike Spann
Unit 9 IIR Filter Design 1. Introduction The ideal filter Constant gain of at least unity in the pass band Constant gain of zero in the stop band The.
Computer Vision Optical Flow
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.
CSCE 641 Computer Graphics: Image Sampling and Reconstruction Jinxiang Chai.
CSCE 641 Computer Graphics: Image Sampling and Reconstruction Jinxiang Chai.
Digital Image Processing Chapter 5: Image Restoration.
Object Detection and Tracking Mike Knowles 11 th January 2005
Image Enhancement.
Signal Analysis and Processing for SmartPET D. Scraggs, A. Boston, H Boston, R Cooper, A Mather, G Turk University of Liverpool C. Hall, I. Lazarus Daresbury.
Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.
Spatio-Temporal Quincunx Sub-Sampling.. and how we get there David Lyon.
Digital to Analogue Conversion Natural signals tend to be analogue Need to convert to digital.
Chapter 4: Sampling of Continuous-Time Signals
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5409 T-R 10:30am – 11:50am.
Topic 7 - Fourier Transforms DIGITAL IMAGE PROCESSING Course 3624 Department of Physics and Astronomy Professor Bob Warwick.
Lecture 1 Signals in the Time and Frequency Domains
Image Preprocessing: Geometric Correction Image Preprocessing: Geometric Correction Jensen, 2003 John R. Jensen Department of Geography University of South.
Robust global motion estimation and novel updating strategy for sprite generation IET Image Processing, Mar H.K. Cheung and W.C. Siu The Hong Kong.
EDGE DETECTION IN COMPUTER VISION SYSTEMS PRESENTATION BY : ATUL CHOPRA JUNE EE-6358 COMPUTER VISION UNIVERSITY OF TEXAS AT ARLINGTON.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Digital Image Processing CSC331 Image Enhancement 1.
Lecture 7: Sampling Review of 2D Fourier Theory We view f(x,y) as a linear combination of complex exponentials that represent plane waves. F(u,v) describes.
Digital Imaging. Digital image - definition Image = “a two-dimensional function, f(x,y), where x and y are spatial coordinates, and the amplitude of f.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Digital Image Processing Lecture 10: Image Restoration March 28, 2005 Prof. Charlene Tsai.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm.
23 November Md. Tanvir Al Amin (Presenter) Anupam Bhattacharjee Department of Computer Science and Engineering,
Digital Image Processing Lecture 10: Image Restoration
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
2D Sampling Goal: Represent a 2D function by a finite set of points.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
2D Fourier Transform.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Projects Project 1a due this Friday Project 1b will go out on Friday to be done in pairs start looking for a partner now.
RECONSTRUCTION OF MULTI- SPECTRAL IMAGES USING MAP Gaurav.
- photometric aspects of image formation gray level images
CS 591 S1 – Computational Audio
… Sampling … … Filtering … … Reconstruction …
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
Motion Detection And Analysis
Image Analysis Image Restoration.
General Functions A non-periodic function can be represented as a sum of sin’s and cos’s of (possibly) all frequencies: F() is the spectrum of the function.
Fundamentals Data.
CAP 5415 Computer Vision Fall 2012 Dr. Mubarak Shah Lecture-5
CSCE 643 Computer Vision: Thinking in Frequency
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Image Processing Today’s readings For Monday
Distributed Ray Tracing
Filtering Part 2: Image Sampling
Digital Image Processing
Apparent Subdiffusion Inherent to Single Particle Tracking
Resampling.
Review and Importance CS 111.
Presentation transcript:

Massey University Image Resolution Improvement from Multiple Images Donald Bailey Institute of Information Sciences and Technology Massey University Palmerston North NEW ZEALAND

Overview Describe the resolution improvement process Describe the results of my investigations Investigations in 1 dimensions –Super-resolution of bar codes Registration in 2 dimensions –Comparison of registration methods –Detailed description of predictive interpolation

Description of the Problem Given a set of related independent low resolution images, combine these together to construct a single high resolution image –output has more detail than any of the input images

Resolution Improvement Resolution limited by number of pixels –Resolution depends on sampling density An ensemble of images: –Each image provides separate samples –Potentially higher sampling density Reconstruction steps: –Register images –Resample ensemble –Inverse filter

Sampling Requirements Images must be sub-sampled If sample rate is greater than Nyquist rate –Can reconstruct the image at any desired resolution –A single image contains all information –Can only improve the signal to noise ratio If sample rate is less than Nyquist rate –Each individual image is aliased –Cannot obtain higher real resolution from single image –Resampling the ensemble untangles the aliased information

One Dimension Example “Super-Resolution of Bar Codes”, D.G. Bailey, Journal of Electronic Imaging, 10 (1), pp ( January 2001). Problem: How can we read this bar code?

Information content of UPC Bar Codes 12 digits, each 7 units wide Guard bands Total width is 95 units Each bar or space is 1-4 units wide Broadband frequency spectrum Centre of main lobe contains required data 6 digits guard bands

Super-resolution procedure A tilted 2D bar code image provides an ensemble of independently sampled 1D images Register low-resolution images –gives relationship between individual images Resample the ensemble at a higher rate –creates a high resolution image Remove system effects –reduces the sampling blur and effect of camera electronics

Registration Determines the offset between rows Phase shift in frequency domain proportional to linear offset and spatial frequency Procedure: –Fourier Transform each row, keep phase –Subtract phase of first row from each row –Unwrap phase image –Discard higher frequencies –Least squares fit to calculate offset per row Input image Phase image

Resampling By interleaving samples from different rows increase the sample rate. Original image samples New image samples Selected sample rows

Resampling By interleaving samples from different rows increase the sample rate. Original image samples New image samples Selected sample rows

Resampling Increase sample rate by an integer multiple of original sample rate Select rows with offsets nearest the desired sample positions 4 x sample rate

Practical limitations From synthetic image From actual image 0 Spatial frequency Amplitude  0 /2  0  0  0 /2 0 Spatial frequency Amplitude  0 /2  0  0  0 /2

Practical limitations Real bar code limitations –Ink smearing means bar and space widths not exact –Smears the envelope in the frequency domain Image capture degradations Object Camera angle Lens system Sensor Digital Image Video signal Video frame grabber Camera electronics

Practical limitations Image distortions –Perspective distortion from camera angle –Lens distortion Lens point spread function –Spatially variant low pass filter Image sensor –area sampling - low pass filter –aliasing Camera electronics –low pass filter, perhaps with high frequency emphasis Frame grabber –sampling (more aliasing)

Removing system effects Aliasing is not a problem –it is actually necessary for higher resolution reconstruction –resampling the ensemble untangles the aliased information Main effects are the low pass filter characteristics –lens point spread function –area sampling in sensor –smoothing filter camera electronics –anti-alias filter in frame grabber

Removing system effects 0 Spatial frequency Amplitude  0 /2  0  0  0 /2 Assume no distortion, and no spatial variation in the low pass filter characteristic Estimate the system response by comparing synthetic and actual reconstructed images Remove using an inverse linear filter

Results Resampled ensemble System response removed Straightened and averaged Thresholded Original image

Bar Code Conclusions A two-dimensional image of a bar code tilted slightly provides an ensemble of related one-dimensional images The low resolution images must be aliased It is necessary to compensate for limitations in the image capture system Analogue video cameras make more complex –Image sampled twice –Additional analogue filters Modest gains in resolution are achievable

Extending to 2 Dimensions Problem is considerably more complex Require multiple 2 dimensional images –Captured at different times –Motion is a limitation Registration more complex –Bar code images all had constant offset per row –In 2D every image is independent, with 2D offset Resampling more complex –Need more images for same improvement 4 images to improve resolution by 2

Registration in 2 Dimensions Requirements –Accurate sub-pixel offset between images –Work directly on low resolution images –Insensitive to aliasing –Tolerates a low level of noise –Does not rely on particular objects in the image –For practical use, must be fast Conventional approach to sub-pixel registration –Interpolate images to chosen high resolution Increased data volume slows this method down –Perform pixel accuracy registration Requires a search

Sub-Pixel Registration Methods Phase based methods –Similar to 1D case, but extended to 2D Determine fit surface on integer grid –Interpolate this to find optimum fit –Correlation methods –Difference methods Predictive interpolation –A new method that turns problem around Other approaches –Rely on locating objects or edges within the image

Phase Methods An offset in the image domain corresponds to a phase shift in the frequency domain Procedure –Window the image and reference –FFT and keep the phases –Unwrap the phase difference –Weighted least squares fit of a plane to the phase

Correlation - Pixel Accuracy Multiply an offset image by a reference Accumulate product in the overlap region Normalise by the average pixel value –Prevents bias if the image has a gradient Frequency domain correlation

Correlation - Sub-pixel Accuracy i Correlation peak c(i) i pk i 0 i 1 i  1 c  1 c 1 c 0 Perform pixel level correlation first Interpolate to find peak to sub-pixel accuracy Expect correlation peak to be a pyramid –Only strictly true with regions of uniform value –Approximately true if there are step edges Width of pyramid is twice smallest feature width –Only local information should be used

Can be shown to be related to correlation Subtract an offset image from a reference Accumulate difference in the overlap region Minimum gives offset to nearest pixel For sub-pixel accuracy –Expect minimum to be an inverted pyramid (locally) –Interpolate to find minimum using previous method Difference Methods

Other Registration Methods Centre of gravity –Segment objects from background –Centre of gravity of objects to sub-pixel accuracy Line fitting –Detect lines or edges –Fit a line or curve to detected points Requires knowledge of contents of image Accuracy limited by size of object / edge and accuracy of segmentation / detection

Predictive Interpolation Turns the problem around –Predicts the pixel values as a function of those in a reference image –Uses the bilinear interpolation equation as a linear predictor –Subject to the constraint: –Then offset is:

Predictive Interpolation Procedure –Requires the image to be pre-registered to nearest pixel Can use a search, or hierarchical registration to do this –Determine coefficients A xx that minimise the error Uses weighted least squares Weight each point with standard deviation of its 4 references –From coefficients, get offset directly Properties –Very fast - single pass if registered to nearest pixel –Very good accuracy - typically better than 5% of a pixel

Evaluating the Registration Requires a set of images with known offsets –Start with a single high resolution image –Simulate capturing with a lower resolution camera by filtering and subsambling –Add random Gaussian noise to each low resolution image

Evaluation Procedure Measure offset between each pair of images –36 offset measurements for 9 images Enforce consistency between measurements –Only 8 independent offsets –Will reduce errors up to Error is difference between expected offset and measured offset –Average to give the RMS registration error

Results Structured image No noise: –All worked well Noise sensitivity: –Phase method only few frequencies –Predictive method weighting gives only few measurement A

Results Low detail image No noise –Phase is best –Others adequate Noise sensitivity: –Predictive method sensitive to noise B

Results Medium detail image No noise –Phase and Predictive significantly better Noise sensitivity: –Correlation noise insensitive –Difference noise insensitive C

Results Low detail text No noise –Phase and predictive significantly better Noise sensitivity: –Predictive method improves with low noise D

Results High detail text –Significant aliasing No noise –Correlation and Difference are significantly poorer –Phase and Predictive give excellent results E

Summary of Results Correlation and Difference methods –Similar results –Performance deteriorates with increasing detail –Relatively insensitive to noise Phase and Predictive methods –Best overall with % pixel accuracy Includes enforcing consistency Predictive method –Results almost independent of image –Improved with addition of small amounts of noise

Reconstruction Low resolution Resampled Inverse filtered

Conclusions of Comparison Correlation and difference methods –Insensitive to noise –Perform poorly in presence of high detail –Pyramidal interpolation method breaks down Phase and predictive methods –More sensitive to noise –Suitable for registration for resolution improvement –Predictive method less expensive than phase

Detailed Properties of Predictive Method Two components to the errors –Systematic component –Random component A B C D E A B C D E

Effect of Match Window Size Systematic bias –No significant change above 20 x 20 pixels –Bias is therefore inherent in the method used Random component –Depends significantly on window size for small windows –When random component has approximately same magnitude as the bias, it stops improving –No further improvement above about 100 x 100 pixels –Above this, the error has distinct double peak –Random component is therefore limited by systematic bias

Summary of Predictive Registration Very fast –Requires only a single pass through the image –(2 passes if not already registered to nearest pixel) Accuracy is 4 - 5% of pixel –This is between pairs of images –May be improved by enforcing consistency Optimum match window size is about 100 x 100 pixels Robust to moderate levels of noise Relatively insensitive to aliasing Suitable for images captured in identical conditions –Sensitive to contrast and brightness

Overall Summary Described resolution improvement –Reconstruction steps: registration, resampling, inverse filtering –Examined some of the preconditions: aliasing –Discussed some of the limitations: lens and camera blurring Presented results from 1D –Super-resolution of bar code images Presented results from 2D registration –Comparison of registration methods –Description of a new fast method Predictive Interpolation method

References “Super-resolution of bar-codes”, D.G. Bailey, Journal of Electronic Imaging, vol 10 (1), pp (2001) “Predictive Interpolation for Registration”, D.G. Bailey, Proceedings of Image and Vision Computing Conference NZ, pp (November 2000) “Image Registration Methods for Resolution Improvement”, D.G. Bailey and T.H. Lill, Proceedings of Image and Vision Computing NZ, pp (August 1999) “Super-Resolution of Bar Codes”, D.G. Bailey, SPIE Proceedings, Vol 3521 Machine Vision Systems for Inspection and Metrology VII, Boston, pp (November 1998)