Superresolution of Texts from Nonideal Video

Slides:

Advertisements

Similar presentations

Bayesian Belief Propagation

Advertisements

The fundamental matrix F

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.

Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep

Computer Vision Optical Flow

X From Video - Seminar By Randa Khayr Eli Shechtman, Yaron Caspi & Michal Irani.

A Multicamera Setup for Generating Stereo Panoramic Video Tzavidas, S., Katsaggelos, A.K. Multimedia, IEEE Transactions on Volume: 7, Issue:5 Publication.

Light Field Compression Using 2-D Warping and Block Matching Shinjini Kundu Anand Kamat Tarcar EE398A Final Project 1 EE398A - Compression of Light Fields.

Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.

Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.

EE465: Introduction to Digital Image Processing

Robust Super-Resolution Presented By: Sina Farsiu.

Textual Information Access for the Visually Impaired Ramani Duraiswami.

Very Low Resolution Face Recognition Problem

Probabilistic video stabilization using Kalman filtering and mosaicking.

Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.

Milanfar et al. EE Dept, UCSC 1 “Locally Adaptive Patch-based Image and Video Restoration” Session I: Today (Mon) 10:30 – 1:00 Session II: Wed Same Time,

ON THE IMPROVEMENT OF IMAGE REGISTRATION FOR HIGH ACCURACY SUPER-RESOLUTION Michalis Vrigkas, Christophoros Nikou, Lisimachos P. Kondi University of Ioannina.

Static Image Mosaicing

CCU VISION LABORATORY Object Speed Measurements Using Motion Blurred Images 林惠勇中正大學電機系

Preprocessing to enhance recognition performance in the presence of: -Illumination variations -Pose/expression/scale variations - Resolution enhancement.

Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.

EE465: Introduction to Digital Image Processing Copyright Xin Li 1 Introduction What is image segmentation?  Technically speaking, image segmentation.

Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,

Authors: Wagner, Waagen and Cassabaum Presented By: Mukul Apte

Super-Resolution Dr. Yossi Rubner

What Does the Scene Look Like From a Scene Point? Donald Tanguay August 7, 2002 M. Irani, T. Hassner, and P. Anandan ECCV 2002.

Jitter Camera: High Resolution Video from a Low Resolution Detector Moshe Ben-Ezra, Assaf Zomet and Shree K. Nayar IEEE CVPR Conference June 2004, Washington.

1 Remote Sensing Laboratory Dept. of Information Engineering and Computer Science University of Trento Via Sommarive, 14, I Povo, Trento, Italy 2.

Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.

The Brightness Constraint

Tour Guide Image Compression Image Manipulation Image Analysis Image Acquisition Image Perception Image Display Image Generation D.I.P. Theme Park.

Expression-invariant Face Recognition using Geodesic Distance Isometries Kerry Widder A Review of ‘Robust expression-invariant face recognition from partially.

Robust global motion estimation and novel updating strategy for sprite generation IET Image Processing, Mar H.K. Cheung and W.C. Siu The Hong Kong.

Scientific Writing Abstract Writing. Why ? Most important part of the paper Number of Readers ! Make people read your work. Sell your work. Make your.

Image stitching Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz, Matthew Brown and Vaclav Hlavac.

Yu-Wing Tai, Hao Du, Michael S. Brown, Stephen Lin CVPR’08 (Longer Version in Revision at IEEE Trans PAMI) Google Search: Video Deblurring Spatially Varying.

1 Digital Image Processing Dr. Saad M. Saad Darwish Associate Prof. of computer science.

Fast Direct Super-Resolution by Simple Functions

Motion Deblurring Using Hybrid Imaging Moshe Ben-Ezra and Shree K. Nayar Columbia University IEEE CVPR Conference June 2003, Madison, USA.

Mitsubishi Electric Research Labs (MERL) Super-Res from Single Motion Blur PhotoAgrawal & Raskar Amit Agrawal and Ramesh Raskar Mitsubishi Electric Research.

Esmaeil Faramarzi, Member, IEEE, Dinesh Rajan, Senior Member, IEEE, and Marc P. Christensen, Senior Member, IEEE Unified Blind Method for Multi-Image Super-Resolution.

Image Restoration Chapter 5.

CS654: Digital Image Analysis Lecture 22: Image Restoration - II.

Xianwu Ling Russell Keanini Harish Cherukuri Department of Mechanical Engineering University of North Carolina at Charlotte Presented at the 2003 IPES.

Vehicle Segmentation and Tracking From a Low-Angle Off-Axis Camera Neeraj K. Kanhere Committee members Dr. Stanley Birchfield Dr. Robert Schalkoff Dr.

Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.

Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.

Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

Digital Image Processing, 3rd ed. © 1992–2008 R. C. Gonzalez & R. E. Woods Gonzalez & Woods Chapter 2 Digital Image Fundamentals.

Journal of Visual Communication and Image Representation

Whiteboard Scanning Using Super-resolution Wode Ni Advisor: John MacCormick COMP 491 Final Presentation Dec

SuperResolution (SR): “Classical” SR (model-based) Linear interpolation (with post-processing) Edge-directed interpolation (simple idea) Example-based.

Comparison of Image Registration Methods David Grimm Joseph Handfield Mahnaz Mohammadi Yushan Zhu March 18, 2004.

Speaker Min-Koo Kang March 26, 2013 Depth Enhancement Technique by Sensor Fusion: MRF-based approach.

Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.

Super-Resolution for Images and Video Ryan Prendergast and Prof. Truong Nguyen Video Processing Group University of California at San Diego

1. 2 What is Digital Image Processing? The term image refers to a two-dimensional light intensity function f(x,y), where x and y denote spatial(plane)

Super-resolution MRI Using Finite Rate of Innovation Curves Greg Ongie*, Mathews Jacob Computational Biomedical Imaging Group (CBIG) University of Iowa.

MAIN PROJECT IMAGE FUSION USING MATLAB

Compressive Coded Aperture Video Reconstruction

Guillaume-Alexandre Bilodeau

Khallefi Leïla © esa Supervisors: J. L. Vazquez M. Küppers

COSC579: Image Align, Mosaic, Stitch

Super-resolution Image Reconstruction

Vehicle Segmentation and Tracking in the Presence of Occlusions

The Brightness Constraint

Toward Drone Privacy via Regulating Altitude and Payload

Presentation transcript:

Superresolution of Texts from Nonideal Video Xin Li Lane Dept. of CSEE West Virginia University Morgantown, WV 26506-6109 This work is partially supported by NASA WV EPSCoR Award 2005-2006

Outline Introduction SR of texts from nonideal video Conclusions What is SR? Why SR? How to achieve SR? A general framework for SR: registration + restoration Understand the boundary of formulating SR as an inverse problem SR of texts from nonideal video Problem statement: why texts and nonideal video? Analyze error accumulation in multiframe registration Address the issue of quality/PSF consistency in restoration Experimental Results Conclusions

Image Resolution W H    Gonzalez “Digital Image Processing” Chip size  Field-Of-View: HW Pixel size  Sampling Distance 

Why Higher Resolution? Improved objective fidelity Natural scene is seldom band-limited Higher resolution implies smaller representation errors Improved subjective quality Attention enhances spatial resolution Spatial resolution enhances attention? Improved measuration/recognition Law enforcement, forensics/biometrics: face recognition grand challenge (FRGC), iris recognition, vehicle license plate recognition

Towards Gigapixel: Artistic Approach Mega-pel Giga-pel Photographers and artists have manually or semi-automatically stitched hundreds of mega-pel pictures together to demonstrate how a giga-pel picture looks like  the power of pixels http://triton.tpd.tno.nl/gigazoom/Delft2.htm

Scientific Solutions Sensor-based Computational (Super-resolution) Reduce pixel size: limit – 0.40m2 for a 0.35 m CMOS process Increase chip size: ineffective due to increased capacitance (bad for speeding up a charge transfer rate) Computational (Super-resolution) Exploit the tradeoff between space and time: obtain a HR from multiple LR copies Physical principles of imaging plays the fundamental role in defining the relationship between LR and HR Hybrid: the convergence of the camera and the computer Computational cameras: catadioptric camera, jitter camera (Ben-Ezra, Zomet and Nayar)

SR: A General Framework S.C. Park et al., “Super-resolution image reconstruction: a technical overview”, IEEE Signal Processing Magazine, pp. 21-36, May 2003 SR can be formulated as an inverse problem, assuming a mathematical model linking LR to HR images is known

SR: At the Intersection of SP and CV Registration problem Translational models Subpixel accuracy phase correlation (Foroosh, Zerubia and Berthod’1996) Subspace methods in the frequency domain (Vandewallea, Sbaiza, S̈usstrunka and Vetterli) Projective models or planar homography (Capel and Zisserman’2003) Images of a planar surface under arbitrary camera motion or images of a scene under fixed camera Restoration problem Model-based: regularized deblurring, robust SR (Farsiu, Elad and Milanfar’2004) Learning-based: exemplar-based SR (Freeman, Jones and Pasztor’2002), video epitome (Cheung, Frey and Jojic’2005)

Understand the Boundary of SR as an Inverse Problem Limited modeling capability Fixed enhancement ratio specified by the down-sampling operation We formulate scalable (progressive) SR: as more data become available, higher resolution can be achieved Inevitable approximation when warping gets complex We advocate nonuniform interpolation based forward approach in the case of arbitrary camera motion Sensor PSF is often unknown and time-varying We propose to adaptively select a subset of LR images

Outline Introduction What is SR? Why SR? How to achieve SR? A general framework for SR: registration + restoration Understand the boundary of formulating SR as an inverse problem SR of texts from nonideal video Problem statement: why texts and nonideal video? Analysis of error accumulation in multiframe registration Issue of phase/PSF consistency in restoration : NOT all LR images are useful Experimental Results Conclusions

SR-of-Texts from Nonideal Video HR image of license plate SR Problem Statement Given a segment of video clip that contains some texts that are illegible due to the limited resolution, how to produce a HR image in which the texts become clearly readable (by human)?

Defining the Boundary of Problem Why texts? Texts represent an important class of visual information (e.g., law enforcement applications) Relatively easy assessment of SR results by human observers Texts are often printed to a planar surface, which facilitates the registration What do we mean by nonideal video? Uncontrolled real-world acquisition conditions: handheld camera (arbitrary camera motion), unfavorable illumination, unknown PSF, inevitable compression artifacts, and so on

Our Practical Approach Consistency-guided Preprocessing Not all LR images are used in our SR scheme Homography-based Registration Accuracy is guaranteed by planar surface assumption Nonuniform Interpolation Search for an appropriate magnifying ratio and phase Diffusion-aided Blind Deconvolution Tailored for bimodal textual images

Human vision helps the selection of consistent LR images LR Image Consistency Quality consistency PSF consistency Human vision helps the selection of consistent LR images

Homography-based Multiframe Registration Sequential image 2 image 1 image K Parallel image 1 image 2 image K or Homography matrix Mosaicing: slightly-overlapped FOV  sequential Superresolution: severely-overlapped FOV  parallel

Nonuniform Interpolation phase of HR lattice distance of HR lattice Data grid : Fused data points from registered LR images Lattice : targeted data points at HR Target HR lattice: min d(, ) over two parameters: distance and phase

Experimental Results (I): SR Comparison on Benchmark Data Input: 20 LR images Before deblurring … … After deblurring Thanks to Prof. Milanfar for providing us the UCSC-SR software UCSC-SR Ours

Experimental Results (II): SR Results Comparison on Nonideal Video Input: 4 LR images UCSC-SR Ours

Experimental Results (II): SR Results Comparison on Nonideal Video Input: 4 LR images Ours UCSC-SR After deblurring

Experimental Results (III): Impact of Error Accumulation K=4 parallel sequential K=8 parallel sequential Error accumulation in sequential registration degrades image quality when K is large

Conclusions and Perspectives SR of texts from nonideal video A class of SR problems whose boundary can be well defined An example supporting a practical, forward approach towards SR To have a better understanding of SR techniques We need to look at the problem from a perceptual perspective New applications such as video compression, distributed coding, iris recognition, biomedical imaging will help us define the boundary of SR Spatial vs. temporal SR: fundamental space-time tradeoff