Anisotropic Double Cross Search Algorithm using Multiresolution-Spatio-Temporal Context for Fast Lossy In-Band Motion Estimation Yu Liu and King Ngi Ngan.

Slides:



Advertisements
Similar presentations
Multimedia Data Compression
Advertisements

University of Ioannina - Department of Computer Science Wavelets and Multiresolution Processing (Background) Christophoros Nikou Digital.
Coherent Multiscale Image Processing using Quaternion Wavelets Wai Lam Chan M.S. defense Committee: Hyeokho Choi, Richard Baraniuk, Michael Orchard.
1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,
Yung-Lin Huang, Yi-Nung Liu, and Shao-Yi Chien Media IC and System Lab Graduate Institute of Networking and Multimedia National Taiwan University Signal.
K.-S. Choi and S.-J. Ko Sch. of Electr. Eng., Korea Univ., Seoul, South Korea IEEE, Electronics Letters Issue Date : June Hierarchical Motion Estimation.
Temporal Video Denoising Based on Multihypothesis Motion Compensation Liwei Guo; Au, O.C.; Mengyao Ma; Zhiqin Liang; Hong Kong Univ. of Sci. & Technol.,
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
An Improved 3DRS Algorithm for Video De-interlacing Songnan Li, Jianguo Du, Debin Zhao, Qian Huang, Wen Gao in IEEE Proc. Picture Coding Symposium (PCS),
Wavelet-Domain Video Denoising Based on Reliability Measures Vladimir Zlokolica, Aleksandra Piˇzurica and Wilfried Philips Circuits and Systems for Video.
Young Deok Chun, Nam Chul Kim, Member, IEEE, and Ick Hoon Jang, Member, IEEE IEEE TRANSACTIONS ON MULTIMEDIA,OCTOBER 2008.
Yen-Lin Lee and Truong Nguyen ECE Dept., UCSD, La Jolla, CA Method and Architecture Design for Motion Compensated Frame Interpolation in High-Definition.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Natan Jacobson, Yen-Lin Lee, Vijay Mahadevan, Nuno Vasconcelos, Truong Q. Nguyen IEEE, ICME 2010.
PREDICTIVE 3D SEARCH ALGORITHM FOR MULTI-FRAME MOTION ESTIMATION Lim Hong Yin, Ashraf A. Kassim, Peter H.N de With IEEE Transaction on Consumer Electronics,2008.
Novel Point-Oriented Inner Searches for Fast Block Motion Lai-Man Po, Chi-Wang Ting, Ka-Man Wong, and Ka-Ho Ng IEEE TRANSACTIONS ON MULTIMEDIA, VOL.9,
Fifth International Conference on Information
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Efficient Motion Vector Recovery Algorithm for H.264 Based on a Polynomial Model Jinghong Zheng and Lap-Pui Chau IEEE TRANSACTIONS ON MULTIMEDIA, June.
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Frederic Payan, Marc Antonini
Multi-Frame Reference in H.264/AVC 卓傳育. Outline Introduction to Multi-Frame Reference in H.264/AVC Multi-Frame Reference Problem Two papers propose to.
Image (and Video) Coding and Processing Lecture: Motion Compensation Wade Trappe Most of these slides are borrowed from Min Wu and KJR Liu of UMD.
Adjustable Partial Distortion Search Algorithm for Fast Block Motion Estimation Chun-Ho Cheung and Lai-Man Po Department of Electronic Engineering, City.
Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 project
Motion Estimation Using Low- Band-Shift Method for Wavelet- Based Moving Picture Hyun-Wook Park, Senior Member, IEEE, and Hyung-Sun Kim IEEE Transactions.
Fundamentals of Multimedia Chapter 8 Lossy Compression Algorithms (Wavelet) Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
A Low-Power VLSI Architecture for Full-Search Block-Matching Motion Estimation Viet L. Do and Kenneth Y. Yun IEEE Transactions on Circuits and Systems.
An Introduction to H.264/AVC and 3D Video Coding.
Video Compression Concepts Nimrod Peleg Update: Dec
MULTITEMP 2005 – Biloxi, Mississippi, USA, May 16-18, 2005 Remote Sensing Laboratory Dept. of Information and Communication Technology University of Trento.
國立屏東商業技術學院 資訊工程系 ( 所 ) 多媒體技術發展實驗室 Laboratory of Multimedia Technology Development Department of Computer Science and Information Engineering Nation Pingtung.
Robust global motion estimation and novel updating strategy for sprite generation IET Image Processing, Mar H.K. Cheung and W.C. Siu The Hong Kong.
A 3D Model Alignment and Retrieval System Ding-Yun Chen and Ming Ouhyoung.
Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 final project
Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.
Team 5 Wavelets for Image Fusion Xiaofeng “Sam” Fan Jiangtao “Willy” Kuang Jason “Jingsu” West.
MOTION ESTIMATION IMPLEMENTATION IN RECONFIGURABLE PLATFORMS
Wavelets and Multiresolution Processing (Wavelet Transforms)
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇.
Journal of Visual Communication and Image Representation
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
3-D Direction Aligned Wavelet Transform for Scalable Video Coding Yu Liu 1, King Ngi Ngan 1, and Feng Wu 2 1 Department of Electronic Engineering The Chinese.
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
3-D WAVELET BASED VIDEO CODER By Nazia Assad Vyshali S.Kumar Supervisor Dr. Rajeev Srivastava.
Video Coding Presented By: Dr. S. K. Singh Department of Computer Engineering, Indian Institute of Technology (B.H.U.) Varanasi
EE591f Digital Video Processing
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
BLOCK BASED MOTION ESTIMATION. Road Map Block Based Motion Estimation Algorithms. Procedure Of 3-Step Search Algorithm. 4-Step Search Algorithm. N-Step.
Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.
Principles of Video Compression Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
1שידור ווידיאו ואודיו ברשת האינטרנט Dr. Ofer Hadar Communication Systems Engineering Department Ben-Gurion University of the Negev URL:
Fast disparity motion estimation in MVC based on range prediction Xiao Zhong Xu, Yun He ICIP 2008.
Date of download: 7/7/2016 Copyright © 2016 SPIE. All rights reserved. Candidate sets of the proposed modified three-block median predictor for (a) the.
Dr. Ofer Hadar Communication Systems Engineering Department
Compressive Coded Aperture Video Reconstruction
Injong Rhee ICMCS’98 Presented by Wenyu Ren
Wavelets : Introduction and Examples
SoC and FPGA Oriented High-quality Stereo Vision System
MOTION ESTIMATION AND VIDEO COMPRESSION
Sum of Absolute Differences Hardware Accelerator
Fully Scalable Multiview Wavelet Video Coding
Improving Retrieval Performance of Zernike Moment Descriptor on Affined Shapes Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info Tech Monash.
Instructor: Professor Yu Hen Hu
Embedded Image Coding Based on Context Classification and
Yu Liu and King Ngi Ngan Department of Electronic Engineering
HALO-FREE DESIGN FOR RETINEX BASED REAL-TIME VIDEO ENHANCEMENT SYSTEM
Presentation transcript:

Anisotropic Double Cross Search Algorithm using Multiresolution-Spatio-Temporal Context for Fast Lossy In-Band Motion Estimation Yu Liu and King Ngi Ngan Department of Electronic Engineering, The Chinese University of Hong Kong PCS2006, April 24-26, Beijing, China

Outline Introduction Background Proposed Algorithm Experimental Results Conclusion

Introduction Motion Estimation in Critically-Sampled Wavelet Domain Pro: basically free form the blocking effects Con: inefficient in high bands Motion Estimation in Shift-Invariant Wavelet Domain Pro: perform ME more precisely and efficiently Con: computational complexity e.g. low-band-shift (LBS) and complete-to-overcomplete DWT (CODWT) ME/MC in wavelet domain has received much attention due to its superior performance by comparing to conventional ME/MC in spatial domain. ME/MC in wavelet domain is basically free form the blocking effects due to the global nature of wavelet transform. However, ME/MC in critically sampled wavelet domain is very inefficient in high bands because of the shift-variant property of wavelet decomposition. To overcome the shift-variant property of wavelet transform, LBS and CODWT are proposed. These methods avoid the shift-variant property of the wavelet transform and perform ME more precisely and efficiently. However, a major disadvantage of these methods is the computational complexity which mainly comes from full search algorithm.

Background Motion Estimation in Shift-Invariant Wavelet Domain (1) Two Level Shift-Invariant Wavelet Decomposition by using Low-Band-Shift (LBS) or Complete-to-Overcomplete DWT (CODWT) First, let’s look at some background. This is an example of two level SIDWT by using LBS or CODWT. For LBS method, these other shifted subbands are obtained by shifting the LL Band of each level, then followed by normal wavelet transform. For CODWT method, these other shifted subbands are obtained by making a direct link between the critically-sampled subbands and shift-invariant subbands using the complete-to-overcomplete prediction filters.

Background Motion Estimation in Shift-Invariant Wavelet Domain (2) Generation of Wavelet Blocks For ME in shift-invariant wavelet domain, the coefficients of each wavelet tree rooted in the lowest subband are rearranged to form a wavelet block. The purpose of the wavelet block is to provide a direct association between the wavelet coefficients and what they represent spatially in the image. Related coefficients at all scales and orientations are included in each wavelet block. The v-pixel-shifted or {dx,dy}-pixel-shifted coefficient of the pth wavelet block of the reference frame t’ can be represented by (dx%2l, dy%2l) means which shifted subband it is, (i+[dx/2l],j+[dy/2l]) means where the pixel is located for the shifted subband. l denotes decomposition level, k denotes subband type, such as LL/HL/LH/HH subband type. The coefficient of the pth wavelet block of the current frame t can be represented by The difference between these two representations of the reference and current frames is that, for ME/MC, we just want to use the shift-invariant wavelet coefficients of the reference frame to predict the critically-sampled wavelet coefficients of the current frame. The v-pixel-shifted or {dx,dy}-pixel-shifted coefficient of the pth wavelet block of reference frame t’ can be represented by The coefficient of the pth wavelet block of current frame t can be represented by

Background Motion Estimation in Shift-Invariant Wavelet Domain (3) The sum of absolute difference (SAD) of the pth wavelet block for the motion vector v is computed as follows: The wavelet blocks in the search window in the reference frame are compared to current wavelet block, and a reference wavelet block that leads to the best match is selected. The sum of absolute difference (SAD) of the pth wavelet block for the displacement vector v is computed as follows: The optimum motion vector v∗ of the pth wavelet block, which has minimum displacement error, is given by: However, computational complexity of full search ME in SIWD is so time consuming because of the extra shifted subbands that limits its practical applications. For this reason, alternative and faster techniques should be developed. The optimum motion vector v∗ of the pth wavelet block, which has minimum displacement error, is given by:

Background Anisotropic Motion Model in Wavelet Domain Traditional 2D ME in spatial domain suffers from the aperture problem 2D ME in wavelet domain the aperture problem can in fact be exploited as an advantage. Another background is the Anisotropic Motion Model in Wavelet Domain. Traditional 2D ME in spatial domain, specially for optical flow estimation techniques, suffers from the aperture problem, due to the ill-posed nature and small observation window. However, the aperture problem can in fact be exploited as an advantage of motion estimation in wavelet domain. Since wavelet transform structures the image into the subbands with different orientations, the subbands contain different edges with different normal flow direction. This suggests that the 2D ME problem can be approximated by a 1D ME along the normal flow direction for the vertical/horizontal subbands. We can use this property to develop the fast lossy motion estimation algorithm in shift-invariant wavelet domain. (a) Aperture problem in spatial domain, (b) Anisotropic motion model in wavelet domain

Proposed Algorithm Multiresolution-Spatio-Temporal Context (1) Traditional MRME algorithms Multiresolution context Not enough for reducing the risk of getting trapped into a local minimum. The proposed algorithm Multiresolution-spatio-temporal Context Consists of one multiresolution context, four spatial contexts, and five temporal contexts. In the traditional MRME algorithms, the ME is first performed in the coarser resolution to find an initial MV, and then the MV of finer resolution will be refined based on the MV obtained at coarser resolution. However, the coarse-scaled MV for some blocks may not be accurate enough and could cause some errors which propagate along the hierarchical structure. Therefore, using only multiresolution context is not enough for reducing the risk of getting trapped into a local minimum. In the proposed approach, we will exploit the context information from multiresolution-spatial-temporal adjacent block to select a set of initial MV candidates. The candidate predictors for the current block consist of one multiresolution context, four spatial contexts, and five temporal contexts. The set of the initial MV candidates can be expressed as {c0, c1, ..., c9}. (a) multiresolution context, (b) spatial context, (c) temporal context

Proposed Algorithm Multiresolution-Spatio-Temporal Context (2) For LL subband Initialization: spatio-temporal context, plus the candidate points in shifted LL subband, where the median predictor is located Refinement: diamond search algorithm For other levels Initialization: multiresolution-spatio-temporal context Refinement: anisotropic double cross search algorithm For LL subband, in the initialization stage, besides the candidate predictors from spatio-temporal context, all the candidate points in the shifted LL subband, where the median predictor is located, are also checked. The best predictor among these candidate predictors is further refined by using diamond search algorithm to obtain the best motion vector in LL subband. For other levels, In the initialization stage, all the candidate predictors from multiresolution-spatio-temporal context will be checked. The best predictor among these candidate predictors is further refined by using our proposed anisotropic double cross search algorithm to obtain the best motion vector in the corresponding level.

Proposed Algorithm Anisotropic Double Cross Search Algorithm (1) Anisotropic motion model suggests that the 2D ME problem in wavelet domain can be approximated by 1D ME along the normal flow direction for the vertical/horizontal subbands. During the 1D window searching, only the coefficients in the corresponding subbands and LL subband are computed. Since anisotropic motion model suggests that the 2D ME problem can be approximated by a 1D ME along the normal flow direction for the vertical/horizontal subbands, we make use of the anisotropic property of the motion field in wavelet domain to develop an anisotropic double cross search algorithm During the 1D window searching refinement, only the coefficients in the corresponding subbands and LL subband are computed. And we define the SAD from the corresponding subbands and LL subband as anisotropic SAD (ASAD), as follows

Proposed Algorithm Anisotropic Double Cross Search Algorithm (2) The proposed search algorithm initially considers all possible motion vector predictor candidates from multiresolution-spatio-temporal context and uses the best motion vector predictor candidate {dxp; dyp} as the center of the search. Then, the search starts from the two different subbands in different routes: one route is from HL subbands to LH subbands, another route is from LH subbands to HL subbands, The circle denotes This ellipse denotes This rectangle denotes The solid ellipse and solid rectangle These two search routes form an anisotropic double cross search pattern. However, there may exist two best matching points for each cross search route. If the two best matching points from those two cross search routes are not the same, as shown in Fig.4(a) Case I, then the coefficients in HH subbands will be used to judge which one is the best matching point. And the winner is selected as the new center of the search; meanwhile the search stepsize keeps unchanged. If the two best matching points are the same but not the center of the search, as shown in Fig.4(b) Case II, then the best matching point is selected as the new center of the search; meanwhile the search stepsize is reduced by half until it is equal to 1. when the two best matching points from each cross search step are both located in the center of the search, as shown in Fig.4(c) Case III., then, the final step is reached

Experimental Results (1) Simulation results are reported in the following ways: PSNR MAD operation number speed-up ratio For performance comparison Full Search Algorithm (FSA) FMRME [6] FIBME [7] proposed MR-STC-ADCS Simulation results are reported in the following ways: PSNR for the quality measure between the original and the motion-compensated reconstructed frames. MAD for the distortion measure between the original and the motion-compensated wavelet frames. operation number per block used to compute the partial distortion; execution time per frame for motion estimation including the required overheads for comparison. For the performance comparison, we tested the following algorithms: FSA, FMRME, FIBME, MR-STC-ADCS. Here are Experimental Results of Test Video Sequences with Full Search Algorithm.

Experimental Results (2) Comparison of the Tested Algorithms for QCIF Video Sequences This table lists the Comparison Results of the Tested Algorithms for QCIF Video Sequences

Experimental Results (3) Comparison of the Tested Algorithms for CIF Video Sequences And, this table lists the Comparison Results of the Tested Algorithms for CIF Video Sequences

Experimental Results (4) Comparison of the Tested Algorithms for 4CIF Video Sequences On average, for all sequences examined in the experimental tests: MR-STC-ADCS is roughly 11.5 and 2.6 times faster whereas its PSNR is approximately 1.46 dB and 0.6 dB higher than FMRME and FIBME; and its MAD is approximately 0.426 and 0.165 lower than FMRME and FIBME. MR-STC-ADCS is about 271 times faster than FSA for QCIF, 667 times for CIF, and 1313 times for 4CIF, while having an average PSNR loss of only 0.04 dB or an average MAD increase of only 0.018 compared to the FSA. And, here is for CIF Video Sequences From the Experimental Results of all the Tested Video Sequences, we can see, On average, MR-STC-ADCS is roughly 11.5 and 2.6 times faster whereas its PSNR is approximately 1.46dB and 0.6dB higher than FMRME and FIBME algorithms, respectively. and its MAD is approximately 0.426 and 0.165 lower than FMRME and FIBME. On average, MR-STC-ADCS is about 271 times faster than FSA for search window (SW) 15, 667 times for SW 31, and 1313 times for SW 63 while having an average PSNR loss of only 0.04dB or an average MAD increase of only 0.018 compared to FSA or an average MAD increase of only 0.018 compared to the FSA.

Conclusion Fast Lossy In-Band Motion Estimation Algorithm Anisotropic property of the motion field in shift-invariant wavelet domain Multiresolution-spatio-temporal Context Anisotropic Double Cross Search In this paper, we proposed a fast lossy In-Band motion estimation algorithm,MR-STC-ADCS. The proposed algorithm is based on the anisotropic property of motion field in shift-invariant wavelet domain. And we use this property to propose an Anisotropic Double Cross Search Algorithm using Multiresolution-spatio-temporal Context.