Robust global motion estimation and novel updating strategy for sprite generation IET Image Processing, Mar. 2007. H.K. Cheung and W.C. Siu The Hong Kong.

Slides:

Advertisements

Similar presentations

Bayesian Belief Propagation

Advertisements

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.

CSCE 643 Computer Vision: Template Matching, Image Pyramids and Denoising Jinxiang Chai.

Optimizing and Learning for Super-resolution

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.

Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.

www-video.eecs.berkeley.edu/research

Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.

Adviser ： Ming-Yuan Shieh Student ID ： M Student ： Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.

Computer Vision Optical Flow

Formation et Analyse d’Images Session 8

K.-S. Choi and S.-J. Ko Sch. of Electr. Eng., Korea Univ., Seoul, South Korea IEEE, Electronics Letters Issue Date : June Hierarchical Motion Estimation.

Temporal Video Denoising Based on Multihypothesis Motion Compensation Liwei Guo; Au, O.C.; Mengyao Ma; Zhiqin Liang; Hong Kong Univ. of Sci. & Technol.,

Limin Liu, Member, IEEE Zhen Li, Member, IEEE Edward J. Delp, Fellow, IEEE CSVT 2009.

Ljubomir Jovanov Aleksandra Piˇzurica Stefan Schulte Peter Schelkens Adrian Munteanu Etienne Kerre Wilfried Philips Combined Wavelet-Domain and Motion-Compensated.

Yen-Lin Lee and Truong Nguyen ECE Dept., UCSD, La Jolla, CA Method and Architecture Design for Motion Compensated Frame Interpolation in High-Definition.

SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,

Natan Jacobson, Yen-Lin Lee, Vijay Mahadevan, Nuno Vasconcelos, Truong Q. Nguyen IEEE, ICME 2010.

Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.

1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.

Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.

Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.

1 Static Sprite Generation Prof ︰ David, Lin Student ︰ Jang-Ta, Jiang

Probabilistic video stabilization using Kalman filtering and mosaicking.

ON THE IMPROVEMENT OF IMAGE REGISTRATION FOR HIGH ACCURACY SUPER-RESOLUTION Michalis Vrigkas, Christophoros Nikou, Lisimachos P. Kondi University of Ioannina.

Feature matching and tracking Class 5 Read Section 4.1 of course notes Read Shi and Tomasi’s paper on.

Robust Lane Detection and Tracking

Object Detection and Tracking Mike Knowles 11 th January 2005

Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )

Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.

Stereo and Multiview Sequence Processing. Outline Stereopsis Stereo Imaging Principle Disparity Estimation Intermediate View Synthesis Stereo Sequence.

Fitting a Model to Data Reading: 15.1,

Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.

Image (and Video) Coding and Processing Lecture: Motion Compensation Wade Trappe Most of these slides are borrowed from Min Wu and KJR Liu of UMD.

Augmented Reality: Object Tracking and Active Appearance Model

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

A REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING 楊靜杰 95/5/18.

Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,

Unequal Loss Protection: Graceful Degradation of Image Quality over Packet Erasure Channels Through Forward Error Correction Alexander E. Mohr, Eva A.

Motion Detection in UAV Videos by Cooperative Optical Flow and Parametric Analysis Masaharu Kobashi.

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.

A plane-plus-parallax algorithm Basic Model: When FOV is not very large and the camera motion has a small rotation, the 2D displacement (u,v) of an image.

High-Resolution Interactive Panoramas with MPEG-4 발표자 : 김영백 임베디드시스템연구실.

Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.

Qiaochu Li, Qikun Guo, Saboya Yang and Jiaying Liu* Institute of Computer Science and Technology Peking University Scale-Compensated Nonlocal Mean Super.

Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.

Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Optical Flow. Distribution of apparent velocities of movement of brightness pattern in an image.

Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.

Flow Control in Compressed Video Communications #2 Multimedia Systems and Standards S2 IF ITTelkom.

Speaker Min-Koo Kang March 26, 2013 Depth Enhancement Technique by Sensor Fusion: MRF-based approach.

Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.

Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.

Image Stitching Computer Vision CS 691E Some slides from Richard Szeliski.

Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.; Electronics Letters

Contents Team introduction Project Introduction Applicability

Conversion of Standard Broadcast Video Signals for HDTV Compatibility

Motion Detection And Analysis

Vehicle Segmentation and Tracking in the Presence of Occlusions

Representing Moving Images with Layers

Representing Moving Images with Layers

Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.

Amir Averbuch Yaki Tsaig School of Computer Science

Image Stitching Computer Vision CS 678

HALO-FREE DESIGN FOR RETINEX BASED REAL-TIME VIDEO ENHANCEMENT SYSTEM

Presentation transcript:

Robust global motion estimation and novel updating strategy for sprite generation IET Image Processing, Mar H.K. Cheung and W.C. Siu The Hong Kong Polytechnic Univ. ( 香港理工大學 )

Outlines  Overview / Introduction  Proposed system New global motion estimation  Combing short- and long-term estimation  Dynamic reference frame 2-pass sprite blending  Preserving frame resolution loss Sprite updating  Overcoming illumination variations & object changing  Experimental results  Conclusions

Overview

 Sprite High resolution image Composed of information belonging to an object visible throughout a video sequence  Background of a scene

Overview  Sprite background of frame 1 (Dimension: 352x288) background of frame 20 Sprite (Dimension: 2670x1072)

Overview  Core of sprite generation Global motion estimation (GME)  Finding a set of parameters representing camera motion between frames Image registration Iterative minimization Blending  Temporal (weighted) averaging, median, updating

Introduction

 Global motion estimation Image registration Short-term motion estimation  Estimation between consecutive frames  Easy and accurate Long-term motion estimation  Estimation between frames with temporal distance  Harder  Required to perform sprite coding Single sprite for all frames in sequence

Introduction  Global motion estimation (cont.) Short- to long-term estimation  Converting short-term motion parameters to long- term parameters  Error propagation Directly long-term estimation  Estimation every frames directly to a specified base frame (reference frame) No error propagation  Search range may be huge  Hard to find overlapping area

Introduction  Global motion estimation (cont.) Hierarchical estimation  Rough estimation to find coarse parameters  Refining parameters Using coarse parameters as initials Iterative minimization  Some existing methods Dufaux and Konrad Szeliski Smolic et. al. Lu et. al.

Introduction  Restrictions Background must be really static  Background objects must be still  No illumination variations Dynamic sprite

Introduction  Classification Static sprite  Build offline before coding individual frames  Quality degradation as frame increases Motion estimation errors Illumination variations Background object changes Dynamic sprite  Built dynamically online in both encoder and decoder while coding individual frames Sprite is updated using reconstructed frame  Short-term estimation is employed Error accumulated

Introduction  Proposed system New global motion estimation  Directly estimating the relative motion between current image and a chosen reference frame Give accurate, stable and robust estimation Alleviate error accumulation  Hierarchical 3-levels approach Coarse-to-fine approach Sprite updating  Updating sprite only if necessary Sprite update frames are generated and sent

Proposed system

 Short-term GME to long-term GME Frame 1 A 11 Frame m A m1 Frame m+1 GME reference frame …… A (m+1)1 A m1 + A (m+1)m Registration Error = A (m+1)m  A m(m-1)  …  A 21 A (m+1)k = A (m+1)m  A m1 Registration Error Registration errors are ACCUMULATED More Error

Proposed system  Directly measure to reference frame GME Frame 1 A 11 Frame m A m1 Frame m+1 reference frame …… A (m+1)1 A m1 initial guess Registration Error Registration errors are COMPENSATED

Proposed system  Weakness Reference frame is temporally far from current frame  Frame contents may change largely Background objects activities Lighting conditions changes Overlapping area could be smaller  Unfavorable to GME

Proposed system  Combining the advantages Dividing video into groups of consecutive frames 1st frame of each group is selected as reference  Frames in a group Each frame is directly measured to the 1st frame  Smaller registration error  Merging groups GMEs of reference frames of all groups are merged  Registration error is slightly increased R1R2R3 …… ++ A (R1)(R1) A (R2)(R1) A (R3)(R2) A (R2)(R1) A (R3)(R1)

Proposed system  Proposed GME structure Motion Estimation Frame k A k1 Frame m A mk A m1 Frame m+1 Frame z Chosen to be reference frame …… A (m+1)k A mk

Proposed system  Dynamic reference frame 1st frame is the initial reference frame Assigning current frame as new reference frame if  Displaced frame difference between registered current frame and the reference frame it large Reference frame is not like current frame  Relative displacement between current frame and the reference frame is large Overlapping area is too small or where Nr is a parameter between 0 and 1 (Nr=0.1 in practical)

Proposed system  Advantages Accuracy  Accurate than short-term and directly long-term estimation Very few memory usage  Estimations are performed frame-to-frame  Sprite building is not necessary

Proposed system  GME Reference frame (frame k) Frame z Three step search Block-based partial distortion search Fast gradient method A (m+1)k A mk +

Proposed system  Motion model Perspective motion model  8 motion parameters to be determined  Three-step matching 3-level pyramids for frame k and z are built using Gaussian down-sampling filter [ ¼, ½, ¼ ] frame k: reference frame frame z: transformed current frame m+1

Proposed system  Block-matching Affine parameters are estimated by solving over- fitting equations Results of block-based motion estimation are used to construct the equations  Parameter estimation Fast gradient descent method by Keller and Averbuch where

Proposed system  Two-passed blending to avoid resolution loss First pass: 1st frame as base frame  All frames are projected into 1st frame  Frame with minimal area of projected frame is selected as new base frame Avoiding resolution loss  No real pixel blending applied Second pass: new base frame  All frames are projected into new base frame  Simple temporal average blending With bilinear interpolation

Proposed system  Dynamic sprite updating Overcoming illumination variations  Single value in sprite can not represent intensity variations over the time Accumulation of GME error blurring the frame  GME error in a reference frame will inherit into all of frames in the group

Proposed system  Studying the generated intensity error an edge pixel a pixel from homogeneous area a pixel from texture area translation in x-direction # of pixel with significant error

Proposed system  Distribution of intensity error correlates roughly to the panning motion Errors tends to be clustered in the temporal domain  Errors of homogeneous and texture regions are tend to randomly around zero

Proposed system  Sprite updating Selecting frames with significant change in panning direction/speed

Proposed system  Sprite updating (cont.) Reconstruct next N frame from the sprite Blend the N error frames into a sprite-sized buffer (the sprite update frame) Compute the N error frames Encode and send the sprite update frame to the decoder MPEG4 I-VOP frame

Experimental results

 Testing Constructing sprite Reconstructing frames from sprite Compute PSNR  Comparison Short-term motion estimation  Estimating between current and previous frame Long-term motion estimation  Estimating between current frame and sprite  No parameters predicting Long-term motion estimation by MPEG-4 VM Long-term motion estimation by Smolic et. al.

Experimental results Short-term Long-term

Experimental results MPEG-4 VM Proposed method

Experimental results  PSNR Proposed MPEG-4 Short-term Long-term Smolic et. al.

Experimental results  Average PSNR (dB) Short- term Long- term MPEG-4 VM Smolic et. al. Proposed (affine) Proposed (perspective) Stefan (150) Foreman (150) Coast Guard (150) Stefan (300) Failure

Experimental results  Selecting threshold Nr Proposed method is better than simple short-term and long-term estimation Short-term 0.1 Long-term

Experimental results  Performance of sprite updating SequenceUpdate framesAverage PSNR (dB)Size of updates (kB) stefan stefan0,51,108,174,206 * stefan0,60,120,180, stefan0,51,108,106 * stefan0,80,160, coast guard coast guard0,76 * foreman foreman0,10,25,64,110 * foreman0,30,60,90, * Update frames is figured out from the major camera operations of the sequences

Conclusions

 New global motion estimation method Directly estimation from current frame to a chosen reference frame Combing advantages of short-term and long-term estimation  Error accumulation prevented  Keeping reference frame close to current frame  Sprite updating Encoding & sending sprite update frames  Errors of a group of reconstructed frames Reducing sprite blurring