Video Motion Interpolation for Special Effect Applications Timothy K. Shih, Senior Member, IEEE, Nick C. Tang, Joseph C. Tsai, and Jenq-Neng Hwang, Fellow,

Slides:



Advertisements
Similar presentations
People Counting and Human Detection in a Challenging Situation Ya-Li Hou and Grantham K. H. Pang IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART.
Advertisements

Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
M.S. Student, Hee-Jong Hong
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
Image Compositing and Matting. Introduction Matting and compositing are important operations in the production of special effects. These techniques enable.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.
Robust Object Tracking via Sparsity-based Collaborative Model
Video Inpainting Under Constrained Camera Motion Kedar A. Patwardhan, Student Member, IEEE, Guillermo Sapiro, Senior Member, IEEE, and Marcelo Bertalm.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling Jungong Han Dirk Farin Peter H. N. IEEE CSVT 2008.
Video Coding with Spatio-temporal Texture Synthesis and Edge-based inpainting Chunbo Zhu, Xiaoyan Sun, Feng Wu, and Houqiang Li ICME 2008.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Video Segmentation Based on Image Change Detection for Surveillance Systems Tung-Chien Chen EE 264: Image Processing and Reconstruction.
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Region-Level Motion- Based Background Modeling and Subtraction Using MRFs Shih-Shinh Huang Li-Chen Fu Pei-Yung Hsiao 2007 IEEE.
Example-Based Color Transformation of Image and Video Using Basic Color Categories Youngha Chang Suguru Saito Masayuki Nakajima.
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
Region Filling and Object Removal by Exemplar-Based Image Inpainting
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Illumination Normalization with Time-Dependent Intrinsic Images for Video Surveillance Yasuyuki Matsushita, Member, IEEE, Ko Nishino, Member, IEEE, Katsushi.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Background Estimation Mehdi Ghayoumi, MD Iftakharul Islam, Muslem Al-Saidi Department of Computer Science Kent State University, Kent, OH
Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.
Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
3D Fingertip and Palm Tracking in Depth Image Sequences
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Olga Zoidi, Anastasios Tefas, Member, IEEE Ioannis Pitas, Fellow, IEEE
1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.
Mean-shift and its application for object tracking
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Introduction to Visible Watermarking IPR Course: TA Lecture 2002/12/18 NTU CSIE R105.
Methods of Video Object Segmentation in Compressed Domain Cheng Quan Jia.
Video Tracking Using Learned Hierarchical Features
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
Hierarchical Method for Foreground DetectionUsing Codebook Model Jing-Ming Guo, Yun-Fu Liu, Chih-Hsien Hsia, Min-Hsiung Shih, and Chih-Sheng Hsu IEEE TRANSACTIONS.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Implementation, Comparison and Literature Review of Spatio-temporal and Compressed domains Object detection. By Gokul Krishna Srinivasan Submitted to Dr.
CSSE463: Image Recognition Day 23 Midterm behind us… Midterm behind us… Foundations of Image Recognition completed! Foundations of Image Recognition completed!
Journal of Visual Communication and Image Representation
Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks Hsu-Yung Cheng, Member, IEEE, Chih-Chia Weng, and Yi-Ying Chen IEEE TRANSACTIONS.
Automatic Caption Localization in Compressed Video By Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE IEEE Transactions on Pattern Analysis and.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Trajectory-Based Ball Detection and Tracking with Aid of Homography in Broadcast Tennis Video Xinguo Yu, Nianjuan Jiang, Ee Luang Ang Present by komod.
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
Video Motion Interpolation for Special Effect Applications
CSSE463: Image Recognition Day 21
A segmentation and tracking algorithm
CSSE463: Image Recognition Day 23
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
CSSE463: Image Recognition Day 23
A Block Based MAP Segmentation for Image Compression
CSSE463: Image Recognition Day 23
Presentation transcript:

Video Motion Interpolation for Special Effect Applications Timothy K. Shih, Senior Member, IEEE, Nick C. Tang, Joseph C. Tsai, and Jenq-Neng Hwang, Fellow, IEEE IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART C: APPLICATIONS AND REVIEWS, VOL. 41, NO. 5, SEPTEMBER 2011

Outline Introduction System Overview Motion Layer Segmentation and Tracking Motion Interpolation Using Video Inpainting Experimental Results Conclusion

Introduction

Background Video forgery (video falsifying): A technique for generating fake videos by altering, combining, or creating new video contents For instance, the outcome of a 100 m race in the olympic game is changed.

Introduction Example of video forgery : Original video frame Falsifying result

Objective To create a forged video, which is almost indistinguishable from the original video To create special effects in video editing applications

Introduction To change the content of video, the following techniques are commonly used: object tracking motion interpolation video inpainting video layer fusing

Introduction Contributions of this paper: 1) It is the first time that video forgery is attempted based on video inpainting techniques. 2) A new concept called guided inpainting for motion interpolation of video objects is proposed. 3) A guided quasi-3-D (i.e., X, Y, and time) video inpainting mechanism is proposed.

System Overview

1) Motion Layer Segmentation: Separates background and tracked object 2) Motion Prediction: Finds Reference Stick-Figure to predict cycle of motion 3) Motion Interpolation: Motion analysis Patch assertion Motion completion via inpainting.

System Overview 4) Background Inpainting: Inpaints background of different camera motions 5) Layer Fusion: Merges an object layer and a background layer

Motion Layer Segmentation and Tracking

Separate target objects from the background Adopt Mean Shift Feature Space Analysis Algorithm [2] for color region segmentation [2] D. Comaniciu and P.Meer, “Mean shift: A robust approach toward feature space analysis,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 5, pp. 603– 619, May 2002.

Initial segmentation of objects from their background

ALGORITHM: REFERENCE STICK FIGURE TRACKING Manually Selected Mean Shift Algorithm C0C0 C0’C0’ Frame 0

ALGORITHM: REFERENCE STICK FIGURE TRACKING Revised Fast Tracking Mechanism [6] Bounding Box B 1 Frame 1 [6] K. Hariharakrishnan and D. Schonfeld, “Fast object tracking using adaptive blockmatching,” IEEE Trans.Multimedia, vol. 7, no. 5, pp. 853–859, Oct

ALGORITHM: REFERENCE STICK FIGURE TRACKING Mean Shift Algorithm B1’B1’ Comparing Color Segments of C 0 ’ and B 1 ’ C1’C1’ Frame 1 Frame 0

ALGORITHM: REFERENCE STICK FIGURE TRACKING Applying Dilation C1’C1’ C1*C1* Frame 1

ALGORITHM: REFERENCE STICK FIGURE TRACKING Frame 0 CoCo C1*C1* Comparing corresponding pixel p If (p in C 1 *) - (p in C 0 ) > L 2 Exclude p in C 1 ’ Else Keep p in C 1 ’ Frame 1 ‧ Set L 2 = 2 ‧ Using L in LUV color space

Motion Segmentation Different parts of the target may move in different directions Decomposing an object into different regions Using revised block searching algorithm [10] to compute motion map [10] J. Jia, Y.-W. Tai, T.-P.Wu, and C.-K. Tang, “Video repairing under variable illumination using cyclic motions,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 28, no. 5, pp. 832–839, May 2006.

Motion Segmentation The Mean Shift color segmentation can also be revised to deal with motion segmentation: Based on blocks, not pixels Important for video inpainting Ghost shadows can be eliminated

Original Video Frame Corresponding result of color segmentation by using [2] The example of tracked object and estimated vectors

Motion Interpolation Using Video Inpainting

Motion Interpolation : Motions of the target object need to be interpolated. Video Inpainting : In order to obtain the interpolated figures Motion interpolation may create background holes.

Motion Interpolation of Target Objects A target object can be segmented into a layer. Motion interpolation is required to produce a slow motion of the target layer. Original Interpolated tntn t n+1 t n+2 t n+3

General Inpainting Strategy In order to obtain the interpolated figures Using a rule-based thinning algorithm [1] : To obain the stick figures of target objects Stick figures: Used to guide the selection of patches Copied from the original video [1] M. Ahmed and R. Ward, “A rotation invariant rule-based thinning algorithm for character recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 12, pp. 1672–1678, Dec

General Inpainting Strategy Quasi-3-D video space (2-D plus tIme) Using 3-D patches in quasi-3-D video inpainting produce a smooth movement

Prediction and Interpolation of Cyclic Motion Consider the following scenario: 1) It’s common for target objects to perform actions in a repeated cycle. 2) A stick figure can be used to estimate the relative positions of patches. (e.g., head, body, and legs).

Prediction and Interpolation of Cyclic Motion Stick figures and the contours of target objects can be used to predict repeated cycles.

Prediction and Interpolation of Cyclic Motion Missing stick figure can be reproduced by: 1) Searching for similar reference stick figures in a repeated motion cycle 2) Interpolation of two known stick figures

ALGORITHM: REFERENCE STICK FIGURE SEARCHING

x …… x+r r : the number of frames in a repeated cycle index range function of a given frame number x as idx(x) = [x + r − 2, x + r − 1, x + r, x + r + 1, x + r + 2] ∪ [x − r − 2, x − r − 1, x − r, x − r + 1, x − r + 2]. x-r

STICK FIGURE INTERPOLATION OaOa Union of O a and O b ObOb Thinning result

Motion Interpolation Alogorithm extending our image inpainting algorithm for motion interpolation consider a video as a 2-D plus time domain

Motion Interpolation Alogorithm I 3 = Φ 3 ∪ Ω 3 Φ 3 is a source space Ω 3 is a target space Φ 3 ∩ Ω 3 = ∅ (an empty set)

ALGORITHM: PATCH ASSERTION

Example for patch assertion: ALGORITHM: PATCH ASSERTION Patches on stick figure Contour ω of (a) (searched in the nearby motion cycle) Result of patch assertion

The main algorithm Let ∂Ω 3 be: a front surface on Ω 3 adjacent to Φ 3 ALGORITHM: MOTION INTERPOLATION Source Region Target Region

Given a 3-D patch Ψ p centered at the point p Let Ψ p ‘s priority P(p) = C(p) × D(p) ALGORITHM: MOTION INTERPOLATION Source Region Target Region

C(p) : Confidence term: The percentage of useful information inside a patch centered at p the size of 3-D patch is denoted as |Ψ 3 | = 27 pixels ALGORITHM: MOTION INTERPOLATION

D(p) : Data term Compute the percentage of edge pixels in the patch ( Instead of computing the isophote[3] ) var(Ψ p ) : the color variation of the patch [21] ALGORITHM: MOTION INTERPOLATION 3 [3] A. Criminisi, P. Perez, and K. Toyama, “Region filling and object removal by exemplar-based image inpainting,” IEEE Trans. Image Process., vol. 13, no. 9, pp. 1200–1212, Sep [21] T. K. Shih, N. C. Tang,W.-S. Yeh, T.-J. Chen, andW. Lee, “Video inpainting and implant via diversified temporal continuations,” in Proc ACM Multimedia Conf., Santa Barbara, CA, Oct. 23–27, 2006, pp. 133–136.

Example of motion interpolation via inpainting ALGORITHM: MOTION INTERPOLATION

Inpainting Camera Motions Using mechanism proposed in [22] Ensures that there is no “ghost shadows” created in the background Segment motions into different regions The inpainted area in the previous frame needs to be incorporated. [22] T. K. Shih, N. C. Tang, and J.-N. Hwang, “Ghost shadow removal in multilayered video inpainting,” in proc. IEEE 2007 Int. Conf.Multimedia Expo, Beijing, China, Jul. 2–5, pp. 1471–1474.

Layer Fusion Need to merge video layers to produce forged video. The fusion process merges an object layer and a background layer. (With contour of object layer computed based on the object tracking)

ALGORITHM: LAYER FUSION

Object layer obj Dilation area of obj δ ob j Corresponding area on background layer δ bkg δ ob j pixel intensity → υ L ObjectBackground

ALGORITHM: LAYER FUSION (-2,70) Histogram of the intensity difference in δ ob j and δ bkg υLυL Intensity Difference Count

Experimental Results

Block size used in Patch Assertion: 10-by-10 pixels Patch size used in Motion Interpolation: 3-by-3-by-3 pixels Hardware: CPU 2.1G with 2G RAM

Evaluation Motion Layer Segmentation Motion Prediction Motion Interpolation Background Inpainting Layer Fusion

Evaluation

Limitations of the Proposed Mechanism Shadows cannot be tracked precisely. Only use intensity to merge layers (without considering the chrominance information) A sophisticated 3-D reconstruction mechanism needs to be investigated. Only produces actions two times slower for slow motion.

Conclusion

This paper Proposes an interesting technology to alter the behavior of moving objects in a video Effectively extends the inpainting technique to a quasi-3-D space Allows a video to be separated into several layers, played in different speeds, and then merged

Conclusion A series of difficult problems are solved. Solutions are successfully integrated. Mostly, it is a subjective feeling of how a fake video looks real. Necessary to develop an authoring tool to allow the users to specify the spatiotemporal fluctuation property