University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent.

Slides:



Advertisements
Similar presentations
Part 2: Unsupervised Learning
Advertisements

Bayesian Belief Propagation
2010 Winter School on Machine Learning and Vision Sponsored by Canadian Institute for Advanced Research and Microsoft Research India With additional support.
CVPR2013 Poster Modeling Actions through State Changes.
Probabilistic Inference and Learning in Computer Vision An extract from the BMVC2000 pre-conference tutorial given by: Prof. Andrew Blake Microsoft Research.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Patch to the Future: Unsupervised Visual Prediction
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Video Inpainting Under Constrained Camera Motion Kedar A. Patwardhan, Student Member, IEEE, Guillermo Sapiro, Senior Member, IEEE, and Marcelo Bertalm.
Dale & Lewis Chapter 3 Data Representation. Representing color Similarly to how color is perceived in the human eye, color information is encoded in combinations.
Joint Estimation of Image Clusters and Image Transformations Brendan J. Frey Computer Science, University of Waterloo, Canada Beckman Institute and ECE,
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
LOCUS Demo Stefan Zickler. Two “different” classes Class “Car Side Views” Class “Car Rears”
Noise Estimation from a Single Image Ce Liu William T. FreemanRichard Szeliski Sing Bing Kang.
A Brief Overview of Computer Vision Jinxiang Chai.
CS 1308 Computer Literacy and the Internet. Creating Digital Pictures  A traditional photograph is an analog representation of an image.  Digitizing.
Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
Tzu ming Su Advisor : S.J.Wang MOTION DETAIL PRESERVING OPTICAL FLOW ESTIMATION 2013/1/28 L. Xu, J. Jia, and Y. Matsushita. Motion detail preserving optical.
 In electrical engineering and computer science image processing is any form of signal processing for which the input is an image, such as a photograph.
INDEPENDENT COMPONENT ANALYSIS OF TEXTURES based on the article R.Manduchi, J. Portilla, ICA of Textures, The Proc. of the 7 th IEEE Int. Conf. On Comp.
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Introduction EE 520: Image Analysis & Computer Vision.
University of Toronto Aug. 11, 2004 Learning the “Epitome” of a Video Sequence Information Processing Workshop 2004 Vincent Cheung Probabilistic and Statistical.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
1 Digital Image Processing Dr. Saad M. Saad Darwish Associate Prof. of computer science.
Compression and Analysis of Very Large Imagery Data Sets Using Spatial Statistics James A. Shine George Mason University and US Army Topographic Engineering.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
An introduction to audio/video compression Dr. Malcolm Wilson.
Image Processing Basics. What are images? An image is a 2-d rectilinear array of pixels.
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
Generative Models for Image Understanding Nebojsa Jojic and Thomas Huang Beckman Institute and ECE Dept. University of Illinois.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Synthesizing Natural Textures Michael Ashikhmin University of Utah.
AAM based Face Tracking with Temporal Matching and Face Segmentation Mingcai Zhou 1 、 Lin Liang 2 、 Jian Sun 2 、 Yangsheng Wang 1 1 Institute of Automation.
Learning Jigsaws for clustering appearance and shape John Winn, Anitha Kannan and Carsten Rother NIPS 2006.
Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge.
© 2009 Robert Hecht-Nielsen. All rights reserved. 1 Andrew Smith University of California, San Diego Building a Visual Hierarchy.
CISC 110 Day 3 Introduction to Computer Graphics.
Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.
Journal of Visual Communication and Image Representation
Epitome Ji Soo Yi and Woo Young Kim Instructor: Prof. James Rehg April 27, Spring 2004, CS7636 Computational Perception.
Visual and auditory scene analysis using graphical models Nebojsa Jojic
Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA
Copyright ©2008, Thomson Engineering, a division of Thomson Learning Ltd.
Introduction to Skin and Face Detection
A Personal Tour of Machine Learning and Its Applications
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Spatio-temporal Segmentation of Video by Hierarchical Mean Shift Analysis Daniel DeMenthon SMVP 2002.
A practical guide to learning Autoencoders
Can computers match human perception?
Super-Resolution Image Reconstruction
Multiscale Feature Identification in the Solar Atmosphere
Kocaeli University Introduction to Engineering Applications
RGB-D Image for Scene Recognition by Jiaqi Guo
Transformation-invariant clustering using the EM algorithm
Introduction to Pattern Recognition
Patch-Based Image Classification Using Image Epitomes
A Block Based MAP Segmentation for Image Compression
The EM Algorithm With Applications To Image Epitome
Background Task Fashion image inpainting Some conceptions
Week 7 Presentation Ngoc Ta Aidean Sharghi
Presentation transcript:

University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent Cheung Probabilistic and Statistical Inference Group Electrical & Computer Engineering University of Toronto Toronto, Ontario, Canada Advisor: Dr. Brendan J. Frey

ML Group Meeting, Oct. 18, 2004 Cheung1 / 18 Outline ●Image epitome ► What? ► Why? ► How? ●Implementation computation issues ► Efficiently implementing the learning algorithm ●Video epitome ► Extension to videos ► Filling-in missing information

ML Group Meeting, Oct. 18, 2004 Image Learning Video Cheung2 / 18 Image Epitome ●Jojic, N., Frey, B., & Kannan, A. (2003). Epitomic analysis of appearance and shape. In Proc. IEEE ICCV. ●Miniature, condensed version of the image ●Models the image’s shape and textural components. ●Applications ► object detection ► texture segmentation ► image retrieval ► compression

ML Group Meeting, Oct. 18, 2004 Image Learning Video Cheung3 / 18 Image Epitome Examples

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung4 / 18 Learning the Image Epitome (1)

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung5 / 18 Learning the Image Epitome (2) Epitome Input image Training Set Sample Patches Unsupervised Learning e Z1Z1 Z2Z2 ZMZM … TMTM T2T2 T1T1 Bayesian network e – epitome T k – mapping Z k – image patch E: M:

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung6 / 18 Shifted Cumulative Sum Algorithm

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung7 / 18 Collecting Sufficient Statistics

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung8 / 18 Extending Epitomes to Videos ●Desire a miniature, condensed version of a video sequence ●Want the epitome to model the basic shape, textural, and motion patterns of the video ●Applications ► optic flow ► segmentation ► texture transfer ► layer separation ► compression ► noise reduction ► fill-in / inpainting

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung9 / 18 Input Video Frame 1 Frame 2 Frame 3 Training Set Sample Patches Video Epitome Unsupervised Learning Video Epitome

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung10 / 18 Video Epitome Example Temporally Compressed Spatially Compressed

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung11 / 18 Video Inpainting (1) ●Fill in missing portions of a video ► damaged films ► occluding objects ●Reconstruct the missing pixels from the video epitome

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung12 / 18 Video Inpainting (2)

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung13 / 18 Filling-in Missing Data Likelihood Joint Variational Approx E-Step M-Step

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung14 / 18 Missing Channels ●Generalization of the video inpainting problem ●Inpainting ► Missing entire pixels ●Missing Channels ► Missing one or more of the red, green, or blue (RGB) components of a given pixel ●Epitome must consolidate multiple patches together to piece together the missing channel information ► No training patch contains all the channel information ► Use the epitome to fill-in the missing data

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung15 / 18 Image Missing Channels Fill-in

Image Learning Video ML Group Meeting, Oct. 18, 2004 Cheung16 / 18 Video Missing Channels Fill-in

ML Group Meeting, Oct. 18, 2004 Cheung17 / 18 Conclusion ●Improved the efficiency of learning image epitomes ●Extended the concept of epitomes to video sequences ●Demonstrated the ability of video epitomes to model motion patterns through video inpainting

ML Group Meeting, Oct. 18, 2004 Cheung18 / 18 Future Work ●Determining the size of the epitome ► Dependent on the complexity of the image / video ■Minimum description length ■Variational Bayesian ●Optimal patch size(s) ► Problem specific ●Additional transformations into the epitome ► Rotation ► Scale ●Additional video epitome applications ► Super-resolution ► Layer separation ► Object recognition