Style/Content separation Evgeniy Bart, Dan Levi April 13, 2003.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Filling Algorithms Pixelwise MRFsChaos Mosaics Patch segments are pasted, overlapping, across the image. Then either: Ambiguities are removed by smoothing.
Active Appearance Models
Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.
Amir Hosein Omidvarnia Spring 2007 Principles of 3D Face Recognition.
Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.
Overview of Texture Synthesis Ganesh Ramanarayanan Cornell Graphics Seminar.
A Study of Approaches for Object Recognition
Model-Based Stereo with Occlusions
Segmentation Divide the image into segments. Each segment:
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
Face Recognition using PCA (Eigenfaces) and LDA (Fisherfaces)
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Curve Analogies Aaron Hertzmann Nuria Oliver Brain Curless Steven M. Seitz University of Washington Microsoft Research Thirteenth Eurographics.
Face Recognition Based on 3D Shape Estimation
Face Recognition Jeremy Wyatt.
Lecture 9: Image alignment CS4670: Computer Vision Noah Snavely
Active Appearance Models Computer examples A. Torralba T. F. Cootes, C.J. Taylor, G. J. Edwards M. B. Stegmann.
3D Geometry for Computer Graphics
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
CSCE 641 Computer Graphics: Image Registration Jinxiang Chai.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Image Analogies Aaron Hertzmann (1,2) Charles E. Jacobs (2) Nuria Oliver (2) Brian Curless (3) David H. Salesin (2,3) 1 New York University 1 New York.
SVD(Singular Value Decomposition) and Its Applications
Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University
Recognition Part II Ali Farhadi CSE 455.
Face Recognition and Feature Subspaces
Face Recognition and Feature Subspaces
Need volunteers… From Monday’s paper: A simple story about representations Input signal: a moving edge. Model it using an auto-regressive model, Using.
1 Graph Embedding (GE) & Marginal Fisher Analysis (MFA) 吳沛勳 劉冠成 韓仁智
Multimodal Interaction Dr. Mike Spann
1 ECE 738 Paper presentation Paper: Active Appearance Models Author: T.F.Cootes, G.J. Edwards and C.J.Taylor Student: Zhaozheng Yin Instructor: Dr. Yuhen.
INDEPENDENT COMPONENT ANALYSIS OF TEXTURES based on the article R.Manduchi, J. Portilla, ICA of Textures, The Proc. of the 7 th IEEE Int. Conf. On Comp.
1 Recognition by Appearance Appearance-based recognition is a competing paradigm to features and alignment. No features are extracted! Images are represented.
Course 13 Curves and Surfaces. Course 13 Curves and Surface Surface Representation Representation Interpolation Approximation Surface Segmentation.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
CSE 185 Introduction to Computer Vision Face Recognition.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Edge Detection and Geometric Primitive Extraction Jinxiang Chai.
Separating Style and Content with Bilinear Models Joshua B. Tenenbaum, William T. Freeman Computer Examples Barun Singh 25 Feb, 2002.
Image Classification for Automatic Annotation
Multimodal Interaction Dr. Mike Spann
2D Texture Synthesis Instructor: Yizhou Yu. Texture synthesis Goal: increase texture resolution yet keep local texture variation.
Supervisor: Nakhmani Arie Semester: Winter 2007 Target Recognition Harmatz Isca.
ELE 488 Fall 2006 Image Processing and Transmission ( )
Distinctive Image Features from Scale-Invariant Keypoints
Texture Synthesis by Image Quilting CS766 Class Project Fall 2004 Eric Robinson.
Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin, South Korea Copyright © solarlits.com.
Statistical Models of Appearance for Computer Vision 主講人:虞台文.
Irena Váňová. B A1A1. A2A2. A3A3. repeat until no sample is misclassified … labels of classes Perceptron algorithm for i=1...N if then end * * * * *
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
University of Ioannina
Lecture 8:Eigenfaces and Shared Features
Face Recognition and Feature Subspaces
Recognition: Face Recognition
Outline Multilinear Analysis
In summary C1={skin} C2={~skin} Given x=[R,G,B], is it skin or ~skin?
Object Modeling with Layers
Turning to the Masters: Motion Capturing Cartoons
Separating Style and Content with Bilinear Models Joshua B
Separating Style and Content with Bilinear Models Joshua B
Presentation transcript:

Style/Content separation Evgeniy Bart, Dan Levi April 13, 2003

Artistic styles Photograph

Artistic styles Impressionist

Artistic styles Expressionist

Artistic styles Pointillist

Photographic styles *Pictures by Aya Aner-Wolf

Fonts ABCDE ABCDE ABCDE ABCDE Style Content

Faces *Images from FERET database

Tasks Extrapolation: Extrapolation of familiar style to new content

Tasks Extrapolation: Extrapolation of familiar content to new style

Tasks Translation:

Task specification by analogy : : ? Image analogies, Hertzmann et al.

Wei&Levoy Ashikhmin

Region growing Somewhat similar to quilting *Picture from presentation by Tal and Zeev

Region growing

Combining the two … : :

… hierarchically : :

: :

: :

Selecting best match

Arbitrating ? yes no Use Wei&Levoy value Use Ashikhmin value

Arbitrating ? yes no Use Wei&Levoy value Use Ashikhmin value

Arbitrating ? yes no Use Wei&Levoy value Use Ashikhmin value

: : Results – artistic filters

: :

: :

Super-resolution : Training 1

Super-resolution : Training 2

Super-resolution : Training 3

Super-resolution Results :

Super-resolution Results :

A lonely pine is standing In the North where high winds blow. He sleeps; and the whitest blanket wraps him in ice and snow. He dreams - dreams of a palm-tree that far in an Orient land Languishes, lonely and drooping, Upon the burning sand. H. Heine, translated by L. Untermeyer

Texture by numbers : :

Parameters annEpsilon: [float] = ashLastLevel: [bool] = false biasPenalty: [float] = cheesyBoundaries: [bool] = true coherenceEps: [float] = coherencePow: [float] = createSrcLocHisto: [bool] = false decayWeight: [double] = filterColorspace: [enum] = {Lab, Luv, RGB, XYZ} filterMM: [string] = (none!) filterModeMask: [string] = (none!) filterProcedure: [enum] = {Copy, Synthesize} filteredFeatureType: [enum] = {Difference, Raw} filteredPyramidType: [enum] = {Gaussian, Laplacian} finalSourceFac: [float] = gainPenalty: [float] = heurAnnEpsilon: [float] = heurMaxTSVQDepth [int] = 7 histogramEq: [bool] = false levelWeighting: [float] = matchBtoA: [bool] = false matchGrayHistogram: [bool] = false matchMeanVariance: [bool] = false maxTSVQDepth: [int] = 20 maxTSVQError: [float] = modeMaskWeight: [float] = neighborhoodWidth: [int] = 5 pyramidType: [enum] = {Gaussian, Laplacian, Steerable} samplerEpsilon: [float] = searchMethod: [enum] = {ANN, Ash, HeurANN, HeurTSVQ, Image, MLP, TSVQ, TSVQR, Vector} sourceColorspace: [enum] = {Lab, Luv, RGB, XYZ} srcWeight: [float] = targetMM: [string] = (none!) targetModeMask: [string] = (none!) useBias: [bool] = false useFilter: [bool] = true useFilterModeMask: [bool] = false useGain: [bool] = false useInterface: [bool] = true useRandomStart: [bool] = true useSigmoidalDecay: [bool] = false useSplineWeights: [bool] = true useTargetModeMask: [bool] = false useYIQ: [bool] = false numHiddenNeurons: [int] = 20 numLevels: [int] = 2 numPasses: [int] = 1 numTSVQBacktracks: [int] = 8 onePixelSource: [bool] = false oneway: [bool] = false pyramidHeight: [int] = 4

3D rotation : :

: :

: :

: :

: :

What went wrong? There is some structure, but not simple correspondence Need more knowledge about objects

Rectangular parallelepipeds (cuboids)

Representation by 3D point coordinates Linear classes, Vetter&Poggio

May combine linearly += + =

Only 3 dimensions ++= = Call it linear class

Linear operators Linear operator L

Example: rotation ++= =

Rotation If Then

Example: projection ++= =

Projection If Then

Example: projection + rotation ++= =

Rotation + projection If Then But also We may work entirely in 2D domain!

+ + Working in 2D : : + +

Results : :

: :

Can we apply this idea to faces? Linear class assumption: Object may be represented as linear combination of other (similar) objects ++ = Use raw images as basis Reconstruction quality will be poor PCA reconstruction is much better Can we use PCA?

+ + Using linear classes : : + +

Eigenfaces do not correspond Cannot use the same coefficients

So far: Solving Specific Tasks (Image analogies), Linear Classes Learn Interaction Generalize To New Examples Goal : General Style/Content Framework

Motivation : Linear Models Faces Images Form A linear subspace Illumination variations (of same face) can be modeled by a low-dimensional linear space (Hallinan ‘94) Eigenvector Basis Reconstruct faces Model: Linear In Style And In Content eigenfaces (Turk, Pentland ‘91)

Bilinear Models (Tenenbaum, Freeman 2000,97) is bilinear if : Linear in x Linear in y y constant

Bilinear Forms : Example x,y ∈ Real f(x,y) = xy Bilinear Forms To Model Style And Content 2 Models : Symmetric, And Asymmetric

K Image in Style s Content c Style vector I J Content vector IXJ Interaction Matrix Symmetric Bilinear Model

K Image in Style s Content c J Content vector Symmetric Bilinear Model Pixel Style vector J Style Matrix AsAs JXK

Symmetric Bilinear Model : basis vectors

Toy Example: Symmetric Model Images: Style: Color Content: Shape (0,0,1,1) (2,2,2,0) (0,3,0,3) Content: (0,0,1,1) (1,0,1,0) (0,1,1,1) Style:

Face Example - Symmetric Style: Pose, Content: Person

K Image in Style s Content c Style vector I J Content vector IXJ Interaction Matrix Asymmetric Bilinear Model WksWks

K Image in Style s Content c J Content vector Asymmetric Bilinear Model Pixel Style vector J Style Matrix AsAs JXK A s : Content  Images

K Image in Style s Content c J Content vector Asymmetric Bilinear Model J

……… J K Image in Style s Content c A style specific basis Mixed by content coefficients

Content: (0,0,1,1) (1,0,1,0) (0,1,1,1) Toy Example: Asymmetric Model Style:

Face Example - Asymmetric Style: Pose, Content: Person

Training Person Illumination Style Content Image Matrix:

Training – Model Fitting Problem : Given {y sc (t) } t = 1..T find model parameters Error Minimization: Asymmetric: Closed SVD solution or Quasi-Newton methods  A s, b c Free parameter: J – content vector dimension. Symmetric: Iterative solution using SVD  a s, W k, b c

Content: person Style: illumination Translation - Faces Asymmetric Model Cannot handle translation!

Problem: C = 23 (faces), S = 3 (illuminations) Training : Fit a symmetric model using iterative SVD with I = S, J = C  a s, W k, b c Generalization: find a s`, b c ` that minimize E* =  k | y k s`c` - a s` W k b c` | 2 Alternating Iterative Linear Solution Translation : Produce a s` W k b c, a s W k b c` for each s and c. Translation – Symmetric Model

Translation - Results

Extrapolation Content: Letter, Style: Font Main Problem : Image Representation Linear combinations of letters should look like a letter.

A Displacement Vector Warp Map

Coulomb Warp Map For unique mapping: physical model of electrostatic forces. Linear combination of letters looks like a letter

Extrapolation Scheme Fit An Asymmetric bilinear model S = 5 training fonts (styles), C = 62 characters (content), K=2888 data dim. closed-form SVD  A s, b c C={c 1,…,c M } letters in a new style s’  find best fitting A s’ Minimize : E* =  c ║y s’c - A s’ b c ║  ∂E*/ ∂ A s’ = 0  Set J High(~60) Overfitting on Test Data (173,280 degrees of freedom!)

Constraint: Close To Symmetric A OLC =  s α s A s (Optimal Linear Combination) α s - style parameters (symmetric) Minimize: E* =  c ║y s’c - A s’ b c ║ + λ║ A s’ - A OLC ║ ∂E*/ ∂ A s’ = 0 Extrapolate missing letters by: y s’c = A s’ b c

Results

Symmetric vs. Aymmetric Can reduce dimensionality of factors Learns the structure of factor interactions : handles translation More Flexible Too flexible: overfitting Cannot handle translation Can be overcome by combining both

Bilinear Models (Tenenbaum, Freeman 2000,97) General framework for two factor problems Explicit parameterized representations of each factor and their interaction Natural generalization for extrapolation and translation tasks Fast algorithms (SVD) Pros: Cons: Assumes Linearity In Each Factor Find Clever Input Representations Decompose To Sub Problems

Example-Based Style Synthesis ( Ido Drori Hezi Yeshurun Daniel Cohen-Or 03)

Algorithm Outline 1.Divide Image To Overlapping Tiles

Algorithm Outline 1.Divide Image To Overlapping Tiles 2. Find Best Match In Each Scene

Algorithm Outline 1.Divide Image To Overlapping Tiles 2. Find Best Match In Each Scene 3.Synthesize tiles By Bilinear Model

Algorithm Outline 1.Divide Image To Overlapping Tiles 2. Find Best Match In Each Scene 3.Synthesize tiles By Bilinear Model 4. Image Quilting

Algorithm Outline 1.Decompose Image To Tiles 2. Find Best Match In Each Scene 3.Synthesize tiles By Bilinear Model 4. Image Quilting 5. Image Analogies

Create Gaussian Pyramids For Examples And Input Images Apply Algorithm To Each Level From Coarse To Fine

Finding Best Matching Fragment Similar GeometryAgreeing Boundaries V search = (,,,,, ) Gradient Laplacian Luminance For Each Training Scene : Create V In every Position And Orientation Search For Nearest Neighbor