Segmentation of Building Facades using Procedural Shape Priors

Slides:

Advertisements

Similar presentations

A. Criminisi, J. Shotton, S. Bucciarelli and K. Siddiqui

Advertisements

Applications of one-class classification

Bayesian Belief Propagation

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

1 Architecture based modeling of buildings Filip Van den Borre.

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.

Clustering & image segmentation Goal::Identify groups of pixels that go together Segmentation.

Robust Object Tracking via Sparsity-based Collaborative Model

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.

Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.

Special Topic on Image Retrieval Local Feature Matching Verification.

Contextual Classification with Functional Max-Margin Markov Networks Dan MunozDrew Bagnell Nicolas VandapelMartial Hebert.

Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Robust Higher Order Potentials For Enforcing Label Consistency

Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:

Real-Time Non-Rigid Shape Recovery via AAMs for Augmented Reality Jackie Zhu Oct. 24, 2006.

Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡

Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.

K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.

Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.

Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.

Computer Vision James Hays, Brown

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.

Procedural Modeling of Architectures towards 3D Reconstruction Nikos Paragios Ecole Centrale Paris / INRIA Saclay Ile-de-France Joint Work: P. Koutsourakis,

Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.

CSE 185 Introduction to Computer Vision Pattern Recognition 2.

ALIP: Automatic Linguistic Indexing of Pictures Jia Li The Pennsylvania State University.

Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.

INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:

Peter Henry1, Michael Krainin1, Evan Herbst1,

Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.

Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.

Associative Hierarchical CRFs for Object Class Image Segmentation

Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.

Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.

1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.

Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

? Shape Grammar and Image based 3D reconstruction of buildings

Course : T Computer Vision

Hybrid Deep Learning for Reflectance Confocal Microscopy Skin Images

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Nonparametric Semantic Segmentation

Machine Learning Basics

Mean Shift Segmentation

Dynamical Statistical Shape Priors for Level Set Based Tracking

Edges/curves /blobs Grammars are important because:

Learning to Combine Bottom-Up and Top-Down Segmentation

A Bayesian Estimation of Building Shape using MCMC

Image Parsing & DDMCMC. Alan Yuille (Dept. Statistics. UCLA)

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.

Shashi Shekhar Weili Wu Sanjay Chawla Ranga Raju Vatsavai

Outline Background Motivation Proposed Model Experimental Results

“Traditional” image segmentation

Using simple machine learning for image segmentation

Presentation transcript:

Segmentation of Building Facades using Procedural Shape Priors Olivier Teboul, Loïc Simon Panagiotis Koutsourakis and Nikos Paragios

Introduction 3D Urban Modeling is of increasing interest Navigation systems, online applications, video games Huge amount of images available online (bing maps, google maps, flickr) Facade analysis plays a central role  semantic segmentation

Problem Rectified Facade image Segmentation into architectural classes

Problem Rectified Facade image Segmentation into architectural classes windows

Problem Rectified Facade image Segmentation into architectural classes windows walls

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky shops

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky shops Enforce architectural constraints

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky shops Enforce architectural constraints alignment

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky shops Enforce architectural constraints alignment consistent topology

Problem Rectified Facade image Segmentation into architectural classes windows walls balconies doors roofs sky shops Enforce architectural constraints alignment consistent topology procedural shape prior : The segmentation should be the result of a shape grammar derivation

Related Work Classic Segmentation methods Mean Shift, Level Set, MRF-based methods, Normalized Cut  Lack semantic information Semantic Segmentation methods He et al. CVPR 04, Shotton et al. ECCV 06  Do not guarantee architectural consistency ! Grammar-based methods Image-driven (bottom-up): Dick et al. ICCV 01, Müller et al. 07, Koutsourakis et al. 09  Based on self-similarity measure Grammar-driven (top-down): Alegre & Dellaert 04 , Ripperda et al. DAGM 06  Exploits the input grammar Recent Work: Vanegas et al. CVPR10, Toshev et al. CVPR10

Shape grammar [Stiny 72] Dictionary of shapes window wall balcony door roof sky shop facade floor groundfloor attic image tile

Shape grammar [Stiny 72] Dictionary of shapes Set of Replacement Rules window wall balcony door roof sky shop facade floor groundfloor attic image tile facade

Split Grammars [Wonka 03] A rule can only split a shape into several one along an axis Example: replace a floor by a sequence of walls and tiles A rule is characterized by the parameter vector W The dimension of W depends on the rule floor wall tile w1 w2 w3 w4 w5

Procedural Modeling of Facades Start from an axiom (Image) Sequentially apply replacement rules The derivation tree keeps track of the building structure

Procedural Modeling of Facades Start from an axiom (Image) Sequentially apply replacement rules The derivation tree keeps track of the building structure

Procedural Modeling of Facades Start from an axiom (Image) Sequentially apply replacement rules The derivation tree keeps track of the building structure

Procedural Modeling of Facades Start from an axiom (Image) Sequentially apply replacement rules The derivation tree keeps track of the building structure

Procedural Modeling of Facades Start from an axiom (Image) Sequentially apply replacement rules The derivation tree keeps track of the building structure To be optimized : rule selection and split parameters

Segmentation energy

Segmentation energy Single Pixel x Feature vector

Segmentation energy Single Pixel x

Segmentation energy Single Pixel x Single Region R R

Segmentation energy Single Pixel x Single Region R Segmentation π

Segmentation energy

Pixel-wise MAP classification Supervised Learning Goal : learning a discriminative model of p(c|fx) Feature vector fx is a patch around x Randomized Forest classifiers [Breiman ML Journal 01, Lepetit & Fua PAMI 06] input Pixel-wise MAP classification Window probability Wall probability (red= 0 blue = 1) … Ground truth (color = class)

Optimization : grammar factorization Independent sampling of rules may lead to inconsistent buildings Idea : tie together the derivation of some architectural classes Factorization Independent derivation

Optimization : grammar factorization Reduces the dimension of the space of shapes Factorization is a natural way to fight the curse of dimensionality Allows a fixed representation of facades (independently from the layout topology). A segmentation is described by a fixed sequence of rules : 1 rule to split the image into sub-regions  r1 1 rule to split a façade into floors  r2 1 rule to split a floor into walls and tiles  r3 1 rules to split a tile into window and balcony  r4 … Segmentation π = (r1, …, rM)

Optimization: algorithm Start from an initial seed π0 = (r10, r20, …, rM0) π0

Optimization: algorithm Start from an initial seed π0 = (r10, r20, …, rM0) Gaussian perturbations of all the rules πi = (r1i, r2i, …, rMi) π0

Optimization: algorithm Start from an initial seed π0 = (r10, r20, …, rM0) Gaussian perturbations of all the rules πi = (r1i, r2i, …, rMi) Keep the best segmentation as new seed v π1

Optimization: algorithm Start from an initial seed π0 = (r10, r20, …, rM0) Gaussian perturbations of all the rules πi = (r1i, r2i, …, rMi) Keep the best segmentation as new seed Iterate N times

Segmentation with Procedural Shape Priors Quantitative Results Pixel-wise MAP 20 training images 10 test images Segmentation with Procedural Shape Priors

Analysis The shape grammar introduces a context : each class has fewer challengers Disambiguation thanks to grammar consistency : take advantage of the repetitions over the facade

Qualitative Results Architecturally consistent Robust to illumination conditions, hard cast shadow, reflections

Conclusion Procedural Shape Prior for Semantic Segmentation Grammar Factorization Source code for Randomized Forest http://www.mas.ecp.fr/vision/Personnel/teboul/source_code.html Database of Parisian facades http://www.mas.ecp.fr/vision/Personnel/teboul/data.html

Future Work Exploring more efficient optimization techniques Optimize the rule selection Robust algorithms inspired from grammar parsing Coupling with multiple views towards finer modeling Image-based modeling Other grammars and architectures Thank you!

Questions ?