Unfolding an Indoor Origami World David Fouhey, Abhinav Gupta, Martial Hebert 1.

Slides:



Advertisements
Similar presentations
Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.
Advertisements

Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
HOPS: Efficient Region Labeling using Higher Order Proxy Neighborhoods Albert Y. C. Chen 1, Jason J. Corso 1, and Le Wang 2 1 Dept. of Computer Science.
Generating Classic Mosaics with Graph Cuts Y. Liu, O. Veksler and O. Juan University of Western Ontario, Canada Ecole Centrale de Paris, France.
Putting Objects in Perspective Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University Robotics Institute.
3rd Workshop On Semantic Perception, Mapping and Exploration (SPME) Karlsruhe, Germany,2013 Semantic Parsing for Priming Object Detection in RGB-D Scenes.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Analysis of Contour Motions Ce Liu William T. Freeman Edward H. Adelson Computer Science and Artificial Intelligence Laboratory Massachusetts Institute.
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Shape Sharing for Object Segmentation
SPONSORED BY SA2014.SIGGRAPH.ORG Annotating RGBD Images of Indoor Scenes Yu-Shiang Wong and Hung-Kuo Chu National Tsing Hua University CGV LAB.
Structured Hough Voting for Vision-based Highway Border Detection
Face Detection, Pose Estimation, and Landmark Localization in the Wild
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
2D Human Pose Estimation in TV Shows Vittorio Ferrari Manuel Marin Andrew Zisserman Dagstuhl Seminar July 2008.
Patch Based Synthesis for Single Depth Image Super-Resolution (ECCV 2012) Oisin Mac Aodha, Neill Campbell, Arun Nair and Gabriel J. Brostow Presented By:
1 Hierarchical Image-Motion Segmentation using Swendsen-Wang Cuts Adrian Barbu Siemens Corporate Research Princeton, NJ Acknowledgements: S.C. Zhu, Y.N.
We propose a successive convex matching method to detect actions in videos. The proposed scheme does not need foreground/background separation, works in.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Computer Vision Group University of California Berkeley 1 Learning Scale-Invariant Contour Completion Xiaofeng Ren, Charless Fowlkes and Jitendra Malik.
Robust Higher Order Potentials For Enforcing Label Consistency
Abstract We present a model of curvilinear grouping using piecewise linear representations of contours and a conditional random field to capture continuity.
The plan for today Camera matrix
WORD-PREDICTION AS A TOOL TO EVALUATE LOW-LEVEL VISION PROCESSES Prasad Gabbur, Kobus Barnard University of Arizona.
An Experimental Evaluation on Reliability Features of N-Version Programming Xia Cai, Michael R. Lyu and Mladen A. Vouk ISSRE’2005.
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
1 Occlusions – the world is flat without them! : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009.
© 2006 by Davi GeigerComputer Vision April 2006 L1.1 Binocular Stereo Left Image Right Image.
Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping.
Integral Photography A Method for Implementing 3D Video Systems.
3D Scene Models Object recognition and scene understanding Krista Ehinger.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Evolving Curves/Surfaces for Geometric Reconstruction and Image Segmentation Huaiping Yang (Joint work with Bert Juettler) Johannes Kepler University of.
1 Contours and Junctions in Natural Images Jitendra Malik University of California at Berkeley (with Jianbo Shi, Thomas Leung, Serge Belongie, Charless.
Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
 Discovery process  Step 1 – Make Observation - Qualitative: non-numerical - Quantitative: numerical.
Acquiring 3D Indoor Environments with Variability and Repetition Young Min Kim Stanford University Niloy J. Mitra UCL/ KAUST Dong-Ming Yan KAUST Leonidas.
The 18th Meeting on Image Recognition and Understanding 2015/7/29 Depth Image Enhancement Using Local Tangent Plane Approximations Kiyoshi MatsuoYoshimitsu.
A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?
Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.
Ch.8 Efficient Coding of Visual Scenes by Grouping and Segmentation Bayesian Brain Tai Sing Lee and Alan L. Yuille Heo, Min-Oh.
Chapter 7 Nonlinear Optimization Models. Introduction The objective and/or the constraints are nonlinear functions of the decision variables. Select GRG.
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
Data-driven Image Processing Fubo Han Images in computer graphics IMAGE: the most engaging visual content in the internet. Image Superiority.
Unsupervised Visual Representation Learning by Context Prediction
Coherent Scene Understanding with 3D Geometric Reasoning Jiyan Pan 12/3/2012.
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
Gaussian Conditional Random Field Network for Semantic Segmentation
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.
1 Resolving the Generalized Bas-Relief Ambiguity by Entropy Minimization Neil G. Alldrin Satya P. Mallick David J. Kriegman University of California, San.
Perceptual real-time 2D-to-3D conversion using cue fusion
Summary of “Efficient Deep Learning for Stereo Matching”
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Semi-Global Stereo Matching with Surface Orientation Priors
Compositional Human Pose Regression
Nonparametric Semantic Segmentation
Wei Liu, Chaofeng Chen and Kwan-Yee K. Wong
Learning to Combine Bottom-Up and Top-Down Segmentation
What is Science?.
Adarsh Kowdle*, Congcong Li*, Ashutosh Saxena, and Tsuhan Chen
A Graph-Matching Kernel for Object Categorization
The Image The pixels in the image The mask The resulting image 255 X
Multiple DAGs Learning with Non-negative Matrix Factorization
Occlusion and smoothness probabilities in 3D cluttered scenes
Presentation transcript:

Unfolding an Indoor Origami World David Fouhey, Abhinav Gupta, Martial Hebert 1

2

3

Local Evidence 4 Hoiem et al. 2005, Saxena et al. 2005, Fouhey et al. 2013, etc.

Constraints 5

Constraints for Single Image 3D 6 Local Smoothness Low Level, Generic Hoiem et al. 2005, Saxena et al. 2005, 2008, Munoz et al., 2009, etc.

Constraints for Single Image 3D 7 Local Smoothness Low Level, Generic Hoiem et al. 2005, Saxena et al. 2005, 2008, Munoz et al., 2009, etc.

Constraints for Single Image 3D 8 Low Level, Generic High Level, Physical

High Level, Physical Constraints for Single Image 3D 9 Local Smoothness Low Level, Generic Coughlan and Yuille 2000, etc.

High Level, Physical Constraints for Single Image 3D 10 Local Smoothness Low Level, Generic Hedau et al. 2009, Del Pero et al., 2011, Wang et al., 2012, Schwing et al. 2012, etc.

High Level, Physical Constraints for Single Image 3D 11 Local Smoothness Low Level, Generic Lee et al. 2010, Xiao et al. 2012, Zhao et al. 2013, Schwing et al., 2013, etc.

High Level, Physical Constraints for Single Image 3D 12 Low Level, Generic

Mid-level in the Past 13 Huffman 71, Clowes 71, Kanade 80, 81 Sugihara 86, Malik 87, etc.

Our Mid-Level Constraints 14

This Work 15 Input: Single Image Output: Discrete Scene Parse

Overview 16 ParameterizationFormulation Experimental Results

Overview 17 ParameterizationFormulation Experimental Results

Parameterization 18

Parameterization 19 vp 1 vp 2 vp 3 VP Estimator from Hedau et al., 2009

Parameterization 20 Two VPs give grid cell

Encoding Surface Normals 21

Encoding Surface Normals 22

Encoding Surface Normals 23

Encoding Surface Normals 24 x 1,…, x 400 x 401,…, x 800 x 801,…, x 1200

Related Parameterizations 25 vp 1 vp 3 x2x2 x1x1 x3x3 x4x4 Hedau et al., 2009; Wang et al. 2010, Schwing et al., 2012, 2013 vp 2

Overview 26 ParameterizationFormulation Experimental Results

Parameterization 27

Formulation 28

Unaries 29

Unaries 30 Low c Any 3D Evidence High c

Unaries 31 … Input3D Primitive Bank Local: Data-Driven 3D Primitives Fouhey, Gupta, Hebert, 2013

Unaries 32 Hedau, Hoiem, Forsyth, 2009 Global: Cuboid Fit + Clutter Mask InputPredicted WallsClutter Mask

Binaries 33

Convex/Concave Constraints 34 Convex (+) Concave (-)

Convex/Concave Constraints 35 Detected Concave (-)

Convex/Concave Constraints 36 Detected Concave (-)

Convex/Concave Constraints 37 Detected Concave (-)

Convex/Concave Constraints 38 Detected Concave (-)

Convex/Concave Constraints 39 Detected Concave (-)

Detecting Convex/Concave 40 Ground-Truth Discontinuities similar to Gupta, Arbelaez, Malik, DP from Fouhey, Gupta, Hebert, 2013 Input3D Primitive Bank … Use 3DP to Transfer Discontinuities

Smoothness 41

Constraints 42

Solving the Model 43

Overview 44 ParameterizationFormulation Experimental Results

Dataset NYU Depth v2: 795 Train, 654 Test 45

Qualitative Results 46

Qualitative Results 47

Qualitative Results 48

Qualitative Results 49

Surface Connection Graphs 50 ConvexConcave

Baseline Primary Baseline: 3D Primitives (3DP) 51 Fouhey, Gupta, Hebert, 2013 OutputInput

Quantitative Results 52 Summary Stats (⁰) (Lower Better) % Good Pixels (Higher Better) 11.25⁰22.5⁰30⁰MeanMedian 3DP Hedau et al RMSE Lee et al Proposed

Quantitative Results 53

Failure Modes Mistaken but Confident Evidence 54

Failure Modes 55 Missing High-Level Modeling

Conclusions 56 Single Image Parameterization Formulation Discrete Parse

Thank You 57