Structure Recovery by Part Assembly Chao-Hui Shen 1 Hongbo Fu 2 Kang Chen 1 Shi-Min Hu 1 1 Tsinghua University 2 City University of Hong Kong.

Slides:



Advertisements
Similar presentations
Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.
Advertisements

A Search-Classify Approach for Cluttered Indoor Scene Understanding Liangliang Nan 1, Ke Xie 1, Andrei Sharf 2 1 SIAT, China 2 Ben Gurion University, Israel.
Ying Cao Antoni B. ChanRynson W.H. Lau City University of Hong Kong.
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
Yunhai Wang 1 Minglun Gong 1,2 Tianhua Wang 1,3 Hao (Richard) Zhang 4 Daniel Cohen-Or 5 Baoquan Chen 1,6 5 Tel-Aviv University 4 Simon Fraser University.
Creating Consistent Scene Graphs Using a Probabilistic Grammar Tianqiang LiuSiddhartha ChaudhuriVladimir G. Kim Qi-Xing HuangNiloy J. MitraThomas Funkhouser.
SPONSORED BY SA2014.SIGGRAPH.ORG Annotating RGBD Images of Indoor Scenes Yu-Shiang Wong and Hung-Kuo Chu National Tsing Hua University CGV LAB.
Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.
Automatic scene inference for 3D object compositing Kevin Karsch (UIUC), Sunkavalli, K. Hadap, S.; Carr, N.; Jin, H.; Fonte, R.; Sittig, M., David Forsyth.
SIGGRAPH Paper Reading 2011 Huang Haibin
3D Shape Representation Tianqiang 04/01/2014. Image/video understanding Content creation Why do we need 3D shapes?
A Probabilistic Model for Component-Based Shape Synthesis Evangelos Kalogerakis, Siddhartha Chaudhuri, Daphne Koller, Vladlen Koltun Stanford University.
Creating Consistent Scene Graphs Using a Probabilistic Grammar Tianqiang LiuSiddhartha ChaudhuriVladimir G. Kim Qi-Xing HuangNiloy J. MitraThomas Funkhouser.
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
A New Suffix Tree Similarity Measure for Document Clustering Hung Chim, Xiaotie Deng City University of Hong Kong WWW 2007 Session: Similarity Search April.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
PARAMETRIC RESHAPING OF HUMAN BODIES IN IMAGES SIGGRAPH 2010 Shizhe Zhou Hongbo Fu Ligang Liu Daniel Cohen-Or Xiaoguang Han.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.
Example Based 3D Shape Completion Mark Pauly 1,2, Niloy J. Mitra 1, Joachim Giesen 2, Markus Gross 2, Leonidas J. Guibas 1 1 Stanford University 2 ETH,
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
A Study of Approaches for Object Recognition
High-Quality Video View Interpolation
Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.
Prior Knowledge for Part Correspondence Oliver van Kaick 1, Andrea Tagliasacchi 1, Oana Sidi 2, Hao Zhang 1, Daniel Cohen-Or 2, Lior Wolf 2, Ghassan Hamarneh.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Salient Object Detection by Composition
The Three R’s of Vision Jitendra Malik.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.
Context-based Surface Completion Andrei Sharf, Marc Alexa, Daniel Cohen-Or.
An Algebraic Model for Parameterized Shape Editing Martin Bokeloh, Stanford Univ. Michael Wand, Saarland Univ. & MPI Hans-Peter Seidel, MPI Vladlen Koltun,
REAL-TIME VIDEO EFFECTS USING A DEPTH SENSOR Robbie Fraser and Ridho Jeftha.
Content-Based Image Retrieval
ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.
Image-based Plant Modeling Zeng Lanling Mar 19, 2008.
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
SPONSORED BY Data-driven Segmentation and Labeling of Freehand Sketches Zhe Huang, Hongbo Fu, Rynson W.H. Lau City University of Hong Kong.
Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.
Disclosure risk when responding to queries with deterministic guarantees Krish Muralidhar University of Kentucky Rathindra Sarathy Oklahoma State University.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
IIIT Hyderabad Learning Semantic Interaction among Graspable Objects Swagatika Panda, A.H. Abdul Hafez, C.V. Jawahar Center for Visual Information Technology,
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Acquiring 3D Indoor Environments with Variability and Repetition Young Min Kim Stanford University Niloy J. Mitra UCL/ KAUST Dong-Ming Yan KAUST Leonidas.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
The 18th Meeting on Image Recognition and Understanding 2015/7/29 Depth Image Enhancement Using Local Tangent Plane Approximations Kiyoshi MatsuoYoshimitsu.
Temporally Coherent Completion of Dynamic Shapes AUTHORS:HAO LI,LINJIE LUO,DANIEL VLASIC PIETER PEERS,JOVAN POPOVIC,MARK PAULY,SZYMON RUSINKIEWICZ Presenter:Zoomin(Zhuming)
2D Texture Synthesis Instructor: Yizhou Yu. Texture synthesis Goal: increase texture resolution yet keep local texture variation.
Sponsored by Deformation-Driven Topology-Varying 3D Shape Correspondence Ibraheem Alhashim Kai Xu Yixin Zhuang Junjie Cao Patricio Simari Hao Zhang Presenter:
Journal of Visual Communication and Image Representation
SA2014.SIGGRAPH.ORG SPONSORED BY Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information Kang Chen 1 Yu-Kun.
Change Blindness Images Li-Qian Ma 1, Kun Xu 1, Tien-Tsin Wong 2, Bi-Ye Jiang 1, Shi-Min Hu 1 1 Tsinghua University 2 The Chinese University of Hong Kong.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Coherent Scene Understanding with 3D Geometric Reasoning Jiyan Pan 12/3/2012.
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
CENG 789 – Digital Geometry Processing 10- Data-Driven Shape Analysis
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Automated Geo-referencing of Images Dr. Ronald Briggs Yan Li GeoSpatial Information Sciences The University.
Course: Structure-Aware Shape Processing Introduction to Geometric ‘Structure’ Extracting Structures –analysis of Individual Models –analysis of Shape.
Structure Recovery by Part Assembly Chao-Hui Shen, Hongbo Fu, Kang Chen and Shi-Min Hu Tsinghua University City University of Hong Kong Presented by: Chenyang.
Student: Shunan Shi Professor: Hao Zhang CMPT888.
Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta
Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.
Fast Preprocessing for Robust Face Sketch Synthesis
RGB-D Image for Scene Recognition by Jiaqi Guo
Knowledge-based event recognition from salient regions of activity
Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences 1, Zhizhong.
Presentation transcript:

Structure Recovery by Part Assembly Chao-Hui Shen 1 Hongbo Fu 2 Kang Chen 1 Shi-Min Hu 1 1 Tsinghua University 2 City University of Hong Kong

Background Consumer level scanning devices Capture both RGB and depth Reconstruction is challenging – Low resolution – Noise – Missing data – …

Example-based Scan Completion Global-to-local and top-down [Kraevoy and Sheffer 2005; Pauly et al. 2005] Rely on the availability of suitable template model However … No suitable model! shape retrieval

Assembly-based 3D Modeling Data-drive suggestion and interaction [Chaudhuri and Koltun 2010; Chaudhuri et al. 2011] – Retrieve suitable parts to match user intent – Aim to support open-ended 3D modeling – Quite different goal from ours Automatic shape synthesis by part composition [Kalogerakis et al. 2012; Jain et al. 2012; Xu et al. 2012] – Result in database that grows exponentially – Significantly enlarge the existing database – But make storage and retrieval challenging

Our solution: Recover the Structure by Part Assembly Structure recovery instead of geometry reconstruction Do NOT prepare a large database Retrieve and assemble suitable parts on the fly

Problem Setup Input Point cloud + Image (Single view) Pre-segmented Repository Models (Parts + Labels) …… Goal: Recover high-level structure Assembly close to geometry Output …… Session: Acquiring and Synthesizing Indoor Scenes An Interactive Approach to Semantic Modeling of Indoor Scenes with an RGBD Camera [Shao et al. 2012] A Search-Classify Approach for Cluttered Indoor Scene Understanding [Nan et al. 2012] Acquiring 3D Indoor Environments with Variability and Repetition [Kim et al. 2012]

Directly searching is computationally prohibitive Need a quick way to explore meaningful structures guided by: – Spatial layout of the parts in the repository models – Acquired data Observations

Complementary characteristics of point cloud & image 3D, more accurate cues for geometry & structure Incomplete and noisy Lack depth information Capture the complete object

Algorithm Overview Candidate Parts SelectionStructure CompositionPart Conjoining ……

Algorithm Overview Candidate Parts SelectionStructure CompositionPart Conjoining ……

Candidate Parts Selection Goal: select a small set of candidates for each category Achieved by retrieving parts that fit well some regions

Straightforward Solution Search for the best-fit parts over the entire domain – Disregards the semantics associated with each part and the interaction between different parts Unlikely to produce good results! X X X X XXX X

Key Fact Man-made objects lie in a low dimensional space – Defined with respect to the relative sizes and positions of shape parts [Ovsjanikov et al. 2011] Employ 3D repository model as a global context – Globally align the models with the input scan first Search in a 3D offset window around the part

Part Matching Scheme Geometric fidelity score Geometric contribution score 3D2D edge map Total matching score 3D offset window

Candidate Parts Select top K parts with highest score for each category Seat Back Arm Front leg ……

Algorithm Overview Candidate Parts SelectionStructure CompositionPart Conjoining ……

Structure Composition Goal: compose the underlying structure by identifying a subset of candidate parts

Constraints for Promising Compositions Geometric fidelityProximityOverlap   having high scoreno isolated partsminimized intersection

Search and Evaluate Search for promising compositions under constraints Globally Evaluate the compositions average geometry fidelity of parts total geometry fidelity total geometry contribution …… optimal composition

Algorithm Overview Candidate Parts SelectionStructure CompositionPart Conjoining ……

Part Conjoining Problem: the parts are loosely placed together Goal: generate a consistent & complete model

Identification of Contact Points Refer to their parent models [Jain et al. 2012]

Matching of Contact Points Greedily match nearby contact points Generate auxiliary contact points when necessary auxiliary contact points

i j identity scale Global Optimization transformed contact points

Results: Chairs 70 repository models, 11 part categories

Results: Tables 61 repository models, 4 part categories

Results: Bicycles 38 repository models, 9 part categories

Results: Airplanes 70 repository models, 6 part categories

Results: Creating New Structures

Results: Impact of Dataset input data Randomly picking some repository models

Summary A bottom-up structure recovery approach – Effectively reuse limited repository models – Automatically compose new structure – Handle single-view inputs by the Kinect system Future work – Multi-view inputs – Include style/functional constraints – Recover Indoor scenes

Thank you! Project Page:

User-Assisted Preprocessing GrabCut to extract foreground object [Rother et al. 2004]

Performance Preprocessing step – 3 minutes user interaction Candidate part selection – 1 minutes for 70 models Structure composition – 2 minutes Part conjoining – 1 second

Limitations Lack of suitable parts in the repository

Limitations Input with severely missing geometry – The global alignment becomes unreliable