Structure Recovery by Part Assembly Chao-Hui Shen, Hongbo Fu, Kang Chen and Shi-Min Hu Tsinghua University City University of Hong Kong Presented by: Chenyang.

Slides:



Advertisements
Similar presentations
Discontinuity Preserving Stereo with Small Baseline Multi-Flash Illumination Rogerio Feris 1, Ramesh Raskar 2, Longbin Chen 1, Karhan Tan 3 and Matthew.
Advertisements

Ying Cao Antoni B. ChanRynson W.H. Lau City University of Hong Kong.
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Hybrid Context Inconsistency Resolution for Context-aware Services
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
www-video.eecs.berkeley.edu/research
3D Shape Representation Tianqiang 04/01/2014. Image/video understanding Content creation Why do we need 3D shapes?
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Structure Recovery by Part Assembly Chao-Hui Shen 1 Hongbo Fu 2 Kang Chen 1 Shi-Min Hu 1 1 Tsinghua University 2 City University of Hong Kong.
PARAMETRIC RESHAPING OF HUMAN BODIES IN IMAGES SIGGRAPH 2010 Shizhe Zhou Hongbo Fu Ligang Liu Daniel Cohen-Or Xiaoguang Han.
3D Video Generation and Service Based on a TOF Depth Sensor in MPEG-4 Multimedia Framework IEEE Consumer Electronics Sung-Yeol Kim Ji-Ho Cho Andres Koschan.
Example Based 3D Shape Completion Mark Pauly 1,2, Niloy J. Mitra 1, Joachim Giesen 2, Markus Gross 2, Leonidas J. Guibas 1 1 Stanford University 2 ETH,
Object Recognition Using Genetic Algorithms CS773C Advanced Machine Intelligence Applications Spring 2008: Object Recognition.
A Study of Approaches for Object Recognition
Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.
Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.
Convergence of vision and graphics Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Exploration of Ground Truth from Raw GPS Data National University of Defense Technology & Hong Kong University of Science and Technology Exploration of.
My Research Experience Cheng Qian. Outline 3D Reconstruction Based on Range Images Color Engineering Thermal Image Restoration.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Automatic Posing of a Meshed Human Model Using Point Clouds Lei Wang Joint work with Tamal K. Dey, Huamin.
A Brief Overview of Computer Vision Jinxiang Chai.
The Three R’s of Vision Jitendra Malik.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.
Use and Re-use of Facial Motion Capture M. Sanchez, J. Edge, S. King and S. Maddock.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Xiaoguang Han Department of Computer Science Probation talk – D Human Reconstruction from Sparse Uncalibrated Views.
3D SLAM for Omni-directional Camera
ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.
Image-based Plant Modeling Zeng Lanling Mar 19, 2008.
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
SPONSORED BY Data-driven Segmentation and Labeling of Freehand Sketches Zhe Huang, Hongbo Fu, Rynson W.H. Lau City University of Hong Kong.
Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.
Organizing Heterogeneous Scene Collections through Contextual Focal Points Kai Xu, Rui Ma, Hao Zhang, Chenyang Zhu, Ariel Shamir, Daniel Cohen-Or, Hui.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
A Face processing system Based on Committee Machine: The Approach and Experimental Results Presented by: Harvest Jang 29 Jan 2003.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
The 18th Meeting on Image Recognition and Understanding 2015/7/29 Depth Image Enhancement Using Local Tangent Plane Approximations Kiyoshi MatsuoYoshimitsu.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
Temporally Coherent Completion of Dynamic Shapes AUTHORS:HAO LI,LINJIE LUO,DANIEL VLASIC PIETER PEERS,JOVAN POPOVIC,MARK PAULY,SZYMON RUSINKIEWICZ Presenter:Zoomin(Zhuming)
Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
2D Texture Synthesis Instructor: Yizhou Yu. Texture synthesis Goal: increase texture resolution yet keep local texture variation.
Extracting Depth and Matte using a Color-Filtered Aperture Yosuke Bando TOSHIBA + The University of Tokyo Bing-Yu Chen National Taiwan University Tomoyuki.
Journal of Visual Communication and Image Representation
SA2014.SIGGRAPH.ORG SPONSORED BY Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information Kang Chen 1 Yu-Kun.
Enhancing Image-Based Aging Approaches Olivier Clément Eric Paquette.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool.
Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.
Stackabilization Honghua Li, Ibraheem Alhashim, Hao Zhang, Ariel Shamir, Daniel Cohen-Or.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Course: Structure-Aware Shape Processing Introduction to Geometric ‘Structure’ Extracting Structures –analysis of Individual Models –analysis of Shape.
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
Student: Shunan Shi Professor: Hao Zhang CMPT888.
Processing visual information for Computer Vision
WSRec: A Collaborative Filtering Based Web Service Recommender System
Rogerio Feris 1, Ramesh Raskar 2, Matthew Turk 1
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Structure from motion Input: Output: (Tomasi and Kanade)
Image Based Modeling and Rendering (PI: Malik)
Rob Fergus Computer Vision
إستراتيجيات ونماذج التقويم
Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences 1, Zhizhong.
Resource Allocation for Distributed Streaming Applications
Structure from motion Input: Output: (Tomasi and Kanade)
CVPR 2019 Tutorial on Map Synchronization
Presentation transcript:

Structure Recovery by Part Assembly Chao-Hui Shen, Hongbo Fu, Kang Chen and Shi-Min Hu Tsinghua University City University of Hong Kong Presented by: Chenyang Zhu

Outline Background Overview Method Conclusion

Background Consumer level scanning devices Capture both RGB and depth Reconstruction is challenging ◦ Low resolution ◦ Noise ◦ Missing data ◦…◦…

Example-based Scan Completion Global-to-local and top-down [Kraevoy and Sheffer 2005; Pauly et al. 2005] Rely on the availability of suitable template model However … No suitable model! shape retrieval

Solution Recover the Structure by Part Assembly ◦ Structure recovery instead of geometry reconstruction ◦ Do NOT prepare a large database ◦ Retrieve and assemble suitable parts on the fly

Problem setup Input Point cloud + Image (Single view) Pre-segmented Repository Models (Parts + Labels) … Output ……

Algorithm Overview Candidate Parts SelectionStructure CompositionPart Conjoining …

Candidate Parts Employ 3D repository model as a global context ◦ Globally align the models with the input scan first Search in a 3D offset window around the part

Geometric contribution score Candidate Parts Employ 3D repository model as a global context ◦ Globally align the models with the input scan first Search in a 3D offset window around the part Geometric fidelity score 3D2D edge map 2D 3D

Candidate Parts Employ 3D repository model as a global context ◦ Globally align the models with the input scan first Select top K parts with highest score for each category Seat Back Arm Front leg … …… ………………

Structure Composition Search for promising compositions under constraints … Optimal composition average geometry fidelity of parts total geometry fidelity total geometry contribution Globally Evaluate the compositions

Part Conjoining Problem: the parts are loosely placed together Goal: generate a consistent & complete model

Part Conjoining identity scale i j transformed contact points

Conclusion A bottom-up structure recovery approach ◦ Effectively reuse limited repository models ◦ Automatically compose new structure ◦ Handle single-view inputs by the Kinect system Future work ◦ Multi-view inputs ◦ Include style/functional constraints ◦ Recover Indoor scenes

THANK YOU! Q&A