Relationship Template for Creating Scene Variations

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Animosaics Kaleigh Smith, Yunjun Liu, and Allison Klein McGill University Eurographics Symposium on Computer Animation 2005.

Ying Cao Antoni B. ChanRynson W.H. Lau City University of Hong Kong.

Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.

Ming-Ming Cheng 1 Ziming Zhang 2 Wen-Yan Lin 3 Philip H. S. Torr 1 1 Oxford University, 2 Boston University 3 Brookes Vision Group Training a generic objectness.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Indexing 3D Scenes Using the Interaction Bisector Surface Xi Zhao He Wang Taku Komura University of Edinburgh.

Patch to the Future: Unsupervised Visual Prediction

SIGGRAPH Paper Reading 2011 Huang Haibin

Robust Object Tracking via Sparsity-based Collaborative Model

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Structure Recovery by Part Assembly Chao-Hui Shen 1 Hongbo Fu 2 Kang Chen 1 Shi-Min Hu 1 1 Tsinghua University 2 City University of Hong Kong.

Reverse Engineering Niloy J. Mitra.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.

Iterative closest point algorithms

Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.

Interactive Hairstyle Modeling Using a Sketching Interface Xiaoyang Mao Kouichi Kashio Hiroyuki Kato Atsumi Imamiya CGGM 2002.

CS292 Computational Vision and Language Visual Features - Colour and Texture.

Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.

AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.

Context-based Surface Completion Andrei Sharf, Marc Alexa, Daniel Cohen-Or.

SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.

FlowString: Partial Streamline Matching using Shape Invariant Similarity Measure for Exploratory Flow Visualization Jun Tao, Chaoli Wang, Ching-Kuang Shene.

Semantic Decomposition and Reconstruction of Residential Scenes from LiDAR Data Hui Lin 1 *, Jizhou Gao 1 *, Yu Zhou 2, Guiliang Lu 2, Mao Ye 1, Ligang.

OPTIMIZATION OF FUNCTIONAL BRAIN ROIS VIA MAXIMIZATION OF CONSISTENCY OF STRUCTURAL CONNECTIVITY PROFILES Dajiang Zhu Computer Science Department The University.

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

Organizing Heterogeneous Scene Collections through Contextual Focal Points Kai Xu, Rui Ma, Hao Zhang, Chenyang Zhu, Ariel Shamir, Daniel Cohen-Or, Hui.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.

Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)

Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.

INTERACTIVELY BROWSING LARGE IMAGE DATABASES Ronald Richter, Mathias Eitz and Marc Alexa.

Acquiring 3D Indoor Environments with Variability and Repetition Young Min Kim Stanford University Niloy J. Mitra UCL/ KAUST Dong-Ming Yan KAUST Leonidas.

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-

2D Texture Synthesis Instructor: Yizhou Yu. Texture synthesis Goal: increase texture resolution yet keep local texture variation.

A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.

CSE 185 Introduction to Computer Vision Feature Matching.

Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.

Outline ● Introduction – What is the problem ● Generate stochastic textures ● Improve realism ● High level approach - Don't just jump into details – Why.

1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.

CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.

A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.

Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.

Structure Recovery by Part Assembly Chao-Hui Shen, Hongbo Fu, Kang Chen and Shi-Min Hu Tsinghua University City University of Hong Kong Presented by: Chenyang.

Deformation Modeling for Robust 3D Face Matching Xioguang Lu and Anil K. Jain Dept. of Computer Science & Engineering Michigan State University.

Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.

Graphcut Textures:Image and Video Synthesis Using Graph Cuts

INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM

Data Driven Attributes for Action Detection

Recognizing Deformable Shapes

Nonparametric Semantic Segmentation

Fast Preprocessing for Robust Face Sketch Synthesis

Hansheng Xue School of Computer Science and Technology

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Context-Aware Modeling and Recognition of Activities in Video

Local Feature Extraction Using Scale-Space Decomposition

Image Segmentation Techniques

Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†

CSc4730/6730 Scientific Visualization

Brief Review of Recognition + Context

An Introduction to Software Architecture

Scale-Space Representation for Matching of 3D Models

Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences 1, Zhizhong.

CSE 185 Introduction to Computer Vision

Recognizing Deformable Shapes

One-shot learning and generation of dexterous grasps

Presentation transcript:

Relationship Template for Creating Scene Variations Xi Zhao Xi’an Jiaotong University Ruizhen Hu Shenzhen University Paul Guerrero Niloy Mitra University College London Taku Komura Edinburgh University

Relationships in a Scene There are many types of spatial relationships in a daily life scene, some are simple, such as the toys on the chest of drawers, pictures hang on the wall; But some are more complex, such as the hat on the stand, the flower in the vase;

How to make variations of complex relationship? it’s time consuming to manually make such scenes, because the objects has to be in specific relative configurations to form the relationship; [Objective] To save the labor, we propose a method to creating variations for such scenes Given a exemplar, our method can synthesize object pairs in a relationship similar to the exemplar For example, put different flowers into different vases, tuck different chairs under different desks…

Existing Methods L.-F. Yu, S.-K. Yeung, C.-K. Tang, D. Terzopoulos, T. F. Chan, and S. J. Osher, “Make it home” SIGGRAPH 2011 Y.-T. Yeh, L. Yang, M. Watson, N. D. Goodman, and P. Hanrahan, “Synthesizing open worlds with constraints using locally annealed reversible jump MCMC,” SIGGRAPH 2012 M. Fisher, D. Ritchie, M. Savva, T. Funkhouser, and P. Hanrahan, “Example-based Synthesis of 3D Object Arrangements” SIGGRAPH ASIA2012 L. Majerowicz, A. Shamir, A. Sheffer, and H. H. Hoos, “Filling your shelves: Synthesizing diverse style-preserving artifact arrangements,” TVCG 2014. These methods provide effective systems to synthesize indoor scenes. They cannot handle complex spatial relations due to their simple centroid-based relationship representation. Our work focus on handing complex relations. Additionally, several methods require an extensive scene database as input, while we only work with a single example scene. about the previous method -> their representation -> their problem -> the representation we make use of -> our solution

Limitation of Previous Representations example scene new scene example scene new scene example scene new Traditional methods: Relative vector / individual shape matching (example 1) They do not work for complex relations (example 2, might be ok if we add orientation constraints; example 3, totally don’t work) Because “switching individual object” does not consider spatial relationship between the two parts And “relative vector" abstract the object to be a single point, it contains too little information of spatial relationships

The Representation We Use: IBS We use IBS to synthesis new scenes Emphasize: we don't want to replace a full synthesis pipeline, but instead provide a component for a synthesis pipeline that can be used to synthesize these complex relationships X. Zhao, H. Wang, and T. Komura, “Interaction Bisector Surface,” TOG2014. R. Hu, C. Zhu, O. van Kaick, L. Liu, A. Shamir, and H. Zhang, “Interaction Context (ICON)” SIGGRAPH2015

Our Method

Overview Example scene 1.Template construction

Overview 1.Template construction 2. Object fitting Novel object 1

Overview Result 1.Template construction 2. Object fitting 3. Scene synthesis

Relationship Template: Abstraction of The Open Space We are using open space as a template The open space is like the board of the puzzle. we build the template from the open space instead of the objects themselves. 1.Template construction 2. Object fitting 3. Scene synthesis

Template Construction: IBS Interaction Bisector Surface(IBS) We want to build the template from the open space of the scene (to abstract the spatial relationship). To compute the template we use IBS. The IBS of a scene is a surface consisting of the set of points equidistant from neighboring objects in a scene. (in 2D… in 3D…) The IBS divides the open space of the scene into N subspaces, each of which corresponds to one object in the scene 1.Template construction 2. Object fitting 3. Scene synthesis

Template Construction: Cells and Features (The motivation of using SCF) From point of template to the closest point on the desk, we can build a vector (the red arrows) Feature1: The size of the vector can describe the distance from the template point to the object Feature2: The direction of the vector can describe the relative direction between the two parts (p2 shows the surface part of the desk is above the template) But how to tell the difference between p1 and p3? (the distance and direction are not descriptive enough) 1.Template construction 2. Object fitting 3. Scene synthesis

Shape Coverage Feature (SCF) The definition and computation of SCF: To compute the SCF at the red point, we sample the boundary distance around the point and normalize it to get the volume diameter function. SCF coefficients are the power of this function in each frequency band. 1.Template construction 2. Object fitting 3. Scene synthesis

SCF Coefficients 1.Template construction Chair(1) Chair(2) Desk SCF coefficients in the open space around different objects (two chairs and a desk – sideview) are shown as color maps. Values are clamped to 1 to preserve contrast for smaller coefficients. In each row, we show coefficient values of bands 0 to 4 for the object shown in the foreground. Why good? The distributions are different for different types of models (chair vs. desk), also robust to the change of local geometry(chair1 vs. chair2) Desk 1.Template construction 2. Object fitting 3. Scene synthesis

Object Fitting: the idea Example scene Template Template Template Novel object Template The open space of a scene is defined as the free space outside the objects/template mesh. The idea here is to fit the object to the template: find a good configuration of the object around the template so that the spatial relationship is kept Fitting object to template = fitting template to object For computation convenience, we move the template in the object’s open space to find the best fit 1.Template construction 2. Object fitting 3. Scene synthesis 1.Template construction 2. Object fitting 3. Scene synthesis

Initial Matching ROI 2. Object fitting Geometric hashing To find the best fitting, we use geometric hashing method to match the template points and the candidate points This fitting process only produces an approximate match due to the finite sampling density of open space points. 1.Template construction 2. Object fitting 3. Scene synthesis

Reduce the Search Space Find the region of interest (ROI) Candidate points sliding window We reduce our search space to regions that are likely to contain good candidates. This is similar to the bag of visual words concept, where the goal is to search for a volume that contains point sets with high matching score. More specifically, we first do the Feature points clustering, then pre-compute features values at a uniform grid of points in the open space around the novel object Then we use a “sliding window” to test all different sub-regions of the open space After finding the ROI, we ignore other points in the open space and only consider the points in the ROI Within the ROI, we do the geometric hashing 1.Template construction 2. Object fitting 3. Scene synthesis

Refinement 2. Object fitting ICP style refinement We iteratively refine the coarse matching results by maximizing the similarity function 1.Template construction 2. Object fitting 3. Scene synthesis

Larger Scenes 3. Scene synthesis Scene hierarchy Combine with other scene synthesis system M. Fisher, D. Ritchie, M. Savva, T. Funkhouser, and P. Hanrahan, “Example-based Synthesis of 3D Object Arrangements” SIGGRAPH ASIA2012 For scenes composed of more than two objects, directly placing objects into the template cells can be overly restrictive. By using a scene hierarchy, we decompose the synthesis to a sequence of pairwise synthesis (describe the example). For the bathroom scene we combine our approach with Fisher’s method to show that our method can be used as a component of existing scene synthesis pipelines. To create the variations for the bathroom scene, objects in complex relationships, such as the baby in the bathtub, toothbrush in the cup, are synthesized by our method, then merged and placed as single objects into the scene using The arrangement and layout models of Fisher’s method. 1.Template construction 2. Object fitting 3. Scene synthesis

Capturing Scene Semantics Without semantic constraints With semantic constraints Realistic scenes usually contain constraints that are not captured purely by the geometry of relationships. Resolving placement ambiguity, Contact constraints, Upright constraints, Floor constraints, Collision constraints 1.Template construction 2. Object fitting 3. Scene synthesis

Results and Evaluations

Pairwise Experiment: Our Method vs. ShapeSPH* Input The input of our experiments are four example scenes and a database of unlabeled objects We use both our method and the ShapeSPH method to synthesize all possible combinations of retrieved object pairs and rank them based on their combined fitting score. Baseline method: ShapeSPH Objects in the example pair are replaced separately with database objects using global shape alignment *T. Funkhouser et al., “Modeling by example,” in ACM Transactions on Graphics (TOG), 2004, vol. 23, pp. 652–663.

Pairwise Experiment: Results The incorrectly synthesized pairs are marked with purple background Here our method performs better (there are less incorrect results), because we describe and optimize for the geometric relationship between these objects The baseline method produces a large number of false positives, because it only takes into account the geometry of individual example objects

Pairwise Experiment: Results

Pairwise Experiment: Results

Pairwise Experiment: Results

Larger Scene Experiment Input scene Results - Ours We compare our results against results obtained from Fisher et al.[2012] using the same input scene. Results - [Fisher et al. 2012]

Larger Scene Experiment Input scene Results - Ours Results - [Fisher et al. 2012]

Larger Scene Experiment Input scene Results - Ours Results – [Fisher et al. 2012]

Larger Scene Experiment: Evaluation User study interface We rated plausibility of each synthesized scene via a user study. Users are shown images of two randomly picked scenes and asked to judge which object arrangement is more realistic. The study was performed in Mechanical Turk.

Larger Scene Experiment: Evaluation Using the BradleyTerry Model [Hunter 2004], we obtained a ‘realism’ or ‘plausibility’ score for each image from the results of these pairwise comparisons, as well as probabilities for each image to be judged more realistic than any other image in the dataset.

Spatially Repeated Objects For synthesizing scenes where the same spatial relations can be found at various locations, such as hats hung at multiple hatholders, we find multiple candidate locations around the stand and do the object fitting several times. By doing this we produce scenes with different hatholders and hats. Similarly we can produce these objects-on-shelf scenes

Conclusion We propose a method for creating variations of scenes that include complex relationships. We propose a novel feature “SCF” to encode the geometry of the open space. Our method can augment existing scene synthesis methods.

Limitations and Future Work Only consider rigid IBS Future work: Add flexibility to the relationship template Learn a parametric model of the relationship template

Acknowledgement Anonymous reviewers Anonymous Mechanical Turk users Ylab in XJTU China Postdoctoral Science Foundation National Science Foundation of China Guangdong Science and Technology Program Shenzhen Innovation Program The ERC starting Grant SmartGeometry Marie Curie CIG EPSRC Grant FP7 TOMSY Thanks!