Presentation is loading. Please wait.

Presentation is loading. Please wait.

Yunhai Wang 1 Minglun Gong 1,2 Tianhua Wang 1,3 Hao (Richard) Zhang 4 Daniel Cohen-Or 5 Baoquan Chen 1,6 5 Tel-Aviv University 4 Simon Fraser University.

Similar presentations


Presentation on theme: "Yunhai Wang 1 Minglun Gong 1,2 Tianhua Wang 1,3 Hao (Richard) Zhang 4 Daniel Cohen-Or 5 Baoquan Chen 1,6 5 Tel-Aviv University 4 Simon Fraser University."— Presentation transcript:

1 Yunhai Wang 1 Minglun Gong 1,2 Tianhua Wang 1,3 Hao (Richard) Zhang 4 Daniel Cohen-Or 5 Baoquan Chen 1,6 5 Tel-Aviv University 4 Simon Fraser University 1 Shenzhen Institutes of Advanced Technology 6 Shandong University 3 Jilin University 2 Memorial University of Newfoundland

2 2/40  One of the most fundamental tasks in shape analysis  Low-level cues (minimal rule; convexity) alone insufficient

3 3/40 3 Learning segmentation [Kalograkis et al. 10] Active co-analysis [Wang et al. 2012] Unsupervised co-analysis [Sidi et al. 2011] Joint segmentation [Huang et al. 2011] Keys to success: amount & quality of labelled or unlabelled 3D data

4 4/40 380 labeled meshes over 19 object categories  How many 3D models of strollers, golf carts, gazebos, …?  Not enough 3D models = insufficient knowledge  Labeling 3D shapes is also a non-trivial task

5 5/40 About 14 million images across almost 22,000 object categories Labeling images is quite a bit easier than labeling 3D shapes

6 6/40 6 Incomplete Real-world 3D models (e.g., those from Tremble Warehouse) are often imperfect Self-intersecting; non-manifold

7 7/40 Treat a 3D shape as a set of projected binary images  Alleviate various data artifacts in 3D, e.g., self- intersections  Then propagate the image labels to the 3D shape  Label these images by learning from vast amount of image data

8 8/40 Joint image-shape analysis via projective analysis for semantic 3D segmentation  Utilize vast amount of available image data  Allowing us to analyze imperfect 3D shapes

9 9/40 Bi-class Symmetric Hausdorff distance = BiSH  Designed for matching 1D binary images  More sensitive to topology changes (holes)  Caters to our needs: part-aware label transfer

10 10/40 10 Image-guided 3D modeling [Xu et al.11] Many works on 2D-3D fusion, e.g., for reconstruction [Li et al.11]

11 11/40 11 Light field descriptor for 3D shape retrieval [Chen et al.03] Image-space simplification error [Lindstrom and Turk 10] We deal with the higher-level and more delicate task of semantic 3D segmentation

12 12/40 PSA for 3D shape segmentation Region-based binary shape matching Results and conclusion

13 13/40 Labeling involves GrabCut and some user assistance

14 14/40 Assume all objects are upright oriented; they mostly are! Project an input 3D shape from multiple pre-set viewpoints

15 15/40 For each projection of the input 3D shape, retrieve top matches from the set of labelled images

16 16/40 Select top (non-adjacent) projections with the smallest average matching costs for label transfer

17 17/40 Label transfer is done per corresponding horizontal slabs Pixel correspondence straightforward Later …

18 18/40 Label transfer is weighted by a confidence value per pixel  Three terms based on image-level, slab-level, and pixel-level similarity: more similar = higher confidence

19 19/40 Probabilistic map over input 3D shape: computed by integrating per-pixel confidence values over each shape primitive  One primitive projects to multiple pixels in multiple images  Per-pixel confidence gathered over multiple retrieved images

20 20/40 Final labeling of 3D shape: multi-label alpha expansion graph cuts based on the probabilistic map

21 21/40 PSA for 3D shape segmentation Region-based binary shape matching Results and conclusion

22 22/40 Projections of input 3D shape … Database of (labeled) images … Characteristics of the data to be matched  Possibly complex topology (lots of holes), not just a contour  All upright orientated: to be exploited Goal: find shapes most suitable for label transfer and FAST!  Not a global visual similarity based retrieval  Want part-aware label transfer but cannot reliably segment Classical descriptors, e.g., shape context, interior distance shape context (IDSC), GIST, Zenike moments, Fourier descriptors, etc., do not quite fulfill our needs

23 23/40 Takes advantage of upright orientation

24 24/40 Classical choice for distance: symmetric Hausdorff (SH) But not sensitive to topology changes; not part-aware Cluster scan-lines into smaller number of slabs --- efficiency! Hierarchical clustering by a distance between adjacent slabs

25 25/40 SH for only one class may not be topology- sensitive A bi-class SH distance is! A B C B SH(A,B)=2, SH(A c, B c )=10 SH(C,B)=2, SH(C c, B c )=2

26 26/40 A B C B SH(A,B)=2, SH(A c, B c )=10 SH(C,B)=2, SH(C c, B c )=2 BiSH(C,B) = 2 BiSH(A,B) = 10

27 27/40 BiSH SH BiSH is more part-aware: new slabs near part boundaries

28 28/40 Slabs are scaled/warped vertically for better alignment Another measure to encourage part-aware label transfer Slabs of labeled image warped to better align with slabs in projected image Warp Slabs recolored: many-to-one slab matching possible Recolor

29 29/40 Dissimilarity between slabs: BiSH scaled by slab height Slab matching allows linear warp: optimized by a dynamic time warping (DTW) algorithm Dissimilarity between images: sum over slab dissimilarity after warped slab matching

30 30/40 PSA for 3D shape segmentation Region-based binary shape matching Results and conclusion

31 31/40 Same inputs, training data (we project), and experimental setting Models in [K 2010]: manifold, complete, no self- intersections PSA allows us to handle any category and imperfect shapes

32 32/40 11 object categories; about 2600 labeled images All input 3D shapes tested have self-intersections as well as other data artifacts

33 33/40 Pavilion (465 pieces) Bicycle (704 pieces)

34 34/40

35 35/40 Matching two images (512 x 512) takes 0.06 seconds Label transfer (2D-to-2D then to 3D): about 1 minute for a 20K-triangle mesh  Number of selected projections: 5 – 10  Number of retrieved images per projection: 2

36 36/40 Projective shape analysis (PSA): semantic 3D segmentation by learning from labeled 2D images 36 Demonstrated potential in labeling 3D models: imperfect, complex topology, over any category

37 37/40 No strong requirements on quality of 3D model Utilize the rich availability and ease of processing of photos for 3D shape analysis

38 38/40 Inherent limitation of 2D projections: they do not fully capture 3D info Inherent to data-driven: knowledge has to be in data Assuming upright; not designed for articulated shapes Relying on spatial and not feature-space analysis

39 39/40 Labeling 2D images is still tedious: unsupervised projective analysis Additional cues from images and projections, e.g., color, depth, etc. Apply PSA for other knowledge-driven analyses

40 40/40 40 More results and data can be found from http://web.siat.ac.cn/~yunhai/psa.html http://web.siat.ac.cn/~yunhai/psa.html


Download ppt "Yunhai Wang 1 Minglun Gong 1,2 Tianhua Wang 1,3 Hao (Richard) Zhang 4 Daniel Cohen-Or 5 Baoquan Chen 1,6 5 Tel-Aviv University 4 Simon Fraser University."

Similar presentations


Ads by Google