Visual Perception of 3D Shape Roland W. Fleming Manish Singh Max Planck Institute for Biological Cybernetics Rutgers University – New Brunswick.

Slides:

Advertisements

Similar presentations

1 Orientation fields and 3D shape estimation Roland W. Fleming Max Planck Institute for Biological Cybernetics.

Advertisements

Unit 4(G): Perceptual Organization and Interpretation

A Projective Framework for Radiometric Image Analysis CVPR 2009 Date: 2010/3/9 Reporter : Annie lin.

Seeing 3D from 2D Images. How to make a 2D image appear as 3D! ► Output and input is typically 2D Images ► Yet we want to show a 3D world! ► How can we.

Fleming, Breidt & Bülthoff Supplementary Images The following slides contain a number of example photographs to show how the claims in the main submission.

Perception Chapter 4.

PERCEPTION Our ________________ of the stimuli coming in from the world around us.

Low-Level Vision. Low Level Vision--outline Problem to be solved Example of one computation—lines How simple computations yield more complex information.

Chapter 23 Mirrors and Lenses. Notation for Mirrors and Lenses The object distance is the distance from the object to the mirror or lens Denoted by p.

Outline Sensation, Perception, Behavior Process of sensation Perceived vs. “real” world Properties of perceptual processes - Adaptation, pattern coding.

Visibility Subspaces: Uncalibrated Photometric Stereo with Shadows Kalyan Sunkavalli, Harvard University Joint work with Todd Zickler and Hanspeter Pfister.

Perception of illumination and shadows Lavanya Sharan February 14th, 2011.

MSU CSE 803 Stockman1 CV: Perceiving 3D from 2D Many cues from 2D images enable interpretation of the structure of the 3D world producing them.

Perception of illumination and shadows Lavanya Sharan February 14th, 2011.

Color, lightness & brightness Lavanya Sharan February 7, 2011.

Human Visual System Lecture 3 Human Visual System – Recap

1 Lecture 11 Scene Modeling. 2 Multiple Orthographic Views The easiest way is to project the scene with parallel orthographic projections. Fast rendering.

Theories of Vision: a swift overview : Advanced Machine Perception A. Efros, CMU, Spring 2006 Most slides from Steve Palmer.

CS292 Computational Vision and Language Visual Features - Colour and Texture.

Inflating an Artist’s Sketch Chapter 2: Hoffman Nicole J., Victor W., and Nicole T.

Image Statistics and the Perception of 3D Shape Roland W. Fleming Max Planck Institute for Biological Cybernetics Yuanzhen Li Edward H. Adelson Massachusetts.

1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.

Computer Vision Spring ,-685 Instructor: S. Narasimhan PH A18B T-R 10:30am – 11:50am Lecture #13.

Perception Illusion A false representation of the environment

CAP4730: Computational Structures in Computer Graphics 3D Concepts.

Computer Graphics Psychophysics Heinrich H. Bülthoff Max-Planck-Institute for Biological Cybernetics Tübingen, Germany Heinrich H. Bülthoff Max-Planck-Institute.

Analysis of Lighting Effects Outline: The problem Lighting models Shape from shading Photometric stereo Harmonic analysis of lighting.

Y. Moses 11 Combining Photometric and Geometric Constraints Yael Moses IDC, Herzliya Joint work with Ilan Shimshoni and Michael Lindenbaum, the Technion.

Module 6 Perception.

Lecture 2b Readings: Kandell Schwartz et al Ch 27 Wolfe et al Chs 3 and 4.

VIEWING THE WORLD IN COLOR. COLOR A psychological interpretation Based on wavelength, amplitude, and purity Humans can discriminate among c. 10 million.

1 Perception, Illusion and VR HNRS 299, Spring 2008 Lecture 8 Seeing Depth.

Chapter 6 Section 2: Vision. What we See Stimulus is light –Visible light comes from sun, stars, light bulbs, & is reflected off objects –Travels in the.

Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.

Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.

Understanding the effect of lighting in images Ronen Basri.

Goal and Motivation To study our (in)ability to detect inconsistencies in the illumination of objects in images Invited Talk! – Hany Farid: Photo Forensincs:

CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.

Photo-realistic Rendering and Global Illumination in Computer Graphics Spring 2012 Material Representation K. H. Ko School of Mechatronics Gwangju Institute.

Perception 1. Inattentional Blindness Challenge: Count the number of passes the white shirts pass! VideoVideo (2mins) Video Type of selective attention.

Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.

Fundamentals of Sensation and Perception RECOGNIZING VISUAL OBJECTS ERIK CHEVRIER NOVEMBER 23, 2015.

1 Perception and VR MONT 104S, Fall 2008 Lecture 12 Illusions.

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

Perception How do we define it?

Ch.9 Bayesian Models of Sensory Cue Integration (Mon) Summarized and Presented by J.W. Ha 1.

Sensation and Perception

1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.

Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth

Visual Perception There are two categories of cognitive processes that we use when we assign meaning to incoming information. What are they?

Visual Perception. What is Visual Perception? Visual perception are rules we apply to visual information to assist our organisation and interpretation.

Theories of Vision: a swift overview : Learning-based Methods in Vision A. Efros, CMU, Spring 2007 Most slides from Steve Palmer.

VISUAL PERCEPTION PRINCIPLES By Mikayla. VISUAL PERCEPTION PRINCIPLES  Gestalt principles 1.Closure 2.Proximity 3.Similarity 4.Figure-ground  Depth.

Perception  How do we define it? How we recognize and interpret stimuli How we recognize and interpret stimuli Top down processing… Top down processing…

Perception. The means by which information acquired from the environment via the sense organs is transformed into experiences of objects, events, sounds,

From local motion estimates to global ones - physiology:

MAN-522 Computer Vision Spring

Unit 4: Perceptual Organization and Interpretation

Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.

Prof. Riyadh Al_Azzawi F.R.C.Psych

R. C. James Photograph.

Visual Organization and Interpretation

A preference for global convexity in local shape perception

Prof. Riyadh Al_Azzawi F.R.C.Psych

Visual Motion and the Perception of Surface Material

Module 19 – Visual Organization and Interpretation

Prof. Riyadh Al_Azzawi F.R.C.Psych

Optical flow and keypoint tracking

Shape from Shading and Texture

Presentation transcript:

Visual Perception of 3D Shape Roland W. Fleming Manish Singh Max Planck Institute for Biological Cybernetics Rutgers University – New Brunswick

The problem of 3D perception Bishop Berkeley ( ): "It is I think agreed by all that distance of itself, and immediately, cannot be seen. For distance being a line directed end-wise to the eye, it projects only one point in the fund of the eye, which point remains invariably the same whether the distance be longer or shorter." P1P1 P2P2 P

The optics of the eye project the 3D world onto a 2D image plane on the retina. What we as behaving organisms care about is the 3D structure of the world. Unfortunately the projection from 3D to 2D is not invertible. The problem of 3D perception Image [2D] World [3D]

Multiple surfaces are consistent with any given image, so 3D shape perception is fundamentally ambiguous It is an inference from incomplete information The problem of 3D perception

Ambiguities in 3D Perception Necker Cube 2 dominant interpretations

Ambiguities in 3D Perception 2 dominant interpretations Only a handful of legal interpretations are generally experienced. Why? Note that neither of these two interpretations are correct perspective projections!

Philosophical Schools Constructivism (e.g. Helmholtz, Gregory, Rock) – vision is ill-posed: sensory data are impoverished – the world we see is a construction – perception is a process of inductive inference – Extra-retinal information and assumptions about the world play a central role Direct Perception (e.g. Gibson) – “ambient optic array” contains sufficient information to support action – we perceive the world directly, through active interaction – the relevant information is global and comparative

Philosophical Schools Gestalt Perception (e.g. Koffka, Metzger, Kohler) – vision is all about structure – the interpretation that we experience is determined by the interaction of simple rules describing the organization of the interpretation – The simplest interpretation is favoured: Prägnanz time

Explaining the Necker Cube 2 dominant interpretations Constructivism: the percepts are the most probable interpretations Direct Perception: the relevant image information specifies these interpretations, but such ambiguous images are rarely encountered in the real world, and we normally resolve the ambiguity through interaction Gestalt: the percepts are the simplest, ‘most orderly’ interpretations.

Perception Pipeline image

Perception Pipeline cues image shading texture

Perception Pipeline cues image shading texture shape estimate shape estimate

Perception Pipeline cues priors image shading texture shape estimate shape estimate “Surfaces are generally smooth” “Texture tends to be isotropic” “Light usually comes from above”

Generic Viewpoint Assumption Koenderink & van Doorn (1979). Binford (1981). Freeman (1994).

Image-based material editing Kahn, Reinhard, Fleming & Bülthoff (2006). Transactions on Graphics: Proceedings of SIGGRAPH 06. © ACM SIGGRAPH. transparencyre-textured  Given single photograph as input, modify material appearance of object.  Physically correct solution not possible: aim for ‘perceptually correct’ solution.  Exploit assumptions of human vision to develop heuristics.

Crude Shape Reconstruction Light from the side: shadows and intensity gradient leads to substantial distortions of the face original reconstructed depths

Importance of viewpoint Substantial errors in depth reconstruction are not visible in transformed image transformed image correct viewpoint

Importance of viewpoint

Seen from Above

Hollow Mask Illusion Convexity and familiarity combine to yield a strong sense that the mask is convex, even when it is concave. But note that the apparent lighting and shape is different. convexconcavetransition

Bas-Relief Ambiguity Scenes related to one another by an affine transformation are indistinguishable from one another Belhumeur, Kriegman & Yuille (1997)

Scenes related to one another by an affine transformation are indistinguishable from one another Bas-Relief Ambiguity Belhumeur, Kriegman & Yuille (1997)

Bas-Relief Ambiguity Belhumeur, Kriegman & Yuille (1997) showed that shape from shading information is fundamentally ambiguous. For direct illumination, scenes that are related to one another by an affine transformation (scaling + shearing) yield pixel-for-pixel identical images. Despite this we rarely experience any ambiguity in the perception of shaded objects. Everyday perception gives us the impression that we see objects in a correct and stable way. But do we? Koenderink and colleagues have shown that perceived shape varies considerably from day to day, with the percepts typically related to one another by an affine transformation.

Light from Above In the absence of other information to indicate shape or lighting direction, the brain assumes light comes from above “light” from below “light” from above

Light from Above In the absence of other information to indicate shape or lighting direction, the brain assumes light comes from above “light” from below “light” from above

Linear Perspective

Bounding Contours © Dejan Todorović, Adapted and used with permission

Bounding Contours © Dejan Todorović, Adapted and used with permission

Bounding Contours

Structure from Motion Individual frames carry a relatively weak sense of 3D shape. It is only through optic flow (motion) that the shape is revealed

Pattern of compressions and rarefactions across the image indicates something about the 3D shape. Shape from Texture

Isotropic compression of textures due to distance Shape from Texture

Anisotropic compression of textures due to slant Shape from Texture

Anisotropic compression of textures due to slant

Shape from Texture Anisotropic compression of textures due to slant

Anisotopic compression specifies surface orientation up to a 180° ambiguity on the surface tilt. This means we can experience perceptual flips (bistability) when there are no other cues to specify convexity vs. concavity Under orthographic projection, there is no isotropic compression and no convergence, so we can see the red line as lying either on a ridge or in a valley

Under perspective projection, isotropic compression (scale gradient) and convergence cues resolve the ambiguity. We experience the red line as lying on a ridge, and not on a valley.

Homogeneous: the statistics of the texture are uniform from location to location. This is necessary to ensure that changes in the statistics of the texture observed in the image are due solely to the process of projection into the image plane and are not intrinsic to the texture itself Isotropic: the texture does not have a dominant local orientation. This is necessary to ensure that anisotropic compressions are aligned with the depth gradient of the surface Assumptions in Shape from Texture

Illusory distortions of shape Inspired by Todd & Thaler VSS 05

Illusory distortions of shape

Inspired by Todd & Thaler VSS 05 Illusory distortions of shape

Interaction of light with surface

Matte Glossy Mirrored

Confounding Effects of Illumination Identical materials can lead to very different images Different materials can lead to very similar images Images © Ron O. Dror. All rights reserved.

Ambiguity between illumination and Shape

reflectance mapimage Classical Shape from Shading Visual system estimates surface orientation from image intensity

Classical Shape from Shading reflectance map Image intensity is a scalar but surface orientation is a vector Recovering orientation from intensity is under-constrained Large amount of computer vision research proposing ways to reduce this ambiguity Problem: image intensity is ambiguous:

Visual system estimates surface orientation from image intensity Classical Shape from Shading reflectance map Circular logic: estimating the reflectance map requires knowing the geometry. Under typical viewing conditions, it is unclear how well subjects can estimate the reflectance map. Problem: reflectance map is unknown:

Visual system estimates surface orientation from image intensity Classical Shape from Shading reflectance map There is no principled way of predicting when human shape perception should succeed or fail Successes attributed to correct estimation of reflectance map, errors to incorrect estimates of reflectance map. But why and when should this occur? Problem: predicting human perception

Use image measurements other than intensity Use the kinds of image measurements the visual system employs at the front end Alternative approach reflectance mapimage

Mirrors No stereopsis No diffuse shading No texture Nothing but a distorted reflection of the world surrounding the object! Yet we perceive the 3D shape. How? Fleming, Torralba & Adelson (2004). Journal of Vision.

highly curved Curvatures determine distortions

slightly curved Anisotropies in surface curvature lead to powerful distortions of the reflected world Curvatures determine distortions

Eigenvectors of Hessian matrix Intrinsic principal curvatures

image depths

Population codes

Orientation fields Ground truth

3D shape appears to be conveyed by the continuously varying patterns of orientation across the image of a surface

Beyond specularity Specular reflection Diffuse reflection

Orientations in shading

Orientation fields in shading

Reflectance as Illumination Mirrors in an increasingly blurry world

highly curved

slightly curved Anisotropies in surface curvature lead to anisotropies in the image.

Light Warps Vergne, Pacanowski, Barla, Granier & Schlick (2009). Light Warping for enhanced Surface Depiction in SIGGRAPH ’09: ACM SIGGRAPH 2009 Papers. © ACM SIGGRAPH 2009, All rights reserved.

Light Warps Vergne, Pacanowski, Barla, Granier & Schlick (2009). Light Warping for enhanced Surface Depiction in SIGGRAPH ’09: ACM SIGGRAPH 2009 Papers. © ACM SIGGRAPH 2009, All rights reserved.

Apparent Ridges Judd, Durand & Adelson (2007). Apparent Ridges for Line Drawing. ACM Transactions on Graphics: Proceedings of SIGGRAPH © ACM SIGGRAPH 2007, All rights reserved.

Apparent Ridges Judd, Durand & Adelson (2007). Apparent Ridges for Line Drawing. ACM Transactions on Graphics: Proceedings of SIGGRAPH © ACM SIGGRAPH 2007, All rights reserved.

Texture vs. Reflectance

“Shape from Smear”

Higher level shape properties Neither object is physically unstable (falling over) But: one “affords being toppled” more than the other

Perceived Shape is Multi-Scale Coarse Mid Fine

Perceived Shape is Multi-Scale Lee, C. H., Varshney, A. & Jacobs, D. W., Mesh saliency, in SIGGRAPH '05: ACM SIGGRAPH 2005 Papers, pp (New York, NY, USA: ACM, 2005). © ACM SIGGRAPH 2005, All rights reserved. Mesh Saliency

Perceived Shape is Multi-Scale Lee, C. H., Varshney, A. & Jacobs, D. W., Mesh saliency, in SIGGRAPH '05: ACM SIGGRAPH 2005 Papers, pp (New York, NY, USA: ACM, 2005). © ACM SIGGRAPH 2005, All rights reserved. Coarse spatial scaleFine spatial scale Applications : Level of Detail Hiding Watermarks Viewpoint selection

Conclusions There are many different cues to 3D shape, which the human visual system can draw on under typical viewing conditions. Most cues are ambiguous or unreliable if considered in isolation. The secret of conveying shape effectively is to provide multiple cues. Orientation fields may be an important common language in human shape processing. There are probably many other applications in CG that can exploit this. Many of the assumptions made by human vision can be exploited in a computer graphics applications. Richer, more perceptual representations of geometry are an exciting challenge for the future.