MSU CSE 803 Stockman1 CV: Perceiving 3D from 2D Many cues from 2D images enable interpretation of the structure of the 3D world producing them.

Slides:



Advertisements
Similar presentations
Saito, T. and Takahashi, T. Comprehensible Rendering of 3-D Shapes Proc. of SIGGRAPH '90 Genesis of Image Space NPR.
Advertisements

Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #12.
CS 376b Introduction to Computer Vision 04 / 21 / 2008 Instructor: Michael Eckmann.
Lecture 18 – Recognition 1. Visual Recognition 1)Contours 2)Objects 3)Faces 4)Scenes.
The Invisible surface that Glows chapter 3 *Jane * Patty * Rath*
Perception The process of organizing and interpreting information, enabling us to recognize meaningful objects and events.
Imaging Science FundamentalsChester F. Carlson Center for Imaging Science The Human Visual System Part 2: Perception.
Extra Credit for Participating in Experiments Go to to sign upwww.tatalab.ca We are recruiting for a variety of experiments including basic.
PSYC 1000 Lecture 21. Selective Attention: Stroop.
MECHANICAL DRAWING MULTIVIEW DRAWINGS.
Plain Mirror Style SNC2D
Stockman MSU/CSE Fall Computer Vision and Human Perception A brief intro from an AI perspective.
2002/02/12PSYC202, Term 2, Copyright Jason Harrison Recovery of World Structure: Art and Image.
EE663 Image Processing Edge Detection 2 Dr. Samir H. Abdul-Jauwad Electrical Engineering Department King Fahd University of Petroleum & Minerals.
Imaging Science FundamentalsChester F. Carlson Center for Imaging Science Binocular Vision and The Perception of Depth.
1 Introduction to 3D Imaging: Perceiving 3D from 2D Images How can we derive 3D information from one or more 2D images? There have been 2 approaches: 1.
PH 103 Dr. Cecilia Vogel Lecture 2. RECALL OUTLINE  Ray model & depth perception  Reflection  law of reflection  mirror images  Polarization REMINDER.
1 Lecture 11 Scene Modeling. 2 Multiple Orthographic Views The easiest way is to project the scene with parallel orthographic projections. Fast rendering.
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
CV: 3D sensing and calibration
© 2004 by Davi GeigerComputer Vision March 2004 L1.1 Binocular Stereo Left Image Right Image.
MSU CSE 803 Fall 2008 Stockman1 CV: 3D sensing and calibration Coordinate system changes; perspective transformation; Stereo and structured light.
PH 103 Dr. Cecilia Vogel Lecture 2. RECALL OUTLINE  Ray model & depth perception  Reflection  law of reflection  mirror images  Polarization REMINDER.
Inflating an Artist’s Sketch Chapter 2: Hoffman Nicole J., Victor W., and Nicole T.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Three-Dimensional Concepts
Reading Gregory 24 th Pinker 26 th. Seeing Depth What’s the big problem with seeing depth ?
Transformations, Constructions and 3D Drawings
Introduction to 3D Graphics John E. Laird. Basic Issues u Given a internal model of a 3D world, with textures and light sources how do you project it.
1 Perceiving 3D from 2D Images How can we derive 3D information from one or more 2D images? There have been 2 approaches: 1. intrinsic images: a 2D representation.
Careers for Psychology and Neuroscience Majors Oct. 19th5-7pm in SU 300 Ballroom B.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #18.
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
Computer Vision Spring ,-685 Instructor: S. Narasimhan PH A18B T-R 10:30am – 11:50am Lecture #13.
Elements of Design. What are they? Line Colour Attributes Shape Categories Space Form.
Design Elements Form & Value To understand and apply the design elements Form & Value.
Recap Low Level Vision –Input: pixel values from the imaging device –Data structure: 2D array, homogeneous –Processing: 2D neighborhood operations Histogram.
CAP4730: Computational Structures in Computer Graphics 3D Concepts.
FINE ARTS LESSON 2 ELEMENTS OF ART
Photography Elements of Design. Line Uses for lines in design Contour line: A line that defines or bounds an edge, but not always the outside edge,
1 Reflection and Mirrors. 2 The Law of Reflection “ The angle of incidence equals the angle of reflection.”
LIGHT AND ITS USES. LIGHT RAY STRAIGHT LINE PATH OF LIGHT.
Shape from Contour Recover 3D information. Muller-Lyer illusion.
Characteristics & Ray Diagrams
Plain Mirror Style SNC2D. Angles – What’s It All Mean?
1 Perception, Illusion and VR HNRS 299, Spring 2008 Lecture 8 Seeing Depth.
3D Shape Inference Computer Vision No.2-1. Pinhole Camera Model the camera center Principal axis the image plane.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
(c) 2000, 2001 SNU CSE Biointelligence Lab Finding Region Another method for processing image  to find “regions” Finding regions  Finding outlines.
SANTANA DRAFTING MULTIVIEWS 1 An introduction Mr. Fitzgerald Rm
Shade & Shadow Figure 1-3, Page 11
1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.
Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth
CIS Intro to AI 1 Interpreting Line Drawings (An Introduction to Constraint Satisfaction) Mitch Marcus CIS 391 Fall 2008.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Digital Image Processing CSC331
Perception  How do we define it? How we recognize and interpret stimuli How we recognize and interpret stimuli Top down processing… Top down processing…
MAN-522 Computer Vision Spring
3D Vision Yang Wang National ICT Australia
Unit 4: Perceptual Organization and Interpretation
Three-Dimensional Concepts. Three Dimensional Graphics  It is the field of computer graphics that deals with generating and displaying three dimensional.
Fourier Transform: Real-World Images
The Elements of Art.
Depthmap Reconstruction Based on Monocular cues
3D Shape Inference Computer Vision No.2-1.
Vision: Scene Labelling
Perspective.
Vision: Scene Labelling
MIRRORS Chrome text with reflection (Advanced)
Presentation transcript:

MSU CSE 803 Stockman1 CV: Perceiving 3D from 2D Many cues from 2D images enable interpretation of the structure of the 3D world producing them

MSU CSE 803 Stockman2 Topic roadmap Labeling 3D structure in a 2D image Labeling constraints on edge graphs Huffman-Clowes-Waltz labeling Other cues motion parallax shape from texture or shading stereo from two 2d images

MSU CSE 803 Stockman3 Topic roadmap: mathematical models Shape from shading Depth from stereo Depth from focus Perspective transformation (review)

MSU CSE 803 Stockman4 Many 3D cues How can humans and other machines reconstruct the 3D nature of a scene from 2D images? What other world knowledge needs to be added in the process?

MSU CSE 803 Stockman5 Vocabulary for image labeling Interpret the local structure of the scene in the image space

MSU CSE 803 Stockman6 Some terms for local 3D structure Crease: (convex + or concave -) formed by an abrupt change to a surface or the joining of two surfaces. Surface on both sides of the crease can be sensed Blade: ( > ) as in the blade of a knife, where the normal to the occluding surface element. Occluding and occluded surfaces unrelated. (left) intensity image of 3 blocks (right) result of 5x5 Prewitt operator

MSU CSE 803 Stockman7 Limb: smooth object contour Limb: (>>) formed by viewing a smooth 3D object, such as an arm or a soup can: when approaching the contour, the surface normal becomes perpendicular to the line of sight. (Right side of arrow is the occluding surface.) egg Soup can

MSU CSE 803 Stockman8 Albedo and lighting mark: (M) surface mark or change of “albedo” (reflectance) and not the 3d surface, creating an intensity contour in the image shading: (S) illumination change due to a change in lighting or shadow on the surface, creating an intensity contour in the image

MSU CSE 803 Stockman9 Labeling image contours interprets the 3D scene structure + An egg and a thin cup on a table top lighted from the top right Logo on cup is a “mark” on the material “shadow” relates to illumination, not material

MSU CSE 803 Stockman10 “Intrinsic Image” stores 3D info in “pixels” and not intensity. For each point of the image, we want depth to the 3D surface point, surface normal at that point, albedo of the surface material, and illumination of that surface point.

MSU CSE 803 Stockman11 Practice labeling the contours (left) an unopened can of Brand X soda is a solid blue can with a bright orange block ‘X’. (right) an empty box with all four flaps open so even the bottom of the box is visible.

MSU CSE 803 Stockman12 3D scene versus 2D image Creases Corners Faces Occlusions (for some viewpoint) Edges Junctions Regions Blades, limbs, T’s

MSU CSE 803 Stockman13 Labeling of simple polyhedra Labeling of a block floating in space. BJ and KI are convex creases. Blades AB, BC, CD, etc model the occlusion of the background. Junction K is a convex trihedral corner. Junction D is a T- junction modeling the occlusion of blade CD by blade JE.

MSU CSE 803 Stockman14 Trihedral Blocks World Image Junctions: only 16 cases! Only 16 possible junctions in 2D formed by viewing 3D corners formed by 3 planes and viewed from a general viewpoint! From top to bottom: L-junctions, arrows, forks, and T-junctions.

MSU CSE 803 Stockman15 Challenge ! Create a scene shot of no more than 2 blocks that creates all 16 junctions in the image

MSU CSE 803 Stockman16 How do we obtain the catalog? think about solid/empty assignments to the 8 octants about the X-Y-Z-origin think about non-accidental viewpoints account for all possible topologies of junctions and edges then handle T-junction occlusions

MSU CSE 803 Stockman17 Blocks world labeling Left: block floating in space Right: block glued to a wall at the back

MSU CSE 803 Stockman18 Try labeling these: interpret the 3D structure, then label parts What does it mean if we can’t label them? If we can label them?

MSU CSE 803 Stockman19 Waltz filtering discards edge interpretations spanning junctions

MSU CSE 803 Stockman researchers very excited very strong constraints on interpretations several hundred in catalogue when cracks and shadows allowed (Waltz): algorithm works very well with them but, world is not made of blocks! later on, curved blocks world work done but not as interesting

MSU CSE 803 Stockman21 Backtracking or interpretation tree

MSU CSE 803 Stockman22 Interpretation tree search

MSU CSE 803 Stockman23 “Necker cube” has multiple interpretations A human staring at one of these cubes typically experiences changing interpretations. The interpretation of the two forks (G and H) flip-flops between “front corner” and “back corner”. What is the explanation? Label the different interpretations

MSU CSE 803 Stockman24 Depth cues in 2D images

MSU CSE 803 Stockman25 “Interposition” cue Def: Interposition occurs when one object occludes another object, thus indicating that the occluding object is closer to the viewer than the occluded object.

MSU CSE 803 Stockman26 interposition T-junctions indicate occlusion: top is occluding edge while bar is the occluded edge Bench occludes lamp post leg occludes bench lamp post occludes fence railing occludes trees trees occlude steeple

MSU CSE 803 Stockman27 Perspective scaling: railing looks smaller at the left; bench looks smaller at the right; 2 steeples are far away Forshortening: the bench is sharply angled relative to the viewpoint; image length is affected accordingly

MSU CSE 803 Stockman28 Texture gradient reveals surface orientation Texture Gradient: change of image texture along some direction, often corresponding to a change in distance or orientation in the 3D world containing the objects creating the texture. ( In East Lansing, we call it “corn” not “maize’. ) Note also that the rows appear to converge in 2D

MSU CSE 803 Stockman29 3D Cues from Perspective

MSU CSE 803 Stockman30 3D Cues from perspective

MSU CSE 803 Stockman31 More 3D cues Virtual linesFalsely perceived interposition

MSU CSE 803 Stockman32 More 3D cues 2D alignment usually means 3d alignment 2D image curves create perception of 3D surface

MSU CSE 803 Stockman33 “structured light” can enhance surfaces in industrial vision Sculpted object Potatoes with light stripes

MSU CSE 803 Stockman34 Shape (normals) from shading Cylinder with white paper and pen stripes Intensities plotted as a surface Clearly intensity encodes shape in this case

MSU CSE 803 Stockman35 Shape (normals) from shading Plot of intensity of one image row reveals the 3D shape of these diffusely reflecting objects.