CSE (c) S. Tanimoto, 2007 Image Understanding

Slides:



Advertisements
Similar presentations
A Graph based Geometric Approach to Contour Extraction from Noisy Binary Images Amal Dev Parakkat, Jiju Peethambaran, Philumon Joseph and Ramanathan Muthuganapathy.
Advertisements

5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Introduction to Computer and Human Vision Shimon Ullman, Ronen Basri, Michal Irani Assistants: Lena Gorelick Denis Simakov.
CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Introduction to Computer and Human Vision Shimon Ullman, Ronen Basri, Michal Irani Assistants: Tal Hassner Eli Shechtman.
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
Introduction to Computer Vision 3D Vision Topic 9 Stereo Vision (I) CMPSCI 591A/691A CMPSCI 570/670.
CSE (c) S. Tanimoto, 2008 Image Understanding II 1 Image Understanding 2 Outline: Guzman Scene Analysis Local and Global Consistency Edge Detection.
Color a* b* Brightness L* Texture Original Image Features Feature combination E D 22 Boundary Processing Textons A B C A B C 22 Region Processing.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 8: Stereoscopic Vision 1 Computational Architectures in Biological.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Information that lets you recognise a region.
CSE (c) S. Tanimoto, 2008 Image Understanding I 1 Image Understanding 1 Outline: Saccade Art: What's going on? Vision and Intelligence Motivating.
CSE473/573 – Stereo Correspondence
Digital Images The nature and acquisition of a digital image.
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
A Brief Overview of Computer Vision Jinxiang Chai.
SCCS 4761 Introduction What is Image Processing? Fundamental of Image Processing.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
DIGITAL IMAGE PROCESSING
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
CS 4487/6587 Algorithms for Image Analysis
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
The Seven Elements of Art Ms. Hanson/ART 1. Definition of The Elements of Art The elements of art are a commonly used group of aspects of a work of art.
Mind, Brain & Behavior Wednesday February 19, 2003.
Autonomous Robots Vision © Manfred Huber 2014.
IEEE Robot Team Vision System Project Michael Slutskiy & Paul Nguyen ECE 533 Presentation.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
1 Machine Vision. 2 VISION the most powerful sense.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Image Perception ‘Let there be light! ‘. “Let there be light”
Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth
1 Computational Vision CSCI 363, Fall 2012 Lecture 16 Stereopsis.
Edge Segmentation in Computer Images CSE350/ Sep 03.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
Computational Vision CSCI 363, Fall 2012 Lecture 17 Stereopsis II
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Image Perception ‘Let there be light! ‘. “Let there be light”
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
A Plane-Based Approach to Mondrian Stereo Matching
Image Processing Objectives To understand pixel based image processing
- photometric aspects of image formation gray level images
Computational Vision CSCI 363, Fall 2016 Lecture 15 Stereopsis
Miguel Tavares Coimbra
Fitting: The Hough transform
Vision.
VIEWING THE WORLD IN COLOR
CIS 601 Image Fundamentals
Introduction to Computer and Human Vision
Common Classification Tasks
Fitting Curve Models to Edges
Perceptual Constancies
CIS 595 Image Fundamentals
Perception We have previously examined the sensory processes by which stimuli are encoded. Now we will examine the ultimate purpose of sensory information.
Binocular Stereo Vision
CSE (c) S. Tanimoto, 2002 Image Understanding
Image Perception and Color Space
Vision: Inferring Information from Clues
Introduction Computer vision is the analysis of digital images
CSE (c) S. Tanimoto, 2001 Image Understanding
Vision: Inferring Information from Clues
CSE (c) S. Tanimoto, 2004 Image Understanding
Occlusion and smoothness probabilities in 3D cluttered scenes
Introduction to Artificial Intelligence Lecture 22: Computer Vision II
DIGITAL IMAGE PROCESSING Elective 3 (5th Sem.)
Presentation transcript:

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Outline: Motivation Human vision and illusions Image representation: Sampling, Quantization, Thresholding Stereo vision as an AI problem Stereograms, Geometry of stereograms, Computing correspondences Letting cues vote for hypotheses: Polar representation of a line, Hough transform Gestalt grouping CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Motivation Allow computer and robots to read books. Allow mobile robots to navigate using vision. Support applications in industrial inspection, medical image analysis, security and surveillance, and remote sensing of the environment. Permit computers to recognize users’ faces, fingerprints, and to track them in various environments. Provide prostheses for the blind. Develop artistic intelligence. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Human Vision 25% of brain volume is allocated to visual perception. Human vision is a parallel & distributed system, involving 2 eyes, retinal processing, and multiple layers of processing in the striate cortex. Most humans are trichromats and they perceive color in a 3-D color space (except for bichromats and monochromats). Vision provides a high-bandwidth input mechanism... “a picture is worth 1000 words.” CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Visual Illusions They provide insights about the nature of the human visual system, helping us understand how it works. Mueller-Lyer illusion CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Herman Grid Illusion CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Herman Grid Illusion (dark on light) CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Subjective Contour (Triangle) CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Image Representation Sampling: Number and density of “pixel” measurements Quantization: Number of levels permitted in pixel values. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Image Representation (cont.) Sampling: e.g., 4 by 4, square grid, 1 pixel/cm Quantization: e.g., binary, {0, 1}, 0 = black, 1 = white. 1 1 1 1 1 1 CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Aliasing due to Under-sampling Here the apparent frequency is about 1/5 the true frequency. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Quantization Capturing a wide dynamic range of brightness levels or colors requires fine quantization. Common is 256 levels of each of red, green and blue. Segmentation is simplified by having a small number of levels -- provided foreground and background pixels are reliably distinguished by their dark or light value. Grayscale thresholding is typically to used to reduce the number of quantization levels to 2. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Vision as Inferring Information from Clues Deriving 3D structure from 2D info requires additional information: e.g., constraints. Deriving global descriptions from local data requires information fusion, i.e., inference. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Stereo Vision as an AI Problem Projection from 3 dimension to 2 loses information. With 2 projections, we can gain back some of that information. Recovering the missing information is an inference problem. The missing information is constrained by knowledge about the real world and assumptions about the scene. The use of knowledge and assumptions to make inferences is a standard approach in artificial intelligence. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Stereograms Two-view stereograms: 1. spatially separated left-eye/right-eye pair (including virtual-reality goggles) 2. superimposed, with separation using color filters. 3. superimposed, with temporal shuttering. 4. superimposed, with separation using polarizing filters. Single-view stereograms: 1. Magic-eye pictures with depth-modulated carrier. 2. Wallpaper offering depth effects due to its periodicity. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Geometry of Stereograms CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Computing Correspondence Approach 1: Extract features and find a consistent matching of features in each view. Approach 2: Directly compute a disparity map, performing local correlations of the views. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Inferring Trends via Voting Methods The classical Hough Transform identifies prominent lines in a scene by letting each edge point vote for the line(s) it is on. Voting methods can do well under noisy conditions. Votes are tallied in an array of accumulators, indexed by theta and rho (polar parameters of a line). ρ = x cos θ + y sin θ. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Letting a Point Vote for all the Lines that Pass Through It CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Hough Transform: Polar representation ρ = x cos θ + y sin θ. (x, y) ρ (0, 0) θ CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

Hough Transform (Cont.) nondirectional, unweighted Hough Transform: H(θ,ρ) = Σ Σ f(x,y) δ(x cos θ + y sin θ - ρ). δ(x) = 1 if | x | < 1 0 otherwise CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Gestalt Grouping CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding

CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding Gestalt Grouping Texture element = “texel” Texel directionality Texel granularity Alignments of endpoints Spacing of texels Groups cue for surfaces, objects. CSE 415 -- (c) S. Tanimoto, 2007 Image Understanding