CSE 415 -- (c) S. Tanimoto, 2008 Image Understanding I 1 Image Understanding 1 Outline: Saccade Art: What's going on? Vision and Intelligence Motivating.

Slides:

Advertisements

Similar presentations

September 2, 2014Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2014 Instructor: Marc Pomplun Instructor: Marc Pomplun.

Advertisements

Read Land’s article about color vision for Tuesday next week.

5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.

Current Trends in Image Quality Perception Mason Macklem Simon Fraser University

Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 1 Vision in Humans and Machines September 10, 2009.

CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.

Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.

2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.

Fitting a Model to Data Reading: 15.1,

Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.

CS292 Computational Vision and Language Visual Features - Colour and Texture.

Digital Images The nature and acquisition of a digital image.

1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.

Sensation Interacting with our environment. What’s the difference? Sensation Interaction between the body-environment the reception of physical stimulation.

November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.

A Brief Overview of Computer Vision Jinxiang Chai.

SCCS 4761 Introduction What is Image Processing? Fundamental of Image Processing.

Sensation and Perception Part 1: Intro and Vision.

Perception Illusion A false representation of the environment

Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.

Perception Introduction Pattern Recognition Image Formation

Myers EXPLORING PSYCHOLOGY Module 14 Introduction to Sensation and Perception: Vision James A. McCubbin, PhD Clemson University Worth Publishers.

.  Sensation: process by which our sensory receptors and nervous system receive and represent stimulus energy  Perception: process of organizing and.

Lecture 2b Readings: Kandell Schwartz et al Ch 27 Wolfe et al Chs 3 and 4.

The Eye contains visual sensory receptors focuses light on the retina

1 Computational Vision CSCI 363, Fall 2012 Lecture 20 Stereo, Motion.

Ch 31 Sensation & Perception Ch. 3: Vision © Takashi Yamauchi (Dept. of Psychology, Texas A&M University) Main topics –convergence –Inhibition, lateral.

Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.

Sensation Vision The Eye Theories Hearing The Ear Theories Other Senses Smell Taste Pain Gestalt Principles Perceptual Constancies Perception Basic Principles.

Chapter 6 Section 2: Vision. What we See Stimulus is light –Visible light comes from sun, stars, light bulbs, & is reflected off objects –Travels in the.

CS 4487/6587 Algorithms for Image Analysis

CIS 601 Image Fundamentals Longin Jan Latecki Slides by Dr. Rolf Lakaemper.

Chapter 2: Digital Image Fundamentals Spring 2006, 劉震昌.

1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.

CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.

1 Computational Vision CSCI 363, Fall 2012 Lecture 5 The Retina.

September 3, 2013Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2013 Instructor: Marc Pomplun Instructor: Marc Pomplun.

Mind, Brain & Behavior Wednesday February 19, 2003.

1 Perception and VR MONT 104S, Fall 2008 Lecture 2 The Eye.

Autonomous Robots Vision © Manfred Huber 2014.

Vocab Theories & Laws Anatomical Structures Other Senses Perceptual Organization $100 $500 $400 $300 $200.

Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015

1 Machine Vision. 2 VISION the most powerful sense.

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

October 16, 2014Computer Vision Lecture 12: Image Segmentation II 1 Hough Transform The Hough transform is a very general technique for feature detection.

Image Perception ‘Let there be light! ‘. “Let there be light”

Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth

Intelligent Robotics Today: Vision & Time & Space Complexity.

Sensation and Perception. Transformation of stimulus energy into a meaningful understanding –Each sense converts energy into awareness.

Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.

Image Perception ‘Let there be light! ‘. “Let there be light”

A Plane-Based Approach to Mondrian Stereo Matching

Image Processing Objectives To understand pixel based image processing

- photometric aspects of image formation gray level images

Rozi Xu & Daniil Kolesnikov

Introduction to Computer and Human Vision

Perceptual Constancies

Mind, Brain & Behavior Wednesday February 12, 2003.

CSE (c) S. Tanimoto, 2002 Image Understanding

CMSC 426: Image Processing (Computer Vision)

Vision: Inferring Information from Clues

Midterm Exam Closed book, notes, computer Similar to test 1 in format:

CSE (c) S. Tanimoto, 2007 Image Understanding

Filtering An image as a function Digital vs. continuous images

CSE (c) S. Tanimoto, 2001 Image Understanding

Vision: Inferring Information from Clues

CSE (c) S. Tanimoto, 2004 Image Understanding

Introduction to Artificial Intelligence Lecture 22: Computer Vision II

Presentation transcript:

CSE (c) S. Tanimoto, 2008 Image Understanding I 1 Image Understanding 1 Outline: Saccade Art: What's going on? Vision and Intelligence Motivating applications and ideas Human vision and illusions Image representation: Sampling, Quantization, Thresholding Stereo vision as an AI problem Stereograms, Geometry of stereograms, Computing correspondences Letting cues vote for hypotheses: Polar representation of a line, Hough transform

CSE (c) S. Tanimoto, 2008 Image Understanding I 2 Saccade Art A great example is at the San Francisco Exploratorium. An online example is at Block off all but one stripe. Can you still see the effect? How few stripes are enough for you? Is it an image? Is there any image?

CSE (c) S. Tanimoto, 2008 Image Understanding I 3 Is Vision Part of Intelligence?  25% of the brain by volume is concerned with vision. This is Brodmann area 17, part of the striate (visual) cortex. image from:

CSE (c) S. Tanimoto, 2008 Image Understanding I 4 Vision Requires Intelligence 1.The image is usually missing relevant information. 2.The gaps must be filled in by making inferences using knowledge and context. 3.How are these inferences made? At many levels: At the retinal level, relevant structure is extracted from "receptive fields". The brain sends expectations to the eyes. An intermediate level theory is the Marr "Primal Sketch". High level: object-recognition. 4.Active vision is the use of motion, including eye movements, to gather scene data – the eyes must be controlled by the brain. 5."Visual thinking" includes spatial reasoning, navigation, pattern recognition, associative memory based on visual aspects, and perception itself. 6.Specific phenomena that reflect on visual intelligence include: unawareness of blind spots, visual illusions, pareidolia, hallucination.

CSE (c) S. Tanimoto, 2008 Image Understanding I 5 Motivation Allow computer and robots to read books. Allow mobile robots to navigate using vision. Support applications in industrial inspection, medical image analysis, security and surveillance, and remote sensing of the environment. Permit computers to recognize users’ faces, fingerprints, and to track them in various environments. Provide prostheses for the blind. Develop artistic intelligence.

CSE (c) S. Tanimoto, 2008 Image Understanding I 6 Human Vision 25% of brain volume is allocated to visual perception. Human vision is a parallel & distributed system, involving 2 eyes, retinal processing, and multiple layers of processing in the striate cortex. Most humans are trichromats and they perceive color in a 3-D color space (except for bichromats and monochromats). Vision provides a high-bandwidth input mechanism... “a picture is worth 1000 words.”

CSE (c) S. Tanimoto, 2008 Image Understanding I 7 The Human Eye

CSE (c) S. Tanimoto, 2008 Image Understanding I 8 Retina: Cross section (a) schematic (b) photo

CSE (c) S. Tanimoto, 2008 Image Understanding I 9 Densities of Rods and Cones

CSE (c) S. Tanimoto, 2008 Image Understanding I 10 Visual Pathway

CSE (c) S. Tanimoto, 2008 Image Understanding I 11 Visual Illusions Help us understand the limits of human perception the processes of perception ways to produce effects in art and architecture possible approaches to artificial perception

CSE (c) S. Tanimoto, 2008 Image Understanding I 12 Visual Illusions They provide insights about the nature of the human visual system, helping us understand how it works. Mueller-Lyer illusion

CSE (c) S. Tanimoto, 2008 Image Understanding I 13 Herman Grid Illusion

CSE (c) S. Tanimoto, 2008 Image Understanding I 14 Herman Grid Illusion (dark on light)

CSE (c) S. Tanimoto, 2008 Image Understanding I 15 Subjective Contour (Triangle)

CSE (c) S. Tanimoto, 2008 Image Understanding I 16

CSE (c) S. Tanimoto, 2008 Image Understanding I 17 ?

CSE (c) S. Tanimoto, 2008 Image Understanding I 18 Dalmation Illusion Camouflage vs Acute Perception Hyperacute perception: Hallucination Pareidolia: Attributing significance to patterns perceived in random arrangements (UFO, visions in the twilight, etc) From

CSE (c) S. Tanimoto, 2008 Image Understanding I 19 Perception: Stimulus + Expectation Image understanding is the interpretation of visual stimuli using context and knowledge. IU by computer normally begins with digital images from a camera.

CSE (c) S. Tanimoto, 2008 Image Understanding I 20 Image Understanding Outline: Saccade Art: What's going on? Vision and Intelligence Motivating applications and ideas Human vision and illusions Image representation: Sampling, Quantization, Thresholding Stereo vision as an AI problem Stereograms, Geometry of stereograms, Computing correspondences Letting cues vote for hypotheses: Polar representation of a line, Hough transform

CSE (c) S. Tanimoto, 2008 Image Understanding I 21 Image Representation Sampling: Number and density of “pixel” measurements Quantization: Number of levels permitted in pixel values.

CSE (c) S. Tanimoto, 2008 Image Understanding I 22 Image Representation (cont.) Sampling: e.g., 4 by 4, square grid, 1 pixel/cm Quantization: e.g., binary, {0, 1}, 0 = black, 1 = white

CSE (c) S. Tanimoto, 2008 Image Understanding I 23 Aliasing due to Under-sampling Here the apparent frequency is about 1/5 the true frequency.

CSE (c) S. Tanimoto, 2008 Image Understanding I 24 Shannon/Nyquist Sampling A band is a range of frequency values. (But sometimes it's defined as a range of wavelengths.) A signal that is bandlimited to band B has no frequency components outside of B. Now, we'll assume B = [0, f max ] Theorem: If a continuous signal z(t) is bandlimited to B, then it is possible to sample it at a frequency of f s > 2 f max such that z(t) can be perfectly reconstructed from the samples. 2 f max is called the Nyquist rate. f s / 2 is called the Nyquist frequency, and depends on f s.

CSE (c) S. Tanimoto, 2008 Image Understanding I 25 Minimal Sampling Let P be an oscillating pattern in an image. To capture P in a sampled representation, you need (1)The rest of the image to be bandlimited to P's frequency, and you need either (2a) two samples per cycle and luck (the phase must be right), or (2b) more than two samples per cycle.

CSE (c) S. Tanimoto, 2008 Image Understanding I 26 Quantization Capturing a wide dynamic range of brightness levels or colors requires fine quantization. Common is 256 levels of each of red, green and blue. Segmentation is simplified by having a small number of levels -- provided foreground and background pixels are reliably distinguished by their dark or light value. Grayscale thresholding is typically to used to reduce the number of quantization levels to 2.

CSE (c) S. Tanimoto, 2008 Image Understanding I 27 Vision as Inferring Information from Clues Deriving 3D structure from 2D info requires additional information: e.g., constraints. Deriving global descriptions from local data requires information fusion, i.e., inference.

CSE (c) S. Tanimoto, 2008 Image Understanding I 28 Stereo Vision as an AI Problem Projection from 3 dimension to 2 loses information. With 2 projections, we can gain back some of that information. Recovering the missing information is an inference problem. The missing information is constrained by knowledge about the real world and assumptions about the scene. The use of knowledge and assumptions to make inferences is a standard approach in artificial intelligence.

CSE (c) S. Tanimoto, 2008 Image Understanding I 29 Stereo and Stereograms A stereogram can help us understand what information is required by a human to make convincing inferences about depth. This can provide a model for a stereo image understanding system.

CSE (c) S. Tanimoto, 2008 Image Understanding I 30 Stereograms Two-view stereograms: 1. spatially separated left-eye/right-eye pair (including virtual-reality goggles) 2. superimposed, with separation using color filters. 3. superimposed, with temporal shuttering. 4. superimposed, with separation using polarizing filters. Single-view stereograms: 1. Magic-eye pictures with depth-modulated carrier. 2. Wallpaper offering depth effects due to its periodicity.

CSE (c) S. Tanimoto, 2008 Image Understanding I 31 Geometry of Stereograms

CSE (c) S. Tanimoto, 2008 Image Understanding I 32 Computing Correspondence Approach 1: Extract features and find a consistent matching of features in each view. Approach 2: Directly compute a disparity map, performing local correlations of the views.

CSE (c) S. Tanimoto, 2008 Image Understanding I 33 Processing Incomplete and Uncertain Evidence: How it's sometimes handled in image understanding Case study: the Hough Transform (rhymes with "rough France dorm")

CSE (c) S. Tanimoto, 2008 Image Understanding I 34 Inferring Trends via Voting Methods The classical Hough Transform identifies prominent lines in a scene by letting each edge point vote for the line(s) it is on. Voting methods can do well under noisy conditions. Votes are tallied in an array of accumulators, indexed by theta and rho (polar parameters of a line). ρ = x cos θ + y sin θ.

CSE (c) S. Tanimoto, 2008 Image Understanding I 35 Letting a Point Vote for all the Lines that Pass Through It

CSE (c) S. Tanimoto, 2008 Image Understanding I 36 Hough Transform: Polar representation ρ = x cos θ + y sin θ. ρ θ (x, y) (0, 0)

CSE (c) S. Tanimoto, 2008 Image Understanding I 37 Hough Transform (Cont.) nondirectional, unweighted Hough Transform: H(θ,ρ) = Σ Σ f(x,y) δ(x cos θ + y sin θ - ρ). δ (z) = 1 if | z | < 1 0 otherwise

CSE (c) S. Tanimoto, 2008 Image Understanding I 38 H.T. Peak Detection After vote accumulation: Apply smoothing to suppress non-dominant peaks. Extract peaks. Trace lines in image space to determine endpoints.