CS 496: Computer Vision Thanks to Chris Bregler. CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz – TA:

Slides:



Advertisements
Similar presentations
Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Advertisements

Visual Sensation & Perception How do we see?. Structure of the eye.
September 2, 2014Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2014 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Analysis of Contour Motions Ce Liu William T. Freeman Edward H. Adelson Computer Science and Artificial Intelligence Laboratory Massachusetts Institute.
The eye – curved cornea – lens – retina – fovea – optic disk Using Light.
Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Contours and Optical Flow: Cues for Capturing Human Motion in Videos Thomas Brox Computer Vision and Pattern Recognition Group University of Bonn Research.
Computer and Robot Vision I
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
COS 429: Computer Vision Thanks to Chris Bregler.
Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 1 Vision in Humans and Machines September 10, 2009.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Berkeley Vision GroupNIPS Vancouver Learning to Detect Natural Image Boundaries Using Local Brightness,
Human Sensing: The eye and visual processing Physiology and Function Martin Jagersand.
Introduction to Computer Vision CS223B, Winter 2005.
The Human Visual System Vonikakis Vasilios, Antonios Gasteratos Democritus University of Thrace
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
What should be done at the Low Level?
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Introduction to Object Tracking Presented by Youyou Wang CS643 Texas A&M University.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
The visual system Lecture 1: Structure of the eye
Cognitive Processes PSY 334 Chapter 2 – Perception.
1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 35 – Review for midterm.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
CIS 489/689: Computer Vision Instructor: Christopher Rasmussen Course web page: vision.cis.udel.edu/cv February 12, 2003  Lecture 1.
THE VISUAL SYSTEM: EYE TO CORTEX Outline 1. The Eyes a. Structure b. Accommodation c. Binocular Disparity 2. The Retina a. Structure b. Completion c. Cone.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)
The primate visual systemHelmuth Radrich, The primate visual system 1.Structure of the eye 2.Neural responses to light 3.Brightness perception.
Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.
FRS 123: Technology in Art and Cultural Heritage Perception.
2/14/00 Computer Vision. 2/14/00 Computer Vision Lecturer: Ir. Resmana Lim, M.Eng. Text: 1) Computer Vision -- A Modern Approach.
Machine Learning for Computer graphics Aaron Hertzmann University of Toronto Bayesian.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Human vision Jitendra Malik U.C. Berkeley. Visual Areas.
September 3, 2013Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2013 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
The Human Visual System Background on Vision Human vision – the best system around Deep network models.
Announcements Final is Thursday, March 18, 10:30-12:20 –MGH 287 Sample final out today.
Representation in Vision Derek Hoiem CS 598, Spring 2009 Jan 22, 2009.
Texture Analysis and Synthesis. Texture Texture: pattern that “looks the same” at all locationsTexture: pattern that “looks the same” at all locations.
COS 429: Computer Vision Thanks to Chris Bregler.
8/16/99 Computer Vision: Vision and Modeling. 8/16/99 Lucas-Kanade Extensions Support Maps / Layers: Robust Norm, Layered Motion, Background Subtraction,
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Course : T Computer Vision
- photometric aspects of image formation gray level images
Computer Vision, Robotics, Machine Learning and Control Lab
Section 2: The Visual System
Visual Perception Chapter 3.
Optical Flow Estimation and Segmentation of Moving Dynamic Textures
Chapter 4 Section 2 The Visual System
Welcome to CS 675 – Computer Vision Spring 2018
Early Processing in Biological Vision
Perceiving and Recognizing Objects
Computer Vision (CSE 490CV, EE400B)
Cognitive Processes PSY 334
?. ? White Fuzzy Color Oblong Texture Shape Most problems in vision are underconstrained White Color Most problems in vision are underconstrained.
Filtering Things to take away from this lecture An image as a function
Cognitive Processes PSY 334
Filtering An image as a function Digital vs. continuous images
Learning complex visual concepts
Presentation transcript:

CS 496: Computer Vision Thanks to Chris Bregler

CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz – TA: Wagner Corrêa – to both Course web page web page

What is Computer Vision? Input: images or videoInput: images or video Output: description of the worldOutput: description of the world

What is Computer Vision? Input: images or videoInput: images or video Output: description of the worldOutput: description of the world – Many levels of description

Low-Level or “Early” Vision Considers local properties of an image “There’s an edge!”

Mid-Level Vision Grouping and segmentation “There’s an object and a background!”

High-Level Vision Recognition “It’s a chair!”

Big Question #1: Who Cares? Applications of computer visionApplications of computer vision – In AI: vision serves as the “input stage” – In medicine: understanding human vision – In engineering: model extraction

Vision and Other Fields Computer Vision Artificial Intelligence Cognitive Psychology Signal Processing Computer Graphics Pattern Analysis Metrology

Big Question #2: Does It Work? Situation much the same as AI:Situation much the same as AI: – Some fundamental algorithms – Large collection of hacks / heuristics Vision is hard!Vision is hard! – Especially at high level, physiology unknown – Requires integrating many different methods – Requires reasoning and understanding: “AI completeness”

Computer and Human Vision Emulating effects of human visionEmulating effects of human vision Understanding physiology of human visionUnderstanding physiology of human vision

Image Formation Human: lens forms image on retina, sensors (rods and cones) respond to lightHuman: lens forms image on retina, sensors (rods and cones) respond to light Computer: lens system forms image, sensors (CCD, CMOS) respond to lightComputer: lens system forms image, sensors (CCD, CMOS) respond to light

Low-Level Vision Hubel

Retinal ganglion cellsRetinal ganglion cells Lateral Geniculate Nucleus – function unknown (visual adaptation?)Lateral Geniculate Nucleus – function unknown (visual adaptation?) Primary Visual CortexPrimary Visual Cortex – Simple cells: orientational sensitivity – Complex cells: directional sensitivity Further processingFurther processing – Temporal cortex: what is the object? – Parietal cortex: where is the object? How do I get it?

Low-Level Vision Net effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filtersNet effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filters

Low-Level Depth Cues FocusFocus VergenceVergence StereoStereo Not as important as popularly believedNot as important as popularly believed

Low-Level Computer Vision Filters and filter banksFilters and filter banks – Implemented via convolution – Detection of edges, corners, and other local features – Can include multiple orientations – Can include multiple scales: “filter pyramids” ApplicationsApplications – First stage of segmentation – Texture recognition / classification – Texture synthesis

Texture Analysis / Synthesis MultiresolutionOriented Filter Bank Original Image ImagePyramid

Texture Analysis / Synthesis OriginalTexture SynthesizedTexture Heeger and Bergen

Low-Level Computer Vision Optical flowOptical flow – Detecting frame-to-frame motion – Local operator: looking for gradients ApplicationsApplications – First stage of tracking

Optical Flow Image #1 Optical Flow Field Image #2

Low-Level Computer Vision Shape from XShape from X – Stereo – Motion – Shading – Texture foreshortening

3D Reconstruction Tomasi+Kanade Debevec,Taylor,Malik Phigin et al. Forsyth et al.

Mid-Level Vision Physiology unclearPhysiology unclear Observations by Gestalt psychologistsObservations by Gestalt psychologists – Proximity – Similarity – Common fate – Common region – Parallelism – Closure – Symmetry – Continuity – Familiar configuration Wertheimer

Grouping Cues

Mid-Level Computer Vision TechniquesTechniques – Clustering based on similarity – Limited work on other principles ApplicationsApplications – Segmentation / grouping – Tracking

Snakes: Active Contours Contour Evolution for Segmenting an Artery

Birchfeld Histograms

Expectation Maximization (EM) Color Segmentation

Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B

Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B Bayes’s Rule P(B|A) = P(A|B)  P(B) / P(A)Bayes’s Rule P(B|A) = P(A|B)  P(B) / P(A) – Probability of model B given observation A Thomas Bayes (c )

Bayesian Methods # black pixels

High-Level Vision Human mechanisms: ???Human mechanisms: ???

High-Level Vision Computational mechanismsComputational mechanisms – Bayesian networks – Templates – Linear subspace methods – Kinematic models

Cootes et al. Template-Based Methods

Linear Subspaces

Data PCA New Basis Vectors Kirby et al. Principal Components Analysis (PCA)

Kinematic Models Optical Flow/Feature tracking: no constraints Layered Motion: rigid constraints Articulated: kinematic chain constraints Nonrigid: implicit / learned constraints

Real-world Applications Osuna et al:

Real-world Applications Osuna et al:

Course Outline Image formation and captureImage formation and capture Filtering and feature detectionFiltering and feature detection Optical flow and trackingOptical flow and tracking Projective geometryProjective geometry Shape from XShape from X Segmentation and clusteringSegmentation and clustering RecognitionRecognition Applications: 3D scanning; image-based renderingApplications: 3D scanning; image-based rendering

3D Scanning

Image-Based Modeling and Rendering Debevec et al. Manex

Course Mechanics 60%: 4 written / programming assignments60%: 4 written / programming assignments 30%: Final group project30%: Final group project 10%: In-class participation (includes attendance, project presentation, etc.)10%: In-class participation (includes attendance, project presentation, etc.)

Course Mechanics Book: Computer Vision – A Modern Approach David Forsyth and Jean PonceBook: Computer Vision – A Modern Approach David Forsyth and Jean Ponce PapersPapers All online – available from class webpageAll online – available from class webpage

CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz – TA: Wagner Corrêa – to both Course web page web page