COS 429: Computer Vision Thanks to Chris Bregler.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Visual Sensation & Perception How do we see?. Structure of the eye.
September 2, 2014Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2014 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Analysis of Contour Motions Ce Liu William T. Freeman Edward H. Adelson Computer Science and Artificial Intelligence Laboratory Massachusetts Institute.
Object Recognition & Model Based Tracking © Danica Kragic Tracking system.
Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Contours and Optical Flow: Cues for Capturing Human Motion in Videos Thomas Brox Computer Vision and Pattern Recognition Group University of Bonn Research.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Introduction to Cognitive Science Lecture 2: Vision in Humans and Machines 1 Vision in Humans and Machines September 10, 2009.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Berkeley Vision GroupNIPS Vancouver Learning to Detect Natural Image Boundaries Using Local Brightness,
Human Sensing: The eye and visual processing Physiology and Function Martin Jagersand.
Introduction to Computer Vision CS223B, Winter 2005.
The Human Visual System Vonikakis Vasilios, Antonios Gasteratos Democritus University of Thrace
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
The visual system Lecture 1: Structure of the eye
Cognitive Processes PSY 334 Chapter 2 – Perception.
1B50 – Percepts and Concepts Daniel J Hulme. Outline Cognitive Vision –Why do we want computers to see? –Why can’t computers see? –Introducing percepts.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
Sensation and Perception Part 1: Intro and Vision.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 35 – Review for midterm.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Visual Perception PhD Program in Information Technologies Description: Obtention of 3D Information. Study of the problem of triangulation, camera calibration.
CS 496: Computer Vision Thanks to Chris Bregler. CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz – TA:
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
The primate visual systemHelmuth Radrich, The primate visual system 1.Structure of the eye 2.Neural responses to light 3.Brightness perception.
FRS 123: Technology in Art and Cultural Heritage Perception.
Computer Vision Michael Isard and Dimitris Metaxas.
2/14/00 Computer Vision. 2/14/00 Computer Vision Lecturer: Ir. Resmana Lim, M.Eng. Text: 1) Computer Vision -- A Modern Approach.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Human vision Jitendra Malik U.C. Berkeley. Visual Areas.
Corner Detection & Color Segmentation CSE350/ Sep 03.
September 3, 2013Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2013 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
The Human Visual System Background on Vision Human vision – the best system around Deep network models.
Announcements Final is Thursday, March 18, 10:30-12:20 –MGH 287 Sample final out today.
Texture Analysis and Synthesis. Texture Texture: pattern that “looks the same” at all locationsTexture: pattern that “looks the same” at all locations.
COS 429: Computer Vision Thanks to Chris Bregler.
8/16/99 Computer Vision: Vision and Modeling. 8/16/99 Lucas-Kanade Extensions Support Maps / Layers: Robust Norm, Layered Motion, Background Subtraction,
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Course : T Computer Vision
- photometric aspects of image formation gray level images
Computer Vision, Robotics, Machine Learning and Control Lab
Dr.safeyya Adeeb Alchalabi
Section 2: The Visual System
Visual Perception Chapter 3.
Optical Flow Estimation and Segmentation of Moving Dynamic Textures
Chapter 4 Section 2 The Visual System
Welcome to CS 675 – Computer Vision Spring 2018
Early Processing in Biological Vision
Perceiving and Recognizing Objects
Computer Vision (CSE 490CV, EE400B)
Cognitive Processes PSY 334
?. ? White Fuzzy Color Oblong Texture Shape Most problems in vision are underconstrained White Color Most problems in vision are underconstrained.
Filtering Things to take away from this lecture An image as a function
Analysis of Contour Motions
Cognitive Processes PSY 334
CMSC 426: Image Processing (Computer Vision)
Filtering An image as a function Digital vs. continuous images
Learning complex visual concepts
Presentation transcript:

COS 429: Computer Vision Thanks to Chris Bregler

COS 429: Computer Vision Instructor: Szymon Rusinkiewicz Szymon Rusinkiewicz TA: Nathaniel Dirksen Nathaniel Dirksen Course web page web page

What is Computer Vision? Input: images or videoInput: images or video Output: description of the worldOutput: description of the world But also: measuring, classifying, interpreting visual informationBut also: measuring, classifying, interpreting visual information

Low-Level or “Early” Vision Considers local properties of an image “There’s an edge!”

Mid-Level Vision Grouping and segmentation “There’s an object and a background!”

High-Level Vision Recognition “It’s a chair!”

Big Question #1: Who Cares? Applications of computer visionApplications of computer vision – In AI: vision serves as the “input stage” – In medicine: understanding human vision – In engineering: model extraction

Vision and Other Fields Computer Vision Artificial Intelligence Cognitive Psychology Signal Processing Computer Graphics Pattern Analysis Metrology

Big Question #2: Does It Work? Situation much the same as AI:Situation much the same as AI: – Some fundamental algorithms – Large collection of hacks / heuristics Vision is hard!Vision is hard! – Especially at high level, physiology unknown – Requires integrating many different methods – Requires reasoning and understanding: “AI completeness”

Computer and Human Vision Emulating effects of human visionEmulating effects of human vision Understanding physiology of human visionUnderstanding physiology of human vision Analogues of human vision at low, mid, and high levelsAnalogues of human vision at low, mid, and high levels

Image Formation Human: lens forms image on retina, sensors (rods and cones) respond to lightHuman: lens forms image on retina, sensors (rods and cones) respond to light Computer: lens system forms image, sensors (CCD, CMOS) respond to lightComputer: lens system forms image, sensors (CCD, CMOS) respond to light

Low-Level Vision Hubel

Retinal ganglion cellsRetinal ganglion cells Lateral Geniculate Nucleus – function unknown (visual adaptation?)Lateral Geniculate Nucleus – function unknown (visual adaptation?) Primary Visual CortexPrimary Visual Cortex – Simple cells: orientational sensitivity – Complex cells: directional sensitivity Further processingFurther processing – Temporal cortex: what is the object? – Parietal cortex: where is the object? How do I get it?

Low-Level Vision Net effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filtersNet effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filters

Low-Level Depth Cues FocusFocus VergenceVergence StereoStereo Not as important as popularly believedNot as important as popularly believed

Low-Level Computer Vision Filters and filter banksFilters and filter banks – Implemented via convolution – Detection of edges, corners, and other local features – Can include multiple orientations – Can include multiple scales: “filter pyramids” ApplicationsApplications – First stage of segmentation – Texture recognition / classification – Texture synthesis

Texture Analysis / Synthesis MultiresolutionOriented Filter Bank Original Image ImagePyramid

Texture Analysis / Synthesis OriginalTexture SynthesizedTexture Heeger and Bergen

Low-Level Computer Vision Optical flowOptical flow – Detecting frame-to-frame motion – Local operator: looking for gradients ApplicationsApplications – First stage of tracking

Optical Flow Image #1 Optical Flow Field Image #2

Low-Level Computer Vision Shape from XShape from X – Stereo – Motion – Shading – Texture foreshortening

3D Reconstruction Tomasi+Kanade Debevec,Taylor,Malik Phigin et al. Forsyth et al.

Mid-Level Vision Physiology unclearPhysiology unclear Observations by Gestalt psychologistsObservations by Gestalt psychologists – Proximity – Similarity – Common fate – Common region – Parallelism – Closure – Symmetry – Continuity – Familiar configuration Wertheimer

Grouping Cues

Mid-Level Computer Vision TechniquesTechniques – Clustering based on similarity – Limited work on other principles ApplicationsApplications – Segmentation / grouping – Tracking

Snakes: Active Contours Contour Evolution for Segmenting an Artery

Birchfeld Histograms

Expectation Maximization (EM) Color Segmentation

Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B

Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B Bayes’s Rule P(B|A) = P(A|B)  P(B) / P(A)Bayes’s Rule P(B|A) = P(A|B)  P(B) / P(A) – Probability of model B given observation A Thomas Bayes (c )

Bayesian Methods # black pixels

High-Level Vision Human mechanisms: ???Human mechanisms: ???

High-Level Vision Computational mechanismsComputational mechanisms – Bayesian networks – Templates – Linear subspace methods – Kinematic models

Cootes et al. Template-Based Methods

Linear Subspaces

Data PCA New Basis Vectors Kirby et al. Principal Components Analysis (PCA)

Kinematic Models Optical Flow/Feature tracking: no constraints Layered Motion: rigid constraints Articulated: kinematic chain constraints Nonrigid: implicit / learned constraints

Real-world Applications Osuna et al:

Real-world Applications Osuna et al:

Course Outline Image formation and captureImage formation and capture Filtering and feature detectionFiltering and feature detection Motion estimationMotion estimation Segmentation and clusteringSegmentation and clustering PCAPCA 3D projective geometry3D projective geometry Shape acquisitionShape acquisition Shape analysisShape analysis

3D Scanning

Image-Based Modeling and Rendering Debevec et al. Manex

Course Mechanics 65%: 4 written / programming assignments65%: 4 written / programming assignments 35%: Final group project35%: Final group project

Course Mechanics Book: Introductory Techniques for 3-D Computer Vision Emanuele Trucco and Alessandro VerriBook: Introductory Techniques for 3-D Computer Vision Emanuele Trucco and Alessandro Verri PapersPapers

COS 429: Computer Vision Instructor: Szymon Rusinkiewicz Szymon Rusinkiewicz TA: Nathaniel Dirksen Nathaniel Dirksen Course web page web page