Download presentation
Presentation is loading. Please wait.
Published byOctavia Martin Modified over 9 years ago
1
CS 496: Computer Vision Thanks to Chris Bregler
2
CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz smr@cs.princeton.edu – TA: Wagner Corrêa wtcorrea@cs.princeton.edu – Email to both cs496@princeton.edu Course web page http://www.cs.princeton.edu/courses/cs496/Course web page http://www.cs.princeton.edu/courses/cs496/
3
What is Computer Vision? Input: images or videoInput: images or video Output: description of the worldOutput: description of the world
4
What is Computer Vision? Input: images or videoInput: images or video Output: description of the worldOutput: description of the world – Many levels of description
5
Low-Level or “Early” Vision Considers local properties of an image “There’s an edge!”
6
Mid-Level Vision Grouping and segmentation “There’s an object and a background!”
7
High-Level Vision Recognition “It’s a chair!”
8
Big Question #1: Who Cares? Applications of computer visionApplications of computer vision – In AI: vision serves as the “input stage” – In medicine: understanding human vision – In engineering: model extraction
9
Vision and Other Fields Computer Vision Artificial Intelligence Cognitive Psychology Signal Processing Computer Graphics Pattern Analysis Metrology
10
Big Question #2: Does It Work? Situation much the same as AI:Situation much the same as AI: – Some fundamental algorithms – Large collection of hacks / heuristics Vision is hard!Vision is hard! – Especially at high level, physiology unknown – Requires integrating many different methods – Requires reasoning and understanding: “AI completeness”
11
Computer and Human Vision Emulating effects of human visionEmulating effects of human vision Understanding physiology of human visionUnderstanding physiology of human vision
12
Image Formation Human: lens forms image on retina, sensors (rods and cones) respond to lightHuman: lens forms image on retina, sensors (rods and cones) respond to light Computer: lens system forms image, sensors (CCD, CMOS) respond to lightComputer: lens system forms image, sensors (CCD, CMOS) respond to light
13
Low-Level Vision Hubel
14
Retinal ganglion cellsRetinal ganglion cells Lateral Geniculate Nucleus – function unknown (visual adaptation?)Lateral Geniculate Nucleus – function unknown (visual adaptation?) Primary Visual CortexPrimary Visual Cortex – Simple cells: orientational sensitivity – Complex cells: directional sensitivity Further processingFurther processing – Temporal cortex: what is the object? – Parietal cortex: where is the object? How do I get it?
15
Low-Level Vision Net effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filtersNet effect: low-level human vision can be (partially) modeled as a set of multiresolution, oriented filters
16
Low-Level Depth Cues FocusFocus VergenceVergence StereoStereo Not as important as popularly believedNot as important as popularly believed
17
Low-Level Computer Vision Filters and filter banksFilters and filter banks – Implemented via convolution – Detection of edges, corners, and other local features – Can include multiple orientations – Can include multiple scales: “filter pyramids” ApplicationsApplications – First stage of segmentation – Texture recognition / classification – Texture synthesis
18
Texture Analysis / Synthesis MultiresolutionOriented Filter Bank Original Image ImagePyramid
19
Texture Analysis / Synthesis OriginalTexture SynthesizedTexture Heeger and Bergen
20
Low-Level Computer Vision Optical flowOptical flow – Detecting frame-to-frame motion – Local operator: looking for gradients ApplicationsApplications – First stage of tracking
21
Optical Flow Image #1 Optical Flow Field Image #2
22
Low-Level Computer Vision Shape from XShape from X – Stereo – Motion – Shading – Texture foreshortening
23
3D Reconstruction Tomasi+Kanade Debevec,Taylor,Malik Phigin et al. Forsyth et al.
24
Mid-Level Vision Physiology unclearPhysiology unclear Observations by Gestalt psychologistsObservations by Gestalt psychologists – Proximity – Similarity – Common fate – Common region – Parallelism – Closure – Symmetry – Continuity – Familiar configuration Wertheimer
25
Grouping Cues
29
Mid-Level Computer Vision TechniquesTechniques – Clustering based on similarity – Limited work on other principles ApplicationsApplications – Segmentation / grouping – Tracking
30
Snakes: Active Contours Contour Evolution for Segmenting an Artery
31
Birchfeld Histograms
32
Expectation Maximization (EM) Color Segmentation
33
Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B
34
Bayesian Methods Prior probabilityPrior probability – Expected distribution of models Conditional probability P(A|B)Conditional probability P(A|B) – Probability of observation A given model B Bayes’s Rule P(B|A) = P(A|B) P(B) / P(A)Bayes’s Rule P(B|A) = P(A|B) P(B) / P(A) – Probability of model B given observation A Thomas Bayes (c. 1702-1761)
35
Bayesian Methods # black pixels
36
High-Level Vision Human mechanisms: ???Human mechanisms: ???
37
High-Level Vision Computational mechanismsComputational mechanisms – Bayesian networks – Templates – Linear subspace methods – Kinematic models
38
Cootes et al. Template-Based Methods
39
Linear Subspaces
40
Data PCA New Basis Vectors Kirby et al. Principal Components Analysis (PCA)
41
Kinematic Models Optical Flow/Feature tracking: no constraints Layered Motion: rigid constraints Articulated: kinematic chain constraints Nonrigid: implicit / learned constraints
42
Real-world Applications Osuna et al:
43
Real-world Applications Osuna et al:
44
Course Outline Image formation and captureImage formation and capture Filtering and feature detectionFiltering and feature detection Optical flow and trackingOptical flow and tracking Projective geometryProjective geometry Shape from XShape from X Segmentation and clusteringSegmentation and clustering RecognitionRecognition Applications: 3D scanning; image-based renderingApplications: 3D scanning; image-based rendering
45
3D Scanning
46
Image-Based Modeling and Rendering Debevec et al. Manex
47
Course Mechanics 60%: 4 written / programming assignments60%: 4 written / programming assignments 30%: Final group project30%: Final group project 10%: In-class participation (includes attendance, project presentation, etc.)10%: In-class participation (includes attendance, project presentation, etc.)
48
Course Mechanics Book: Computer Vision – A Modern Approach David Forsyth and Jean PonceBook: Computer Vision – A Modern Approach David Forsyth and Jean Ponce PapersPapers All online – available from class webpageAll online – available from class webpage
49
CS 496: Computer Vision PersonnelPersonnel – Instructor: Szymon Rusinkiewicz smr@cs.princeton.edu – TA: Wagner Corrêa wtcorrea@cs.princeton.edu – Email to both cs496@princeton.edu Course web page http://www.cs.princeton.edu/courses/cs496/Course web page http://www.cs.princeton.edu/courses/cs496/
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.