Download presentation
Presentation is loading. Please wait.
1
Instructor: Mircea Nicolescu
Computer Vision Instructor: Mircea Nicolescu Good afternoon and thank you everyone for coming. My talk today will describe the research I performed at the IRIS at USC, the object of this work being to build a computational framework that addresses the problem of motion analysis and interpretation.
2
Nomenclature Somewhat interchangeable names, with somewhat different implications: Computer Vision Most general term Computational Vision Includes modeling of biological vision Image Understanding Automated scene analysis (e.g., satellite images, robot navigation) Machine Vision Industrial, factory-floor systems for inspection, measurements, part placement, etc.
3
Another View Interesting Computer vision Actually works
Computational vision Machine vision Actually works
4
Related Fields Largely built upon Related areas Image Processing
Statistical Pattern Recognition Artificial Intelligence Related areas Robotics Biological vision Medical imaging Computer graphics Human-computer interaction
5
Vision Processing
6
Low Level Vision
7
Low Level Vision Feature extraction
8
Low Level Vision Region segmentation
9
Mid Level Vision
10
Mid Level Vision 3D Reconstruction
11
Mid Level Vision Motion
12
Mid Level Vision A moving camera can be used instead of stereo
13
High Level Vision
14
Progress in Computer Vision
First generation: Military/Early Research Few systems, each custom-built, cost $Ms “Users” have PhDs 1 hour per frame Second generation: Industrial/Medical Numerous systems, 1000s of each, cost $10Ks Users have college degree Special hardware Third generation: Consumer 100000(00) systems, cost $100s Users have little or no training Emphasis on software
15
Visual Inspection
16
Character Recognition
17
Biometrics
18
Indexing into Databases
Shape content
19
Indexing into Databases
Color, texture
20
Target Recognition Department of Defense (Army, Air Force, Navy)
21
Interpretation of Aerial Photography
22
Autonomous Vehicles Land, Underwater, Space
23
Traffic Monitoring
24
Inserting Artificial Objects into a Scene
25
Inserting Images onto Ground Plane
26
Face Detection
27
Face Recognition
28
Human Activity Recognition
29
Human Activity Recognition
Overview: The course examines fundamental concepts in computer vision, while highlighting application areas where vision techniques are providing solutions. It is suited for students who are interested in doing research in computer vision, or who are involved in computer graphics, robotics, artificial intelligence or image processing. Topics: Color Vision Multiple View Geometry Camera Calibration 3D Reconstruction Object Recognition Optical Flow and Motion Analysis Perceptual Grouping Background Modeling Detection and Tracking Robotic Vision Gesture and Facial Expression Recognition Activity Recognition
30
Cameras First photograph due to Niepce (1816)
First on record - shown below (1822)
31
Main Concepts Factors that define image formation: Optical Parameters
Focal length Angular aperture Photometric Parameters Type, intensity and direction of illumination Reflectance properties of viewed surfaces Sensor structure Geometric Parameters Type of projection Position and orientation of the camera in space Distortions
32
Evolution Photographic camera: Animal eye: a (really) long time ago.
Niepce, 1816. Animal eye: a (really) long time ago. Pinhole perspective projection: Brunelleschi, XVth Century. Camera obscura: XVIth Century.
33
Pinhole Cameras Abstract camera model - box with a small hole in it
Pinhole cameras work in practice The point to make here is that each point on the image plane sees light from only one direction, the one that passes through the pinhole.
34
Distant Objects Are Smaller
35
Parallel Lines Meet Common to draw film plane
in front of the focal point. Moving the film plane merely scales the image.
36
Vanishing Points Each set of parallel lines (direction) meets at a different point The vanishing point for this direction Sets of parallel lines on the same plane lead to collinear vanishing points. The line is called the horizon for that plane Good ways to spot fake images scale and perspective don’t work vanishing points behave badly supermarket tabloids are a great source
37
Is This Correct? Is this a perspective image of four identical buildings? why not?
38
The Perspective Projection Equation
x’ = f’ x/z y’ = f’ y/z
39
Homogenous Coordinates
Add an extra coordinate and use an equivalence relation for 2D equivalence relation k*(X,Y,Z) is the same as (X,Y,Z) for 3D equivalence relation k*(X,Y,Z,T) is the same as (X,Y,Z,T) Basic notion Possible to represent points “at infinity” Where parallel lines intersect Where parallel planes intersect Possible to write the action of a perspective camera as a matrix
40
The Camera Matrix Turn previous expression into HC’s
HC’s for 3D point are (X,Y,Z,T) HC’s for point in image are (U,V,W) Image coordinates are x’ and y’: x’ = U / W = f’ X / Z y’ = V / W = f’ Y / Z
41
Other Projection Models
Weak Perspective Assumes that the relative distance along the Z axis between any scene points is much smaller than the average distance z0 to the camera (objects far away) is the magnification.
42
Other Projection Models
Orthographic Projection Quite unrealistic
43
Pinhole Cameras - Problems
Why are pinhole cameras not really used? Pinhole too big - many directions are averaged, blurring the image Pinhole too small - diffraction effects blur the image Generally, pinhole cameras are dark, because a very small set of rays from a particular point hits the screen.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.