Download presentation
Presentation is loading. Please wait.
Published byDominick Todd Modified over 8 years ago
1
Lecture 01 Introduction to Computer Vision Course: T0283 - Computer Vision Year: 2013
2
Learning Outcomes After carefully listening this lecture, students will be able to do the following: Show what computer vision is all about and basic principles of various computer vision applications (LO1) Explain approaches in computer vision systems (three level processing) (LO1) Explain how a digital image is formed and other related matters such as light and visual perception (LO1) T0283 - Computer Vision2
3
Learning Outline What is Computer Vision? Image Processing Computer Vision Face Detection Text Detection & Recognition High Level Capability Approach in Computer Vision The Three Processing Level Image Formation Image Digitization Digital Image T0283 - Computer Vision3
4
What is Computer Vision? Ballard and Brown The construction of explicit, meaningful description of physical objects from images. Forsyth and Ponce Extracting descriptions of the world from pictures or sequences of pictures. T0283 - Computer Vision4
5
5 What is Computer Vision? A field that includes methods for acquiring, processing, analyzing, and understanding images and, in general, high-dimensional data from the real world in order to produce numerical or symbolic information, e.g., in the forms of decisions.
6
Application of Computer Vision Interaksi manusia dan robot (Human Robot Interaction) Pengontrolan proses industri Pendeteksi nomor plat kendaraan Surveillance (monitor penyusup, analisa trafik jalan tol dan lainnya) Robot Vision, Humanoid Robot and Soccer Robot. Modeling obyek atau lingkungan T0283 - Computer Vision6
7
Image Processing T0283 - Computer Vision7
8
Image Enhancement T0283 - Computer Vision8
9
Image Processing (cont’d) Image Restoration (e.g., correcting out-focus images) T0283 - Computer Vision9
10
Computer Graphics Geometric modeling T0283 - Computer Vision10
11
Computer Vision T0283 - Computer Vision11
12
An Industrial Computer Vision System T0283 - Computer Vision12
13
T0283 - Computer Vision13 Low-level image analysis: Identify edges, regions Mid-level: Distinguish “cap” from “no cap” Estimation: What are orientation of cap, height of liquid? Cap Inspection System
14
Face Detection How is this like the bottle problem on the previous slide? T0283 - Computer Vision14
15
Text Detection & Recognition Similar to face finding: Where is the text and what does it say? Viewing at an angle complicates things... T0283 - Computer Vision15 from J. Zhang et al.
16
Text Detection & Recognition (cont’d) Text Detection and Recognition Apps T0283 - Computer Vision16 Google Goggles
17
High Level Capability Computer Vision System (CVS) is expected to have high level capabilities like Human Visual System (HVS) does such as: Object detection – is an object present in the scene? If so, where is its boundaries Recognition – putting a label on an object Description – assigning properties to objects 3D inference – interpreting a 3D object from 2D views Interpreting motion T0283 - Computer Vision17
18
Approach in Computer Vision T0283 - Computer Vision18 3-D World Objects Images Objects Detection & Recognitions Edges/Region/Depth Models and Assumptions Features/Surfaces
19
The Three Processing Level Low-level processing Standard procedures are applied to improve image quality Procedures are required to have no intelligent capabilities T0283 - Computer Vision19
20
The Three Processing Level (cont’d) Intermediate-level processing Extract and characterize components in the image Some intelligent capabilities are required T0283 - Computer Vision20
21
The Three Processing Level (cont’d) High-level processing Recognition and interpretation Procedures require high intelligent capabilities T0283 - Computer Vision21
22
How are images represented in the computer? T0283 - Computer Vision22
23
Image Formation There are two parts to the image formation process: The geometry of image formation, which determines where in the image plane the projection of a point in the scene will be located. The physics of light, which determines the brightness of a point in the image plane as a function of illumination and surface properties. T0283 - Computer Vision23
24
Image formation (cont’d) Geometric parameters type of projections position and orientation of camera in space perspective distortions introduced by the imaging process Photometric parameters type, intensity, and direction of illumination reflectance properties of the viewed surfaces Optical parameters of the lens lens type focal length field of view T0283 - Computer Vision24
25
A Simple Model of Image Formation The scene is illuminated by a single source The scene reflects radiation towards the camera The camera senses it via chemicals on film T0283 - Computer Vision25
26
What is Light? The visible portion of the electromagnetic (EM) spectrum It occurs between wavelengths of approximately 400 and 700 nanometers T0283 - Computer Vision26
27
CCD (Charged-Coupled Device) Cameras Tiny solid state cells convert light energy into electrical charge. The image plane acts as a digital memory that can be read row by row by a computer. T0283 - Computer Vision27
28
Frame Grabber Usually, a CCD camera plugs into a computer board (frame grabber). The frame grabber digitizes the signal and stores it in its memory (frame buffer). T0283 - Computer Vision28
29
Image Digitization Sampling means measuring the value of an image at a finite number of points. Quantization is the representation of the measured value at the sampled point by an integer. T0283 - Computer Vision29
30
Image Digitization (cont’d) T0283 - Computer Vision30
31
Image Quantization T0283 - Computer Vision31 256 gray levels (8 bpp) 32 gray levels (5 bpp) 16 gray levels (4 bpp) 8 gray levels (3 bpp) 4 gray levels (2 bpp) 2 gray levels (1 bpp)
32
Image Sampling T0283 - Computer Vision32 Original Image Sampled by a Factor of 2 Sampled by a Factor of 4 Sampled by a Factor of 8
33
T0283 - Computer Vision33 Digital Image An image is represented by a rectangular array of integers. An integer represents the brightness or darkness of the image at that point. N: # of rows, M: # of columns, Q: # of gray levels N =, M =, Q = (q is the # of bits/pixel) Storage requirements: NxMxQ (e.g., N=M=1024, q=8, 1MB)
34
Image File Formats Many image formats adhere to the simple model shown below (line by line, no breaks between lines). The header contains at least the width and height of the image. Most headers begin with a signature or “magic number” - a short sequence of bytes for identifying the file format. T0283 - Computer Vision34
35
Common Image File Formats GIF (Graphic Interchange Format) PNG (Portable Network Graphics) JPEG (Joint Photographic Experts Group) TIFF (Tagged Image File Format) PGM (Portable Gray Map) FITS (Flexible Image Transport System) T0283 - Computer Vision35
36
OpenCV Computer vision is the most important technology in the future in the development of intelligent robot. Computer vision is In the simplest terms, computer vision is the discipline of "teaching machines how to see." OpenCV (Open Source Computer Vision Library) and released under a BSD license and hence it’s free for both academic and commercial use. It has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android. T0283 - Computer Vision36
37
Color and grayscale grayscale image = ( (0.3 * R) + (0.59 * G) + (0.11 * B) ) T0283 - Computer Vision37
38
Optical Flow Optical flow or optic flow is the pattern of apparent motion of objects, surfaces, and edges in a visual scene caused by the relative motion between an observer (an eye or a camera) and the scene. T0283 - Computer Vision38
39
Homework Install Visual Studio 2012 Express Edition and OpenCV 2.4.9 Create a program to display an image and play a video. Create a program to display video from a Webcam T0283 - Computer Vision39
40
DisplayImage.cpp // Menampilkan Image menggunakan cvLoadImage #include "stdafx.h" #include int _tmain(int argc, _TCHAR* argv[]) { IplImage *img = cvLoadImage("f:\handsome.jpg"); cvNamedWindow("OpenCV",1); cvShowImage("OpenCV",img); cvWaitKey(0); cvDestroyWindow("OpenCV "); cvReleaseImage(&img); return 0; } T0283 - Computer Vision40
41
C++ 2.x Style //DisplayImage2.cpp #include "stdafx.h" #include using namespace std; int _tmain(int argc, _TCHAR* argv[]) { cv::Mat img=cv::imread("f:/fruits.jpg"); cv::imshow("Belajar OpenCV",img);//tampilkan cv::waitKey(); return EXIT_SUCCESS; } T0283 - Computer Vision41
42
Camera CvCapture* capture; capture = cvCaptureFromCAM(0); if( capture ) { while( true ) { frame = cvQueryFrame( capture ); T0283 - Computer Vision42
43
Face Library String face_cascade_name = "lbpcascade_frontalface.xml"; String eyes_cascade_name = "haarcascade_eye_tree_eyeglasses.xml"; T0283 - Computer Vision43
44
Final Project ( 5-6 person in group) Create a program using OpenCV Presentation 10 minutes in session 13. Example projects : 1.Face Recognition using AAM 2.Car Plate Recognition 3.Stereo Vision 4.Object Detection and Tracking 5.Image Processing for specific purposes. T0283 - Computer Vision44
45
References Textbook Forsyth, D., Ponce, J. (2012). Computer Vision: A Modern Approach. 2nd ed. Prentice Hall. ISBN: 978-0-27-376414-4 Web Lecture 01 Introduction to Computer Vision https://www.youtube.com/watch?v=715uLCHt4jE T0283 - Computer Vision45
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.