Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lecture 01 Introduction to Computer Vision Course: T0283 - Computer Vision Year: 2013.

Similar presentations


Presentation on theme: "Lecture 01 Introduction to Computer Vision Course: T0283 - Computer Vision Year: 2013."— Presentation transcript:

1 Lecture 01 Introduction to Computer Vision Course: T0283 - Computer Vision Year: 2013

2 Learning Outcomes After carefully listening this lecture, students will be able to do the following:  Show what computer vision is all about and basic principles of various computer vision applications (LO1)  Explain approaches in computer vision systems (three level processing) (LO1)  Explain how a digital image is formed and other related matters such as light and visual perception (LO1) T0283 - Computer Vision2

3 Learning Outline What is Computer Vision? Image Processing Computer Vision Face Detection Text Detection & Recognition High Level Capability Approach in Computer Vision The Three Processing Level Image Formation Image Digitization Digital Image T0283 - Computer Vision3

4 What is Computer Vision? Ballard and Brown  The construction of explicit, meaningful description of physical objects from images. Forsyth and Ponce  Extracting descriptions of the world from pictures or sequences of pictures. T0283 - Computer Vision4

5 5 What is Computer Vision? A field that includes methods for acquiring, processing, analyzing, and understanding images and, in general, high-dimensional data from the real world in order to produce numerical or symbolic information, e.g., in the forms of decisions.

6 Application of Computer Vision Interaksi manusia dan robot (Human Robot Interaction) Pengontrolan proses industri Pendeteksi nomor plat kendaraan Surveillance (monitor penyusup, analisa trafik jalan tol dan lainnya) Robot Vision, Humanoid Robot and Soccer Robot. Modeling obyek atau lingkungan T0283 - Computer Vision6

7 Image Processing T0283 - Computer Vision7

8 Image Enhancement T0283 - Computer Vision8

9 Image Processing (cont’d) Image Restoration (e.g., correcting out-focus images) T0283 - Computer Vision9

10 Computer Graphics Geometric modeling T0283 - Computer Vision10

11 Computer Vision T0283 - Computer Vision11

12 An Industrial Computer Vision System T0283 - Computer Vision12

13 T0283 - Computer Vision13 Low-level image analysis: Identify edges, regions Mid-level: Distinguish “cap” from “no cap” Estimation: What are orientation of cap, height of liquid? Cap Inspection System

14 Face Detection How is this like the bottle problem on the previous slide? T0283 - Computer Vision14

15 Text Detection & Recognition Similar to face finding: Where is the text and what does it say? Viewing at an angle complicates things... T0283 - Computer Vision15 from J. Zhang et al.

16 Text Detection & Recognition (cont’d) Text Detection and Recognition Apps  T0283 - Computer Vision16 Google Goggles

17 High Level Capability Computer Vision System (CVS) is expected to have high level capabilities like Human Visual System (HVS) does such as: Object detection – is an object present in the scene? If so, where is its boundaries Recognition – putting a label on an object Description – assigning properties to objects 3D inference – interpreting a 3D object from 2D views Interpreting motion T0283 - Computer Vision17

18 Approach in Computer Vision T0283 - Computer Vision18 3-D World Objects Images Objects Detection & Recognitions Edges/Region/Depth Models and Assumptions Features/Surfaces

19 The Three Processing Level Low-level processing  Standard procedures are applied to improve image quality  Procedures are required to have no intelligent capabilities T0283 - Computer Vision19

20 The Three Processing Level (cont’d) Intermediate-level processing  Extract and characterize components in the image  Some intelligent capabilities are required T0283 - Computer Vision20

21 The Three Processing Level (cont’d) High-level processing  Recognition and interpretation  Procedures require high intelligent capabilities T0283 - Computer Vision21

22 How are images represented in the computer? T0283 - Computer Vision22

23 Image Formation There are two parts to the image formation process:  The geometry of image formation, which determines where in the image plane the projection of a point in the scene will be located.  The physics of light, which determines the brightness of a point in the image plane as a function of illumination and surface properties. T0283 - Computer Vision23

24 Image formation (cont’d) Geometric parameters  type of projections  position and orientation of camera in space  perspective distortions introduced by the imaging process Photometric parameters  type, intensity, and direction of illumination  reflectance properties of the viewed surfaces Optical parameters of the lens  lens type  focal length  field of view T0283 - Computer Vision24

25 A Simple Model of Image Formation The scene is illuminated by a single source The scene reflects radiation towards the camera The camera senses it via chemicals on film T0283 - Computer Vision25

26 What is Light? The visible portion of the electromagnetic (EM) spectrum It occurs between wavelengths of approximately 400 and 700 nanometers T0283 - Computer Vision26

27 CCD (Charged-Coupled Device) Cameras Tiny solid state cells convert light energy into electrical charge. The image plane acts as a digital memory that can be read row by row by a computer. T0283 - Computer Vision27

28 Frame Grabber Usually, a CCD camera plugs into a computer board (frame grabber). The frame grabber digitizes the signal and stores it in its memory (frame buffer). T0283 - Computer Vision28

29 Image Digitization Sampling means measuring the value of an image at a finite number of points. Quantization is the representation of the measured value at the sampled point by an integer. T0283 - Computer Vision29

30 Image Digitization (cont’d) T0283 - Computer Vision30

31 Image Quantization T0283 - Computer Vision31 256 gray levels (8 bpp) 32 gray levels (5 bpp) 16 gray levels (4 bpp) 8 gray levels (3 bpp) 4 gray levels (2 bpp) 2 gray levels (1 bpp)

32 Image Sampling T0283 - Computer Vision32 Original Image Sampled by a Factor of 2 Sampled by a Factor of 4 Sampled by a Factor of 8

33 T0283 - Computer Vision33 Digital Image An image is represented by a rectangular array of integers. An integer represents the brightness or darkness of the image at that point. N: # of rows, M: # of columns, Q: # of gray levels  N =, M =, Q = (q is the # of bits/pixel)  Storage requirements: NxMxQ (e.g., N=M=1024, q=8, 1MB)

34 Image File Formats Many image formats adhere to the simple model shown below (line by line, no breaks between lines). The header contains at least the width and height of the image. Most headers begin with a signature or “magic number” - a short sequence of bytes for identifying the file format. T0283 - Computer Vision34

35 Common Image File Formats GIF (Graphic Interchange Format) PNG (Portable Network Graphics) JPEG (Joint Photographic Experts Group) TIFF (Tagged Image File Format) PGM (Portable Gray Map) FITS (Flexible Image Transport System) T0283 - Computer Vision35

36 OpenCV Computer vision is the most important technology in the future in the development of intelligent robot. Computer vision is In the simplest terms, computer vision is the discipline of "teaching machines how to see." OpenCV (Open Source Computer Vision Library) and released under a BSD license and hence it’s free for both academic and commercial use. It has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android. T0283 - Computer Vision36

37 Color and grayscale grayscale image = ( (0.3 * R) + (0.59 * G) + (0.11 * B) ) T0283 - Computer Vision37

38 Optical Flow Optical flow or optic flow is the pattern of apparent motion of objects, surfaces, and edges in a visual scene caused by the relative motion between an observer (an eye or a camera) and the scene. T0283 - Computer Vision38

39 Homework Install Visual Studio 2012 Express Edition and OpenCV 2.4.9 Create a program to display an image and play a video. Create a program to display video from a Webcam T0283 - Computer Vision39

40 DisplayImage.cpp // Menampilkan Image menggunakan cvLoadImage #include "stdafx.h" #include int _tmain(int argc, _TCHAR* argv[]) { IplImage *img = cvLoadImage("f:\handsome.jpg"); cvNamedWindow("OpenCV",1); cvShowImage("OpenCV",img); cvWaitKey(0); cvDestroyWindow("OpenCV "); cvReleaseImage(&img); return 0; } T0283 - Computer Vision40

41 C++ 2.x Style //DisplayImage2.cpp #include "stdafx.h" #include using namespace std; int _tmain(int argc, _TCHAR* argv[]) { cv::Mat img=cv::imread("f:/fruits.jpg"); cv::imshow("Belajar OpenCV",img);//tampilkan cv::waitKey(); return EXIT_SUCCESS; } T0283 - Computer Vision41

42 Camera CvCapture* capture; capture = cvCaptureFromCAM(0); if( capture ) { while( true ) { frame = cvQueryFrame( capture ); T0283 - Computer Vision42

43 Face Library String face_cascade_name = "lbpcascade_frontalface.xml"; String eyes_cascade_name = "haarcascade_eye_tree_eyeglasses.xml"; T0283 - Computer Vision43

44 Final Project ( 5-6 person in group) Create a program using OpenCV Presentation 10 minutes in session 13. Example projects : 1.Face Recognition using AAM 2.Car Plate Recognition 3.Stereo Vision 4.Object Detection and Tracking 5.Image Processing for specific purposes. T0283 - Computer Vision44

45 References Textbook Forsyth, D., Ponce, J. (2012). Computer Vision: A Modern Approach. 2nd ed. Prentice Hall. ISBN: 978-0-27-376414-4 Web Lecture 01 Introduction to Computer Vision https://www.youtube.com/watch?v=715uLCHt4jE T0283 - Computer Vision45


Download ppt "Lecture 01 Introduction to Computer Vision Course: T0283 - Computer Vision Year: 2013."

Similar presentations


Ads by Google