Student: Wanli Ouyang (歐陽萬里) Supervisor: Prof. W.K. Cham

Slides:



Advertisements
Similar presentations
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Detecting Faces in Images: A Survey
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Presenter: Hoang, Van Dung
M.S. Student, Hee-Jong Hong
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Sreya Chakraborty Under the guidance of Dr. K. R. Rao Multimedia Processing Lab (MPL) University of Texas at Arlington.
A Robust Pedestrian Detection Approach Based on Shapelet Feature and Haar Detector Ensembles Wentao Yao, Zhidong Deng TSINGHUA SCIENCE AND TECHNOLOGY ISSNl.
The Viola/Jones Face Detector Prepared with figures taken from “Robust real-time object detection” CRL 2001/01, February 2001.
The Viola/Jones Face Detector (2001)
Lecture 5 Template matching
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Face Recognition with Harr Transforms and SVMs EE645 Final Project May 11, 2005 J Stautzenberger.
CS 223B Assignment 1 Help Session Dan Maynes-Aminzade.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Viewpoint Tracking for 3D Display Systems A look at the system proposed by Yusuf Bediz, Gözde Bozdağı Akar.
Oral Defense by Sunny Tang 15 Aug 2003
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #22.
Manhattan-world Stereo Y. Furukawa, B. Curless, S. M. Seitz, and R. Szeliski 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Face Detection using the Viola-Jones Method
Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.
A Tutorial on Object Detection Using OpenCV
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
CPSC 601 Lecture Week 5 Hand Geometry. Outline: 1.Hand Geometry as Biometrics 2.Methods Used for Recognition 3.Illustrations and Examples 4.Some Useful.
An Example of Course Project Face Identification.
Dense correspondences across scenes and scales Tal Hassner The Open University of Israel CVPR’14 Tutorial on Dense Image Correspondences for Computer Vision.
A Non-local Cost Aggregation Method for Stereo Matching
Face detection Slides adapted Grauman & Liebe’s tutorial
Object Detection with Discriminatively Trained Part Based Models
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Fast Census Transform-based Stereo Algorithm using SSE2
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Department of Computer Science,
Robust Nighttime Vehicle Detection by Tracking and Grouping Headlights Qi Zou, Haibin Ling, Siwei Luo, Yaping Huang, and Mei Tian.
Face Detection Using Neural Network By Kamaljeet Verma ( ) Akshay Ukey ( )
Frank Bergschneider February 21, 2014 Presented to National Instruments.
Hand Detection with a Cascade of Boosted Classifiers Using Haar-like Features Qing Chen Discover Lab, SITE, University of Ottawa May 2, 2006.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Face Detection 蔡宇軒.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
CS262: Computer Vision Lect 06: Face Detection
2. Skin - color filtering.
License Plate Detection
Deeply learned face representations are sparse, selective, and robust
An Image Database Retrieval Scheme Based Upon Multivariate Analysis and Data Mining Presented by C.C. Chang Dept. of Computer Science and Information.
University of Ioannina
Lit part of blue dress and shadowed part of white dress are the same color
Seunghui Cha1, Wookhyun Kim1
Efficient Deformable Template Matching for Face Tracking
Fast Algorithms for Walsh Hadamard Transform on Sliding Windows
Implementation on video object segmentation algorithm
A (Zernike) moment-based nonlocal-means algorithm for image denoising
A Tutorial on Object Detection Using OpenCV
AHED Automatic Human Emotion Detection
Lecture 29: Face Detection Revisited
ECE738 Final Project Face Detection Baseline
Presentation transcript:

Computing Sum of Pixels in a Rectangle and Haar-Like Features by Strip Sum Student: Wanli Ouyang (歐陽萬里) Supervisor: Prof. W.K. Cham The Chinese University of Hong Kong 2018/11/15

Outline Introduction The proposed algorithm Experimental result Conclusion 2018/11/15

Outline Introduction The proposed algorithm Experimental result Conclusion 2018/11/15

Introduction Haar-like features and rectangle sum Application 2018/11/15

Haar-like features and rectangle sum Simple. Any size, any position, non-orthogonal, over-complete. Fast algorithm. Sum of pixels in a rectangle: rectangle sum +1 -1 W=H=4 Extend 24×24: 160,000 features 2018/11/15 P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001.

Extended Haar-like features P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001. R. Lienhart, J. Maydt, “An extended set of Haar-like features for rapid object detection,” ICIP Vol. 1, pp.I-900 - I-903, Sept. 2002. Feng Tang, R. Crabb, Hai Tao, “Representing Images Using Nonorthogonal Haar-Like Bases,” IEEE PAMI. Intell. Vol. 29(12), pp. 2120 – 2134, Dec. 2007. 2018/11/15

Introduction Application Haar-like features and rectangle sum 2018/11/15

Haar-like features -- application Face recognition Template matching F Tang, H Tao, Fast linear discriminant analysis using binary bases Volume 28, Issue 16, 1 December 2007, Pages 2209-2218 2018/11/15 Feng Tang, R. Crabb, Hai Tao, “Representing Images Using Nonorthogonal Haar-Like Bases,” IEEE PAMI. Intell. Vol. 29(12), pp. 2120 – 2134, Dec. 2007.

Haar-like features -- application Face detection From Project of Prof. Ngan and Liu Qiang P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001. 2018/11/15

Haar-like features -- application Feature matching H. Bay, T. Tuytelaars, and L. Van Gool, “Surf: Speeded up robust features,” In The 9th ECCV, 2006 2018/11/15 From Cui Chunhui’s seminar

Pattern matching • • • • • • • • • • • • • • • • • • • • • • N N Image • • • • • • • • • • • • • • • • • • • • • • window N N pattern Pattern matching seeks a given pattern within an image. For each pixel, the distance between window and pattern is calculated: The smaller the distance is, the more similar the window is to the pattern. In practice: ==> match! 2018/11/15 Window matches the pattern or not Application

Pattern Matching application Template matching is useful in signal processing, computer vision, image and video processing For example, image based rendering, image compression, object detection, video compression, tracking, denoising, super resolution, texture synthesis, block matching in motion estimation, road/path tracking … 2018/11/15

pattern matching WHT has been successfully used for full search equivalent pattern matching. We apply orthogonal Haar transform for full search equivalent pattern matching. Y. Hel-Or and H. Hel-Or, “Real time pattern matching using projection kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 9, pp. 1430-1445, Sept. 2005. 2018/11/15

Outline The proposed algorithm Introduction Experimental result Conclusion 2018/11/15

Outline The proposed algorithm Introduction The proposed algorithm Definition of rectangle sum and integral image Strip sum for computing rectangle sum Orthogonal Haar transform Experimental result Conclusion 2018/11/15

Rectangle sum - definition A rectangle in the image X(j1, j2) is specified by: rect = (j1, j2, N1, N2, θ), (j1, j2): upper left position (N1, N2): size of the rectangle. θ=0°: upright rectangle θ=45°: 45° rotated rectangle. Related to dc component N1N 2-1 additions 3 additions 2018/11/15

Integral image 1: A 2: A+B 3: A+C 4: A+B+C+D A-D: rectangle sum The integral image I(j1, j2) is the sum of pixels above and to the left of (j1, j2) 1: A 2: A+B 3: A+C 4: A+B+C+D A-D: rectangle sum 2018/11/15 F.C. Crow. Summed-Area Tables for Texture Mapping. in Proc.11th Ann. Conf. Computer Graphics and Interactive Techniques, pp. 207-212, 1984. P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001.

Integral image method 1: A 2: A+B 3: A+C 4: A+B+C+D D = 4 + 1 − 3 − 2 The integral image I(j1, j2) is the sum of pixels above and to the left of (j1, j2) 1: A 2: A+B 3: A+C 4: A+B+C+D (0, 0) D = 4 + 1 − 3 − 2 = 4 − 2 − (3 − 1) A-D: rectangle sum 3 additions 4 1 3 2 2018/11/15 F.C. Crow. Summed-Area Tables for Texture Mapping. in Proc.11th Ann. Conf. Computer Graphics and Interactive Techniques, pp. 207-212, 1984. P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001.

Outline The proposed algorithm Strip sum for computing rectangle sum Introduction The proposed algorithm Definition of rectangle sum and integral image Strip sum for computing rectangle sum Orthogonal Haar transform Experimental result Conclusion 2018/11/15

The proposed strip sum D = 4 + 1 − 2 − 3 = 4 − 2 − (3 − 1) (j1, j2) = 4 − 2 − (3 − 1) (j1, j2) HStrip= (j1, j2, N2) N2 RectSum(j1, j2, N1, N2, 0°) = [I(j1+ N1, j2+N2) – I(j1+ N1, j2)] – [I(j1, j2+N2) – I(j1, j2)] = HStripSum(j1+N1, j2, N2) – HStripSum(j1, j2, N2) 2018/11/15 P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001.

Constructing the strip sum HStripSum(j1, j2, N2) = I(j1, j2+N2) – I(j1, j2) 1 addition Compute 2 RectSum: 2 adds for strip sum; 6 adds for integral image. 2018/11/15

Other strips 2018/11/15

Computational analysis r=30 r rectangle sums of different sizes Box: 4r Integral image: 3r+2 Strip sum: 2 + Min{ Num(width), Num(height)} + r These r rectangles have Num(width) different widths and Num(height) different heights. Toy case Size 1×1 to N×N (1×1, 1×2, 2×1, 2×2, 1×3, 3×1, 2×3, … , N×N) r = N2, Num(width)=Num(height)=N. Box: 4N2 Integral image: 3N2 + 2 Strip sum: 2 + N + N2 D = 4 + 1 − 2 − 3 int. Strip Haar-like features 2018/11/15 P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001. M. J. McDonnell. Box-filtering techniques. Comput. Graph. Image Process., 17: 65–70, 1981.

Haar-like features P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” CVPR pp. I: 511-I: 518, 2001. R. Lienhart, J. Maydt, “An extended set of Haar-like features for rapid object detection,” ICIP Vol. 1, pp.I-900 - I-903, Sept. 2002. Feng Tang, R. Crabb, Hai Tao, “Representing Images Using Nonorthogonal Haar-Like Bases,” IEEE PAMI. Intell. Vol. 29(12), pp. 2120 – 2134, Dec. 2007. 2018/11/15

Computational analysis Feature(1a)=Rectsum1 – Rectsum2 2018/11/15

Experiment – Face detection OpenCV uses integral image method. In our implementation, we just replace the part that is used for computing the Haar like features. CMU frontal face test set. All of the 130 images are used for face detection. Methods Integral Image Strip Sum Time 23.6 Secs 20.2 Secs 86% execution time H. Rowley, S. Baluja, and T. Kanade. Neural network-based face detection. IEEE Patt. Anal. Mach. Intell., Vol. 20, pp. 22-38, 1998. 2018/11/15 R. Lienhart, J. Maydt, “An extended set of Haar-like features for rapid object detection,” ICIP Vol. 1, pp.I-900 - I-903, Sept. 2002. http://sourceforge.net/projects/opencvlibrary

Outline The proposed algorithm Orthogonal Haar transform Introduction The proposed algorithm Definition of rectangle sum and integral image Strip sum for computing rectangle sum Orthogonal Haar transform Experimental result Conclusion 2018/11/15

The proposed orthogonal Haar transform OHT on sliding windows. The same Haar feature has been considered as different basis for different window positions! Basis 2 and 3 are the same Haar feature on different window positions. 2 3 2 3 2018/11/15

The proposed orthogonal Haar transform Basis 2 and 3 are the same Haar feature on different window positions. Similarly for 4-7. Similarly for basis 8-15. Compute u basis: u=2: 2 Haar features. u=4: 3 Haar features. u=8: 4 Haar features. u=16: 5 Haar features. u: 1+log2u Haar features. 4 + 5log4 u additions for computing u OHT 2-D basis. 2018/11/15

OHT and WHT - example 4x4 OHT 4x4 WHT 2018/11/15

OHT and WHT - Energy compaction ability X: number of Basis Y: Energy extracted X: number of operations Y: Energy extracted Y. Hel-Or and H. Hel-Or. Real time pattern matching using projection kernels. TPAMI, 27(9):1430–1445, Sept. 2005. G. Ben-Artz, H. Hel-Or, and Y. Hel-Or. The Gray-code filter kernels. TPAMI, 29(3):382– 393, Mar. 2007. W. Ouyang and W. Cham. Fast algorithm for Walsh Hadamard transform on sliding windows. TPAMI, 32(1):165–171, Jan. 2010. 2018/11/15

Outline Experimental result Introduction The proposed algorithm Conclusion 2018/11/15

Datasets (1) 120 images selected from 3 different databases. MIT, medical and remote sensing. F. Tombari etc. Full search-equivalent pattern matching with incremental dissimilarity approximations. IEEE TPAMI, 31(1):129–141, Jan.2009. 1,8, 9 http://people.csail.mit.edu/torralba/images. http://zulu.ssc.nasa.gov/mrsid. www.data-compression.info/corpora/lukascorpus 2018/11/15

Datasets (2) Noises: N1, N2, N3 and N4, Variances: 100, 200, 400 and 800 PSNR: 28.1, 25.1, 22.1 and 19.2 N1 N2 N3 N4 2018/11/15

1. Evaluating algorithms on different sizes Speedup over full search (FS) in pattern matching OHTI: OHT using integral image TimeFS/ TimeWHT OHTs: OHT using strip sum TimeFS/ TimeGCK TimeFS/ TimeIDA 500s/1s = 500 Y. Hel-Or and H. Hel-Or, “Real time pattern matching using projection kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 9, pp. 1430-1445, Sept. 2005. G. Ben-Artzi, H. Hel-Or, and Y. Hel-Or, “The gray-code filter kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 3,  pp.382 – 393, Mar. 2007 2018/11/15 F. Tombari etc. Full search-equivalent pattern matching with incremental dissimilarity approximations. IEEE TPAMI, 31(1):129–141, Jan.2009. 1,8, 9

2. Evaluating algorithms on different pattern sizes and different noise levels Speedup over FS. Time??/ TimeOHT Y. Hel-Or and H. Hel-Or, “Real time pattern matching using projection kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 9, pp. 1430-1445, Sept. 2005. G. Ben-Artzi, H. Hel-Or, and Y. Hel-Or, “The gray-code filter kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 3,  pp.382 – 393, Mar. 2007 F. Tombari etc. Full search-equivalent pattern matching with incremental dissimilarity approximations. IEEE TPAMI, 31(1):129–141, Jan.2009. 1,8, 9 2018/11/15

2. Evaluating algorithms on different pattern sizes and different noise levels (2) At about 4-15 times the speed of IDA At about 8-10 times the speed of GCK G. Ben-Artzi, H. Hel-Or, and Y. Hel-Or, “The gray-code filter kernels,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 3,  pp.382 – 393, Mar. 2007 2018/11/15 F. Tombari etc. Full search-equivalent pattern matching with incremental dissimilarity approximations. IEEE TPAMI, 31(1):129–141, Jan.2009. 1,8, 9

Outline Conclusion Introduction The proposed algorithm Experimental result Conclusion 2018/11/15

Conclusion A data structure (strip sum) that computes sum of pixels in a rectangle by 1 addition. A transform (orthogonal Haar transform) that requires O(logu) additions per pixel to project N1xN2 input window onto u basis vectors. Pattern matching using OHT Find the same result as Full Search (FS). Experimental results show that it can achieve up to 10 times speed-up over GCK in pattern matching. 2018/11/15

Thanks ! 2018/11/15