Advanced Science and Technology Letters Vol.28 (AIA 2013), pp.66-71 Local Contour Features for Writer Identification.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

A Graph based Geometric Approach to Contour Extraction from Noisy Binary Images Amal Dev Parakkat, Jiju Peethambaran, Philumon Joseph and Ramanathan Muthuganapathy.
QR Code Recognition Based On Image Processing
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Detecting Faces in Images: A Survey
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
Adviser:Ming-Yuan Shieh Student:shun-te chuang SN:M
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Chih-Hsing Lin, Jia-Shiuan Tsai, and Ching-Te Chiu
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Segmentation Divide the image into segments. Each segment:
Clustering Color/Intensity
CS 223B Assignment 1 Help Session Dan Maynes-Aminzade.
Object Recognition Using Geometric Hashing
Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:
Iris localization algorithm based on geometrical features of cow eyes Menglu Zhang Institute of Systems Engineering
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Handwritten Thai Character Recognition Using Fourier Descriptors and Robust C-Prototype Olarik Surinta Supot Nitsuwat.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
Neighborhood Operations
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
New Segmentation Methods Advisor : 丁建均 Jian-Jiun Ding Presenter : 蔡佳豪 Chia-Hao Tsai Date: Digital Image and Signal Processing Lab Graduate Institute.
1 Recognition of Multi-Fonts Character in Early-Modern Printed Books Chisato Ishikawa(1), Naomi Ashida(1)*, Yurie Enomoto(1), Masami Takata(1), Tsukasa.
Ajay Kumar, Member, IEEE, and David Zhang, Senior Member, IEEE.
Chapter 9.  Mathematical morphology: ◦ A useful tool for extracting image components in the representation of region shape.  Boundaries, skeletons,
Table 3:Yale Result Table 2:ORL Result Introduction System Architecture The Approach and Experimental Results A Face Processing System Based on Committee.
Huijuan Yang, Alex C. Kot, IEEE Fellow IEEE Transactions on Multimedia, Vol. 9, No. 3, Apr Multimedia Security Final Project R 葉容瑜 R
ENT 273 Object Recognition and Feature Detection Hema C.R.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
Digital Image Processing Lecture 18: Segmentation: Thresholding & Region-Based Prof. Charlene Tsai.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Comparison of Handwritings Miroslava Božeková Thesis supervisor: Doc. RNDr. Milan Ftáčnik, CSc.
Human Detection Method Combining HOG and Cumulative Sum based Binary Pattern Jong Gook Ko', Jin Woo Choi', So Hee Park', Jang Hee You', ' Electronics and.
Advanced Science and Technology Letters Vol.31 (ACN 2013), pp Application Research of Wavelet Fusion Algorithm.
Zhongyan Liang, Sanyuan Zhang Under review for Journal of Zhejiang University Science C (Computers & Electronics) Publisher: Springer A Credible Tilt License.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
Timo Ahonen, Abdenour Hadid, and Matti Pietikainen
CSE 185 Introduction to Computer Vision Feature Matching.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
Advanced Science and Technology Letters Vol.29 (SIP 2013), pp Electro-optics and Infrared Image Registration.
Wonjun Kim and Changick Kim, Member, IEEE
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Chapter 6 Skeleton & Morphological Operation. Image Processing for Pattern Recognition Feature Extraction Acquisition Preprocessing Classification Post.
Arabic Handwriting Recognition Thomas Taylor. Roadmap  Introduction to Handwriting Recognition  Introduction to Arabic Language  Challenges of Recognition.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Optical Character Recognition
October 3, 2013Computer Vision Lecture 10: Contour Fitting 1 Edge Relaxation Typically, this technique works on crack edges: pixelpixelpixel pixelpixelpixelebg.
Course : T Computer Vision
Improving the Performance of Fingerprint Classification
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Seunghui Cha1, Wookhyun Kim1
Mean Shift Segmentation
CSSE463: Image Recognition Day 25
Image Segmentation Techniques
Random Neural Network Texture Model
Presentation transcript:

Advanced Science and Technology Letters Vol.28 (AIA 2013), pp Local Contour Features for Writer Identification Hong Ding ', Huiqun Wu 2, and Xiaofeng Zhang ' 1. School of Computer Science and Technology, Nantong University, Nantong, China 2. Medical School, Nantong University, Nantong, China Abstract. A method based on local contour features is proposed for writer identification in this paper. In preprocessing, an improved Bern-son algorithm is used to abstract contours form images. Then the distribution of local contour is extracted from the fragments which are parts of the contour in sliding windows. In order to reduce the impact of stroke weight, the fragments which do not directly connect the center point are ignored during feature abstraction. The edge point distributions of the fragments are counted and normalized into Local Contour Distribution Features (LCDF). At last, weighted Manhattan distance is used as similarity measurement. The experiments on our database show that the performances of the proposed method gets the state-of-art performance. Keywords: writer identification, stroke feature, local contour distribution feature, weighted Manhattan distance. 1 Introduction Writer identification is a hot area of computer vision and pattern recognition. According to the identification contents, writer identification methods fall into two major categories: text- dependent and text- independent [1]. In recent years, a variety of methods have been proposed for text-independent writer identifica-tion. These methods can be categorized into two kinds: model based and feature based. Model based methods build models and train the parameters from training images. Schlapbache et al.[2] builded a handwriting model by hidden Markov model and identified a writing image by its agreement with the model. He et al. [3] used hidden Markov tree model in wavelet domain to build the model of handwriting. Feature based methods focus on abstracting distinguish features for writer identification. Bulacu et al. [4] proposed a serial features with direction, angle for writer identification. Li et al. proposed a micro- structure feature [5], and improved it [6]. The micro-structure feature makes good performances on Chinese character identification. Ghiasi et al. [7] coded the local structures into a leng-angle form and used them to describe the direction of handwriting. Learning from the idea of local structure distribution and extending it to general case, a method base on Local Contour Distribution Feature (LCDF) is ISSN: ASTL Copyright © 2013 SERSC

Advanced Science and Technology Letters Vol.28 (AIA 2013) proposed in this paper. LCDF reflects the writing style by counting the distri- bution of stroke in sliding windows. In order to reduce the impacts of stroke weights and irrelevant structures, only the edge points directly connecting the center point are counted in the sliding window. At last, the weighted Manhattan distance is used to measure the similarity between two LCDFs. The experiments on our database show that the proposed method gets state-of-art performance of the methods. 2 Feature abstraction and similarity measurement The proposed method contains two main parts: feature abstraction and similarity measurement. For our feature is extracted from the stroke contour, an contour detection preprocessing is required. 2.1 Contour detection preprocessing Bernsen algorithm [8] is a local binarization method and better for uneven illuminative images. This algorithm should be operated in sliding windows. In a sliding window, the center point is ( x, y ), the max value is max f and the min value is min f, where f contains values of all pixels. The definition of Bernsen algorithm in sliding window is T ( x,y ) =. (1) 2 max f + min f Then, the binarization result can be obtained by {. (2) B(x, y) = 0 if f ( x, y ) < T ( x, y ) 255 else The shortcoming of Bersen algorithm is over-segmentation in uniform regions. So it is not a reasonable binarization method by equation (2) when the difference between the max and min is too small. Considering even regions are not inner regions of strokes in most conditions, the. (3) binarization method is modified to { B(x, y) = 0 if f ( x,y ) T 255 otherwise where T is a threshold. 2.2 Fragment extraction The rectangle in Fig. 1 is a sliding window. Its center is an edge point marked 67 Copyright © 2013 SERSC

with”+”.Thesizeofthewindowis(2r+ 1) × (2r + 1),where2r+ 1 isthelength of a side of the window. There are several fragments in the window, which are parts of the contour. In the conditions of any writing instruments

Advanced Science and Technology Letters Vol.28 (AIA 2013) allowed, handwritings with different weight will be obtained from a same writer. So the stroke weight has a negative influence in writer identification. In order to reduce the influence of stroke weight, the fragments not connecting the center point are ignored. Fig. 1 shows the local fragment extraction process. There are two fragments in the window and only the one connecting center point is used in next step. Fig. 1. The fragment extraction in a sliding window. 2.3 The LCDF extraction The stroke distribution can reveal hidden features of stroke. The probability distribution of local structure in sliding windows is used in literature [5,6]. The sliding window goes through the image with all edge points as its center. The feature counting window in literature [5, 6] is showed in Fig. 2. Its center is an edge point marked with black. The gray sites are edge points of the fragment connecting the center. The subscript of every site is its group number. For a 7×7 window, there are three groups. In literature [5,6], the numbers of some related site pair are counted, such as the same group pairs (123, 203), (82, 132) and (41, 71), the adjacent group pairs (41, 82), (71, 132), (82, 123) and (132, 203), the interval group pairs (41, 123) and (71, 203). These related group pairs can depict the local structure distribution. The existing local features only used a subset of related site pairs. A reason-able extension of these ideas is considering more pairs may gain a more powerful feature. The proposed feature uses the pairs whose first group number is no less than the second number. Then, the feature is abstracted by next steps: 1.Contour detection. It is an important preprocessing. Sobel detector is useful for simple background images, while an improved Bernsen algorithm is valuable for complex background images. 2.Local fragment extraction. The method is shown in section Counting the number of (I m 1, J m 2), where I and J are two related points in a sliding window, m1 and m2 are their group numbers, m1 m2. 4.Go through all edge points and repeat step (2) and (3). 5.Normalization. Different images have different numbers of edge points. So, the distribution should be normalized. In our experiments, it is normalized with Copyright © 2013 SERSC 68

Advanced Science and Technology Letters Vol.28 (AIA 2013) Fig. 2. Feature counting window in literature [5, 6]. P I m N ( I m ), where N ( I m ) is the number of edge points. Then, the probability density of coding becomes N ( I m 1, J m 2 ) (4) p ( I m 1,J m 2 ) = P I m N ( I m ), where N ( I m 1, Jm 2 ) is the number of pair ( I m 1, J m 2 ). The main part of feature extraction is repeat counting, which is an easy way of realization. As the size of sliding window increases, the feature dimension rapidly increases and most features far from center tends to be nearly useless for their close to zero values. So, the size of sliding window is limited in a small range. In our experiments, three kinds of window sizes are used: 11 x 11, 13 x 13 and 15 x Similarity measurement The methods of similarity measurement fall into two major categories: model based and distance based. Considering the model is more time consuming and difficult to describe the relations between stroke and its surroundings, the pro- posed method directly computes the distance between two features and measures the similarity by the nearest neighbor rule. Several distance measurements and their weighted measurements have been tested in our experiments. Among these methods, the weighted Manhattan dis- tance has obtained the best performance, whose definition is D = i | LCDF 1 i − LCDF 2 i | σ i, (5) where σ i is standard deviation of the i th component of LCDFs, LCDF 1 i and LCDF 2 i are the i th components of two LCDFs, respectively. 69 Copyright © 2013 SERSC

Advanced Science and Technology Letters Vol.28 (AIA 2013) 3 Experiments To evaluate the effectiveness of the proposed method, we test it on our writer database. Distances between any two document images of the database are cal- culated by the weighted Manhattan distance. These results are sorted from the most similar to the less similar image. Then, two different measurements soft TOP-N and hard TOP-N criterion are used to evaluate the performance of the proposed method. Soft TOP-N criterion is the accuracy of at least one of the same writer is included in the N most similar document images. While hard TOP-N criterion is the accuracy of all the N most similar document images are written by the same writer. Hard criterion is more strict. Handwritings of our database are from fifteen writers. Each writer has three document images and each image has about fifty Chinese characters. These images have inhomogeneous intensities and obvious noise because of the low perfor-mance of our scanner. So they are binarized by the improved Bernsen algorithm. The values of N used for the soft criterion are 1, 2, 5 and 10. For every writer only has three images, the value of N used for the hard criterion is 2. The proposed method is a local structure method. The method of [6] has rel-ative high performance among the existing methods. So we realized this method for comparison. Table 1,2 show the performance on our database. The high per- formance of our method show LCDF is more powerful than the existing local structure features. Table 1. Performance on our database (soft evaluation). method sliding window size Top -1 Top -2 Top -5 Top x % 96.5% 100% 100% Method of [6] 13 x % 97.8% 97.8% 100% 15 x % 97.8% 97.8% 100% 11 x % 97.8% 100% 100% The proposed method 13 x % 100% 100% 100% 15 x % 97.8% 100% 100% Table 2. Performance on our database (hard evaluation). method sliding window size Top-2 Method of [6] 11 x % The proposed method x % Copyright©2013 SERSC 70

Advanced Science and Technology Letters Vol.28 (AIA 2013) 4 Conclusion In this paper, a method based on LCDF is proposed. The contour is detected by an improved Bernsen algorithm. Then LCDF is extracted from the sliding win- dows by counting the edge point distribution of the fragments. In order to reduce the impact of the stroke weight, only the fragments connecting the centers of sliding windows are counted. Our feature is more powerful than the existing local structure features by counting more related pairs. At last, the weighted Manhat-tan distance effectively measures the similarities of the LCDFs. The experiments on our database show the good performance of our method. References 1.X. Li, X. Q. Ding, X. L. Wang.: Semi-text-independent writer verification of Chi- nese handwriting. In: International Conference on Fountiers of Handwriting Recognition, (2008). 2.A. Schlapbach, H. Bunke.: A writer identification and verification system using HMM based recognizers. Pattern Analysis and Applications, vol.10, no.1, pp.33– 43(2007). 3.Z. Y. He, X. G. You, Y. Y. Tang.: Writer identification of Chinese handwriting documents using hidden Markov tree model. Pattern Recognition, vol.41, no.4, pp.1295–1307(2008). 4.M. Bulacu, L. Schomaker.: Text-independent writer identification and verification using textural and allographic features. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, no.4, pp.701–717(2007). 5.X. Li, X. Q. Ding, L. R. Peng.: A microstructure feature based text-independent method of writer identification for multilingual handwritings. Acta Automatica Sinica, vol.35, no.9, pp.1199–1208(2009). 6.X. Li, X. Q. Ding.: Writer identification based on improved microstructure features. Journal of Tsinghua University (Science and technology), vol.50, no.4, pp.595– 600(2010). 7.G. Ghiasi, R. Safabakhsh.: Offline text-independent writer identification using codebook and efficient code extraction methods. Image and Vision Computing, vol.31, no.5, pp.379–391(2013). 8.J. Bernsen.: Dynamic thresholding of gray-level images. In: International Conference on Pattern Recognition, pp (1986). 71 Copyright © 2013 SERSC