1 Document Image Matching Based on Component Blocks Fuhui Long, Hanchuan Peng, Zheru Chi, and Wanchi Siu Center for Multimedia Signal Processing, Department.

Slides:

Advertisements

Similar presentations

Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.

Advertisements

Applications of one-class classification

IMAGE RESIZING & SEAMCARVING CS16: Introduction to Algorithms & Data Structures Thursday, January 23, 14 1.

QR Code Recognition Based On Image Processing

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.

Word Spotting DTW.

電腦視覺 Computer and Robot Vision I

An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.

Automatically Annotating and Integrating Spatial Datasets Chieng-Chien Chen, Snehal Thakkar, Crail Knoblock, Cyrus Shahabi Department of Computer Science.

Face Alignment with Part-Based Modeling

3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.

Automatic Histogram Threshold Using Fuzzy Measures 呂惠琪.

September 10, 2013Computer Vision Lecture 3: Binary Image Processing 1Thresholding Here, the right image is created from the left image by thresholding,

Quadtrees, Octrees and their Applications in Digital Image Processing

DTM Generation From Analogue Maps By Varshosaz. 2 Using cartographic data sources Data digitised mainly from contour maps Digitising contours leads to.

Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

Chamfer Matching & Hausdorff Distance Presented by Ankur Datta Slides Courtesy Mark Bouts Arasanathan Thayananthan.

Pores and Ridges: High- Resolution Fingerprint Matching Using Level 3 Features Anil K. Jain Yi Chen Meltem Demirkus.

ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.

Highlights Lecture on the image part (10) Automatic Perception 16

Fitting a Model to Data Reading: 15.1,

Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:

1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.

Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University

Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.

Triangle-based approach to the detection of human face March 2001 PATTERN RECOGNITION Speaker Jing. AIP Lab.

Morphological Image Processing

Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.

AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.

FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.

ENDA MOLLOY, ELECTRONIC ENG. FINAL PRESENTATION, 31/03/09. Automated Image Analysis Techniques for Screening of Mammography Images.

CPSC 601 Lecture Week 5 Hand Geometry. Outline: 1.Hand Geometry as Biometrics 2.Methods Used for Recognition 3.Illustrations and Examples 4.Some Useful.

BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.

Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.

Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,

S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.

Under Supervision of Dr. Kamel A. Arram Eng. Lamiaa Said Wed

COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.

September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.

HP-PURDUE-CONFIDENTIAL Final Exam May 16th 2008 Slide No.1 Outline Motivations Analytical Model of Skew Effect and its Compensation in Banding and MTF.

Digital Image Processing CCS331 Relationships of Pixel 1.

Quadtrees, Octrees and their Applications in Digital Image Processing.

© 2005 Martin Bujňák, Martin Bujňák Supervisor : RNDr.

Dengsheng Zhang and Melissa Chen Yi Lim

2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.

2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )

CSC508 Convolution Operators. CSC508 Convolution Arguably the most fundamental operation of computer vision It’s a neighborhood operator –Similar to the.

A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,

A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

CS654: Digital Image Analysis

Nottingham Image Analysis School, 23 – 25 June NITS Image Segmentation Guoping Qiu School of Computer Science, University of Nottingham

Wonjun Kim and Changick Kim, Member, IEEE

An Improved Approach For Image Matching Using Principle Component Analysis(PCA An Improved Approach For Image Matching Using Principle Component Analysis(PCA.

Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.

Compressing Bi-Level Images by Block Matching on a Tree Architecture Sergio De Agostino Computer Science Department Sapienza University of Rome ITALY.

Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.

Chapter 6 Skeleton & Morphological Operation. Image Processing for Pattern Recognition Feature Extraction Acquisition Preprocessing Classification Post.

Morphological Image Processing

DETECTION OF COPY MOVE FORGERY IN DIGITAL IMAGES.

3D Perception and Environment Map Generation for Humanoid Robot Navigation A DISCUSSION OF: -BY ANGELA FILLEY.

S.Rajeswari Head , Scientific Information Resource Division

Web Data Extraction Based on Partial Tree Alignment

Computer Vision Lecture 5: Binary Image Processing

An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.

Handwritten Characters Recognition Based on an HMM Model

Comparing Images Using Hausdorff Distance

Presentation transcript:

1 Document Image Matching Based on Component Blocks Fuhui Long, Hanchuan Peng, Zheru Chi, and Wanchi Siu Center for Multimedia Signal Processing, Department of Electronic & Information Eng., The Hong Kong Polytechnic Univ., {fhlong, phc,

2 Outline Introduction Component Block & Data Structure Matching Algorithm Experiments Discussion & Conclusion

3 Document Image Matching Key technique for document image registration & retrieval Can be applied widely for office automation, digital library, video- conferencing, etc.

4 Current Techniques Existing methods are mainly based on local features of document page image Cesarini ’ s form-reader system (attributed relational graphs) Shimotsuji ’ s cell struct based 2-dimensional hash table Watanabe ’ s blank form structure of repetitions and positions of cells Tseng and Chen ’ s line segments based method Fan and Chang ’ s line crossing relationship matrix Watanabe and Huang ’ s predefined logical structure for business cards Safari ’ s projective geometry method etc

5 Our Approach Decompose a document page image into local component blocks Propose measurements to combine local block information global page layout information It is closely related to our e-Doc technique, which is developed for document databases

6 e-Doc Documents on Papers Image Acquiring Optical Images Pre-Processing Component Block List e-Doc Page Block Organization Functions for Applications The Block-oriented e-Doc technique can be very useful for document databases related applications, including the document page image retrieval, etc.

7 Preprocessing Noise removing Region based binarization and foreground extraction Correlation based skew correction Image Blocking: Scan from bottom to top and from left to right Use the simplest region growing method (not pixel-by-pixel, but line-by-line, i.e. if there is a pixel on the out boundary of current block, then grow out one more line.)

8 Component Block List …… order=1 bound={(250,429),(45,86)} lang_index = type=English We only make use of the block location & size information for matching.

9 Matching Algorithm Procedure CBL-MA; {Input: a CBL for the input document image, a handle to a template image database of K TBLs } {Output: the TBL with the minimum distance D to CBL} {Preprocessing: for k=1,…,K, do begin sort the kth TBL by block size (from small to large); end.} {Note: the preprocessing is not a part of this CBL-MA and needs to be done only once beforehand} begin sort the CBL by block size (from small to large); for k=1,…,K, do begin compute D k, which is the distance between CBL and the kth TBL; end; select the TBL with the minimum D k as output; end. (Notation: TBL – Template Block List, CBL – Component Block List)

10 Distance Definitions Size Matching: Location Matching: Total Distance:

11 Illustration of the Algorithm BA BTBT …… Sequencing with size Component block list Sequenced block list Template block list 1. Matching with size 2. Matching with location

12 Experimental Data A large document template data set of 1350 templates. Define 5 subsets with sizes: 50 templates 100 templates 200 templates 500 templates 1350 templates Use computer to generate all test data (deformation images according to these templates)

13 Deformation Types Detection Error Block misdetection rate P m Block misaddition rate P a Block Size Variation Block size variation rate P s Block size variation scale S s Block Location Displacement Block displacement rate P d Block displacement scale S d Block Rotation Block rotation rate P r Block rotation angle D r

14 Data Examples Template imageDeformation image

15 Results for Detection Errors (1) CBL-MA can perform well (r c > 85%) even when 50% blocks in the block list are lost or wrongly added (see the column of P m = 0.5 and P a = 0.5). Even when 80% blocks are lost or added, this algorithm can still produce matching accuracy nearly 60% (see the column of P m = 0.8 and P a = 0.8). (2) CBL-MA is more insensitive to block misaddition than to block misdetection. This is reasonable because when additional blocks are wrongly put into CBL, the original blocks still play, although weaker, roles. On the contrary, the lost information due to block misdetection is non-recoverable.

16 Results for Detection Errors P m: Block misdetection rate P a : Block misaddition rate

17 Results for Block Size Variation For block size variation, the influence of parameters P s and S s (here we set the same scale factor for both block width and height) on r c is given in Table 2. When blocks expand or shrink greatly, CBL-MA can keep r c above 90% (the fourth row of Table 2). At the same time, even when all blocks have size variations (P s = 1.0), our algorithm can produce a very high matching accuracy of 95%. Note that the latter corresponds to many office automation applications, where S s is not very large, however, most blocks are subject to some degree of size variation, i.e. P s is close to 1.

18 Results for Block Size Variation P s : Block size variation rate S s : Block size variation scale

19 Results for Block Displacement For block location displacement, the influence of parameters P d and S d on r c is given in Table 3. Evidently CBL-MA is robust to block location variation (r c always larger than 95%).

20 Results for Block Displacement P d : Block displacement rate S d: Block displacement scale

21 Results for Block Rotation For block rotation, the influence of parameters P r and D r on r c is given in Table 4. For both cases {D r =15 , P r varies from 0.2 to 1.0} and { P r =0.5, D r varies from 5  to 45  }, CBL-MA produces satisfying classification, even when the test images contain strong deformation, e.g. 50% component blocks have at most 45  rotation, or all component blocks have at most 15  rotation. Notice that block rotation will directly lead to the significant change of block sizes.

22 Results for Block Rotation P r : Block rotation rate D r : Block rotation angle

23 Results for Template Set Size Here under a general setting of parameters {P a =0.2, P m =0.2, P s =0.2, S s =0.2, P d =0.5, S d =0.5, P r =0.5, D r =15  }, we examine the influence of the template image set size on the matching accuracy. All the five template sets are used. For each template set, we independently generate at least 2000 images for testing. The results are listed in Table 5. It is clear that even when the template set size grows to 500, the matching accuracy is satisfying (>80%). For the template set (Set-E), r c is still around 70%.

24 Results for Template Set Size

25 Comparison to Other Algorithms It is noticed that the failure in detecting local features (e.g. line-segments) usually immediately results in bad performance of several other algorithms. However, our experiments demonstrate that even when the block information is partially lost or inaccurate, there is no significant performance reduction of CBL-MA.

26 Computational Complexity Denote n as the number of blocks in a CBL, m as the number of blocks in a TBL, K as the number of template images in a document image database. When we use quicksort and binary search algorithms, the typical computational complexity is O(nlogn) for CBL sorting O(Kmlogn) for CBL/TBL size matching O(Km(2T C +1)) for CBL/TBL location matching O(K) for CBL/TBL distance calculation O(logK) for finding the minimum distance Totally O((Km+n)logn)

27 Application: Column Table Data Extraction This page matching algorithm is then refined and applied to automatic data extraction of column forms. ---All fields in input image differ from each other a lot and the local-feature based approach can not work well. ---Our algorithm is a powerful tool to find out the correct image template, which is used to annotate the image data fields to accomplish the data extraction successfully.

28 Conclusions Based on block list (and tree), our algorithm can effectively make use of the local information of each page block and the global information of page layout. We present a method for effective document image matching. The algorithm gives satisfying performance for various image deformations. The algorithm is robust to image distortion, filled-in text, and noises. We report an successful application of our algorithm in column table data auto-reading.