Automatic Script Identification. Why do we need Script Identification OCRs are generally language dependent. Document layout analysis is sometimes language.

Slides:



Advertisements
Similar presentations
EA C461 - Artificial Intelligence
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Face Recognition: A Convolutional Neural Network Approach
Neural networks Introduction Fitting neural networks
Data Mining Classification: Alternative Techniques
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Neural Networks II CMPUT 466/551 Nilanjan Ray. Outline Radial basis function network Bayesian neural network.
Lecture 5 Template matching
Lecture 14 – Neural Networks
RBF Neural Networks x x1 Examples inside circles 1 and 2 are of class +, examples outside both circles are of class – What NN does.
A Study of Approaches for Object Recognition
Prénom Nom Document Analysis: TextRecognition Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Slide credits for this chapter: Frank Dellaert, Forsyth & Ponce, Paul Viola, Christopher Rasmussen.
Object Recognition Using Geometric Hashing
Scale Invariant Feature Transform (SIFT)
Optimal Adaptation for Statistical Classifiers Xiao Li.
1 Automated Feature Abstraction of the fMRI Signal using Neural Network Clustering Techniques Stefan Niculescu and Tom Mitchell Siemens Medical Solutions,
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Aula 4 Radial Basis Function Networks
Radial-Basis Function Networks
8/10/ RBF NetworksM.W. Mak Radial Basis Function Networks 1. Introduction 2. Finding RBF Parameters 3. Decision Surface of RBF Networks 4. Comparison.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Pattern Recognition Vidya Manian Dept. of Electrical and Computer Engineering University of Puerto Rico INEL 5046, Spring 2007
This week: overview on pattern recognition (related to machine learning)
Introduction to variable selection I Qi Yu. 2 Problems due to poor variable selection: Input dimension is too large; the curse of dimensionality problem.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Geodesic Minimal Paths Vida Movahedi Elder Lab, January 2010.
Kumar Srijan ( ) Syed Ahsan( ). Problem Statement To create a Neural Networks based multiclass object classifier which can do rotation,
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Group 2 R 李庭閣 R 孔垂玖 R 許守傑 R 鄭力維.
Matlab Matlab Sigmoid Sigmoid Perceptron Perceptron Linear Linear Training Training Small, Round Blue-Cell Tumor Classification Example Small, Round Blue-Cell.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Texture analysis Team 5 Alexandra Bulgaru Justyna Jastrzebska Ulrich Leischner Vjekoslav Levacic Güray Tonguç.
CSSE463: Image Recognition Day 31 Today: Bayesian classifiers Today: Bayesian classifiers Tomorrow: project day. Tomorrow: project day. Questions? Questions?
Object Recognition in Images Slides originally created by Bernd Heisele.
ECE738 Advanced Image Processing Face Detection IEEE Trans. PAMI, July 1997.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.
Linear Models for Classification
AUTOMATIC TARGET RECOGNITION AND DATA FUSION March 9 th, 2004 Bala Lakshminarayanan.
CSSE463: Image Recognition Day 31 Today: Bayesian classifiers Today: Bayesian classifiers Tomorrow: Out-of-class project workday Tomorrow: Out-of-class.
Automated Fingertip Detection
A New Method for Crater Detection Heather Dunlop November 2, 2006.
CSC321 Lecture 5 Applying backpropagation to shape recognition Geoffrey Hinton.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
Intro. ANN & Fuzzy Systems Lecture 13. MLP (V): Speed Up Learning.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Neural networks (2) Reminder Avoiding overfitting Deep neural network Brief summary of supervised learning methods.
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
Optical Character Recognition
Deep Learning Amin Sobhani.
Source: Procedia Computer Science(2015)70:
Self organizing networks
A critical review of RNN for sequence learning Zachary C
Outline Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no.
Outline Single neuron case: Nonlinear error correcting learning
network of simple neuron-like computing elements
Convolutional neural networks Abin - Roozgard.
Concave Minimization for Support Vector Machine Classifiers
Mathematical Foundations of BME
Face Recognition: A Convolutional Neural Network Approach
Department of Computer Science Ben-Gurion University of the Negev
Review of Statistical Pattern Recognition
Using Association Rules as Texture features
Presentation transcript:

Automatic Script Identification

Why do we need Script Identification OCRs are generally language dependent. Document layout analysis is sometimes language dependent. For Indexing Documents.

Steps in Script Identification Pre-Processing Binerization, Skew detection and correction. Classification Feature Extraction Segmentation Line and word Seperation. Input DocumentScript type

Feature Extraction Projection Profiles. Texture features using entropy, energy etc. Hough transform based features. Gabor filter based directional features.

Nature of Classification Problem Classifiers like KNN, MLP, SVM have been used. Training samples for a class is limited to a few fonts only. However in reality there are many fonts for the same script. Hence while the entire data is distributed over much larger space, the training is packed in much smaller region.

Nature of Classification Problem Red for training data Blue for remaining data

Bayesian neural networks solve this by sampling from the distribution of weights and integrating over solutions But sampling is expensive and the method is time consuming. Nature of Classification Problem

Expectation Maximization Neural Model To get non-linear probability fields we need to integrate over all weights that classify well and yet are different from each other. Modify cost function or error function in the neural network we need to optimize. M(w i ) = E(w i ) + cos 2 (w i, w j ) where (j ~= i)

EM Neural Model Training using EM. E step is exact as we make prior equal to posterior M step through the modified Gradient descent Single layer of neurons shows non-linearity! ∫∫∫∫ Σ

Some Experiments

MLPEMNM

References 1. A. Spitz, “Determination of The Script and Language Content of Document Images” IEEE Transactions on PAMI, vol 19, pp , J Ding, L Lam and C Y Suen, “Classification of Oriental and European Scripts by Using Charecterestic Features”, Proc. 4 th ICDAR, pp , J. Hochberg, P Kelly, T Thomas and L Kerns, “Automatic Script Identification from Document Images using Cluster Based Templates.” IEEE Transactions on PAMI, vol 19, pp , T.N. Tan, “Rotation Invariant Texture Features and Their Use in Automatic Script Identoification” IEEE Transactions on PAMI, vol 20, pp , J Hochberg, K Bowers, M Cannon and P Kelly, “Script and Language Identification for Handwritten Document Images”, IJDAR, vol 2, pp 42-52, U Pal and B B Chaudhuri, “ Automatic Identification of English, Chinese, Arabic, Devanagiri and Bangla Script Line”, Proc. 6 th ICDAR, 2001