Segmentation of Touching Characters in Devnagari & Bangla Scripts Using Fuzzy MultiFactorial Analysis Presented By: Sanjeev Maharjan St. Xavier’s College.

Slides:



Advertisements
Similar presentations
TUTORIAL 1 Getting Started with Excel
Advertisements

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Word Spotting DTW.
INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
Document Processing Methods for Telugu and other SE Asian Scripts
DIGITAL IMAGE PROCESSING
CS 376b Introduction to Computer Vision 04 / 08 / 2008 Instructor: Michael Eckmann.
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Prénom Nom Document Analysis: Segmentation & Layout Analysis Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
An Approach to Korean License Plate Recognition Based on Vertical Edge Matching Mei Yu and Yong Deak Kim Ajou University Suwon, , Korea 指導教授 張元翔.
Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.
Quark QuarkXPress 4 Intermediate Level Course. Working with Master Pages The Document Layout Palette allows you to add, delete, and move document and.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
Facial Recognition CSE 391 Kris Lord.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
MetImage : Image Analysis Software. The MetImage LX workstation is a complete image analysis system developed specifically to increase the speed, accuracy.
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
Chapter 4 Pattern Recognition Concepts continued.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
: Chapter 10: Image Recognition 1 Montri Karnjanadecha ac.th/~montri Image Processing.
Word Processing Standard Grade Computing LA/LM. Word processor a computer program that allows you to manipulate text What is?
Graph-based Segmentation. Main Ideas Convert image into a graph Vertices for the pixels Vertices for the pixels Edges between the pixels Edges between.
1 Recognition of Multi-Fonts Character in Early-Modern Printed Books Chisato Ishikawa(1), Naomi Ashida(1)*, Yurie Enomoto(1), Masami Takata(1), Tsukasa.
IIIT Hyderabad Thesis Presentation By Raman Jain ( ) Towards Efficient Methods for Word Image Retrieval.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CS 6825: Binary Image Processing – binary blob metrics
Avoiding Segmentation in Multi-digit Numeral String Recognition by Combining Single and Two-digit Classifiers Trained without Negative Examples Dan Ciresan.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
A survey of different shape analysis techniques 1 A Survey of Different Shape Analysis Techniques -- Huang Nan.
Digital Image Processing Lecture 16: Segmentation: Detection of Discontinuities Prof. Charlene Tsai.
Reporter: 資訊所 P Yung-Chih Cheng ( 鄭詠之 ).  Introduction  Data Collection  System Architecture  Feature Extraction  Recognition Methods  Results.
Mobile Image Processing
UC Berkeley CS294-9 Fall Document Image Analysis Lecture 11: Word Recognition and Segmentation Richard J. Fateman Henry S. Baird University of.
Creating Tables LESSON 6 - #1.06 ESSENTIAL STANDARD #1 - UNDERSTAND WORD PROCESSING INDICATOR #6 – CREATING TABLES.
Dr. István Marosi Scansoft-Recognita, Inc., Hungary SSIP 2005, Szeged Character Recognition Internals.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
Wonjun Kim and Changick Kim, Member, IEEE
Digital Image Processing Lecture 16: Segmentation: Detection of Discontinuities May 2, 2005 Prof. Charlene Tsai.
Face Detection Using Color Thresholding and Eigenimage Template Matching Diederik Marius Sumita Pennathur Klint Rose.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
Arabic Handwriting Recognition Thomas Taylor. Roadmap  Introduction to Handwriting Recognition  Introduction to Arabic Language  Challenges of Recognition.
Date of download: 5/28/2016 Copyright © 2016 SPIE. All rights reserved. Flowchart of the computer-aided diagnosis (CAD) tool. (a) Segmentation: The region.
Character legibility: which details are important, and how might it be measured? John Hayes, Jim Sheedy, Yu-Chi Tai, Vinsunt Donato, David Glabe.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
Computer Fundamentals Muhammadamin Daneshwar And Masoud Aras Computer Engineering Department Soran University Lecture 6.
Optical Character Recognition
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
Learning to use a ‘For Loop’ and a ‘Variable’. Learning Objective To use a ‘For’ loop to build shapes within your program Use a variable to detect input.
UC Berkeley CS294-9 Fall Document Image Analysis Lecture 12: Word Segmentation Richard J. Fateman Henry S. Baird University of California – Berkeley.
OCR Reading.
Recognition of biological cells – development
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
S.Rajeswari Head , Scientific Information Resource Division
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Digital Image Processing Lecture 16: Segmentation: Detection of Discontinuities Prof. Charlene Tsai.
Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas
Zone Identification in the Printed Gujarati Text
Text Detection in Images and Video
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Dr. István Marosi Recosoft Ltd., Hungary
Loop Statements & Vectorizing Code
Handwritten Characters Recognition Based on an HMM Model
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Loop Statements & Vectorizing Code
Pattern Recognition and Training
Pattern Recognition and Training
Presentation transcript:

Segmentation of Touching Characters in Devnagari & Bangla Scripts Using Fuzzy MultiFactorial Analysis Presented By: Sanjeev Maharjan St. Xavier’s College

Contents  Background  Introducing Devnagari & Bangla Scripts  Segmentation in Devnagari & Bangla  Touching Characters in OCR  Fuzzy Mutifactorial Analysis (Solving Touching Character Problem)  Result  Conclusion

BACKGROUND  Document Analysis System  OCR for Document Analysis  How OCR works?

BACKGROUND (Document Analysis System)  Facilitates transfer of information on a paper document to the computer systems without intensive manual keying

BACKGROUND (OCR for Document Analysis System)  Recognizes the well-shaped and well-spaced characters from the scanned paper document  Converts those recognized characters to machine- editable format

BACKGROUND (How OCR Works???)  Take Text Image as Input  Segmentation of Text Segment the text into lines Segment the lines into words Segment the words into characters  Character Recognition Extract distinct features of character Map the character with predefined character set

INTRODUCING ‘Devnagari’ & ‘Bangla’ SCRIPTS  Writing style Left to right  50 basic characters in both scripts  Vowels take modified shape (allograph)  Consonants combine to form compound characters  More than 250 characters  Characters of the word are combined by ‘headline’ or ‘Shirorekha’ or ‘Dikka’

INTRODUCING ‘Devnagari’ & ‘Bangla’ SCRIPTS FIGURE: Three parts in Devnagari Script (a) & Bangla Script (b)

SEGMENTATION IN ‘Devnagari’ & ‘Bangla’ SCRIPTS  Headline is detected in text by row wise sum of black pixels  Position between two consecutive headlines where projection profile height is least, segments into lines  Vertical projection pixel profile segments words  Removing the headline segments words into individual characters

SEGMENTATION IN ‘Devnagari’ & ‘Bangla’ SCRIPTS

TOUCHING CHARACTERS IN OCR  Efficiency of OCR relies on segmentation error rate  Segmentation is based on connectivity analysis  Invalid Touching Characters degrade segmentation efficiency  Touching characters are more frequent in the Devnagari & Bangla Scripts

TOUCHING CHARACTERS IN OCR (Research Observation)  Mostly constitutes of two characters  Not valid characters  Have larger aspect ratio than isolated character  Vertical thickness of the black bob at touching position is small  At most of the touching positions single black run is encountered  Touch mostly at middle of the middle zone  Uncommon stroke patterns are generated at touching points

FUZZY MULTIFACTORIAL ANALYSIS  Wang defined the concept of factor spaces(1982)  He defined ‘factor’ as primary time with possible states & characteristics  Eg: If factor is length, then 1mtr, 10 mtr etc are its states and long, short are its characteristics

FUZZY MULTIFACTORIAL ANALYSIS  H.X. Li & V.C. Yen discussed 4 factors: Measurable Factors: (like time) Nominal Factors: (like religion) Degree/Fuzzy Factors (Degree of Similarity) Switch/Boolean Factors (0/1)  Multiple fuzzy factors are analyzed to identify & segment the touching characters

FUZZY MULTIFACTORIAL ANALYSIS (Identifying Touching Characters)  Factors Considered: Dissimilarity factor (F md ) Aspect Ratio (F ar )  F md =1-d off /d d off =minimum similarity distance for a target character against a set of stored prototypes d=the offset distance used by character classifiers  F ar= e a /1+e a a=w/h, w, h=width and height of minimum upright bounding box of character  Multifactorial Function (Mid)=1/2(F md + F ar )

FUZZY MULTIFACTORIAL ANALYSIS (Finding Cut Positions)  Factors Considered F ic (Inverse crossing count)= c-1 c = vertical crossing count for a pixel column. F mt (measure of blob thickness)=1-t/T t = no. of black pixel found in one column scan T = height of the characters middle zone F dm (degree of Middleness) F dm =min(l1,l2)/max(l1,l2)

FUZZY MULTIFACTORIAL ANALYSIS (Finding Cut Positions)  Factors Considered: F up (Up Stroke pattern) F low (Lower Stroke Pattern)

FUZZY MULTIFACTORIAL ANALYSIS (Finding Cut Positions)  For ‘m’ pixel columns, all five factors are evaluated, forming nx5 evaluation matrix  Multifactorial function

FUZZY MULTIFACTORIAL ANALYSIS (Confirming Cut Column) 1.List optimal cut positions identified 2.Take cut position with highest multifactor evaluation value 3.Segment the touching character resulting two characters say p1 & p2 4.Send p1 and p2 to character classifier 5.If p1 & p2 both are recognized then cut position confirmed else if p1 is recognized then take cut position with 2 nd highest multifactor evaluation repeat from step 3 to segment p2 else if p2 is recognized then take cut position with 2 nd highest multifactor evaluation repeat from step 3 to segment p1 else take cut position with 2 nd highest multifactor evaluation repeat from step 3

FUZZY MULTIFACTORIAL ANALYSIS (Confirming Cut Column) Touching Character After identification of cut positions Separation using Right-Most Cut Position Character Classifier identifies as ‘Ka’ Character Classifier identifies as ‘La’ Character Classifier identifies as ‘Ma’ Separating Again

RESULTS (By using Fuzzy Multifactorial Analysis)  Still problem in identifying the touching characters  Segmentation accuracy is 98.92% and 98.47% in Devnagari and Bangla Scripts repectively  System Throughput (T) is calculated as : T= C/t where C is total number of characters properly recognized by the OCR, and t is total time elapsed for the operation  System Efficiency (E) is calculated as E=(Nv*100)/Nt where Nv is the number of valid cut columns and Nt is total number of cut columns checked to find the valid cuts.

CONCLUSION  Touching characters being one of the major problem for Nepali OCR at the present  Use of Fuzzy Multifactorial Analysis would certainly contribute in minimising the joining errors in Nepali OCR

Resource Material  Paper on “Segmentation of Touching Characters Printed Devnagari and Bangla Scripts Using Fuzzy Multifactorial Analysis” By: and Utpal GarainBidyut B. Chaudhari