Optimization of Line Segmentation Techniques for Thai Handwritten Document Olarik Surinta Mahasarakham University Thailand.

Slides:



Advertisements
Similar presentations
(MICROSOFT EXCEL). Is a spreadsheet application designed to take advantage of the windows graphical interface MICROSOFT EXCEL.
Advertisements

Segmentation of Touching Characters in Devnagari & Bangla Scripts Using Fuzzy MultiFactorial Analysis Presented By: Sanjeev Maharjan St. Xavier’s College.
Word Spotting DTW.
Yasuhiro Fujiwara (NTT Cyber Space Labs)
Y A S O O B A L I b o r n o n 1 9 t h F e b r u a r y i n K a n p u r d i s t r i c t o f U t t a r P r a d e s h. H e s t a r t e d s i n g i.
Digital Image Processing Lecture11: Histogram Processing.
Facial feature localization Presented by: Harvest Jang Spring 2002.
Esmail Hadi Houssein ID/  „Motivation  „Problem Overview  „License plate segmentation  „Character segmentation  „Character Recognition.
CS771 Machine Learning : Tools, Techniques & Application Gaurav Krishna Y Harshit Maheshwari Pulkit Jain Sayantan Marik
ROBOT BOMB REMOVER CHALLENGE 7. BOMB REMOVAL CHALLENGE Using the color sensor, you will program distinguish bombs and to remove them from the room. When.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Improved Image Quilting Jeremy Long David Mould. Introduction   Goal: improve “ minimum error boundary cut ”
Binary Image Compression Using Efficient Partitioning into Rectangular Regions IEEE Transactions on Communications Sherif A.Mohamed and Moustafa M. Fahmy.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
Personality Assessment from Handwriting Aaron Dancygier Jayson Diaz Sunday Olatunbosun Stacy Bryan.
Automatic Camera Calibration for Image Sequences of a Football Match Flávio Szenberg (PUC-Rio) Paulo Cezar P. Carvalho (IMPA) Marcelo Gattass (PUC-Rio)
Fundamentals of Multimedia Chapter 7 Lossless Compression Algorithms Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
Handwritten Thai Character Recognition Using Fourier Descriptors and Robust C-Prototype Olarik Surinta Supot Nitsuwat.
Detecting Vehicles from Satellite Images Presented By: Dr. Fernando Rios Dr. Rocio Alba Flores Sumalatha Kuthadi Prashant Jain.
VEHICLE NUMBER PLATE RECOGNITION SYSTEM. Information and constraints Character recognition using moments. Character recognition using OCR. Signature.
Classification with Hyperplanes Defines a boundary between various points of data which represent examples plotted in multidimensional space according.
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
CHAPTER 1 Basic Statistics Statistics in Engineering
Presented by Tienwei Tsai July, 2005
Huijuan Yang, Alex C. Kot, IEEE Fellow IEEE Transactions on Multimedia, Vol. 9, No. 3, Apr Multimedia Security Final Project R 葉容瑜 R
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Towards an Intelligent Multilingual Keyboard System Tanapong Potipiti, Virach Sornlertlamvanich, Kanokwut Thanadkran Information Research and Development.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
1 Iterative Multimodel Subimage Binarization for Handwritten Character Segmentation Author: Amer Dawoud and Mohamed S. Kamel Source: IEEE TRANSACTIONS.
Wei Dang Kevin Ellsworth Cory Shirts.  Goal: have a user interface to allow user text input using sign language digits and letters ◦ User interface ◦
Detection of nerves in Ultrasound Images using edge detection techniques NIRANJAN TALLAPALLY.
A New Fingertip Detection and Tracking Algorithm and Its Application on Writing-in-the-air System The th International Congress on Image and Signal.
Copyright 2006 South-Western/Thomson Learning Chapter 12 Tables.
Oct 21, 2008IIP20081 Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta Mahasarakham University Thailand.
Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.
Presented By: Abirami Poonkundran Authors: Jeff Yan, Ahmad El Ahmad.
Yuya Akita , Tatsuya Kawahara
Duy & Piotr. How to reconstruct a high quality image with the least amount of samples per pixel the least amount of resources And preserving the image.
Reporter: 資訊所 P Yung-Chih Cheng ( 鄭詠之 ).  Introduction  Data Collection  System Architecture  Feature Extraction  Recognition Methods  Results.
Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology.
Lecture 4: Lossless Compression(1) Hongli Luo Fall 2011.
Intelligent Key Prediction by N-grams and Error-correction Rules Kanokwut Thanadkran, Virach Sornlertlamvanich and Tanapong Potipiti Information Research.
By Terry Treutel Ride Quality Consultant Former WisDOT Ride Spec Coordinator.
Using Word Based Features for Word Clustering The Thirteenth Conference on Language Engineering 11-12, December 2013 Department of Electronics and Communications,
INVITATION TO Computer Science 1 11 Chapter 2 The Algorithmic Foundations of Computer Science.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Detection of nerves in Ultrasound Images using edge detection techniques NIRANJAN TALLAPALLY.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
THREE COLUMN (Click to edit) Introduction or Abstract Type in or past your text (Click to edit) Materials and Methods Type in or past your text (Click.
An intelligent strategy for checking the annual inspection status of motorcycles based on license plate recognition Yo-Ping Huang a, Chien-Hung Chen b,
Optical Character Recognition
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Improving Chinese handwriting Recognition by Fusing speech recognition
an Introduction to English
Presenter: Ibrahim A. Zedan
Context-based Data Compression
Zone Identification in the Printed Gujarati Text
PowerPoint Template
PowerPoint Template
Motivation and Background
Motivation and Background
8-3 RRAM Based Convolutional Neural Networks for High Accuracy Pattern Recognition and Online Learning Tasks Z. Dong, Z. Zhou, Z.F. Li, C. Liu, Y.N. Jiang,
PowerPoint Template
Joshua Kahn, Scott Wiese ECE533 – Fall 2003 December 12, 2003
Research on the Modeling of Chinese Continuous Speech Recognition
Create engaging training with text pictures and video!
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
PowerPoint Template.
Presentation transcript:

Optimization of Line Segmentation Techniques for Thai Handwritten Document Olarik Surinta Mahasarakham University Thailand

Introduction In handwritten recognition, the line segmentation is an essential scheme. The occurrence of an inaccurately line segmentation will cause errors in the character segmentation. Most of line segmentation techniques have been based on horizontal projection profile technique. 10/21/2009SNLP20092

Introduction (cont) The texts in most document images are aligned along horizontal lines. Projection profile based techniques may be one of the most successful top-down algorithms 10/21/2009SNLP20093

The characteristic of Thai character Character typesCharacter Consonants ก ข ฃ ค ฅ ฆ ง จ ฉ ช ซ ฌญ ฎ ฏ ฐ ฑ ฒ ณ ด ต ถ ท ธน บ ป ผ ฝ พ ฟ ภ ม ย ร ฤล ฦ ว ศ ษ ส ห ฬ อ ฮก ข ฃ ค ฅ ฆ ง จ ฉ ช ซ ฌญ ฎ ฏ ฐ ฑ ฒ ณ ด ต ถ ท ธน บ ป ผ ฝ พ ฟ ภ ม ย ร ฤล ฦ ว ศ ษ ส ห ฬ อ ฮ Vowels อั อะ อา อิ อี อึ อื อุ อู เอ โอ ใอ ไอ ๆ อ็ อ์ อํ Tones อ่ อ้ อ๊ อ๋ 10/21/2009SNLP20094

The characteristic of Thai character (cont) 10/21/2009SNLP20095 Thai sentence structure

Based on the horizontal projection profile The horizontal projection profile is used in dividing the text image into character line. 10/21/2009SNLP20096

The line segmentation techniques in this research 1. The horizontal projection technique 2. The stripe technique 3. The comparing Thai character technique 4. The sorting and distinguishing (all of techniques based on horizontal projection profile) 10/21/2009SNLP20097

1. The horizontal projection profile 10/21/2009SNLP20098 result Horizontal histogram Image document

2. The stripe technique Firstly, the stripe technique divides image into stripe (small column). After that, the horizontal projection profile is used to divided the text image into character lines. 10/21/2009SNLP20099

2. The stripe technique (cont) 10/21/2009SNLP The result of strip technique for horizontal projection profile

3. Technique for comparing Thai character This technique takes advantage of the differences in size of characters to differentiate Thai characters between consonants and a group of small vowels and tones. 10/21/2009SNLP Comparing between consonant and a group of small vowel and tone

3. Technique for comparing Thai character (cont) First step The groups are divided into two groups (upper and lower zone). The higher group is then used to define the line from the image document 10/21/2009SNLP200912

3. Technique for comparing Thai character (cont) 10/21/2009SNLP The result of first step of comparison Thai character technique

3. Technique for comparing Thai character (cont) Second step consider the high value of white pixel between the line markers and choose a new line marker 10/21/2009SNLP200914

3. Technique for comparing Thai character (cont) this technique is complex as there are many steps to be proved. 10/21/2009SNLP The result of comparing Thai character technique.

4. The new technique for sorting and distinguishing This technique is not complicated and suitable for Thai character. Firstly Use the histogram of horizontal projection profile to sort the group of black pixels by starting with the minimum to maximum of black pixel. 10/21/2009SNLP200916

4. The new technique for sorting and distinguishing (cont) 10/21/2009SNLP Sorting the group of black pixels.

4. The new technique for sorting and distinguishing (cont) Secondly Find the maximum difference between two groups of black pixels The line marker is marked on the middle of the group of black pixels when the maximum difference value is less than value of the group of black pixels 10/21/2009SNLP200918

4. The new technique for sorting and distinguishing (cont) 10/21/2009SNLP The result of second step of sorting and distinguishing technique.

4. The new technique for sorting and distinguishing (cont) Finally A new line marker is placed in the middle between every two conjunction line markers 10/21/2009SNLP200920

4. The new technique for sorting and distinguishing 10/21/2009SNLP Click me to play this video.

Experimental result Thai image documents were generated from different peoples. Data sets contained varieties of writing styles, and limited to only single-column Thai image documents. 10/21/2009SNLP200922

Experimental result (cont) 10/21/2009SNLP  Single-Column  Not Single-Column

Experimental result (cont) The line marker is used to define the character line. The line makers pass through the image document and do not cross the group of black pixels (line segment is completed). 10/21/2009SNLP200924

Experimental result (cont) 10/21/2009SNLP  Complete line segmentation

Experimental result (cont) 10/21/2009SNLP  incomplete line segmentation

Experimental result (cont) Number of lines on image documents percentage T1 T2 T3 T Average /21/2009SNLP T1 is Horizontal projection technique T2 is Stripe technique T3 is Comparing Thai character T4 is Sorting and distinguishing

Conclusion I have presented four techniques for the line segmentation of Thai language –Horizontal projection profile –Stripe –Comparing Thai character –Sorting and distinguishing 4 techniques based on horizontal projection profile 10/21/2009SNLP200928

Conclusion (cont) The accuracy of the techniques are –The horizontal projection technique24.33% –The stripe technique19.44% (suitable for English character and Oriya text) –The comparing Thai character technique65.25% –The sorting and distinguishing technique97.11% (complex, many steps to be proved) 10/21/2009SNLP200929

Thank you Question & Answer 10/21/2009SNLP200930