Oct 21, 2008IIP20081 Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta Mahasarakham University Thailand
Oct 21, 2008IIP20082 Introduction Palm leaf manuscripts have been a popular written media for over a thousand years in Southeast Asia Palm leaves were used for recording the history, knowledge and local wisdoms such as – Medical treatments – Buddhist doctrine – The story of dynasties
Oct 21, 2008IIP20083 Introduction (cont) Mahasarakham University is establishing Palm Leaf Manuscript Preservation Project for the discovery, preservation and protection of palm leaf manuscripts from Northeast Thailand palm leaf manuscript
Oct 21, 2008IIP20084 Proposed framework We use palm leaf manuscripts consisting of 227 pages to do research work The system processes consist of – Background elimination – Line segmentation, and – Character segmentation
Oct 21, 2008IIP20085 Proposed framework (cont) Framework of the BILAN (palm leaf manuscripts) system
Oct 21, 2008IIP20086 Convert Image from RGB color to Grey Image We use this equation to convert RGB color to Grey image Y = 0.3R G B RGB color Grey image
Oct 21, 2008IIP20087 Noise Reduction Noise is maybe appearing from the scanning process. This process is removing noise from Grey image using Gaussian Filtering grey image before and after noise reduction
Oct 21, 2008IIP20088 Background Elimination using Otsu’s Algorithm This method proposed by Otsu. It based on grey level histogram Otsu’s threshold value method
Oct 21, 2008IIP20089 Background Elimination using Otsu’s Algorithm (cont) Grey image binary image after background elimination
Oct 21, 2008IIP Image recovery We apply Mathematical Morphology in this research such as – Dilation – erosion binary imagebinary image after morphology
Oct 21, 2008IIP Line Segmentation Projection profile analysis is a popular technique for line segmentation. We use horizontal projection profile analysis because the texts in most document images are aligned along horizontal lines
Oct 21, 2008IIP Line Segmentation (cont) Line segmentation histogram Image after line segmentation
Oct 21, 2008IIP Line Segmentation (cont)
Oct 21, 2008IIP Character Segmentation In this step, use vertical projection profile analysis. we apply a threshold value on the length of the space in between the characters image after vertical projection profile
Oct 21, 2008IIP Experimental Results The method was tested using a set of 227 palm leaf manuscripts
Oct 21, 2008IIP Background Elimination Results Background EliminationAccuracy Number of documentsPercentage of background elimination segmented Complete13861 Incomplete8939 Total227100
Oct 21, 2008IIP Line Segmentation Results Number of Segmented Lines Percentage of lines correctly segmented 478% 587% Average82.5%
Oct 21, 2008IIP Complete background elimination
Oct 21, 2008IIP Incomplete background elimination
Oct 21, 2008IIP Incomplete background elimination
Oct 21, 2008IIP Future work Application this research for OCR system. Translation Palm leaf manuscripts into Thai language.
Oct 21, 2008IIP End of this presentation Thank you very much