Department of Computer Science Ben-Gurion University of the Negev

Slides:

Advertisements

Similar presentations

Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.

Advertisements

Writer identification through information retrieval Ralph Niels, Franc Grootjen & Louis Vuurpijl.

1 Probabilistic Artificial Neural Network For Recognizing the Arabic Hand Written Characters Khalaf khatatneh, Ibrahiem El Emary,and Basem Al- Rifai Journal.

Handwritten Character Recognition Using Artificial Neural Networks Shimie Atkins & Daniel Marco Supervisor: Johanan Erez Technion - Israel Institute of.

Aula 5 Alguns Exemplos PMR5406 Redes Neurais e Lógica Fuzzy.

Example: ZIP Code Recognition Classification of handwritten numerals.

1 Automated Feature Abstraction of the fMRI Signal using Neural Network Clustering Techniques Stefan Niculescu and Tom Mitchell Siemens Medical Solutions,

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.

Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab

Avoiding Segmentation in Multi-digit Numeral String Recognition by Combining Single and Two-digit Classifiers Trained without Negative Examples Dan Ciresan.

COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.

Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.

Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.

EE459 Neural Networks Examples of using Neural Networks Kasin Prakobwaitayakit Department of Electrical Engineering Chiangmai University.

Comparison of Handwritings Miroslava Božeková Thesis supervisor: Doc. RNDr. Milan Ftáčnik, CSc.

Presented By Lingzhou Lu & Ziliang Jiao. Domain ● Optical Character Recogntion (OCR) ● Upper-case letters only.

ImageNet Classification with Deep Convolutional Neural Networks Presenter: Weicong Chen.

1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.

By: Shane Serafin.  What is handwriting recognition  History  Different types  Uses  Advantages  Disadvantages  Conclusion  Questions  Sources.

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

Learning to Compare Image Patches via Convolutional Neural Networks SERGEY ZAGORUYKO & NIKOS KOMODAKIS.

Tofik AliPartha Pratim Roy Department of Computer Science and Engineering Indian Institute of Technology Roorkee CVIP-WM 2017 Paper ID 172 Word Spotting.

Big data classification using neural network

Hybrid Deep Learning for Reflectance Confocal Microscopy Skin Images

Learning to Compare Image Patches via Convolutional Neural Networks

Deeply learned face representations are sparse, selective, and robust

The Relationship between Deep Learning and Brain Function

Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.

Data Mining, Neural Network and Genetic Programming

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Jure Zbontar, Yann LeCun

ROBUST FACE NAME GRAPH MATCHING FOR MOVIE CHARACTER IDENTIFICATION

Perceptual Loss Deep Feature Interpolation for Image Content Changes

Image Recognition. Contents: Motivation Objective Definition Introduction Preprocessing / Edge Detection Neural Networks in Image Recognition Practical.

Part-Based Room Categorization for Household Service Robots

Ajita Rattani and Reza Derakhshani,

Natural Language Processing of Knee MRI Reports

Guidelines for Green Computing projects

Final Year Project Presentation --- Magic Paint Face

Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules

A Convolutional Neural Network Cascade For Face Detection

By: Kevin Yu Ph.D. in Computer Engineering

Layer-wise Performance Bottleneck Analysis of Deep Neural Networks

Bird-species Recognition Using Convolutional Neural Network

Convolutional Neural Networks for sentence classification

Image Inpainting Using Pre-Trained Classification CNN

Towards Understanding the Invertibility of Convolutional Neural Networks Anna C. Gilbert1, Yi Zhang1, Kibok Lee1, Yuting Zhang1, Honglak Lee1,2 1University.

RGB-D Image for Scene Recognition by Jiaqi Guo

Deep Learning Hierarchical Representations for Image Steganalysis

Domingo Mery Department of Computer Science

Introduction of MATRIX CAPSULES WITH EM ROUTING

network of simple neuron-like computing elements

CornerNet: Detecting Objects as Paired Keypoints

Creating Data Representations

Department of Electrical Engineering

Zip Codes and Neural Networks: Machine Learning for

Analysis of Trained CNN (Receptive Field & Weights of Network)

Introduction to Artificial Intelligence Lecture 24: Computer Vision IV

Zhedong Zheng, Liang Zheng and Yi Yang

Face Recognition: A Convolutional Neural Network Approach

Attention for translation

Department of Computer Science Ben-Gurion University of the Negev

Automatic Handwriting Generation

Human-object interaction

U-Net: Convolutional Network for Segmentation

Deep Object Co-Segmentation

End-to-End Facial Alignment and Recognition

Example of training and deployment of deep convolutional neural networks. Example of training and deployment of deep convolutional neural networks. During.

Presentation transcript:

Department of Computer Science Ben-Gurion University of the Negev Case Study: Fine Writing Style Classification Using Siamese Neural Network Alaa Abdalhaleem Berat Kurar Barakat Jihad El-Sana Hi, I am Berat Kurar, from Ben Gurion University of the Negev. I am going to talk on a work for writing style classification using Siamese network, in which I collaborated with my collegue Ala Abdalhaleem under supervision of prof. Jihad El-Sana. Department of Computer Science Ben-Gurion University of the Negev

Table of Contents Problem definition Contribution Related work Method Experiments Results Conclusion I will make a problem definition and describe our contribution for it Then I will discuss some related works on this problem. Then I will explain our method and details of experiments Finally I will present the results and conclude.

Problem definition Principle of handwriting style uniqueness BOOK 2 A well known fact in the literature is the principle of handwriting style uniqueness. That is, no two writers share the same handwriting style. For example here is given two pages from two manuscripts. Even at first glance, we can recognize that two writers do not share handwriting style. And if we have a close look, we can see that writer of the book1 writes the words wider and shorter in relative to the writer of the book2. Given a query document assigning one of the known writers to it, is called writer identification and is well studied in the literature.

Problem definition Principle of in-writer variation BOOK 1 BOOK 1 We studied another problem based on the principle of in-writer variation. That is no one person writes exactly the same letter formations. A person’s writing style may change due to use of different pens, physical conditions like noise and poor light or age. For example here is given two pages from a single manuscript. If we have a close look we can see that words in the left page well formed and bigger than the words in the right page.

Contribution Automatic system Divide manuscript into parts with similar writing styles Siamese Convolutional Neural Network (CNN) Input: Two document patches Output: Same or different So our work presents an automatic system for dividing a manuscript into parts, according to their similarity in handwriting style. This system is based on a Siamese convolutional network which inputs two document patches and outputs whether they contain similar writing style or not.

Related work Chu and Srihari, 2014: Fiel and Sablatnig, 2015: Single branch CNN Input: Concatenated two word images Output: Same or different Children’s writing style change by time Fiel and Sablatnig, 2015: Input: Single word images Output: Classes in the number of writers Writer identification and retrieval Previously Chi and Srihari used CNN for a similar problem. They train a single branch CNN using concatenated word images with labels of same or different. Trained network is used to determine the change in the writing style of children by time. Fiel and Sablatnig they also train a single branch CNN using single words with classes in the number of writers. Trained network trimmed from the classification layer is used to extract features for writer identification and retrieval.

Method L1 CNN Siamese CNN W Siamese CNN architecture is based on Koch et al, 2015 Siamese CNN CNN W L1 F(x1) F(x2) Same or Different Single neuron Patches x1 x2 Our method is based on Siamese CNN taken from the work of Koch et al. for one-shot character recognition. It consists of two identical CNN branches that share the same weights vector W. It inputs two document image patches, computes their feature vectors, measures the L1 distance between them and decides whether these patches contain similar writing style or not.

Datasets WAHD Dataset (Abdelhaleem et al, 2017) 3 manuscripts 139, 239 and 392 pages VML Dataset (Kassis et al, 2017) 1 manuscripts 140 pages We worked on two publicly available datasets, WAHD and VML We selected 3 manuscripts of pages 139, 239 and 392 from WAHD dataset. And one manuscript of pages 140 from VML dataset. Each manuscript belongs to one author and consistent in terms of font size and type.

Experiments Train set sizes: 8266, 4270, 2560 pairs Same (1) In experiments we changed some hyperparameters to improve the performance. We experimented with training set sizes of around 8, 4 and 2 k pairs. A pair of patches from the same page is labeled as same, and a pair of patches from different books is labeled as different. This ensures maximum similarity for same pairs and maximum dissimilarity for different patches. Different (0)

Experiments Patch sizes 𝟏𝟓𝟎 ×𝟏𝟓𝟎 𝟏𝟐𝟎 ×𝟏𝟐𝟎 𝟕𝟓 ×𝟕𝟓 𝟒𝟎 ×𝟒𝟎 We generated 4 different sizes of patches But each time we reduce the spatial resolution so that the patch includes around 3 lines of script.

Experiments 3 lines 4 lines Number of script lines 𝟏𝟓𝟎 ×𝟏𝟓𝟎 𝟏𝟓𝟎 ×𝟏𝟓𝟎 Getting the best result with the patch size of 150 pixels we generated another train set with patches of 150 pixels but with 4 lines of script. 3 lines 4 lines

Results Performance with different train set sizes

Results Performance with different patch sizes

Results Performance with different number of lines

Results Summary of results Train size Number of lines Patch size Test accuracy 8266 3 150x150 98.15 120x120 98.58 75x75 94.75 40x40 94.40 4 98.37 4270 96.40 2568 95.13

Conclusion An automatic system for dividing a book into similar parts by writing style Input: 2 paragraphs from a book Output: Similarity of input paragraphs Future work: Modern handwritten datasets