PROJECT PROPOSAL DIGITAL IMAGE PROCESSING TITLE:- Automatic Machine Written Document Reader Project Partners:- Manohar Kuse(Y08UC073) Sunil Prasad Jaiswal(Y08UC124)

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 3 Image Processing/Analysis Basic Requirements.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Michele Merler Jacquilene Jacob.  Applications online are inherently insecure  Growing rate of hackers  Confidentiality of online systems should be.
Face Recognition and Biometric Systems Eigenfaces (2)
Automatic classification of weld cracks using artificial intelligence and statistical methods Ryszard SIKORA, Piotr BANIUKIEWICZ, Marcin CARYK Szczecin.
Prénom Nom Document Analysis: Document Image Processing Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
A Comprehensive Study on Third Order Statistical Features for Image Splicing Detection Xudong Zhao, Shilin Wang, Shenghong Li and Jianhua Li Shanghai Jiao.
Document Analysis Systems Bedola Roberto, Bordoni Davide, Franc Vojtech Supervised by Luca Lombardi Overview: Introduction Our implementation of Wahl et.
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
September 14, 2010Neural Networks Lecture 3: Models of Neurons and Neural Networks 1 Visual Illusions demonstrate how we perceive an “interpreted version”
Toward Automatic Processing and Indexing of Microfilm.
Implementing a reliable neuro-classifier
BRETT GIPSON PRESENTS CHAPTER 5. DESCRIBE INPUT Input devices translate words, sounds, images and actions that people understand into symbols that the.
Highlights Lecture on the image part (10) Automatic Perception 16
Group No. 4 Members- AKASH AGARWAL (Y08UC010) MAYANK INDORIA (Y08UC080)
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
Brief overview of ideas In this introductory lecture I will show short explanations of basic image processing methods In next lectures we will go into.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
 A data processing system is a combination of machines and people that for a set of inputs produces a defined set of outputs. The inputs and outputs.
Do Now: Take out your notebook and a pen. Good morning! Do Now: Take out your notebook and a pen. Good morning! Aim: What is a Font Family? Mr. Spaterella.
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Classification with Hyperplanes Defines a boundary between various points of data which represent examples plotted in multidimensional space according.
ONLINE HANDWRITTEN GURMUKHI SCRIPT RECOGNITION AND ITS CHALLENGES R. K. SHARMA THAPAR UNIVERSITY, PATIALA.
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Copyright © 2012 Pearson Education, Inc. Chapter 1: Introduction to Computers and Programming 1.
Using a Template to Create a Resume and Sharing a Finished Document
Participation in the NIPS 2003 Challenge Theodor Mader ETH Zurich, Five Datasets were provided for experiments: ARCENE: cancer diagnosis.
An Example of Course Project Face Identification.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
1 Recognition of Multi-Fonts Character in Early-Modern Printed Books Chisato Ishikawa(1), Naomi Ashida(1)*, Yurie Enomoto(1), Masami Takata(1), Tsukasa.
IIIT Hyderabad Thesis Presentation By Raman Jain ( ) Towards Efficient Methods for Word Image Retrieval.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Helping your reader get the message.  Text  Headings and titles  Graphics  Simple separators (lines and boxes)  White space.
CAPTCHA solving Tianhui Cai Period 3. CAPTCHAs Completely Automated Public Turing tests to tell Computers and Humans Apart Determines whether a user is.
Line detection Assume there is a binary image, we use F(ά,X)=0 as the parametric equation of a curve with a vector of parameters ά=[α 1, …, α m ] and X=[x.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Machines that Make Decisions Instructor: Edmondo Trentin
Presented By Lingzhou Lu & Ziliang Jiao. Domain ● Optical Character Recogntion (OCR) ● Upper-case letters only.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Digital Image Processing
Computer Skills Dr. Omar Al Jadaan RAK-MHSU College of Medical Sciences
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
The Big Picture Things to think about What different ways are there to collect information automatically? What are the advantages and disadvantages of.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
License Plate Recognition of A Vehicle using MATLAB
Group Number5 Group Members Harsh AroraY08UC059 Mayank AgarwalY08UC079 Vishal BharadwajY08UC141 Automatic Number Plate Recognition.
Optical Character Recognition
Mousavi,Seyed Muhammad – Lyashenko, Vyacheslav
Extracting Old Persian Cuneiform Font Out of
ABSTRACT FACE RECOGNITION RESULTS
Folders out, planners out…
Recognition using Nearest Neighbor (or kNN)
Ying He Wuhan University of Technology Twitter: #AMIA2017
Using Transductive SVMs for Object Classification in Images
RGB-D Image for Scene Recognition by Jiaqi Guo
Aniko T. Valko, Keymodule Ltd.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Using Uneven Margins SVM and Perceptron for IE
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
A Novel Smoke Detection Method Using Support Vector Machine
Automatic Handwriting Generation
SIDE: The Summarization IDE
Presentation transcript:

PROJECT PROPOSAL DIGITAL IMAGE PROCESSING TITLE:- Automatic Machine Written Document Reader Project Partners:- Manohar Kuse(Y08UC073) Sunil Prasad Jaiswal(Y08UC124)

AIM To build a system, which could read aloud the text in a book, by means of recognition of characters from the image of the pages of the book from a simple overhead web-cam. Our Approach  First we will acquire a page of text book by means of overhead web-cam. This will work as our input image.  Now we will convert the text in image to ASCII text.  This process will be carried by means of first separating individual characters from the image. Identifying them with some machine learning approach.  The ASCII test thus formed will be sent to program which can convert this ASCII text to sound.  SEGMENTATION OF CHARACTERS:  We plan to achieve this by First thresholding the acquired image, to obtain a binary mask of individual characters.  Then by means of CCA (connected component analysis) label individual characters and then normalize the size of each connected component.  CHARACTER-FEATURES:  One of the method suggested, was to use statistical zonal features. Which means, that an individual character was divided into a few zones. And statistical features like mean, variance, area, perimeter etc of each zones were the features.  These features are to be later learned with classifiers like – Neural networks, Bayer's classifier or Support Vector Machines (SVM).

 We are planning to divide the normalized character into 16 zones. For each zone we will evaluated mean, variance, area, perimeter. We are also planning to use global features like – compactness, area, perimeter and fraction of filled area.  Block diagram of our approach :- image Acquisition Segmentation of characters &size normalization Character feature extraction Identification Sound producing device

Challenges  CCA is not able to identify spaces. This will create problems in grouping characters as words. We plan to come-up with a new approach for segmentation of characters.  Since we are using zonal features, slight orientation of characters might cause inaccuracy for the classifier.  Requirement of large dataset, covering various font styles, and symbols. We have not yet found such a data-set on the internet and we plan to make one by ourselves, and get our friends for manual annotation.. Course of Progress We plan to have following deadlines:- 28 th Oct Finalization of character features 2 nd Nov Building the data-set with annotations 4 th Nov Preliminary results on the data-set 10 th Nov Testing with real time data 12 th Nov Testing results 15 th Nov Demonstration of the system 16 th Nov Final report