Optical Character Recognition

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

Using Journal and Other Tablet PC Tools. Tools Bars in Journal To access all tool bars click on view and select each tool bar to activate each.
Review of AI from Chapter 3. Journal May 13  What advantages and disadvantages do you see with using Expert Systems in real world applications like business,
 A word processor is a computer that can be used for writing, editing and printing text.  We are going to use a word processing package called Microsoft.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Google Android as a mobile development platform T Internet Technologies for Mobile Computing Olli Mäkinen.
ANDROID PROGRAMMING MODULE 1 – GETTING STARTED
Advanced OCR with OmniPage and FineReader. Overview Optical character recognition Optical character recognition Structural recognition Structural recognition.
Android Development (Basics)
INTERNATIONAL SUMMER ACADEMIC COURSE UNIVESITY OF NIS ISAC – Android programming.
© Strategic Mapping & Data Services LLC – Dr. Michael Stachiw – February 2012© Strategic Mapping & Data Services LLC Android Programming Dr. Michael A.
Apps VS Mobile Websites Which is better?. Bizness Apps Survey Bizness Apps surveyed over 500 small business owners with both a mobile app and a mobile.
Programming Your Android App Gourav Khadge
Unit 30 P1 – Hardware & Software Required For Use In Digital Graphics
Presented by: Kushal Mehta University of Central Florida Michael Spreitzenbarth, Felix Freiling Friedrich-Alexander- University Erlangen, Germany michael.spreitzenbart,
Take a leap towards the most promising technology
Speech Recognition ECE5526 Wilson Burgos. Outline Introduction Objective Existing Solutions Implementation Test and Result Conclusion.
--Caesar Cat.  Write an optical character recognition application that identifies and recognizes printed text within an image.
--Caesar Cai TEXT RECOGNITION SENIOR CAPSTONE 2012.
Data Representation and Storage Lecture 5. Representations A number value can be represented in many ways: 5 Five V IIIII Cinq Hold up my hand.
The string data type String. String (in general) A string is a sequence of characters enclosed between the double quotes "..." Example: Each character.
By: Hadley Scholtz Supervisor: Mehrdad Ghaziasgar Co - supervisor: James Connan Mentor: Ibraheem Frieslaar.
Open Your Mind to Open Source MPDO’s & EOPR’s Centre for IT & eGovernance AMR-APARD Hyderabad Welcome!
Chapter 14 Working with Graphics. 2Practical PC 5 th Edition Chapter 14 Getting Started In this Chapter, you will learn: − About different graphics you.
Phone Reader Project Presenter: Marilyn Bihina Supervisor: James Connan.
My computer should look like this to display your scanner & digital camera drivers.
Spring 2007 COMP TUI 1 Computer Vision for Tangible User Interfaces.
Understanding Character Encodings Basics of Character Encodings that all Programmers should Know. Pritam Barhate, Cofounder and CTO Mobisoft Infotech.
CHAPTER 1 Introduction. Chapter objectives: Understand what Android is Learn the differences between Java and Android Java Examine the Android project.
TODAY Android Studio Installation Getting started Creating your 1 st App Beginning to understanding Intents.
Cosc 5/4735 YouTube API. YouTube The YouTube Android Player API enables you to incorporate video playback functionality into your Android applications.
How to Recover Deleted Photos from Android Cell Phone? Android is keeping on improving their products and make sure to provide the best software service.
 Handwritten character recognition is a frontier area for research for the past few decades  OCR-process of translation of images of handwritten shorthand.
Submitted by: DRPU Software Team Site:
CHAPTER 1 part 1 Introduction. Chapter objectives: Understand Android Learn the differences between Java and Android Java Examine the Android project.
The Basics of Android App Development Sankarshan Mridha Satadal Sengupta.
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
How to Enable Account Key Sign Instead Of Password In Yahoo? For more details:
Chapter 14 Working with Graphics
DATA COLLECTION Data Collection Data Verification and Validation.
Android 01: Fundamentals
ultrasound digital pen
S.Rajeswari Head , Scientific Information Resource Division
Android Studio, Android System Basics and Git
Graduation Project Seminar wesome Scanner
Development-Introduction
What this activity will show you
Getting started with your Smartboard!
Software Engineering in Mobile Computing
Google translate app demo
Sinhala Language Support for Java Micro Edition
Sensors, maps and fragments:
How to Copyright free images download
Android SDK & App Development
Transact™ Mobile SDK Quickly bring capture-enabled mobile applications to market with open-ended backend integrations.
Does Toast Trouble you in this Digital Age?
Software and Multimedia
Software and Multimedia
Chapter 4 Application Software
Funny Face Application
Overview What is Multimedia? Characteristics of multimedia
CHAPTER 1 Introduction Chapter objectives: Understand what Android is
Android Developer Fundamentals V2
Timelapse with the Veho VMS-001 Microscope
Chapter 1: Digital Communication Tools
Android Platform, Android App Basic Components
Mobile Programming Dr. Mohsin Ali Memon.
PRODUCTION PHASES CHANGES
2016 CSUN Accessibility Competition
New type of devices for identification of users of “Raiffeisen ONLINE” – Hardware and Software Tokens.
Presentation transcript:

Optical Character Recognition Mobile App. Development Project Aakanksha Gupta

Optical Character Recognition A Technology that enables one to extract text out of printed documents, captured images Targets typewritten text, one character at a time An "offline" process, which analyses a static document Pre-Processes images for successful recognition Android currently doesn’t come pre bundled with libraries for OCR, hence need external libraries Current Apps on Google Play Store: Google Keep Text Fairy CamScanner Application in Real Life Data entry for business documents, e.g. check, passport Extracting business card information into a contact list Converting handwriting in real time to control a compute

Text Recognition Functionality: An app developed to allow user to capture an image and gives user an option to see the text what is available on the image. Images are not stored but gives an option to store in external media. Requirement: Android 5 and above Features: Camera Feature, Convert English Words to Text Issues: Sometimes gives garbage values and cannot recognise text Camera gives blur images sometimes Horizontal and Vertical camera orientation Autofocusing

Demo of Text Recognition

Implementation Preparing Tesseract Tesseract has unicode (UTF-8) support, and can recognize more than 100 languages It provides API for converting Image to Text. Used English Language for the application Install Native Development Kit (NDK) is a set of tools that allows you to use C and C++ code to run in Java and Android Methodology Adding tess-two library to dependency Creating a class to manage Tesseract calls Initialize the object and call methods on the object

Android Side Using OpenCV 3_1_0 Still have to take a photo from the camera, Loading data from image files Gradle

Initialize the TessBAseAPI with the path to traineddata file and proper page segmentation Pass the image as bitmap to tessBaseAPI variable Call getUTF8Text method, and this will return string value

CameraBridgeViewBase Implemented to control when camera can be enabled, process the frame, calls external listener to make adjustment to frame Bitmap Graphic image file used to store digital images, 32 bit color FloatingActionButton Circled icon floating above the UI

Enable the Tesseract Toast notifications, and all the other notifications

Layout and Resource

AndroidManifest.xml Uses External Memory Change the icon of the application Change the name of application Uses the UTF-8 character encoding which is capable of encoding all possible characters defined by Unicode

Challenges Improvement Choosing the dataset was the biggest challenge Tried with Matalab but results were not fine Working with Tesseract Installing NDK and making it work Improvement Tesseract dataset is not great, sometimes give garbage values instead of detection Camera orientation creates issue Image when stored for processing appears to be blurred Not 100% accurate

Learnings Learned a lot about OpenCV & Tesseract Why dataset are important Android layout can be fun Not always does internet helps And yes, Finally I made an App of my own!! Phew.

Questions??