Dr. István Marosi Recosoft Ltd., Hungary

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

Media Center Essential Question How can I be an effective user of information?
QUICK TIPS (--THIS SECTION DOES NOT PRINT--) This PowerPoint template requires basic PowerPoint (version 2007 or newer) skills. Below is a list of commonly.
Standard Grade Notes General Purpose Packages. These are Software packages which allow the user to solve a range of problems.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Segmentation of Touching Characters in Devnagari & Bangla Scripts Using Fuzzy MultiFactorial Analysis Presented By: Sanjeev Maharjan St. Xavier’s College.
Premier Director Document Imaging
Dori Baldwin Student Info Mgmt Coordinator Kent ISD.
Flowcharts Using Visio. Definitions  An Algorithm is just a detailed sequence of simple steps that are needed to solve a problem.  A Flowchart is a.
Real-Time Camera-Based Character Recognition Free from Layout Constraints M. Iwamura, T. Tsuji, A. Horimatsu, and K. Kise.
Extraction of text data and hyperlink structure from scanned images of mathematical journals Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Highlights Lecture on the image part (10) Automatic Perception 16
Artificial Neural Networks (ANNs)
QUICK DESIGN GUIDE (--THIS SECTION DOES NOT PRINT--) This PowerPoint 2007 template produces a 36x48 inch professional poster. You can use it to create.
Processing PDF: How to Go from PDF to E-text to Audio Gaeir Dietrich Director High Tech Center Training Unit of the California Community Colleges Foothill.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
With Alex Conger – President of Webmajik.com FrontPage 2002 Level I (Intro & Training) FrontPage 2002 Level I (Intro & Training)
Office 2010 Word Ribbons Slides 1 and 2 are a look at the 7 basic ribbons in Word Slides 3 – 9 give descriptions of some of the functions available.
QUICK DESIGN GUIDE (--THIS SECTION DOES NOT PRINT--) This PowerPoint 2007 template produces a 36”x60” professional poster. It will save you valuable time.
Traffic Sign Identification Team G Project 15. Team members Lajos Rodek-Szeged, Hungary Marcin Rogucki-Lodz, Poland Mircea Nanu -Timisoara, Romania Selman.
XP New Perspectives on Microsoft Word 2002 Tutorial 41 Microsoft Word 2002 Tutorial 4 – Desktop Publishing a Newsletter.
Track, Trace & Control Solutions © 2010 Microscan Systems, Inc. Machine Vision Tools for Solving Auto ID Applications Part 3 of a 3-part webinar series:
TH-OCR NK. content introduction go to next page background assumptions overall structure chart IPO for overall structure dataflow diagram of overall structure.
Aniko T. Valko, Keymodule Ltd.
© Cheltenham Computer Training 2002 Microsoft Publisher 2002 – Slide No 1 Microsoft Publisher 2002 Intermediate Level Course.
Microsoft Office 2003 Illustrated Introductory a Presentation Creating.
: Chapter 10: Image Recognition 1 Montri Karnjanadecha ac.th/~montri Image Processing.
Word Processing Standard Grade Computing LA/LM. Word processor a computer program that allows you to manipulate text What is?
CTS130 Spreadsheet Lesson 3 Using Editing and Formatting Tools.
RESEARCH POSTER PRESENTATION DESIGN © QUICK TIPS (--THIS SECTION DOES NOT PRINT--) This PowerPoint template requires basic.
Standard Grade Computing General Purpose Packages WORD-PROCESSING WORD-PROCESSING Chapter 2.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Problem description and pipeline
CS 6825: Binary Image Processing – binary blob metrics
QUICK DESIGN GUIDE (--THIS SECTION DOES NOT PRINT--) This PowerPoint 2007 template produces a 36x48 inch professional poster. You can use it to create.
0 Paper rocess Scanner Throughput P eople PP P Effective Scanner Throughput Consider KOFAX – VRS (Virtual Re-Scan) Increase Productivity.
QUICK TIPS (--THIS SECTION DOES NOT PRINT--) This PowerPoint template requires basic PowerPoint (version 2007 or newer) skills. Below is a list of commonly.
Ascending  Sort order arranging text or numbers from A to Z, from smallest to largest, or from earliest to latest.
Presented By: Abirami Poonkundran Authors: Jeff Yan, Ahmad El Ahmad.
RESEARCH POSTER PRESENTATION DESIGN © entations.com QUICK DESIGN GUIDE (--THIS SECTION DOES NOT PRINT--) This PowerPoint 2007 template.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Dr. István Marosi Scansoft-Recognita, Inc., Hungary SSIP 2005, Szeged Character Recognition Internals.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
Read through the following slides and fill out your study guide.
Additional Features in Microsoft Word Session Version 1.0 © 2011 Aptech Limited.
CMA Coastline Matching Algorithm SSIP’99 - Project 10 Team H.
WP3: Image Segmentation - OCR Stavros Perantonis, Vassilis Maragos Edinburgh, March 6-7, 2003 Institute of Informatics & Telecommunications NCSR “Demokritos”
Optical Character Recognition
OCR Reading.
Tutorial 4 – Desktop Publishing a Newsletter
Optical Character Recognition
S.Rajeswari Head , Scientific Information Resource Division
For Letters, Labels or s Mail Merge For Letters, Labels or s.
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Word Processing.
Electronic Document Management Software
Microsoft Access 2003 Illustrated Complete
Microsoft Word 2003 Illustrated Complete
OPERATE A WORD PROCESSING APPLICATION (BASIC)
Conversion accuracy Users often need to convert PDF files into more familiar and capable editing tools such as Microsoft Word, Excel and PowerPoint PDF.
Aniko T. Valko, Keymodule Ltd.
Data Capture Process Stages
TC 310 The Computer in Technical Communication
TC 310 The Computer in Technical Communication
Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Pattern Recognition and Training
Pattern Recognition and Training
Creating Accessible PDF’s
Presentation transcript:

Dr. István Marosi Recosoft Ltd., Hungary SSIP 2002, Budapest Internal Structure of the Character Recognition Engine used inside Omnipage Pro Dr. István Marosi Recosoft Ltd., Hungary

Some „Marketing talk” Main tasks of an OCR system: Image acquisition Layout recognition Text recognition User assisted correction Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Image acquisition Get image B/W Scanning Gray Scanning Color Scanning Load from image file Preprocess image Layout recognition Text recognition User assisted correction Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Image acquisition Get image Preprocess image Color separation Thresholding Despeckling Rotation Deskewing Layout recognition Text recognition User assisted correction Result exportation 1/13/2019 Recosoft Ltd

The Preprocessed Image Joined chars 1/13/2019 Recosoft Ltd

The Preprocessed Image Joined chars 1/13/2019 Recosoft Ltd

The Preprocessed Image Broken chars 1/13/2019 Recosoft Ltd

The Preprocessed Image Broken chars 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Layout recognition Image acquisition Layout recognition Text zones Columns of flowed text Tables Inverse text Graphic zones Text recognition User assisted correction Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Layout recognition Image acquisition Layout recognition Text zones Graphic zones Line Art Photo Text recognition User assisted correction Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Text recognition Image acquisition Layout recognition Text recognition ... Let’s do it when the marketing staff is over... User assisted correction Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Image acquisition Layout recognition Text recognition User assisted correction By the user’s random editing... Pop-up verifier Manual Training By proofreading of doubtful words Result exportation 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Image acquisition Layout recognition Text recognition User assisted correction By the user’s random editing... By proofreading of doubtful words Correct: User dictionary Changed: IntelliTrain Remember trained characters Apply them on following pages Result exportation 1/13/2019 Recosoft Ltd

IntelliTrain Recognized word: sorneUüng 1/13/2019 Recosoft Ltd

IntelliTrain Recognized word: sorneUüng Fixed word: something 1/13/2019 Recosoft Ltd

IntelliTrain Recognized word: sorneUüng Fixed word: something 1/13/2019 Recosoft Ltd

IntelliTrain Recognized word: sorneUüng Fixed word: something Substitutions found: m  rn thi  Uü 1/13/2019 Recosoft Ltd

IntelliTrain Recognized word: sorneUüng Fixed word: something Substitutions found: m  rn thi  Uü Perform automatically: Learn image pattern and substitution info Find similar substituted (‘blue’) text on actual page Match against pattern of substitution and correct Find such errors on following pages, too 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Result exportation Image acquisition Layout recognition Text recognition User assisted correction Result exportation Combine pages into a Document Header / Footer recognition Page numbers Hyperlinks (e.g. „See Table 20”) Save results 1/13/2019 Recosoft Ltd

Some „Marketing talk” Main tasks of an OCR system: Result exportation Image acquisition Layout recognition Text recognition User assisted correction Result exportation Combine pages into a Document Save results doc file e-mail Speech synthesizer 1/13/2019 Recosoft Ltd

OP11 Internals Text recognition in ScanSoft’s OP11 OCR Engines available: Caere’s engine (codename: Salt & Pepper) Recognita’s engine (codename: Paprika) 1/13/2019 Recosoft Ltd

OP11 Internals Text recognition in ScanSoft’s OP11 OCR Engines available: Caere’s engine (Salt & Pepper) Uses a Matrix Matching based algorithm feature set: 40 cells of an 8x5 grid good overall description of a shape weaker at detailed structure Recognita’s engine (Paprika) Uses a Contour Tracing based algorithm feture set: convex and concave arcs on the contour good detailed description of a shape weaker at overall structure 1/13/2019 Recosoft Ltd

OP11 Internals Text recognition in ScanSoft’s OP11 OCR Engines available: Caere’s engine (Salt & Pepper) Recognita’s engine (Paprika) Segmentation algorithms: 1/13/2019 Recosoft Ltd

Segmentation What are those pixel groups belonging to a single letter?

Segmentation What are those pixel groups belonging to a single letter?

Segmentation What are those pixel groups belonging to a single letter?

Segmentation What are those pixel groups belonging to a single letter?

Segmentation What are those pixel groups belonging to a single letter?

Segmentation What are those pixel groups belonging to a single letter?

OP11 Internals Text recognition in ScanSoft’s OP11 OCR Engines available: Caere’s engine (Salt & Pepper) Recognita’s engine (Paprika) Segmentation algorithms: Developed by independent groups Have different strengths and weaknesses 1/13/2019 Recosoft Ltd

OP11 Internals Text recognition in ScanSoft’s OP11 Conclusion: OCR Engines available: Caere’s engine (Salt & Pepper) Recognita’s engine (Paprika) Segmentation algorithms Conclusion: They are complementary Let’s create a voting system 1/13/2019 Recosoft Ltd

OP11 Internals Voting strategies External „Black box” voting Image Paprika Salt & Pepper Txt 2 Txt 1 Vote? Final Txt 1/13/2019 Recosoft Ltd

OP11 Internals Voting strategies External „Black box” voting Image Paprika Salt & Pepper Txt 2 Txt 1 Dict Vote Final Txt 1/13/2019 Recosoft Ltd

OP11 Internals Voting strategies External „Black box” voting ~15% gain Image Voting strategies External „Black box” voting ~15% gain Paprika Salt & Pepper Txt 2 Txt 1 Dict Vote Final Txt 1/13/2019 Recosoft Ltd

OP11 Internals Voting strategies Internal „Shape” voting Image External „Black box” voting Internal „Shape” voting Salt & Pepper Paprika Txt 1 Txt 2 Dict Bronze Final Txt 1/13/2019 Recosoft Ltd

Recognize original segmentation Image OP11 Internals Recognize original segmentation Paprika Original segmentation: Every independent connected component is a character Good segmentation: recognize Bad segmentation: reject K.B. 1/13/2019 Recosoft Ltd

OP11 Internals Paprika Image Recognize original segmentation K.B. Train adaptive classifier from original shapes Txt 1 Adaptive K.B. 1/13/2019 Recosoft Ltd

OP11 Internals Paprika Image Recognize original segmentation K.B. Try several segmentations Loop if unrecognizable K.B. Train adaptive classifier from original shapes Txt 1 Adaptive K.B. Recognize broken and joined shapes 1/13/2019 Recosoft Ltd

OP11 Internals Paprika Image Recognize original segmentation K.B. Train adaptive classifier from original shapes Txt 1 Adaptive K.B. Recognize broken and joined shapes Train adaptive classifier from ‘ugly’ shapes 1/13/2019 Recosoft Ltd

OP11 Internals Paprika Image Recognize original segmentation K.B. Train adaptive classifier from original shapes Txt 1 Adaptive K.B. Recognize broken and joined shapes Train adaptive classifier from ‘ugly’ shapes Recognize more broken and joined shapes Try several segmentations Loop if unrecognizable Txt 2 1/13/2019 Recosoft Ltd

OP11 Internals Voting strategies ~45% gain Image Salt & Pepper Txt 1 Paprika Txt 1 Txt 2 Dict Bronze Final Txt 1/13/2019 Recosoft Ltd

OP12 Voting strategies +20% gain Image Fire- Salt & Pepper worx Txt 1A Paprika Txt 1A Txt 1B Txt 2 Dict Bronze Final Txt 1/13/2019 Recosoft Ltd