WP3: Image Segmentation - OCR Stavros Perantonis, Vassilis Maragos Edinburgh, March 6-7, 2003 Institute of Informatics & Telecommunications NCSR “Demokritos”

Slides:



Advertisements
Similar presentations
Microfilm CAR, Film and Paper Scanners OCR Image Indexing VersaVIEW VersaIMAGE GOLD Twain Paper Scanners Automated Microfilm Readers TWAIN & ISIS Microfilm.
Advertisements

Don’t Type it! OCR it! How to use an online OCR..
Word Processing and Desktop Publishing Software
Process Monitoring is only the first step in improving process efficiency.
Introduction to Microsoft Office 2007 with focus on MS Word
Lesson 15 Presentation Programs.
Sharpdesk Overview Desktop Composer Search Imaging      
RESEARCH POSTER PRESENTATION DESIGN © (—THIS SIDEBAR DOES NOT PRINT—) DESIGN GUIDE This PowerPoint 2007 template produces.
THE PROFESSIONAL APPROACH SERIES © 2008 The McGraw-Hill Companies, Inc. All rights reserved. 1 Lesson Objectives Lesson 5 objectives Use a template to.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
INSERT BOOK COVER 1Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Access 2010 by Robert Grauer, Keith Mast,
Mid-Peninsula IBM PC Club Meeting November 21, 2005 SnagIt Screen Capture OmniPage Pro 14 OCR IconSaver Utility Jan Laskowski
Integrated Imaging and Document Management System Product Demonstration.
TRACK 2™ Version 5 The ultimate process management software.
Creating Accessible Presentations Training Guide.
This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation. All.
ExpressReader Pro adopted to retrodigitization of mathematical documents Kazuaki Yokota.
1 of 4 Note When you first use desktop faxing, you will be prompted to sign up for a fax service. Click OK to open your Web browser, and then follow the.
TRACK 3™ The ultimate process management software.
IRISDocument Server IRISPowerscan IRISCapture Pro/X4D Alone or together to meet your needs Alejandro Grüssi VAR / OEM Account Manager.
AN OVERVIEW OF MAC PDF TOOLS 1. PDF Tools for Mac PDF files can be used either in Windows, Unix or Apple’s Mac OS operating system commonly. It still.
Premier Accessibility Suite Software for Reading and Writing.
Advanced Workgroup System. RED Advanced Workgroup Systems: Scan Features Copy Print Scan DNSG Software Our Customers Documents Our Customers Documents.
Advanced OCR with OmniPage and FineReader. Overview Optical character recognition Optical character recognition Structural recognition Structural recognition.
With Alex Conger – President of Webmajik.com FrontPage 2002 Level I (Intro & Training) FrontPage 2002 Level I (Intro & Training)
PowerPoint Lesson 4 Expanding on PowerPoint Basics
Word Processing basics
Word Processing ADE100- Computer Literacy Lecture 12.
Millionaire Kristin Strickland & Johnny Fuller. What does the wavy, red line under a word in a presentation mean? a. Misspelling b. Grammar Error c. Synonym.
Choose a category. You will be given the answer. You must give the correct question. Click to begin.
The most powerful high-speed scanning, indexing and OCR solution on the market Supports many high speed scanners: Fujitsu, Canon, Kodak, Epson, Avision,
Confidential, I.R.I.S. © 2005, All rights reserved Discover… The most robust solution to structure, index, compress and convert all your documents into.
Basic Computer and Word Functions, part 1 Read the information and use to answer the questions in the Basic Computer and Word Functions Study Guide.
An-Najah National University Faculty Of Engineering Computer Engineering Department Abed Al-hadi kulib.
Word Processing Definitions Indent to move text horizontally away from the left or right margin, setting it apart from surrounding text.
Confidential, I.R.I.S. © 2005, All rights reserved I.R.I.S. new OCR Software suite: A full range for document conversion, for private and corporate users.
0 Paper rocess Scanner Throughput P eople PP P Effective Scanner Throughput Consider KOFAX – VRS (Virtual Re-Scan) Increase Productivity.
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
1 Lesson 13 Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
Spreadsheets 101 What is Excel?. Objectives 1. Identify the parts of the Excel Screen 2. Identify the functions of a spreadsheet 3. Identify how spreadsheets.
Introduction to KE EMu
Microsoft ® Outlook 2000 Integrating Outlook with Office Applications.
Chapter 28. Copyright 2003, Paradigm Publishing Inc. CHAPTER 28 BACKNEXTEND 28-2 LINKS TO OBJECTIVES Table Calculations Table Properties Fields in a Table.
Active Matrix LCD Panel ProMax® Interactive LCD Writing Panel Active Matrix LCD Panel ProMax® Interactive LCD Writing Panel.
MS Word Full Tiger Menu Labels & Text Boxes Advanced Formatter Settings Tiger Designer Variable Dot Heights Graphic Tools Fill Patterns Braille Labeling.
XP New Perspectives on Creating Web Pages With Word Tutorial 1 1 Creating Web Pages With Word Tutorial 1.
VNew PDF Converter “A PDF Converter to convert PDF files to images Manually and Automatically“
QUICK START (cont.) Change template color theme DESIGN menu, click on COLORS, and choose the color theme of your choice. Create your own color theme. Use.
1 Word Processing Intermediate Using Microsoft Office 2000.
Submitted by: DRPU Software Team Site:
(—THIS SIDEBAR DOES NOT PRINT—) STARS Design Poster Guide This PowerPoint template produces a 24”x36”, i.e., 2 by 3 foot, presentation poster. Please visit.
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison WP3 Multilingual and Multimedia Fact.
Understand Charts and SmartArt Graphics
With Microsoft FrontPage 2000
Getting started with your Smartboard!
You will be given the answer. You must give the correct question.
Conversion accuracy Users often need to convert PDF files into more familiar and capable editing tools such as Microsoft Word, Excel and PowerPoint PDF.
Embedding Graphics in Web Pages
Creating Accessible Electronic Documents
Word Processing and Desktop Publishing Software
POSTER MAKING.
My Program Session Title
Exploring Microsoft® Office 2016 Series Editor Mary Anne Poatsy
Graphic Design Layout Options
Microsoft Word.
Graphic Design Layout Options
How To Repair PDF File After Disk Crash???. What is PDF file..??? PDF file is portable document file. This file format is used during the exchange .
Quick and Dirty: the art of OCR
Presentation transcript:

WP3: Image Segmentation - OCR Stavros Perantonis, Vassilis Maragos Edinburgh, March 6-7, 2003 Institute of Informatics & Telecommunications NCSR “Demokritos”

© NCSR, Edinburgh, March 6-7, 2003 Banner Recognition

© NCSR, Edinburgh, March 6-7, 2003 Banner characteristics -Low resolution -Graphics, noiseless -Anti-aliasing -Color contrast visible by the human eye -Text body of restricted thickness

© NCSR, Edinburgh, March 6-7, 2003 OCR Input Original image: B/W OCR input: B/W OCR input after Text Area Enhancement Pre-processing Tool:

© NCSR, Edinburgh, March 6-7, 2003 Text Area Enhancement Pre-processing Tool

© NCSR, Edinburgh, March 6-7, 2003 Text Area Enhancement Pre-processing Tool

© NCSR, Edinburgh, March 6-7, 2003 FineReader5ReadIris7 Text Area Enhancement + FineReader5 Text Area Enhancement + ReadIris7 x1W",hmo".,. ~..~.. u.:.-.é~W."hm.- x2 Watch movies. ~'~..'"...,., Watch movies... x4 Watch movies.. é... x8 Watch movies.. é...

© NCSR, Edinburgh, March 6-7, 2003 FineReader5ReadIris7 Text Area Enhancement + FineReader5 Text Area Enhancement + ReadIris7 x1.,....",..II'I;a Novità e offerte x2~(i~ Nevità e Gfferte x4~(3~ Novità € offerte Nevità e efferte x8-- Novità € offerte Nevità e efferte

© NCSR, Edinburgh, March 6-7, 2003 FineReader5ReadIris7 Text Area Enhancement + FineReader5 Text Area Enhancement + ReadIris7 x114" 64M' P.Cil33 "1411. SDRAM i x264M:14» 64M' RB113S "1488. SDR4M' ;,;, i OCLIC. "E"E x4 64M: SH3I 14" 'A CLICK MERE e4M. R&138 4~1411 U. IORolM. I ~ I:LII:K HERE x8 Jd CLICK MERE RAM' PC133 D4IVI* sdram CLICK MERE. RGJ133 tt~. SOR.IM Il I )111 E:LIE:K HEFi!E Result3

© NCSR, Edinburgh, March 6-7, 2003 Next Steps  Automatic Evaluation of the Text Area Enhancement Pre- processing Tool (create Ground Truth Annotations – record improved Recognition Rates for letters/words).  Parameters fine tuning for the Text Area Enhancement Pre- processing Tool (resolution, number of iterations)  Select the appropriate OCR engine.  Train the OCR engine for better results.  Add CROSSMARC lexicons - Post-processing technique to increase recognition accuracy  Integration with NERC, Delivery of an Ellogon-based application

 Unsurpassed Accuracy. Thanks to its use of IPA Technology, FineReader has an unprecedented recognition accuracy. FineReader has come out on top in comparative tests.  Impeccable Layout Retention. New recognition procedures retain the look and feel of your printed documents, be it wrap-around text, vertical text, columns, tables, non-rectangular pictures or varying fonts. Wide range of document saving formats is supported.  PDF Input and Output. Recognize, edit and save documents in PDF format. Dozens of multilanguage fonts included!  Full HTML Support.  FineReader is a Pleasure to Use.  Batch Document Support provides you with the tools you need to work with multipage documents.  The Spelling-check system  Multilingual Document Recognition. FineReader is the leading multi-national OCR software. It recognizes texts in 122 languages  Quick Export to Microsoft Word, Excel and Outlook. FineReader 6.0: Key Features

 Unmatched Combination of Accuracy and Speed. Less editing, increase in performance.  PDF Input. Open PDF documents (even read-only!), and convert them into editable files you can send directly to your favorite application.  Page Orientation and Image Deskew. Automatically detects the document orientation and the text skew.  Powerful Adjust Image Option. Restore degraded documents with manual or automatic image adjustments and despeckling options.  Color Document Recognition. Recognizes color documents and text on colored backgrounds. Retains any pictures in color on the output file.  Foreign Language Support. Recognizes up to 104 different languages:  New User Interface. The new user-friendly interface includes a redesigned thumbnail bar and guides you intuitively through the different recognition steps.  Flowing Text Mode. Thanks to the powerful Autoformat™ technology, pictures, graphics and tables are positioned correctly and the text nicely flows accross columns or pages.  New "Send To" Mode. The new “Send To” mode automatically sends the output result to the selected application such as Microsoft® Word, Microsoft® Excel, etc.  Multipage documents/batch OCR ReadIris 8: Key Features