Development of an OCR System

Slides:



Advertisements
Similar presentations
3.1 – Paired Data and The Rectangular Coordinate System
Advertisements

Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
CS223B Assignment 1 Recap. Lots of Solutions! 37 Groups Many different approaches Let’s take a peek at all 37 results on one image from the test set.
Graphs Rectangular Coordinates Use the distance formula. Use the midpoint formula.
K. Zagoris, K. Ergina and N. Papamarkos Image Processing and Multimedia Laboratory Department of Electrical & Computer Engineering Democritus University.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CS 6825: Binary Image Processing – binary blob metrics
Development of an OCR System Nathan Harmata TJHSST Computer Systems Lab
Development of an OCR System Third Quarter Nathan Harmata Period 5.
April 21 st - Your next hand-in day will be Wednesday, April 30 th - Draw 5 2D Shapes. Using a dotted line, draw a line of symmetry through each shape.
Chapter The rise of a line is the difference in y-values (up (+) /down (-)) The run of a line is the difference in x-values (right (+), left.
2 nd scenario. PURPOSE OF EXERCISE  To find out the existence of Jupiter’s moons (satellites)  To calculate the radius and period of their orbit around.
Slope of a Line Lesson 6.2.
EXPLORING Mountains have slopes…. …and so do lines.
Objective The student will be able to: find the slope of a line given 2 points and a graph.
Objective The student will be able to: Find the slope of a line given 2 points and a graph.
Writing Equations of Lines. Find the equation of a line that passes through (2, -1) and (-4, 5).
Quiz Week 8 Topical. Topical Quiz (Section 2) What is the difference between Computer Vision and Computer Graphics What is the difference between Computer.
THE EQUATION OF A LINE By: Mr. F. A. Ogrimen Jr..
Decimals Day 2 Multiply and Divide Decimals. DO NOT line up decimals Multiply with first number, then continue with next number Place decimal point in.
Lesson 5-1. The ___________ of a line is a number determined by any two points on the line. It is the ratio of the ___________ (vertical change) over.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
Optical Character Recognition
Finding Coordinate Plane Distance…
What is the meaning of this sign?
Environmental Remote Sensing GEOG 2021
Objective I can find the slope of a line given 2 points and a graph.
Do Now: If you have progress reports, put them in my basket
4.4 Slope Formula.
Objective The student will be able to:
Objective The student will be able to:
Section 4.4 The Slope of a Line.
Use Absolute Value Functions and Transformations
Objective The student will be able to:
Objective The student will be able to:
What is the meaning of this sign?
Objective The student will be able to:
Objective The student will be able to:
What is the meaning of this sign?
Objective The student will be able to:
Objective The student will be able to:
THE HYPERBOLA.
Section 1.2 Straight Lines.
4n + 2 1st term = 4 × = 6 2nd term = 4 × = 10 3rd term
Objective The student will be able to:
Objective The student will be able to:
EXPLORING SLOPE.
Objective The student will be able to:
Dilations.
Objective The student will be able to:
Unit 1 Basics of Geometry
Objective The student will be able to:
Chapter 2: Transformations
Objective The student will be able to:
7.4 Slope Objectives: To count slope To use slope formula.
Unit 5 Lesson 1 Objective:
Equations and Inequalities in 2 Variables; Functions
Objective The student will be able to:
Objective The student will be able to:
Objective The student will be able to:
Objective The student will be able to:
Objective The student will be able to:
Velocity-Time Graphs for Acceleration
Section 4.5 The Slope of a Line Goal: Find the slope of a line.
Objective The student will be able to:
Geometry Section 3.3 Reitz High School
Systems Practice Use the next slide in this powerpoint as a template to identify 5 different types of systems. 2 of your slides need to be natural systems.
Equations and Inequalities in 2 Variables; Functions
IMAGE DEBLURRING THE END IS NIGH
Presentation transcript:

Development of an OCR System Second Quarter Nathan Harmata Period 5

Recap of 1st Quarter Cache system based on quadrant counts Font dependent, since it is based on cache Completely from scratch Framework for the rest of the year is basically done

Goals of 2nd Quarter Generic letter recognition Transformation of letters Same letter of different font should have similar form Unique forms

SlopeField Idea I proposed at the end of 1st Quarter Transformation of a letter into a collection of line segments of different slopes

SlopeField Steps: - get rid of non black pixels - average horizontal clumps of pixels - starting with the lower left pixel, form a line segment with its adjacent pixel - continue adding more pixels to the line segment if the slope doesn't change too much - stop when a different slope is encountered - repeat with the offending pixel

SlopeField

SectorParsing Deals with the major flaw with SlopeField Parses the image into portions that pass the vertical line test. Each portion is then transformed into a SlopeField.

New Caching System 5 very different fonts Output was analyzed SectorParsing and SlopeField done to each letter of each font Output was analyzed Goal is to use these results to create a new way to compare letters

Cache Program and Results - SlopeField

Cache Program and Results - SectorParsing

SectorVector From the results, the following were deemed important: - number of sectors - approximate number of segments - sign of the slope of the first segment Using the data from testing, a “SectorVector” for each letter was formed

Results from SectorVector Analysis

OCRManager Parses text just it was done first quarter; it uses the same method Individual letters are parsed using SectorParsing into SlopeFields into a SectorVector This SectorVector is compared to the cache by computing the scaled distance between them

Goals for 3rd Quarter Make the matching letters for SectorVectors fewer in number and more spread out Develop other heuristics