Laserfiche Clinic 2006-2007 Liaison HMC, Sept. 12 th, 2006 Adam Field Stephen Smith Ben Tribelhorn, PM Aaron Wolin Advisor: Zach Dodds.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Hydrology for all: Image-based water level monitoring.
Area and perimeter calculation using super resolution algorithms M. P. Cipolletti – C. A. Delrieux – M. C. Piccolo – G. M. E. Perillo IADO – UNS – CONICET.
QR Code Recognition Based On Image Processing
Word Spotting DTW.
Prénom Nom Document Analysis: Document Image Processing Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Document Image Processing
IntroductionIntroduction AbstractAbstract AUTOMATIC LICENSE PLATE LOCATION AND RECOGNITION ALGORITHM FOR COLOR IMAGES Kerem Ozkan, Mustafa C. Demir, Buket.
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
With support from: NSF DUE in partnership with: George McLeod Prepared by: Geospatial Technician Education Through Virginia’s Community Colleges.
OCRdroid : A Framework to Digitize Text Using Mobile Phones  Authors  Mi Zhang, Anand Joshi, Ritesh Kadmawala, Karthik Dantu, Sameera Poduri, and Gaurav.
Esmail Hadi Houssein ID/  „Motivation  „Problem Overview  „License plate segmentation  „Character segmentation  „Character Recognition.
CS 128/ES Lecture 10a1 Raster Data Sources: Paper maps & Aerial photographs.
Computer Vision REU Week 2 Adam Kavanaugh. Video Canny Put canny into a loop in order to process multiple frames of a video sequence Put canny into a.
Automatic Parking Enforcer ENSC 440 Presentation and Demo December 13, 2010 Presented by: R.Maroufi, R.Johal, A.Moshgabadi, Y.Kuo, S.Rohani.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Digital Imaging and Remote Sensing Laboratory Correction of Geometric Distortions in Line Scanner Imagery Peter Kopacz Dr. John Schott Bryce Nordgren Scott.
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
Textual Information Access for the Visually Impaired Ramani Duraiswami.
Face Detection: a Survey Speaker: Mine-Quan Jing National Chiao Tung University.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.
Fig. 2 – Test results Personal Memory Assistant Facial Recognition System The facial identification system is divided into the following two components:
Extension of M-VOTE: Improving Feature Detection
Document Image Analysis CSE 717 An Introduction. Document Image Analysis  DIA is the theory and practice of recovering the symbol structures of digital.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Digital Imaging and Remote Sensing Laboratory Automatic Tie-Point and Wire-frame Generation From Oblique Aerial Imagery Seth Weith-Glushko Advisor: Carl.
Brain segmentation and Phase unwrapping in MRI data ECE 738 Project JongHoon Lee.
Computer Vision Systems for the Blind and Visually Disabled. STATS 19 SEM Talk 3. Alan Yuille. UCLA. Dept. Statistics and Psychology.
1 REAL-TIME IMAGE PROCESSING APPROACH TO MEASURE TRAFFIC QUEUE PARAMETERS. M. Fathy and M.Y. Siyal Conference 1995: Image Processing And Its Applications.
IT Introduction to Information Technology CHAPTER 05 - INPUT.
PixelLaser: Range scans from image segmentation Nicole Lesperance ’11 Michael Leece ’11 Steve Matsumoto ’12 Max Korbel ’13 Kenny Lei ’15 Zach Dodds ‘62.
KNOWLEDGE DATABASE Topics inside  Document sharing  Event marketing  Web content.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
By Meidika Wardana Kristi, NRP  Digital cameras used to take picture of an object requires three sensors to store the red, blue and green color.
Stereoscopic Imaging for Slow-Moving Autonomous Vehicle By: Alexander Norton Advisor: Dr. Huggins April 26, 2012 Senior Capstone Project Final Presentation.
Course Syllabus 1.Color 2.Camera models, camera calibration 3.Advanced image pre-processing Line detection Corner detection Maximally stable extremal regions.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CS 6825: Binary Image Processing – binary blob metrics
Sensing for Robotics & Control – Remote Sensors R. R. Lindeke, Ph.D.
September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.
Chapter 10, Part II Edge Linking and Boundary Detection The methods discussed in the previous section yield pixels lying only on edges. This section.
Update September 21, 2011 Adrian Fletcher, Jacob Schreiver, Justin Clark, & Nathan Armentrout.
CS-498 Computer Vision Week 8, Day 3 Thresholding and morphological operators My thesis? 1.
11/29/ Image Processing. 11/29/ Systems and Software Image file formats Image processing applications.
BVisionBVision Dedicated for the Morris water maze test study of effect of different compounds on spatial orientation and memory of experimental animals.
Barcodes, MMS, and the Internet’s Cheapest Prices Greg McGrath & Greg Maier Advisors: Professor Cotter, Professor Rudko ECE-499 March 01, 2008.
Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.
Ancona - Municipality Hall The principal point PP is the intersection of the three heights of the triangle with vertices the three V.P. The V.P. procedure.
CMSC5711 Image processing and computer vision
Signal and Image Processing Lab
Bitmap Image Vectorization using Potrace Algorithm
Lecture 26 Hand Pose Estimation Using a Database of Hand Images
A language assistant system for smart glasses
Fill and Stroke Stroke is the outline of a shape, text or image.
Computer Vision Lecture 5: Binary Image Processing
Text Detection in Images and Video
2.02 Understand Digital Vector Graphics
CMSC5711 Image processing and computer vision
Creating Vectors – Part Two
Graphic Editing Terms Cropping
CS654: Digital Image Analysis
Binary Image processing بهمن 92
SIFT.
Creating Vectors – Part Two
Presentation transcript:

Laserfiche Clinic Liaison HMC, Sept. 12 th, 2006 Adam Field Stephen Smith Ben Tribelhorn, PM Aaron Wolin Advisor: Zach Dodds

The Problem To convert pictures of documents taken with a digital camera into images that can be organized using Laserfiche's OCR and database technologies. Project goal: raw imageOCR-able image

The Problem To convert pictures of documents taken with a digital camera into images that can be organized using Laserfiche's OCR and database technologies. Project goal: Some important cases: presence of paperclips and/or staples varied/confusing backgrounds (including stacks of papers) one or more edges off the edge of the image knowing when the system has failed camera perspective issues - documents not images head-on (?) other important cases? raw imageOCR-able image

Approach taken by previous clinic Finding document corners Unwarping to 8.5 x 11" Possible approach taken by current clinic First analyzing text-line boundaries Then unwarping to straighten them Approaches Outside - InInside - Out ?

Lu and Tan. “Camera Document Restoration for OCR.” VSBs Camera Document Restoration for OCR Several algorithms use VSBs to detect and correct the image Able to detect the type of distortion or severity of the warping Uses “Vertical Stroke Boundaries” VSBs of characters

Lu, Chen, and Ko. “Perspective rectification of document images using fuzzy set and morphological operations.” Tip point tracing process. Finding Vertical Stroke Boundaries Connected components first Find the "top" and "base" lines for a line of text Scan between the top and base lines, searching for pixels that form relatively orthogonal and straight lines

Avila and Lins. “A Fast Orientation and Skew Detection Algorithm for Monochromatic Document Images.” A Fast Orientation and Skew Detection Algorithm Uses connected components and nearest neighbors to find document skew Places the text line angles into two histograms from ±90º Precisions are 1.0º and 0.1º The skew angle is the histogram peak

Hand- writing Geometric PerspectiveSkew Magazines/ Newspaper Forms Problem Taxonomy Mostly text documents warp severity document difficulty

Hand- writing Geometric PerspectiveSkew Magazines/ Newspaper Forms Problem Priorities ? Mostly text documents primary focus secondary focus warp severity document difficulty

Pair 1's plan Finding character strokes Estimating warp severity Thresholding picture from ben and stephen

Least-sq. line-fitting Visualizing the processing Finding skew estimates Two-tier assessment 1) reasonable? 2) OCR accuracy picture from aaron & adam Pair 2's plan

Tentative Schedule Weekly conference calls with Ed Heaney Accessible codebase and performance updates Other deliverables ? Th 9/21 (11:30 am) Call - progress update T 9/26 Initial Harvey Mudd Th 9/28 Prototype of each algorithm F 10/6 ? Site visit and Laserfiche

Comments?

Other Papers

Hand Writing Image Warping GeometricPerspectiveSkew Magazin es Forms Plain Text

Hand- writing Geometric PerspectiveSkew Magazines/ Newspaper Forms Taxonomy Mostly text documents