I MAGE SEGMENTATION AND 3 D MODELING TO BOOST TEXT RECOGNITION IN NATURAL SCENES Shounak Gore 04/26/11.

Slides:

Advertisements

Similar presentations

Feature extraction: Corners

Advertisements

CVPR2013 Poster Modeling Actions through State Changes.

Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.

IntroductionIntroduction AbstractAbstract AUTOMATIC LICENSE PLATE LOCATION AND RECOGNITION ALGORITHM FOR COLOR IMAGES Kerem Ozkan, Mustafa C. Demir, Buket.

Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.

電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.

Computer Vision Lecture 16: Region Representation

1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.

Quadtrees, Octrees and their Applications in Digital Image Processing

IS THERE A MORE COMPUTATIONALLY EFFICIENT TECHNIQUE FOR SEGMENTING THE OBJECTS IN THE IMAGE? Contour tracking/border following identify the pixels that.

Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.

Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.

EE663 Image Processing Edge Detection 2 Dr. Samir H. Abdul-Jauwad Electrical Engineering Department King Fahd University of Petroleum & Minerals.

Processing Digital Images. Filtering Analysis –Recognition Transmission.

Segmentation Divide the image into segments. Each segment:

CS223B Assignment 1 Recap. Lots of Solutions! 37 Groups Many different approaches Let’s take a peek at all 37 results on one image from the test set.

Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.

Quadtrees, Octrees and their Applications in Digital Image Processing

Highlights Lecture on the image part (10) Automatic Perception 16

Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.

Redaction: redaction: PANAKOS ANDREAS. An Interactive Tool for Color Segmentation. An Interactive Tool for Color Segmentation. What is color segmentation?

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

Image Segmentation Using Region Growing and Shrinking

Image Segmentation by Clustering using Moments by, Dhiraj Sakumalla.

October 8, 2013Computer Vision Lecture 11: The Hough Transform 1 Fitting Curve Models to Edges Most contours can be well described by combining several.

CS 376b Introduction to Computer Vision 04 / 29 / 2008 Instructor: Michael Eckmann.

October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.

BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.

HANOLISTIC: A HIERARCHICAL AUTOMATIC IMAGE ANNOTATION SYSTEM USING HOLISTIC APPROACH Özge Öztimur Karadağ & Fatoş T. Yarman Vural Department of Computer.

Computer Vision Lecture 5. Clustering: Why and How.

Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.

CS 6825: Binary Image Processing – binary blob metrics

ENT 273 Object Recognition and Feature Detection Hema C.R.

Digital Image Processing CCS331 Relationships of Pixel 1.

Quadtrees, Octrees and their Applications in Digital Image Processing.

Chapter 10 Image Segmentation.

November 13, 2014Computer Vision Lecture 17: Object Recognition I 1 Today we will move on to… Object Recognition.

G52IVG, School of Computer Science, University of Nottingham 1 Edge Detection and Image Segmentation.

Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.

Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.

Geodesic Saliency Using Background Priors

The Implementation of Markerless Image-based 3D Features Tracking System Lu Zhang Feb. 15, 2005.

Tracking Turbulent 3D Features Lu Zhang Nov. 10, 2005.

October 16, 2014Computer Vision Lecture 12: Image Segmentation II 1 Hough Transform The Hough transform is a very general technique for feature detection.

Machine Vision ENT 273 Regions and Segmentation in Images Hema C.R. Lecture 4.

Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen

Lukáš Neumann and Jiří Matas Centre for Machine Perception, Department of Cybernetics Czech Technical University, Prague 1.

Digital Image Processing Lecture 17: Segmentation: Canny Edge Detector & Hough Transform Prof. Charlene Tsai.

Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.

Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.

Problem Set 2 Reconstructing a Simpler World COS429 Computer Vision Due October (one week from today)13 th.

Digital Image Processing CCS331 Relationships of Pixel 1.

Keypoint extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.

Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.

1. 2 What is Digital Image Processing? The term image refers to a two-dimensional light intensity function f(x,y), where x and y denote spatial(plane)

Parallel Image Processing: Active Contour Algorithm

Recognition of biological cells – development

DIGITAL SIGNAL PROCESSING

Mean Shift Segmentation

Fitting Curve Models to Edges

3D Rendering Pipeline Hidden Surface Removal 3D Primitives

RGB-D Image for Scene Recognition by Jiaqi Guo

Brief Review of Recognition + Context

Object Recognition Today we will move on to… April 12, 2018

Image Segmentation.

Grape Detection in Vineyards Introduction To Computational and Biological Vision final project Kobi Ruham Eli Izhak.

Image Segmentation Using Region Growing and Shrinking

Presentation transcript:

I MAGE SEGMENTATION AND 3 D MODELING TO BOOST TEXT RECOGNITION IN NATURAL SCENES Shounak Gore 04/26/11

P ROBLEM S TATEMENT Text detection in natural scene images is an unsolved problem. The objective of the project is to segment images and understand the context to boost the detection of text in natural scenes.

M OTIVATION Text recognition in natural scenes is a question of open research. Robotic systems doing exact opposite thing are available.

A LGORITHM Segment the image. Select the area of interest, find the text location and apply text restoration if required. The text obtained in the above part is given to an OCR for text detection.

I MAGE S EGMENTATION “In computer vision, segmentation refers to the process of partitioning a digital image into multiple segments.” “Image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain visual characteristics.”

W AYS TO S EGMENT AN I MAGE Thresholding Clustering Compression-based Histogram-based Edge detection Region growing Split-and-merge Partial differential equation-based Level set methods Graph partitioning methods

F UZZY S HELL C LUSTERING This method combines both the edge detection and clustering approach. For an object like a building, simple edge detection is not sufficient. We need to cluster points together that form rectangles.

F UZZY S HELL C LUSTERING The algorithm in brief : Select the number of clusters that you need. For every point, using the selected distance measure (Euclidean or a close approximation is usually used), decide the extent to which it belongs to every cluster. Look out for parallel edges (rectangles or shape of a building will have parallel edges).

F UZZY S HELL C LUSTERING The fuzzy clustering has many advantages : It can easily detect shapes at various orientations Fuzzy Shell Clustering Algorithms in Image Processing, Frank Hoeppner

F UZZY S HELL C LUSTERING The fuzzy clustering has many advantages : It can easily detect various shapes Fuzzy Shell Clustering Algorithms in Image Processing, Frank Hoeppner

F UZZY S HELL C LUSTERING The fuzzy clustering has many advantages : It can compensate for missing edges Fuzzy Shell Clustering Algorithms in Image Processing, Frank Hoeppner

T EXT L OCATION D ETECTION The assumption is that either the text or its background is of a uniform color. The text is located by grouping text pixels based on their intensity and region layout analysis.

This and previous image from : Text detection and restoration in natural scene images, Qixiang Ye, Jianbin Jiao, Jun Huang, Hua Yu

T EXT R ESTORATION Once the location has been determined, we need to restore the orientation of the text. This usually needs the camera parameters are necessary. But as these parameters are not available, perspective projection matrix which relates the image coordinate to the world coordinates is used.

T EXT R ESTORATION We use the following equation for the purpose : Text detection and restoration in natural scene images, Qixiang Ye, Jianbin Jiao, Jun Huang, Hua Yu

T EXT R ESTORATION But as we have a 2D view after the transformation Z =1 and the equation transforms to : Text detection and restoration in natural scene images, Qixiang Ye, Jianbin Jiao, Jun Huang, Hua Yu

T EXT OCR The text after the preprocessing is then given to a text OCR. For this project an HMM based OCR will be used. The HTK toolkit was used for the generation of the OCR.

M INIMUM G OALS The text localization, restoration and text OCR are already completed. Get the Fuzzy cluster code working and make it compatible for all possible cases that can occur in natural scenes.

T ESTING The metric for testing is the result of the OCR. ICDAR 2003 dataset is used for training and testing the proposed algorithm. The segmented, localized and restored text will be tested for its accuracy as compared to a text obtained with just the localization segment of the algorithm.

A DDITIONAL G OALS Try to segment a varied variety of images. Find more time efficient algorithms. Train the OCR based on the contextual information and test the accuracy.

R EFERENCES g ion Fuzzy Shell Clustering Algorithms in Image Processing: Fuzzy C-Rectangular and 2-Rectangular Shells, Frank Hoeppner Text detection and restoration in natural scene images, Qixiang Ye, Jianbin Jiao, Jun Huang, Hua Yu

Thank You