1 Détection des textes dans les images issues d’un flux vidéo pour l´indexation sémantique Laboratoire d'Informatique en Images et Systèmes d'information.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
QR Code Recognition Based On Image Processing
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Presented by Xinyu Chang
Prénom Nom Document Analysis: Document Image Processing Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Detecting Cartoons a Case Study in Automatic Video-Genre Classification Tzvetanka Ianeva Arjen de Vries Hein Röhrig.
High-level Component Filtering for Robust Scene Text Detection
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
SEMANTIC FEATURE ANALYSIS IN RASTER MAPS Trevor Linton, University of Utah.
WRSTA, 13 August, 2006 Rough Sets in Hybrid Intelligent Systems For Breast Cancer Detection By Aboul Ella Hassanien Cairo University, Faculty of Computer.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
1 Détection des textes dans les images issues d ’un flux vidéo pour l´indexation sémantique Laboratoire d'Informatique en Images et Systèmes d'information.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
EE663 Image Processing Edge Detection 2 Dr. Samir H. Abdul-Jauwad Electrical Engineering Department King Fahd University of Petroleum & Minerals.
Distinguishing Photographic Images and Photorealistic Computer Graphics Using Visual Vocabulary on Local Image Edges Rong Zhang,Rand-Ding Wang, and Tian-Tsong.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Edge Detection Phil Mlsna, Ph.D. Dept. of Electrical Engineering
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Presented by Zeehasham Rasheed
Iris localization algorithm based on geometrical features of cow eyes Menglu Zhang Institute of Systems Engineering
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Information Retrieval in Practice
Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Detection and Extraction of Artificial Text from Videos PROJECT France Télécom Research & Development 001B575 Laboratoire de Reconnaissance de Formes et.
K. Zagoris, K. Ergina and N. Papamarkos Image Processing and Multimedia Laboratory Department of Electrical & Computer Engineering Democritus University.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Automated Detection and Classification Models SAR Automatic Target Recognition Proposal J.Bell, Y. Petillot.
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Computer Vision Michael Isard and Dimitris Metaxas.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Motion Estimation using Markov Random Fields Hrvoje Bogunović Image Processing Group Faculty of Electrical Engineering and Computing University of Zagreb.
1/25 Detection and Extraction of Artificial Text for Semantic Indexing Laboratoire Reconnaissance de Formes et Vision Bât. Jules Verne, INSA de Lyon
October 1, 2013Computer Vision Lecture 9: From Edges to Contours 1 Canny Edge Detector However, usually there will still be noise in the array E[i, j],
Wonjun Kim and Changick Kim, Member, IEEE
Scene Text Extraction Using Focus of Mobile Camera Egyul Kim, SeongHun Lee, JinHyung Kim Artificial Intelligence & Pattern Recognition Lab, KAIST, Korea.
Lecture 04 Edge Detection Lecture 04 Edge Detection Mata kuliah: T Computer Vision Tahun: 2010.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Digital Image Processing CSC331
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
An intelligent strategy for checking the annual inspection status of motorcycles based on license plate recognition Yo-Ping Huang a, Chien-Hung Chen b,
Content Based Coding of Face Images
- photometric aspects of image formation gray level images
Edge Detection Phil Mlsna, Ph.D. Dept. of Electrical Engineering Northern Arizona University.
Nonparametric Semantic Segmentation
Recognition using Nearest Neighbor (or kNN)
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Binarization of Low Quality Text Using a Markov Random Field Model
Brief Review of Recognition + Context
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Support vector machine-based text detection in digital video
A Novel Smoke Detection Method Using Support Vector Machine
Filtering An image as a function Digital vs. continuous images
Christian Wolf Jean-Michel Jolion Françoise Chassaing
Presentation transcript:

1 Détection des textes dans les images issues d’un flux vidéo pour l´indexation sémantique Laboratoire d'Informatique en Images et Systèmes d'information LIRIS, FRE 2672 CNRS Bât. Jules Verne, INSA de Lyon Villeurbanne cedex 1 juillet Christian Wolf

2 FeaturesIntroductionEvaluationConclusionText detectionResults Introduction Features Evaluation/ Choice of features Text detection Conclusion Experimental Results Plan

3 Image/video indexing Content based image retrieval (Master’s degree): Query by example: Indexing based on local texture (Gabor) features Video indexing using semantic descriptors (PhD) : Text detection, enhancement, segmentation and recognition. Result keyword-based Search Patrick Mayhew Min. chargé de l´irlande de Nord ISRAEL Jerusalem montage T.Nouel... Key word Indexing phase FeaturesIntroductionEvaluationConclusionText detectionResults

4 Text detection “Soukaina Oufkir” Detection Enhancement Segmentation FeaturesIntroductionEvaluationConclusionText detectionResults

5 Detection in an image Contrast and Edge features Geometrical features Texture features Color features Problems: Which features? How can the decision be taken (text - non-text)? Separate populations (discriminant analysis) Learning a model (SVM, etc.) Reinforcement learning Master’s thesis of Graham Taylor Heuristics Region/stroke segmentation Corner features FeaturesIntroductionEvaluationConclusionText detectionResults

6 FeaturesIntroductionEvaluationConclusionText detectionResults Introduction Features Evaluation/ Choice of features Text detection Conclusion Experimental Results Plan

7 Videos vs. scanned documents Temporal aspects Complex and moving background Artificial shadows FeaturesIntroductionEvaluationConclusionText detectionResults

8 Videos vs. scanned documents Low resolution Low quality Antialising artifacts Compression artifacts Color bleeding FeaturesIntroductionEvaluationConclusionText detectionResults

9 What is text? - character segmentation Artificial text Scene text FeaturesIntroductionEvaluationConclusionText detectionResults

10 What is text? - texture Example: Gabor energy features on a text image Original imageFilter tuned to the example text Gabor energyThresholded Gabor energy FeaturesIntroductionEvaluationConclusionText detectionResults

11 What is text? - texture Still imagesIntroductionVideosIndexingCharacter segmentationResults

12 What is text? - corners Unthresholded “Harris” corner response FeaturesIntroductionEvaluationConclusionText detectionResults Derivative2nd derivative smeared

13 What is text? - contrast & geometry Example image Accumulated horizontal Sobel edges FeaturesIntroductionEvaluationConclusionText detectionResults

14 What is text? - color Original image Sobel on grayscale image Modified Sobel on L*u*v* image Special cases of text: Small contrast in the lumination plane High(er) contrast in the color plane FeaturesIntroductionEvaluationConclusionText detectionResults

15 FeaturesIntroductionEvaluationConclusionText detectionResults Introduction Features Text detection Conclusion Experimental Results Plan Evaluation/ Choice of features

16 Evaluation A good evaluation algorithm permits: A simple and intuitive interpretation of the obtained performance An objective comparison between the different algorithms to evaluate A good correspondence between the performance measures and the real performance, taking into account the objective of the algorithm (goal oriented approach) Takes into account only the performance of the algorithm, without side effects of other processing steps FeaturesIntroductionEvaluationConclusionText detectionResults

17 Evaluation at different levels Statistical separation: Bhattacharyya distance Error rate, Recall/Precision on pixel level Recall/Precision on rectangle level Goal oriented: Recall/Precision on character level Higher relevance to the application Lower influence of later stages Lower computational complexity Patrick Mayhew Min. chargé de l´irlande de Nord ISRAEL Jerusalem montage T.Nouel... Patrick Mayhew Min. chargé de l´irlande de Nord ISRAEL Jerusalem montage T.Nouel... Detection resultGround truth FeaturesIntroductionEvaluationConclusionText detectionResults

18 Evaluation on rectangle level DetectionGround truth Pure overlap is ambiguous on multiple images: 50% of recall could mean: 50% of the text rectangles have been detected perfectly 100% of the rectangles have been detected with 50% surface Anything between the two... FeaturesIntroductionEvaluationConclusionText detectionResults

19 Evaluation on rectangle level Requirements of an evaluation measure: Tells intuitively how many rectangles have been detected, and how many false alarms Measures the detection quality Takes into account one-2-one, one-2-many and many-2-one matches Scales up to multiple images Counts number of correctly detected rectangles Measures the detection quality Problem: Contradiction FeaturesIntroductionEvaluationConclusionText detectionResults

20 Performance graphs Ground truth G i Detection D i “Surface” Recall and Precision: Thresholded by different thresholds on recall and precision For each rectangle, we will know whether it has been detected or not, depending on a quality threshold FeaturesIntroductionEvaluationConclusionText detectionResults

21 Performance graphs Threshold on surface recall Threshold on surface precision FeaturesIntroductionEvaluationConclusionText detectionResults

22 Comparison of different detection algorithms Method 1: Local contrast Method 2: SVM Learning FeaturesIntroductionEvaluationConclusionText detectionResults

23 The influence of the test database Local contrastSVM learning FeaturesIntroductionEvaluationConclusionText detectionResults

24 FeaturesIntroductionEvaluationConclusionText detectionResults Introduction Features Conclusion Experimental Results Plan Evaluation/ Choice of features Text detection

25 The local contrast method Calculate a text probability image according to a text model (1 value/ pixel) Separate the probability values into 2 classes. Post processing Fisher/Otsu Mathematical morphology Geometrical constraints Verification of special cases Combination of rectangles F. LeBourgeois Still imagesIntroductionVideosConclusionCharacter segmentationResults

26 The learning method Learning gray values and edge maps alone may not generalize enough. Texture alone is not reliable, especially if the text is short. Geometry is a valuable feature. State of the art: enforce geometrical constraints in the post-processing step (mathematical morphology) We propose the usage of geometrical features very early in the detection process, i.e. not during post-processing. FeaturesIntroductionEvaluationConclusionText detectionResults

27 Geometrical features: baseline Text consists of: A high density of strokes in direction of the text baseline. A consistent baseline (a rectangular region with an upper and lower border). Two detection philosophies: Detection of the baseline directly before detecting the text region. Detection of the baseline as the boundary area of the detected text region in order to refine the detection quality. FeaturesIntroductionEvaluationConclusionText detectionResults

28 Estimation of the text rectangle height Original image Accumulated gradients FeaturesIntroductionEvaluationConclusionText detectionResults

29 Mode width (=rectangle height)Mode height (=Contrast)Difference height left-right Mode meanMode standard deviationDifference in mode width Features IntroductionEvaluationConclusionText detectionResults

30 Learning with Support Vector Machines Training image database positive samplesnegative samples Classification step: a reduction of the computational complexity is necessary: Sub-sampling of the pixels to classify (4x4) Approximation of the SVM model by SVM-regression. Bootstrapping, cross-validation FeaturesIntroductionEvaluationConclusionText detectionResults

31 FeaturesIntroductionEvaluationConclusionText detectionResults Introduction Features Conclusion Plan Evaluation/ Choice of features Text detection Experimental Results

32 FeaturesIntroductionEvaluationConclusionText detectionResults AIM3 News AIM4 Cartoons, News AIM5 News AIM2 Commercials

33 Detection in still images Local contrast SVM learning FeaturesIntroductionEvaluationConclusionText detectionResults

34 FeaturesIntroductionEvaluationConclusionText detectionResults Local contrast SVM learning

35 FeaturesIntroductionEvaluationConclusionText detectionResults Local contrast SVM learning

36 Detection in video sequences FeaturesIntroductionEvaluationConclusionText detectionResults

37 Character segmentation: examples Original image Fisher/Otsu Fisher/Otsu (windowed) Yanowitz-B. Yanowitz-B. +post-proc. Niblack Sauvola et al. Contrast maximiz. FeaturesIntroductionEvaluationConclusionText detectionResults

38 OCR results Local contrast based binarization Recognition by Abby Finereader 5.0 Sauvola et al. MRF Bayesian estimation using a Markov random field prior FeaturesIntroductionEvaluationConclusionText detectionResults

39 TREC 2002 “Dance” “Energy Gas” “Music” “Oil” “Airline” “Air plane” FeaturesIntroductionEvaluationConclusionText detectionResults Collaboration with Laboratory LAMP, University of Maryland

40 Conclusion êThe choice of features is primordial in vision. êWe developed a new system for detection, tracking, enhancement and binarisation of text. êDetection performance is high due to the integration of several types of features in a very early stage. The learning method is less sensitive to textured noise in the image. êWe propose a new evaluation method which allows intuitive visualization of the detection quality by performance graphs. FeaturesIntroductionEvaluationConclusionText detectionResults

41 Outlook êPossible improvement of the features (e.g. contrast normalization, non-linear texture filters). êIntegration of different feature types (statistical, structural,...) êUsage of a priori knowledge on text in order to decrease the number of false alarms êIntegration of the detected text into a indexing/browsing/segmentation framework FeaturesIntroductionEvaluationConclusionText detectionResults

42 Optional slides

43 The Bhattacharyya distance