JY Ramel et al. Interactive and Incremental Analysis of Document Images Laboratoire d’Informatique de Tours - FRANCE.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Representation and Description
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
電腦視覺 Computer and Robot Vision I
Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.
Chapter 5 Raster –based algorithms in CAC. 5.1 area filling algorithm 5.2 distance transformation graph and skeleton graph generation algorithm 5.3 convolution.
O PTICAL C HARACTER R ECOGNITION USING H IDDEN M ARKOV M ODELS Jan Rupnik.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
International Workshop on Computer Vision - Institute for Studies in Theoretical Physics and Mathematics, April , Tehran 1 II SIZE FUNCTIONS:
1Ellen L. Walker Segmentation Separating “content” from background Separating image into parts corresponding to “real” objects Complete segmentation Each.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Prénom Nom Document Analysis: Segmentation & Layout Analysis Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Image Segmentation. Introduction The purpose of image segmentation is to partition an image into meaningful regions with respect to a particular application.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Chapter 4.
E.G.M. PetrakisBinary Image Processing1 Binary Image Analysis Segmentation produces homogenous regions –each region has uniform gray-level –each region.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Programming by Example using Least General Generalizations Mohammad Raza, Sumit Gulwani & Natasa Milic-Frayling Microsoft Research.
Information Retrieval in Practice
Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer Mathieu Delalandre Pattern Recognition.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Charts and Graphs V
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Aniko T. Valko, Keymodule Ltd.
Ink and Gesture recognition techniques. Definitions Gesture – some type of body movement –a hand movement –Head movement, lips, eyes Depending on the.
October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.
CS654: Digital Image Analysis Lecture 3: Data Structure for Image Analysis.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
Need for Semantics  Now models represent just the object appearance  We need to represent also its  Properties  Roles  Behaviour  Services  … 
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CS 6825: Binary Image Processing – binary blob metrics
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Image interpretation by using conceptual graph: introducing complex spatial relations Aline Deruyver, AFD LSIIT UMR7005 CNRS ULP.
HP-PURDUE-CONFIDENTIAL Final Exam May 16th 2008 Slide No.1 Outline Motivations Analytical Model of Skew Effect and its Compensation in Banding and MTF.
Chapter 10, Part II Edge Linking and Boundary Detection The methods discussed in the previous section yield pixels lying only on edges. This section.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Results Post Processing using MSC.Acumen
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Digital Image Processing
Machine Vision ENT 273 Regions and Segmentation in Images Hema C.R. Lecture 4.
1 Presentation Methodology Summary B. Golden. 2 Introduction Why use visualizations?  To facilitate user comprehension  To convey complexity and intricacy.
Image Segmentation Image segmentation (segmentace obrazu)
Wonjun Kim and Changick Kim, Member, IEEE
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
XP New Perspectives on Macromedia Dreamweaver MX 2004 Tutorial 5 1 Adding Shared Site Elements.
NLP&CC 2012 报告人:许灿辉 单 位:北京大学计算机科学技术研究所 Integration of Text Information and Graphic Composite for PDF Document Analysis 基于复合图文整合的 PDF 文档分析 Integration of.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Machine Vision ENT 273 Lecture 4 Hema C.R.
Automatic Video Shot Detection from MPEG Bit Stream
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Graph Based Shapes Representation and Recognition
Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†
Presented by: Prof. Ali Jaoua
Aniko T. Valko, Keymodule Ltd.
EE 492 ENGINEERING PROJECT
Presentation transcript:

JY Ramel et al. Interactive and Incremental Analysis of Document Images Laboratoire d’Informatique de Tours - FRANCE

Document Image Analysis Systems: strategies and tools 2 LABRI Seminar Introduction Context of the presented work Let’s dive into the semantical gap… Characterization and representation of document images  Selection of low level primitives  A graph representation for the layout or structure Analysis and recognition of the contents  Contextual and incremental analysis  Operators and scenarios  AGORA and RETRO Prototypes Conclusion Work realized over a series of collaborations … thanks to all contributors!

Document Image Analysis Systems: strategies and tools 3 Introduction Context of the work Preservation of the cultural heritage  The CESR Tours : a training and research centre  Working on various domains of the Renaissance (historians)  A rich library of rare books (Loire Valley) An initial project: The Humanistic Virtual Library (BVH in French) Collaboration with RFAI research team  A pluridisciplinary collaboration : Experts in DIA + Experts in rare books + End-users  Fill up the semantical gap ?  A new idea : Introduce more interaction into DIA systems

Document Image Analysis Systems: strategies and tools 4 Introduction Let’s dive into the semantical gap Which segmentation methods are able to extract such EoC (Element of Content)? Data driven methods? No  Too much noise and parameters Model driven methods? No  Not much variability in the model of document

Document Image Analysis Systems: strategies and tools 5 Introduction Let’s dive into the semantical gap Low level information (data driven)  Pixels, regions, contours, primitives  Low level processing according to images specificities (data) High level entities (model driven)  Indexation = reading = a priori knowledge = Model  Domain specific processing  genericity Most of the time: no user / Everything is encoded ?

Document Image Analysis Systems: strategies and tools 6 Fill up the gap with the help of the user ? Learning of the model or of the shapes to recognize Introduction Let’s dive into the semantical gap Before

Document Image Analysis Systems: strategies and tools 7 Fill up the gap with the help of the user ? Interactive construction of the processing sequence A posteriori intervention  Error Correction by relevance feedback Ariane / Pandore GraphEdit / Directshow Introduction Let’s dive into the semantical gap Before After

Document Image Analysis Systems: strategies and tools 8 Introduction Let’s dive into the semantical gap Fill up the gap with the help of the user  Our Proposition Incremental analysis  Segmentation for recognition, recognition for segmentation  From the simplest to the more difficult Interactive analysis  User-driven method  Adaptation according to images  Adaptation according to user objectives It requires  An initial representation of the image content  A set of processing operators for segmentation and recognition with interoperability and compatibility capabilities DURING

Document Image Analysis Systems: strategies and tools 9 Part I Characterization and representation of document images  Information about the shapes using contour vectorization  Information about the structure (layout) using a graph representation  Towards a generic structural representation  VectoGraph

Document Image Analysis Systems: strategies and tools 10 Characterization and representation of document images Information about the shapes using contour vectorization Which primitives for describing shapes in a document?  Binarization to extract contours (Vectors, Quad) and CC

Document Image Analysis Systems: strategies and tools 11 Characterization and representation of document images Information about structure using graphs Idea: An evolutive graph of EoC Two types of EoC: Primitives / Elementary EoC Connected components Vectors Quadrilaterals User defined EoC Characters Words Ornamental letters Triangles Diodes …

Document Image Analysis Systems: strategies and tools 12 Node = Primitive or EoC - Type of EoC – Centre (X,Y) of the Bounding Box – Bounding Box – Bounding Rectangle BR :(P1,P2,P3,P4) – Orientation = inertia axis – Density of B&W inside the BB – Color : Average color – List of the elementary EoC – Number of elementary EoC – Confidence rate Edge = Relation between EoC – Minimal distance between 2 EoC – Angle between EoC – Relation : Inside, Overlap, L, T, P, X, S, undefined EoC Characterization and representation of document images Information about structure using graphs Towards a generic representation with graphs Axis

Document Image Analysis Systems: strategies and tools 13 Initial Representation for old document - Graph of EoCs + Background map - Background map Graph of the connected components  Tagging of the nodes according to the size  Noise  Text  Images  Edges between closed shapes (CC) Horizontal/Vertical neighborhood H H V V

Document Image Analysis Systems: strategies and tools 14 Characterization and representation of document images Information about structure using graphs Towards a generic structural representation Domain independent Documents : Pixels, Points, … Layout & relations Shapes & EoC Angle Edge : Relation between primitives or EoCs Quadrilateral Vector Connected component Distance Topology Node : Primitive, EoC Representation = structural graph Analysis = Domain dependent

Document Image Analysis Systems: strategies and tools 15 Part II Analysis and recognition of image contents  Contextual and incremental analysis of old document images  Three operators with simple parameters  Interactive construction of scenario  Examples  AGORA prototype

Document Image Analysis Systems: strategies and tools 16 Analysis and recognition of image contents Strategy of analysis Proposition : user driven analysis (scenario)  incremental and interactive approach No predefined EoCs (model of document)  Users can themselves define the required EoCs  Interactive definition of the model of the document  Incremental analysis (simple  difficult) Easiness  Easy to use interfaces – user assistant  No complex image processing algorithms  But just: Tagging (extraction-recognition) Merging Deletion

Document Image Analysis Systems: strategies and tools 17 Analysis and recognition of image contents Three operators Tagging (extraction) of EoC (nodes) according to  rules about spatial position in the pages  rules about neighborhood relationship (using edges)  rules about internal properties (node attributes) Merging of EoC according to  rules using the distance computed from the background map (edge attributes)  on a specific type of EoCs Deletion of EoC according to  label and user decision

Document Image Analysis Systems: strategies and tools Analysis and recognition of image contents Scenarios User-defined processing sequence  Graph analysis and modification  Defined by users on a typical image  Depending on the user objectives and on the images  Can be saved, edited and applied in batch mode …

Document Image Analysis Systems: strategies and tools 19 Analysis and recognition of image contents Examples Initial representation = primal EoC

Document Image Analysis Systems: strategies and tools 20 Analysis and recognition of image contents Examples Tagging the primal EoC  Text – Graphic - Noise Graphic

Document Image Analysis Systems: strategies and tools 21 Analysis and recognition of image contents Examples Merging of EoC = Text  Word – Line - Paragraph

Document Image Analysis Systems: strategies and tools 22 Analysis and recognition of image contents Examples Position verticale Position horizontale avg = 0,46 std = 0,41 avg = 0,51std = 0,07 Automatic Tagging  Lettrine With the collaboration of Nicholas…

Document Image Analysis Systems: strategies and tools 23 Analysis and recognition of image contents Examples ERREUR Automatic Tagging with manual validation/modification of the rule With the collaboration of Nicholas…

Document Image Analysis Systems: strategies and tools 24 Analysis and recognition of image contents Examples - Primal sketch construction - Img Type = Connected Component of size > Noise Type = Connected Component of size < 10 - Text Type = Connected Component of size between 10 and Horizontal and vertical Fusion of Text with d < Border Type = Img with Width/Height Ratio between 3 and 10 - Ornamental Letter Type = Img close to Nothing at the Left and close to Text at the Right - Img Type = Ornamental Letter with Width/Height Ratio 1,2 - Img Type = Ornamental Letter on the Right < 75 % - Left Margin Type = Text on the left with 25% - Right Margin Type = Text on the right with 25% - Vertical Fusion of the Left and Right Margins with d < Horizontal Fusion of the Text with d < Pagination Type = Text in top with 10% - Text Type = Pagination with a number of Connected Components > 3 - Signature Type = Text in bottom with 25% - Text Type = Signature with a number of Connected components > 5 - Text type = Signature with Text below, on the left or on the right - Suppression of the EoC labelled Text - Suppression of the EoC labelled Noise Example of an obtained scenario applicable on a set of images

Document Image Analysis Systems: strategies and tools Analysis and recognition of image contents Examples Marge Title Text Legend Lettrine Noise Results

Document Image Analysis Systems: strategies and tools 26 Analysis and recognition of image contents AGORA A User-driven Approach |see IJDAR] Graph representation Simple operators Scenarios Some interfaces Used since 2004 at the CESR Always to be improved… Download :

Document Image Analysis Systems: strategies and tools 27 Analysis and recognition of image contents From AGORA to RETRO

Document Image Analysis Systems: strategies and tools 28 Analysis and recognition of image contents From AGORA to RETRO … La l[21]ngueu[7] du chevalet depuis s[21]n pied jusques au c[7][21]chet d’en haut, p[21]rte d[21]uze t[7][21]us … Accuracy Frequence of « tri-gram » Dictionnary Contextual, manual and automatic Transcription

Document Image Analysis Systems: strategies and tools 29 Analysis and recognition of image contents From AGORA to RETRO Experiment on one complete book (Vésale) Book of 150 pages connected components (pseudo characters) classes (clusters) have been built. 90% of these classes are composed of less than 10 occurrences Ignoring these classes during transcription means to miss one character for 14  more than one on each text line !!!  57% of the classes are composed of a single shape  Why ?  Noise, spots  Touching characters  Splitted characters  Same for words The 200 largest classes correspond to 85% of the text

Document Image Analysis Systems: strategies and tools 30 Conclusion Proposition of a global approach: from images to their interpretation Modelisation of the data  representation of image content  Genericity : thin and filled shapes, line and curves, shapes and structure or layout  Contour vectorization + relationship analysis  Utilization of attributed graphs Modelisation of processing operators  recognition  Utilization of contextual information during the EoC extraction and recognition  Involvement of the user (early) in the processing sequence : user- driven analysis Proposition of new structural PR techniques

Document Image Analysis Systems: strategies and tools 31 Thanks Questions ?

Document Image Analysis Systems: strategies and tools Analysis and recognition of image contents Scenarios Document : Image, pixels Symbole … Titre Lettrine EdC Representation = Graph of EdC Q1 Q2 … User defined scenarios = succession of operators + thresholds Q2 Q1 … P2Q3 … Scénario 1 Scénario 2 Scénario 3 P1 P3 P2P4