Employing structural representation for symbol detection, symbol spotting and indexation in line drawing document images Muhammad Muzzamil Luqman

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Víctor Ponce Miguel Reyes Xavier Baró Mario Gorga Sergio Escalera Two-level GMM Clustering of Human Poses for Automatic Human Behavior Analysis Departament.

Query Expansion in Information Retrieval using a Bayesian Network-Based Thesaurus Luis M. de Campus, Juan M. Fernandez, Juan F. Huete.

Improving Fuzzy Multilevel Graph Embedding through Feature Selection Technique Muhammad Muzzamil Luqman, Jean-Yves Ramel and Josep Lladós

On-Line Probabilistic Classification with Particle Filters Pedro Højen-Sørensen, Nando de Freitas, and Torgen Fog, Proceedings of the IEEE International.

Probabilistic inference in human semantic memory Mark Steyvers, Tomas L. Griffiths, and Simon Dennis 소프트컴퓨팅연구실오근현 TRENDS in Cognitive Sciences vol. 10,

Assessment. Schedule graph may be of help for selecting the best solution Best solution corresponds to a plateau before a high jump Solutions with very.

Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)

Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.

Building Synthetic Graphical Documents for Performance Evaluation of Symbol Recognition M. Delalandre 1, E. Valveny 1 and T. Pridmore 2 1 CVC, Barcelona,

Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Pattern Recognition and Machine Learning

Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.

Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.

Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,

Presented by Zeehasham Rasheed

Bayesian Frameworks for Deformable Pattern Classification and Retrieval by Kwok-Wai Cheung January 1999.

Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.

Scalable Text Mining with Sparse Generative Models

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

FLANN Fast Library for Approximate Nearest Neighbors

Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.

MPI Informatik 1/17 Oberseminar AG5 Result merging in a Peer-to-Peer Web Search Engine Supervisors: Speaker : Sergey Chernov Prof. Gerhard Weikum Christian.

Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.

Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,

Recent Trends in Text Mining Girish Keswani

The 5th annual UK Workshop on Computational Intelligence London, 5-7 September 2005 The 5th annual UK Workshop on Computational Intelligence London, 5-7.

Automated Detection and Classification Models SAR Automatic Target Recognition Proposal J.Bell, Y. Petillot.

Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.

Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments:

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

Towards Performance Evaluation of Symbol Recognition & Spotting Systems in a Localization Context Mathieu Delalandre CVC, Barcelona, Spain EuroMed Meeting.

Bayesian Generalized Kernel Mixed Models Zhihua Zhang, Guang Dai and Michael I. Jordan JMLR 2011.

Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.

Using Webcast Text for Semantic Event Detection in Broadcast Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 7, NOVEMBER 2008.

Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.

Chapter 12 Object Recognition Chapter 12 Object Recognition 12.1 Patterns and pattern classes Definition of a pattern class:a family of patterns that share.

Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,

Lecture 2: Statistical learning primer for biologists

Indexing Correlated Probabilistic Databases Bhargav Kanagal, Amol Deshpande University of Maryland, College Park, USA SIGMOD Presented.

KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.

Towards Total Scene Understanding: Classiﬁcation, Annotation and Segmentation in an Automatic Framework N 工科所錢雅馨 2011/01/16 Li-Jia Li, Richard.

Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.

A Performance Characterization Algorithm for Symbol Localization Mathieu Delalandre 1,2, Jean-Yves Ramel 2, Ernest Valveny 1 and Muhammad Muzzamil Luqman.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Statistical Significance Hypothesis Testing.

1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.

A Performance Characterization Algorithm for Symbol Localization Mathieu Delalandre 1, Jean-Yves Ramel 2, Ernest Valveny 1 and Muhammad Muzzamil Luqman.

Work in progress in graphics recognition Mathieu Delalandre DAGMinar, 12th of May 2009, CVC, Barcelone, Spain.

Fuzzy Multilevel Graph Embedding for Recognition, Indexing and Retrieval of Graphic Document Images presented by Muhammad Muzzamil LUQMAN

Recent Trends in Text Mining

Guillaume-Alexandre Bilodeau

LECTURE 33: STATISTICAL SIGNIFICANCE AND CONFIDENCE (CONT.)

Learning Mid-Level Features For Recognition

Probabilistic Data Management

Graph Based Shapes Representation and Recognition

Color-Texture Analysis for Content-Based Image Retrieval

Mathieu Delalandre1, Ernest Valveny1, Tony Pridmore2

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Image Segmentation Techniques

CSc4730/6730 Scientific Visualization

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

George Bebis and Wenjing Li Computer Vision Laboratory

Prepared by: Mahmoud Rafeek Al-Farra

Ernest Valveny Computer Vision Center

Presentation transcript:

Employing structural representation for symbol detection, symbol spotting and indexation in line drawing document images Muhammad Muzzamil Luqman PhD student (II year) Jean-Yves RamelUniversité François Rabelais de Tours, France Thierry BrouardUniversité François Rabelais de Tours, France Josep LladósUniversitat Autònoma de Barcelona, Spain Thesis supervisors Wednesday, 02 June 2010

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 2 Plan Part 1 Representation and recognition of graphics content in line drawing document images Part 2 Unsupervised indexation and content based (focused) retrieval for line drawing document image repositories

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 3 Plan Part 1 Representation and recognition of graphics content in line drawing document images

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 4 Representation phase Representation of structure of graphics content by an Attributed Relational Graph. Description phase Learning and Classification phase

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 5 Description phase Representation phase Learning and Classification phase Description phase Extraction of signature from ARG.

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 6 Description phase

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 7 Description phase A value laying here fully contributes (i.e. membership weight 1) to the interval “Small” A value laying here contributes in part to the interval “Medium” and in part to the interval “Full”

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 8 Description phase Two iterations over set of ARGs: First iteration 1.Compute ‘connection density counts’ for all ARGs 2.Distribute these ‘connection density counts’ in an optimal number of bins 3.Arrange the bins in a fuzzy fashion to form overlapping intervals for ‘Low’, ‘Medium’ & ‘High’ connection densities. Second iteration Compute signature for graphic symbols (ARGs)

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 9 Learning phase (Structure & Parameters of BN) Representation phase Description phase Learning and Classification phase Encoding of Joint Probability Distribution of signatures by a Bayesian Network. P(Nodes) P(Class|Nodes) P(DenH|DenM)

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 10 Classification phase (Graphics Recognition) Representation phase Description phase Learning and Classification phase Encoding of Joint Probability Distribution of signatures by a Bayesian Network. Bayesian probabilistic inference for recognition. Bayes rule: where Query is recognized as class which gets highest posterior probability!

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 11 Example images

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 12 Noise and deformations 2D linear model symbols from GREC databases Learning on clean symbols and testing against noisy and deformed symbols Results presented in CIFED2010 – With Fuzzy Intervals Results presented in ICDAR2009 – Without Fuzzy intervals

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 13 Noise and deformations 2D linear model symbols from GREC databases Learning on clean symbols and testing against noisy and deformed symbols Comparing results with (Qureshi et al., 2007) and (Luqman et al., 2009)

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 14 Context noise 2D linear model symbols from GREC databases (SESYD dataset) Learning on clean symbols and testing against context-noise Results presented in CIFED2010 – With Fuzzy Intervals

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 15 Some remarks Based on vectorization and hence is sensitive to noise and deformation (which produce irregularities in signature). The proposed signature is more vulnerable to symbols that are composed of circles/arcs. However, lightweight signature and use of an efficient classifier makes it suitable to be used as a pre-processing step to reduce search space or as a quick discrimination method for sufficiently large number of graphic symbols … an application to symbol spotting!

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 16 Generalizing fuzzy signature - Explicit Graph Embedding Vector for explicit embedding of attributed graphs Fuzzy zones for “features for node degrees” (for example) A value laying here contributes in part to the interval “F i2 ” and in part to the interval “F i3 ”

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 17 ICPR2010 contest on Explicit Graph Embedding (GEPR) ICPR2010 contest Graph Embedding for Pattern Recognition (GEPR) Results on sample contest data ALOI (Performance Index: 0.379) COIL (Performance Index: 0.376) ODBK (Performance Index: 0.353) ALOI - Amsterdam Library of Object Images COIL - Columbia Object Image Library ODBK - Object Databank Performance Index measures the quality of clustering (that could be obtained for the embedded vectors). The closer it gets to zero the better the embedding results are!

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 18 Plan Part 2 Unsupervised indexation and content based (focused) retrieval for line drawing document image repositories

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 19 A Symbol Spotting & Focused Retrieval System

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 20 A Symbol Spotting & Focused Retrieval System Unsupervised indexation of line drawing document images  Represent document images by attributed relational graphs  Spot Regions Of Interest (ROIs) in the ARG of document image  Learn parameters for fuzzy structural signature from the set of ROIs  Describe each ROI by a fuzzy structural signature  Cluster signatures of ROIs  Prepare an index (clusterID vs ROIs vs documentImage) and  Learn a BN

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 21 A Symbol Spotting & Focused Retrieval System Content based focused retrieval for line drawing document images  Represent query ROI by attributed relational graph  Spot Regions Of Interest (ROIs)  Describe each query ROI by a fuzzy structural signature  Classify query ROIs using BN and  Retrieve documents using repository index

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 22 A Symbol Spotting & Focused Retrieval System

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 23 Experimentation Dataset SESYD (Systems Evaluation SYnthetic Documents) During learning phase our system detected a total of ROIs in electronic diagrams and 4586 ROIs in floorplans, which approximately corresponds to 108% of the symbols in each of the datasets.

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 24 Experimentation Document Retrieval Results Results presented in ICPR2010 Each point in the graph represents the precision and recall values for a query image.

Part1: Recognition of graphics content Experimentation Some remarks Part2: Content based (focused) retrieval Experimentation Conclusion - 25 Conclusion and Future work The Overall framework allows to prepare an index for the document repository in an unsupervised fashion, which is a very important contribution. However the underlying method for ROI localization is based on a set of heuristics and does not return a single symbol in most of the cases and needs to be improved. Future lines of work include the designing of a method to replace the manually selected heuristics by automatic learned heuristics for spotting a ROI.

- 26 References  Delalandre et al., “Building synthetic graphical documents for performance evaluation,” in GREC, vol of LNCS, pp. 288–298, Springer,  Delaplace et al., Two evolutionary methods for learning bayesian network structures, in LNAI  Luqman et al., A Content Spotting System For Line Drawing Graphic Document Images, International Conference on Pattern Recognition, 2010, to appear.  Luqman et al., Vers une approche ﬂoue d’encapsulation de graphes: application à la reconnaissance de symboles, Colloque International Francophone sur l'Ecrit et le Document, 2010,  Luqman et al., Graphic Symbol Recognition using Graph Based Signature and Bayesian Network Classifier, Tenth International Conference on Document Analysis and Recognition (ICDAR), IEEE Computer Society, 2009, volume 10,  Luqman et al., Employing fuzzy intervals and loop-based methodology for designing structural signature: an application to symbol recognition, Eighth IAPR International Workshop on Graphics RECognition (GREC), 2009, volume 8,  Qureshi et al., Combination of symbolic and statistical features for symbols recognition, in IEEE ICSCN’2007.  Qureshi et al., “Spotting symbols in line drawing images using graph representations,” in GREC, pp. 91–103, 2007.