Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer Mathieu Delalandre Pattern Recognition.

Slides:



Advertisements
Similar presentations
Indexing DNA Sequences Using q-Grams
Advertisements

Face Recognition Face Recognition Using Eigenfaces K.RAMNATH BITS - PILANI.
Shape From Light Field meets Robust PCA
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Chapter 1: Introduction to Pattern Recognition
Object Localization using affine invariant substructure constraints Ishani Chakraborty and Ahmed Elgammal, Dept. of Computer Science, Rutgers University.
Locally Constraint Support Vector Clustering
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
A Study of Approaches for Object Recognition
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.
Evaluating the Quality of Image Synthesis and Analysis Techniques Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
Highlights Lecture on the image part (10) Automatic Perception 16
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Robust Linear Registration of CT images using Random Regression Forests Ender Konukoglu 1, Antonio Criminisi 1, Sayan Pathak 2, Duncan Robertson 1, Steve.
1 Street Generation for City Modeling Xavier Décoret, François Sillion iMAGIS GRAVIR/IMAG - INRIA.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Introduction to Software Testing
Exploration of Ground Truth from Raw GPS Data National University of Defense Technology & Hong Kong University of Science and Technology Exploration of.
Facial Recognition CSE 391 Kris Lord.
IIIT HyderabadUMASS AMHERST Robust Recognition of Documents by Fusing Results of Word Clusters Venkat Rasagna 1, Anand Kumar 1, C. V. Jawahar 1, R. Manmatha.
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Vectorial Distortion For Performance Evaluation Current investigations …. Mathieu Delalandre and Ernest Valveny Meeting of Document Analysis Group Computer.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Regulatory Affairs Domain
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
FlowString: Partial Streamline Matching using Shape Invariant Similarity Measure for Exploratory Flow Visualization Jun Tao, Chaoli Wang, Ching-Kuang Shene.
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Adaptive Registration of Very Large Images Brian Jackson & Ardy Goshtasby Wright State University.
Avoiding Segmentation in Multi-digit Numeral String Recognition by Combining Single and Two-digit Classifiers Trained without Negative Examples Dan Ciresan.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Symbol Recognition Contest 2009 current status Philippe Dosch 1, Ernest Valveny 2 and Mathieu Delalandre 2 1 LORIA, QGAR team, Nancy, France 2 CVC, DAG.
Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Sheraz Ahmed, Koichi Kise, Masakazu Iwamura, Marcus Liwicki,
Towards real-time camera based logos detection Mathieu Delalandre Laboratory of Computer Science, RFAI group, Tours city, France Osaka Prefecture Partnership.
* Challenge the future Graduation project 2014 Exploring Regularities for Improving Façade Reconstruction from Point Cloud Supervisors Dr. Ben Gorte Dr.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
Images Similarity by Relative Dynamic Programming M. Sc. thesis by Ady Ecker Supervisor: prof. Shimon Ullman.
Towards Performance Evaluation of Symbol Recognition & Spotting Systems in a Localization Context Mathieu Delalandre CVC, Barcelona, Spain EuroMed Meeting.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Introduction to Pattern Recognition (การรู้จํารูปแบบเบื้องต้น)
INTRODUCTION TO BIOMATRICS ACCESS CONTROL SYSTEM Prepared by: Jagruti Shrimali Guided by : Prof. Chirag Patel.
Combining the strengths of UMIST and The Victoria University of Manchester A Generic Method for Evaluating Appearance Models and Assessing the Accuracy.
A Performance Characterization Algorithm for Symbol Localization Mathieu Delalandre 1,2, Jean-Yves Ramel 2, Ernest Valveny 1 and Muhammad Muzzamil Luqman.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
An Improved Approach For Image Matching Using Principle Component Analysis(PCA An Improved Approach For Image Matching Using Principle Component Analysis(PCA.
By Sridhar Godavarthy. Co-Author: Joshua Candamo Ph.D Advisors: Dr. Kasturi Rangachar Dr. Dmitry Goldgof.
Improvements on Automated Registration CSc83020 Project Presentation Cecilia Chao Chen.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
CMA Coastline Matching Algorithm SSIP’99 - Project 10 Team H.
Oisin Mac Aodha (UCL) Gabriel Brostow (UCL) Marc Pollefeys (ETH)
Building Synthetic Graphical Documents for Performance Evaluation M. Delalandre 1, T. Pridmore 2, E. Valveny 1, H. Locteau 3, E. Trupin 3 1 CVC, Barcelona,
Automated Geo-referencing of Images Dr. Ronald Briggs Yan Li GeoSpatial Information Sciences The University.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
A 2D/3D correspondence building method for reconstruction of a 3D bone surface model Longwei Fang
Work in progress in graphics recognition Mathieu Delalandre DAGMinar, 12th of May 2009, CVC, Barcelone, Spain.
Thomas Karle Kurt Teichenthaler
Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas
Fast Preprocessing for Robust Face Sketch Synthesis
A Generic Method for Evaluating Appearance Models and Assessing the Accuracy of NRR Roy Schestowitz, Carole Twining, Tim Cootes, Vlad Petrovic, Chris.
Reconstructing Shredded Documents
Shape matching and object recognition using shape contexts
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Efficient Subgraph Similarity All-Matching
Presentation transcript:

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer Mathieu Delalandre Pattern Recognition and Image Analysis Group Laboratory of Computer Science François Rabelais University Tours city, France Digidoc meeting, 6 th of January

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “Introduction” Performance evaluation is a particular cross-disciplinary research field in a variety of domains. Its purpose is the development of frameworks to evaluate and compare a set of methods in order to select the best-suited for a given application. Groundtruth must be reliable (i.e. 100% recognition rate) and exhaustive (label, localization, geometric transforms, noise estimation, metadata, etc.) Considering the document image analysis field (apart of the graphics), five main approaches exist. Characterisation Groundtruth Groundtruthing Results Performance evaluation System Training data dATA Data Degradation SpeedReliabilityTypeConstraint GUI based groundtruthing --Real (any)None Semi automatic transcription ++Real (any)None Electronic document mapping +++ Real (modern) Electronic document Transcript mapping ++Real (old)Transcription Generation of synthetic document ++ Synthetic (any) None

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “GUI based groundtruthing” Principles: GUI plugged to a DIA systems, based on user correction. e.g. TrueViz [Kan’01], Xmillum [Hitz’00], PinkPanther [Yanikoglu’01], PerfectDoc [Yacoub’05], etc. Pros: Discussion about groundtruth formalism Cons: Time consuming considering the user correction, specific DIA chains must be designed for every application, groundtruth is not still reliable. 3 SpeedReliabilityTypeConstraint GUI based groundtruthing --Real (any)None Semi automatic transcription ++Real (any)None Electronic document mapping +++ Real (modern) Electronic document Transcript mapping ++Real (old)Transcription Generation of synthetic document ++ Synthetic (any) None

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “Semi-automatic transcription” Principles: To exploit the context and user interaction to make more robust the recognition process. Transcription is achieved at metadata level, without considering the images. e.g. [Bal’ 08], [Lebourgeois’ 01] Algorithms: binarization and connected component labeling, shape context, image distance, clustering, etc. Pros: Interesting idea, 5% of labeling could result in 95% of correct transcription. Cons: What about the robustness, are we sure of a complete transcription, what about the impact of the segmentation, robustness of the approach is not proved yet. 4 SpeedReliabilityTypeConstraint GUI based groundtruthing --Real (any)None Semi automatic transcription ++Real (any)None Electronic document mapping +++ Real (modern) Electronic document Transcript mapping ++Real (old)Transcription Generation of synthetic document ++ Synthetic (any) None

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “Electronic document mapping Principles: A registration algorithm estimates the global geometric transformation and then performs a robust local bitmap match to register an ideal document image to its corresponding scanned version. e.g. [Kan’96], [Hobby’98], [Beusekom’08], [Kim’02] Algorithms: Registration for transformation estimation, RAST (Recognition using Adaptive Subdivision of Transformation space), branch- and-bound algorithm Pros: The strongest approach of the literature. Cons: Can’t be applied with “old” documents, as an electronic version is mandatory. 5 SpeedReliabilityTypeConstraint GUI based groundtruthing --Real (any)None Semi automatic transcription ++Real (any)None Electronic document mapping +++ Real (modern) Electronic document Transcript mapping ++Real (old)Transcription Generation of synthetic document ++ Synthetic (any) None

Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “Transcript mapping” Principles: Transcript mapping eases the construction of document image segmentation ground truth that includes text-image alignment. e.g. [Stamatopoulos’10], [Zinger’09], [Jawahar’07], etc. Algorithms: HHM, DTW Pros: When no electronic documents exist, certainly the only valid way to obtain a groundtruth at the graphical level. Cons: Depends of the quality of transcriptions, producing transcriptions is time consuming, the approach is more sensitive to segmentation errors. 6

7 Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer “Generation of synthetic document” SpeedReliabilityTypeConstraint GUI based groundtruthing --Real (any)None Semi automatic transcription ++Real (any)None Electronic document mapping +++ Real (modern) Electronic document Transcript mapping ++Real (old)Transcription Generation of synthetic document ++ Synthetic (any) None Principles: In such a system, the test documents are generated by an automatic system which combines pre-defined models of document components in a pseudo-random way. As documents are synthetically generated, the groundtruth becomes automatically available. e.g. [Heroux’07], [Zi’05], etc. Pros: No previous data is mandatory, efficient and exhaustive groundtruth is generated automatically. Cons: Synthetic is not real, to prove similarity between synthetic and real data is not so simple.