Madonne Talk (Tours University) 7 th November 2006 A Fast System for Dropcap Image Retrieval Mathieu Delalandre and Jean-Marc Ogier L3i, La Rochelle University, France
Madonne Talk (Tours University) 7 th November 2006 Short CV
Madonne Talk (Tours University) 7th November 2006 Short CV mais aussi des bandeaux, portraits, armoiries, fleurons, marques … Personal Information Mathieu Delalandre, 32 years old, Married Academic Degrees Lic.Sc In Industrial Computing Rouen University, France M.Sc in Computer Science Rouen University, France Research Experiences (5 years, Graphics Recognition) 04/01-09/01Master PSI Laboratory (Rouen, France) 10/01-04/05PhDPSI Laboratory (Rouen, France) 05/05-09/05Post-docSCSIT (Nottingham, England) 10/05-10/06Post-docL3i Laboratory (La Rochelle, France) 11/06-12/06Post-docPSI Laboratory (Rouen, France) 01/07-12/09Post-docCVC (Barcelone, Spain)
Madonne Talk (Tours University) 7 th November 2006 Introduction - Old books - Old graphics retrieval - Our problem
Madonne Talk (Tours University) 7th November 2006 Introduction Old books Old books of XV° and XVI° centuries Samples Bartolomeo (1534) Alciati (1511) Laurens (1621) figure dropcap headline Example of digitized database (BVH, CESR Tours) Book46 Page1385 Graphics4755 (3.4/page) Foreground pixel 63% textual 37% graphical Graphics type41% dropcap 59% others Old Graphics - Old books - Old graphics retrieval - Our problem
Madonne Talk (Tours University) 7th November 2006 Introduction Old graphics retrieval - Old books - Old graphics retrieval - Our problem Image Database Query ExtractionComparison Index Indexing Retrieval Manual Index System overview General architecture Samples Pareti’05 Graphics style Zip law Uttama’05 Document layout MST Baudrier’05 Sub image Hausdorff distance Bigun’96 Stroke image Radiogram orientation letter (c)topic (vegetal) pattern (cross) Retrieval criterion
Madonne Talk (Tours University) 7th November 2006 Introduction Our problem (1/2) Context MAsse de DOnnées issues de la Numérisation du patrimoiNE (MADONNE) Project Bibliothèques Virtuelles Humanistes (BVH) du Centre d’Etudes Supérieures de la Renaissance (CESR) Class 1 Class 2 Class 3 printing Wood plug (bottom view) Vascosan 1555Marnef 1576 Wood Plug Tracking Printing house tampon exchange copy Old books - Old graphics retrieval - Our problem
Madonne Talk (Tours University) 7th November 2006 Introduction Our problem (2/2) Problem features No scaled, no oriented Noise Offset Complexity Accuracy Scalability descriptors fast local complex global Descriptor choice To scalar [Loncaric’98] Hough, Radon, Zernike, Hu, Fourrier Scaled and orientation invariant fast local To image [Gesu’99] Template matching, Hausdorff distance no scaled and orientation invariant global (scene) Query Compression Centering and Comparison R1 R2 R3 Formatting Image Database - Old books - Old graphics retrieval - Our problem
Madonne Talk (Tours University) 7 th November 2006 Our system Compression Centering and Comparison Formatting
Madonne Talk (Tours University) 7th November 2006 Our system Formatting Digitalization problems [Lawrence’00] Problem sources Several image providers Several digitalization tools Length of process Human supervised … QUEID « QUery Engine on Image Database » Diagnostic Base Expertis e QUEID query charts analysis Format Compression Centering and Comparison Formatting OLDB (Ornamental Letters Database) Before (oldb.jpg)oldb.jpg After Packbits and JpegCompression ?; from 72 to 450 dpiResolutions Jpeg and TiffFormats gray and colourModel MpSize 2803Files 250 to 350Resolutions UncompressCompression TiffFormats grayModel MpSize 2038Files
Madonne Talk (Tours University) 7th November 2006 Our system Compression Run based compression Run Length Encoding (RLE) Compression rate RLE Types image foreground background both OLDB results Fixed threshold binarisation Both RLE Compression Centering and Comparison Formatting
Madonne Talk (Tours University) 7th November 2006 Our system Centering and comparison Centering x2x2 x2x2 x2x2 x1x1 x1x1 x1x1 x2x2 x2x2 x1x1 line (y) image 1 line (y+d y ) image 2 x stack pointeur while x 2 x 1 handle image 2 while x 1 x 2 handle image 1 OLDB results Max Mean Min Time s Size k.pixel Max Mean Min Time s Size k.run Comparison Compression Centering and Comparison Formatting image database query image
Madonne Talk (Tours University) 7 th November 2006 In progress
Madonne Talk (Tours University) 7th November 2006 In progress query 1 st Level 2 sd Level Our problem Current time : 40 s Wished time : < 4 s To use a lossless compressio n To use a system approach Key idea First system Level 1 : image sizes Level 2 : black, white pixels Level 3 : RLE comparison Depth Speed Selection algorithm 11 22 if 1 - 2 < 0 push x, cluster while 1 - 2 < 0 next
Madonne Talk (Tours University) 7th November 2006 In progress OLDB results 59%Max 24%Mean 4%Min Depth % To decrease variability To work on selection To add a level Run based signature
Madonne Talk (Tours University) 7th November 2006 In progress Query example Same plug Next plug Query Performance evaluation Base IHM Retrieve engine control display retrieve Labels driven labelling Bench1Bench2 To produce Criterion ? - Scalability - Accuracy - Time processing Benchmar k system
Madonne Talk (Tours University) 7 th November 2006 Conclusions and perspectives
Madonne Talk (Tours University) 7th November 2006 Conclusions et perspectives Conclusions Dropcap image retrieval « wood tracking » Formatting image database (QUEID) Fast approach, two features RLE comparison ( 7 to 9) Top-down strategy ( 2 to 20) Results 10 s for 2000 images (300 Mo) Perspectives Working on RLE signature Benchmark system for performance evaluation
Madonne Talk (Tours University) 7 th November 2006 Bibliography
Madonne Talk (Tours University) 7th November 2006 Bibliography 1. J. Bigun, S. Bhattacharjee, and S. Michel. Orientation radiograms for image retrieval: An alternative to segmentation. In International Conference on Pattern Recognition (ICPR), volume 3, pages , V. D. Gesu and V. Starovoitov. Distance based function for image comparison. Pattern Recognition Letters (PRL), 20(2): , S. Loncaric. A survey of shape analysis techniques. Pattern Recognition (PR), 31(8): , R. Pareti and N. Vincent. Global discrimination of graphics styles. In Workshop on Graphics Recognition (GREC), pages , S. Uttama, M. Hammoud, C. Garrido, P. Franco, and J. Ogier. Ancient graphic documents characterization. In Workshop on Graphics Recognition (GREC), pages , E. Baudrier, G. Millon, F. Nicolier, and S. Ruan. A fast binary-image comparison method with local-dissimilarity quantification. In International Conference on Pattern Recognition (ICPR), volume 3, pages , 2006.
Madonne Talk (Tours University) 7 th November 2006 Thanks …