Retrieval of the Ornaments from the Hand-Press Period: an Overview Etienne BaudrierLSIIT (Illkirch, France) Sébastien BussonCESR (Tours, France) Silvio.

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

Aggregating local image descriptors into compact codes
Digital Color 24-bit Color Indexed Color Image file compression
Three things everyone should know to improve object retrieval
Presented by Xinyu Chang
Content-Based Image Retrieval
FRE 2645 GREC 2003 : 31 July 2003 Local Structural Analysis: a Primer Mathieu Delalandre¹, Eric Trupin¹, Jean-Marc Ogier² ¹PSI Laboratory, Rouen University,
Internet 2. Written & presented by: Martina Blackwood.
Relevance Feedback Retrieval of Time Series Data Eamonn J. Keogh & Michael J. Pazzani Prepared By/ Fahad Al-jutaily Supervisor/ Dr. Mourad Ykhlef IS531.
Foundation Level Course
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
How the edges of a line, paragraph, object, or table are positioned horizontally and vertically between the margins or on a page.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
A Robust Approach for Local Interest Point Detection in Line-Drawing Images 1 The Anh Pham, Mathieu Delalandre, Sabine Barrat and Jean-Yves Ramel RFAI.
Tuesday, 13th November 2007 Work Group CALYPOD graphiCs imAge anaLYsis from Printed Old Document Presented by.
ACM Multimedia th Annual Conference, October , 2004
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
Highlights Lecture on the image part (10) Automatic Perception 16
1/20 Document Segmentation for Image Compression 27/10/2005 Emma Jonasson Supervisor: Dr. Peter Tischer.
Document Image Analysis CSE 717 An Introduction. Document Image Analysis  DIA is the theory and practice of recovering the symbol structures of digital.
Introduction to Database Systems 1.  Assignments – 3 – 9%  Marked Lab – 5 – 10% + 2% (Bonus)  Marked Quiz – 3 – 6%  Mid term exams – 2 – (30%) 15%
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Groundtruthing for Performance Evaluation of Document Image Analysis Systems: a primer Mathieu Delalandre Pattern Recognition.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Chapter 12 Word Processing and Desktop Publishing: Printing It.
Bag-of-Words based Image Classification Joost van de Weijer.
Madonne Talk (Tours University) 7 th November 2006 A Fast System for Dropcap Image Retrieval Mathieu Delalandre and Jean-Marc Ogier L3i, La Rochelle University,
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Fast System for the Retrieval of Ornamental Letter Image M. Delalandre 1, J.M. Ogier 2, J. Lladós 1 1 CVC, Barcelona, Spain 2 L3i, La Rochelle, France.
1 CP586 © Peter Lo 2003 Multimedia Communication Font and Text.
Content-based Retrieval of 3D Medical Images Y. Qian, X. Gao, M. Loomes, R. Comley, B. Barn School of Engineering and Information Sciences Middlesex University,
Representing Information Digitally. Digitization Initially transforming data for computer use Assigning people social security numbers The creation of.
PowerPoint 2003 – Level 1 Computer Concepts Cathy Horwitz April 25, 2011.
Fast Retrieval of Old Dropcap Images Mathieu Delalandre, Jean-Marc Ogier and Josep Llados IDoc Meeting Valencia, Spain 22th February 2007.
1 Faculty of Information Technology Generic Fourier Descriptor for Shape-based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info.
Tuesday, 6th November 2007 Work Group CALYPOD graphiCs imAge anaLYsis from Printed Old Document Thierry Brouard,
SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Like.com vs. Ugmode Non-infringement arguments *** CONFIDENTIAL *** Prepared by Ugmode, Inc.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Towards Performance Evaluation of Symbol Recognition & Spotting Systems in a Localization Context Mathieu Delalandre CVC, Barcelona, Spain EuroMed Meeting.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
2006 Mouse AHM Mapping 2D slices to 3D atlases - Application of the Digital Atlas Erh-Fang Lee Laboratory of NeuroImage UCLA.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Yixin Chen and James Z. Wang The Pennsylvania State University
A Performance Characterization Algorithm for Symbol Localization Mathieu Delalandre 1,2, Jean-Yves Ramel 2, Ernest Valveny 1 and Muhammad Muzzamil Luqman.
Chapter Three Presentation: User interface How to Build a Digital Library Ian H. Witten and David Bainbridge.
Presenting Documents How to Build a Digital Library Ian H. Witten and David Bainbridge.
Components of Computer. Output The data that has been processed into useful information is called output. Types –Screen – soft copy –Printer – hard copy.
A Performance Characterization Algorithm for Symbol Localization Mathieu Delalandre 1, Jean-Yves Ramel 2, Ernest Valveny 1 and Muhammad Muzzamil Luqman.
Work in progress in graphics recognition Mathieu Delalandre DAGMinar, 12th of May 2009, CVC, Barcelone, Spain.
1 Midterm Examination. 2 General Observations Examination was too long! Most people submitted by .
Using the Gamera framework for the recognition of cultural heritage materials Levy Project II Digital Knowledge Center, Sheridan Libraries, Michael Droettboom,
Presented by Mathieu Delalandre CESR Meeting CESR, Tours, France
Document Analysis Group
Chapter III, Desktop Imaging Systems and Issues: Lesson IV Working With Images
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Aim of the project Take your image Submit it to the search engine
Brief Review of Recognition + Context
Multimedia Information Retrieval
CHAPTER 7: Information Visualization
Search and Retrieval in a Virtual World
Mathieu Delalandre, Pierre Héroux, Sébastien Adam, Eric Trupin,
Presentation transcript:

Retrieval of the Ornaments from the Hand-Press Period: an Overview Etienne BaudrierLSIIT (Illkirch, France) Sébastien BussonCESR (Tours, France) Silvio CorsiniBCU (Lausanne, Switzerland) Mathieu DelalandreCVC (Barcelona, Spain) Jérôme LandréCReSTIC (Troyes, France) Frédéric Morain-NicolierCReSTIC (Troyes, France)

Plan About this work … Hand Press Period About Ornaments Digital Collection of Ornaments How DIA can help ? Content Based Image Retrieval Visual Comparison Conclusions and Perspectives

About this work … Computer Science People 1.Etienne Baudrier 2.Mickael Coustaty 3.Mathieu Delalandre 4.Nathalie Girard 5.Nicholas Journet 6.Dimosthenis Karatzas 7.Jerome Landré 8.Kamel Ait-Mohand 9.Jean-Marc Ogier 10.Nicolas Ragot 11.Jean-Yves Ramel Human Science People 1.Pierre Aquilon 2.Sébastien Busson 3.Silvio Corsini 4.Marie-Luce Demonet 5.Stephen Rawles 6.Toshinori Uetani One-day Workshop 13 th November 2007 CESR, Tours city, France CESR Labs of Human Science Labs of Computer Science

Hand Press Period (1/2) The Hand-Press period runs from around 1454 (approximate date of Gutenberg’s invention) to through the first half of the nineteenth century (when mechanized presses started to appear). a hand-press book 1454 Gutenberg half 18 th mechanized presses Hand Press hand press character matrix

Hand Press Period (2/2) HPB Database 22 European libraries half 19 th 3 Millions books Trinity old library (Dublin, Ireland) 16 th - today Mathematics, medicine, history, music, religion, literature, etc.

About Ornaments (1/2) Ornaments in pages “lettrine” “fleuron” to start a paragraph trademark of a printing house “cul de lampe” to close a part or a chapter to epitomize a concept, or to represent a person, such as a king or saint. “emblème” Categories of ornaments ornamentstext

About Ornaments (2/2) Page 3,4 ornaments/page Book 103,4 ornaments/book Foreground pixels [Journet’05] Text63% Graphics37% Part of ornaments in books (BVH dataset, 46 books) sciences, medical, religion … Hand Press books are composed for a large part of ornaments. Pictures were a powerful mean of communication at this period due to the low education level of people.

Digital Collections of Ornaments (1/2) 25H112rocks 44G312prisoner; in fetters 91E461punishment of Prometheus; he is chained to a rock, usually by Vulcan and/or Mercury 91E4611an eagle tears at Prometheus' liver Digitalization Pre-processing (deskew, lighting correction, filtering, cropping…) Layout analysis and segmentation [Ramel’07] Expert Classification using thesaurus icon class encoding of an emblem image

Digital Collections of Ornaments (2/2) DLsSizePeriodsWeb links BVH th Fleuron th Impact th -18 th Mouriau th Moriane th Collections of ornaments are small in regard to mass digitalization collections (e.g. Million Book Project), two main reasons: (1)Mass digitalization projects are thought in terms of OCR only (layout analysis aims to perform text/graphics separation, final electronic documents are “ASCII code”, no use of high-level document model)  Digitalization programs should consider better the graphics aspects. (2)Classification using thesaurus by human experts is time consuming (15-20 mn per image)  Collaborative platforms, integrating DIA components, can help in. Other smallest datasets are ArtDico, Canadian heraldry, Printers' Devices, etc.

How DIA can help ? (1/2) A duplicated block Redundancy of ornaments in books A same block used in 2 books Vascosan 1555Marnef 1576 Printing house tampon exchange copy Tracking of plugs noise offset precision skewing scaling scalability, mass of data weak resolution, lossy compression

How DIA can help ? (2/2) DB 1 DB 2 CBIR DB n --- Query image Visual Comparison R1 R2 R3 Context information Publication dates Publication places Practices of printers … submit a query retrieval results comparison visualization assign previous classification Meta Digital Collections Of ornaments

Content Based Image Retrieval Ideal method High precision (weak difference) Robust (noise, skew, offset) Invariant to scale Fast comparison (online, mass of data) Scalable Precision Scale invariant Speed Images used forthe experiments Image adequacy Bigun’96-no++500medium Chen’03-no+50large Baudrier’08++no--68none Delalandre’07+no-2048none Bigun’96 Chen’03 w h hw Radiogram 0°Radiogram 90° Detection of key points (Haris) Zernike moments (local template) Nearest points compared with a likelihood estimation Baudrier’08 Expert set resolution analysis Hausdorff distance between images SVM classification Delalandre’07 Run Length Encoding Histogram centering RLE Comparison Orientation Radiograms Fourier Descriptors Euclidean Distance Comparison

Visual Comparison Ideal method Highlight pertinent differences Make an hypothesis of relative dating Invariant to scale Robust (noise, skew, offset) Beusekom’07 Detection of points of interest (connected components) Pixel to Pixel Difference Map (PPDMap) PPDMapBlockA#1 LDMap BlockA#2 Baudrier’07 Equivalent ellipse computation (first image moments) Local Dissimilarity Map (LDMap) Image Registration Visualization Method

Conclusions and Perspectives Large ornament material is available, but there is few digital collections Digitalization programs should consider better the graphics aspects. Collaborative platforms, integrating DIA components, can help in. Two database levels (with, without thesaurus classification) DIA components CBIR systems (orientation signature, points of interest, image distance, compressed representation) Lack of evaluation of the methods make difficult the comparison To define benchmark datasets (time, precision/recall) Methods propose a tradeoff between complexity/precision, possible combination Visual Comparison (registration, PPDMap, LDMap) Hard point is the registration, user interaction could help in