Download presentation
Presentation is loading. Please wait.
Published byLucas Cogswell Modified over 10 years ago
1
MADONNE MAsses de DONnées issues de la Numérisation du patrimoiNE Project Leader : Jean-Marc OGIER L3i Laboratory, la Rochelle University Tel : 0033 5 46 45 82 15 – jean-marc.ogier@univ-lr.frjean-marc.ogier@univ-lr.fr NAVIDOMASS NAVigation In DOcument MASSes Two French Projects on Analysis of Cultural Heritage Documents Mathieu Delalandre (CVC) IDoc Meeting, Valencia (Spain) 22th February 2007
2
MADONNE MAsses de DONnées issues de la Numérisation du patrimoiNE French ANR program “Masse de données” Length36 months Funding110 000 € NAVIDOMASS NAVigation In DOcument MASSes French ANR program “Masse de données et connaissances” Length36 months Funding550 000 € Introduction Strategy Model ProcessingGUI High-Level Meta- Data of images Structured and Indexed Information Cultural Document Images System Scope of projects … The cultural heritage documents correspond to a very large mass of data. The Madonne/NaviDoMass projects develop document analysis systems allowing to index and to browse inside this mass of data. 2003 2004 2005 2006 2007 2008 2009 Years Calendar …
3
Consortium Centre de Recherche en Informatique de Paris 5 (Paris) Institut de Recherche en Informatique et Systèmes Aléatoires (Rennes) Laboratoire Informatique (Tours) Laboratoire d'InfoRmatique en Image et Systèmes d'information (Lyon) Laboratoire Lorrain de Recherche en Informatique et ses Applications (Nancy) Laboratoire d'Informatique de Traitement de l'Information (Rouen) Laboratoire d’informatique image et interaction (La Rochelle) Centre d’Etude Supérieures de la Renaissance (Tours) Professor8 Lecturer14 Post-Doctoral3 PhD Student9 Master Student 15 Engineer6 55 Project Members Permanent On the last 3 years Companies5HP, APROGEIDE … Libraries5CHAN, British library … Research Centers10CVC, Indian SI … 20 Project Partners
4
Overview Document Layout [Ramel’05] Bloc segmentation into footnote, text zone, dropcap, figure,.. Background analysisForeground analysis Merging 10 000 pages of old printed books Text density Graphic density Collection Modelling [Journet’06] Directional rose Old printed books
5
Overview Graphem based signature for handwritten patronymic retrieval Document Layout and Retrieval [Couasnon’05] Segmented Cells (1) Line extraction based on Kalman Filter (2) Positioning Grammar to correct and build cells from extracted lines 60 000 Forms of XIX° Century Form viewer Retrieved patronymic “access to form” Query Text Field
6
Overview text erasure interline Document Layout [Nicolas’06] Handwritten pages of XIX° century Segmentation based on Markov Random Field Dropcap Retrieval [Parreti’05] [Uttama’05] [Delalandre’06] [Salmon’05] 10 000 dropcap images Pattern rank Frequency Style retrieval texturesMST image Structure retrieval Printing retrieval query compacityRLE Accuracy Letter retrieval imagecapital letter combination of shape descriptors
7
PhD Thesis4 Journal Paper8 Conference Paper43 Master Thesis15 Technical Report6 76 Publications http://l3iexp.univ-lr.fr/madonne/publications.html 33 Softwares Licence2 Free4 Prototype27 http://l3iexp.univ-lr.fr/madonne/ressources.html Conclusion Results Consortium 8 laboratories, 55 members Renew of project NaviDoMass WP related to MADONNE Perspectives NaviDoMass started since November 2007 … 5 Work Package (WP) 1.Document Layout analysis and structure based indexing 2.Information spotting 3.Structuring the feature space 4.User needs, participative design and groundtruthing 5.Interactive extraction and relevance feedback New topics
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.