Using the Gamera framework for the recognition of cultural heritage materials Levy Project II Digital Knowledge Center, Sheridan Libraries, Michael Droettboom,

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

1/41 OCVE 2004 Fujinaga Levy Sheet Music Project and Optical Music Recognition introducing Gamut Ichiro Fujinaga McGill University OCVE Workshop (May 2004)
Input to the Computer * Input * Keyboard * Pointing Devices
1 Institutional Repository (IR) Models Rutgers University Community Repository (RUcore) A digital library perspective (objects and collections) Flexible.
Extraction of text data and hyperlink structure from scanned images of mathematical journals Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Text Operations: Preprocessing. Introduction Document preprocessing –to improve the precision of documents retrieved –lexical analysis, stopwords elimination,
Document Analysis Systems Bedola Roberto, Bordoni Davide, Franc Vojtech Supervised by Luca Lombardi Overview: Introduction Our implementation of Wahl et.
WMES3103 : INFORMATION RETRIEVAL
Aletheia Apostolos Antonacopoulos PRImA Lab, The University of Salford, United Kingdom
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
ExpressReader Pro adopted to retrodigitization of mathematical documents Kazuaki Yokota.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Why choose Drupal?
Document Image Analysis CSE 717 An Introduction. Document Image Analysis  DIA is the theory and practice of recovering the symbol structures of digital.
'DIGITAL LIBRARY MANAGER' Profile Sector to which the profile is associated: Culture Fabrizio Melorio
Business Communication, 15e
IRISDocument Server IRISPowerscan IRISCapture Pro/X4D Alone or together to meet your needs Alejandro Grüssi VAR / OEM Account Manager.
DigiMuse Digitalizing and Vocalizing Sheet Music for Mobile Devices running on Android OS by GOBİT.
Software Engineering Project Fruit Recognition Zheng Liu.
Optical Music Recognition Ichiro Fujinaga McGill University 2003.
Document classification and document recognition for partners based on highly optimized free text and layout analysis Manfred Traeger, Head of Research.
DIVA. What Is Diva ? Diva is a software infrastructure for visualizing and interacting with dynamic information spaces. Visualizations are built by hooking.
Chapter 7 Structuring System Process Requirements
Knowledge Systems Lab JN 9/10/2002 Computer Vision: Gesture Recognition from Images Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Lakeland Click arrow to advance show. Click on the “A” under “Listed By Name.” (“A” for Academic Search Database)
The most powerful high-speed scanning, indexing and OCR solution on the market Supports many high speed scanners: Fujitsu, Canon, Kodak, Epson, Avision,
Organizing Data and Information AD660 – Databases, Security, and Web Technologies Marcus Goncalves Spring 2013.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
K. Zagoris, K. Ergina and N. Papamarkos Image Processing and Multimedia Laboratory Department of Electrical & Computer Engineering Democritus University.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Problem description and pipeline
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Confidential, I.R.I.S. © 2005, All rights reserved I.R.I.S. new OCR Software suite: A full range for document conversion, for private and corporate users.
1 Digital Image Processing Dr. Saad M. Saad Darwish Associate Prof. of computer science.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
1 (21) EZinfo Introduction. 2 (21) EZinfo  A Software that makes data analysis easy  Reveals patterns, trends, groups, outliers and complex relationships.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Module Overview. Aims apply your programming skills to an applied study of Digital Image Processing, Digital Signal Processing and Neural Networks investigate.
Image Processing Part II. 2 Classes of Digital Filters global filters transform each pixel uniformly according to the function regardless of its location.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Gamera Optical Music Recognition in a New Shell Michael Droettboom, Karl MacMillan Sheridan Libraries Johns Hopkins University Ichiro Fujinaga McGill University.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
Scanned Documents INST 734 Module 10 Doug Oard. Agenda Document image retrieval  Representation Retrieval Thanks for David Doermann for most of these.
Digitization of the Lester S. Levy Collection of Sheet Music Ichiro Fujinaga McGill University with Michael Droettboom, Karl MacMillan, G. Sayeed Choudhury,
Web Application Overview RISK REDUCTION OVERVIEW.
“Moh’d Sami” AshhabSummer 2008University of Jordan MATLAB By (Mohammed Sami) Ashhab University of Jordan Summer 2008.
Evolution of C and C++ n C was developed by Dennis Ritchie at Bell Labs (early 1970s) as a systems programming language n C later evolved into a general-purpose.
An Augmented Reality iOS Application for Music Education Presenter: Kristen Brown.
The world’s libraries. Connected. CONTENTdm ® Digital Collection Management Solutions Learn what to consider when outsourcing your library’s digitization.
Optical Character Recognition
The world’s libraries. Connected. The Benefits of CONTENTdm Hosting Services OCLC’s Digital Lifecycle Webinar Series April 9, 2013.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
DATA MINING © Prentice Hall.
S.Rajeswari Head , Scientific Information Resource Division
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Natural Language Processing (NLP)
Extracting Old Persian Cuneiform Font Out of
(Mohammed Sami) Ashhab
Gamera A Toolkit for Structured Document Recognition including Music
DIGITAL LIBRARY.
Optical Music Recognition
Introduction to Neural Networks And Their Applications - Basics
Presented By : Sriram Subramanian CSE 6331
Natural Language Processing (NLP)
Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution - Non-Commercial - Share Alike 3.0 License..
III. Introduction to Neural Networks And Their Applications - Basics
AI Discovery Template IBM Cloud Architecture Center
Natural Language Processing (NLP)
Presentation transcript:

Using the Gamera framework for the recognition of cultural heritage materials Levy Project II Digital Knowledge Center, Sheridan Libraries, Michael Droettboom, Ichiro Fujinaga, Karl MacMillan, G. Sayeed Choudhury, Tim DiLauro, Mark Patton, Teal Anderson

Overview Application area: cultural heritage materials The Gamera system Demo

Ingestion of cultural heritage materials Digitizing images is a good first step......however, new kinds of inquiry and retrieval require converting the images into a more useful format

The problem Commercial OCR systems are often inadequate for non-standard documents The market for specialized recognition of historical documents is very small Researchers performing document recognition often “re-invent” the basic image processing wheel

The solution Provide easy to use tools to allow domain experts (people with specialized knowledge of a collection) to create custom recognition applications

Gamera Allows rapid development of domain-specific document recognition applications Domain experts can customize and control all aspects of the recognition process Includes an easy-to-use interactive environment for experimentation

How Gamera is like Matlab Very high-level programming language Close to the problem at hand Supports incremental (test-and-refine) approach to software development User can see the results of the program interactively Provides a large set of pre-designed building blocks

How Gamera is unlike Matlab Designed specifically for structured documents Narrower scope makes it simpler to use Open-source and open standards-compliant Gamera scripts are production applications Can be batched, to process a number of images consecutively

Tools provided by Gamera Pre-processing Document segmentation and analysis Symbol segmentation and analysis Syntactical or structural analysis Output

Pre-processing Photoshop-like filtering of the image Noise removal Blurring De-skewing Binarization

Document segmentation and analysis Dividing the document into high-level units

Symbol segmentation and classification Dividing the image into individual symbols Using a classifier to label them Classifiers can include k-nearest neighbor, neural-nets, hidden Markov models, etc.

Symbol segmentation and classification Original image

Symbol segmentation and classification Staff removed image

Symbol segmentation and classification Connected component segmentation

Symbol segmentation and classification Labeling by a classifier

Syntactical or structural analysis Grouping the symbols into meaningful units and performing basic analysis on their relationships In text: lines, paragraphs, columns, etc.

Demonstration A whirlwind tour of the Gamera system