DML-CZ: Scanning and adjusting the images Martin Lhoták Academy of Sciences Library Launching the DML-CZ 11. 5. 2008 Prague.

Slides:



Advertisements
Similar presentations
0 DIGITIZING GREY LITERATURE FROM THE ANTARCTIC BIBLIOGRAPHY COLLECTION Tina Gheen and Sue Olmsted National Science Foundation Arlington, Virginia USA.
Advertisements

From books to bytes: accelerating digitization at TTU Libraries with Kirtas BookScan APT 2400 Jessica Lu Donell Callender Texas Tech University Libraries.
From books to bytes: accelerating digitization at TTU Libraries with Kirtas BookScan APT 2400 Jessica Lu Donell Callender Texas Tech University Libraries.
Preservation of the Texas Agricultural Experiment Station Bulletin in the Digital Repository By Dr. Rob McGeachin Texas A&M University Libraries June,
Sandy Bostian Content Manager, World Digital Library Library of Congress
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Make the CUT April 30, 2014 Required Readings In House, AERO, Publishers, Ares Supplementary Readings ACE, JMC Transcription Services at.
Services Digitisation & Content Management. 600 People – India.
Overwhelmed by Large-scale Digitization Projects
A Digital Imaging Primer Nick Dvoracek Instructional Resources Center University of Wisconsin Oshkosh.
National Library of the Czech Republic EMBARKEMBARK Future Work Zdeněk Uhlíř Belgrade
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
Selection and procurement of material Selection Analysis of the material + planning of workflow Analysis Scanning Digital photography Image manipulation.
Building The Rare book Collection at Rijeka University Library in the Digital Age Ines Cerovac, Senka Tomljanović, Rijeka University Library Seminar The.
DIGITIZATION OF LOCAL HISTORY COLLECTIONS IN PUBLIC LIBRARY “VLADISLAV PETKOVIC DIS” IN CHACHAK: DIGITIZATION OF THE NEWSPAPER “THE VOICE OF CHACHAK” Bogdan.
DML–CZ: asks and bids Jiří Rákosník, Institute of Mathematics AS CR, Praha Towards a European Virtual Library in Mathematics, Santiago de Compostela,
Records Services New Pilot Service ReBorn Digital – Joe Arthur.
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
Teula Morgan The Adaptable Repository: Swinburne Online Journals.
Lund University Libraries Head Office Update on International Seminar on Open Access for Developing Countries – Salvador, Bahia – Brazil September 21st-22.
Naming and Identifying Digital Objects Naming and Identifying Digital Objects George Kozak Library Systems Cornell University.
1 / 1509 / 17 / 14 Digital preservation of architectural 3D data Rosetta in the context of the DURAARK project IGeLU Conference Oxford, September 17 th.
2.01 Understand Digital Raster Graphics
Scanning Documents Lunch and Learn: April 20, 2005.
IAEA International Atomic Energy Agency Dobrica Savić & Germain St-Pierre Nuclear Information Section, IAEA Vienna Austria.
Digitization and scientific digital libraries Martin Lhoták Knihovna AV ČR, v. v. i. Academy of Sciences Library UISK, Universita Karlova v Praze.
Analog and Digital Cameras  History of Digital cameras  Advantages and Disadvantages / Similarities and Differences of both types of cameras  Types.
HathiTrust – How To By Dr. Rob McGeachin 20 th Annual AgNIC Meeting May 7, 2015.
Mark Phillips Digital Projects Department University of North Texas Annexation of Texas Project.
Digitisation of Cultural Heritage at the National Library of Latvia: Past and Future Uldis Zariņš Head of Strategic Development National Library of Latvia.
Project HISPRA (Historical Pragensia) Supported by the European Economic Area (EEA) and Norwegian Financial Mechanisms Metropolitan Libraries Section Conference.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
Kurzweil Designed for individuals with vision Designed for individuals with vision –Learning disabilities –Low vision –TBI/ABI –ADD/ADHD.
Uganda Science Digital Library (USDL) Digitizing and publishing documents Bergen – Makerere visit February 2005.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)
Kurzweil 3000 Ron Stewart Access Technology Instructor High Tech Center Training Unit.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
Library of Vilnius Gediminas Technical University Asta Katinaitė, Aurelija Striogienė
Image Workflow Processes Elspeth Haston, Robert Cubey, Martin Pullan & David J Harris.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
1 The Digitization Centre at Goettingen State and University Library Andrea Rapp Goettingen State and University Library
Quality Levels of Reproduction Adolf Knoll National Library of the Czech Republic.
Digitization Programmes National Library of the Czech Republic Adolf Knoll
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
E-Books Presentation. Hard Copy (Book) Scanning OCR Text Document HTML Conversion Text Formatting Linking Image Insertion Final QC Soft Copy (JPG/TIFF)
Digitization Costs & Funding Digital Library Workshop Oct. 2, 2003.
1 By: Suman Negi, Technical Officer ‘B’ DESIDOC, DRDO, Delhi Presentation at NACLIN 14 (During 9-11 December 2014, Pondicherry) Design and Development.
National Library of the Czech Republic as End-User of the Research Networks Adolf Knoll deputy director
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
O PEN A CCESS TO O UR H ERITAGE The Gateway to Oklahoma History Cross Timbers Library Conference – August 16, 2013 Sarah Lynn Fisher University of North.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
DIGITIZATION IN THEORY AND PRACTICE WEBSITE: Helen Nneka Okpala Presentation done at University of.
Million Book Project: Vision Becoming Reality Gabrielle Michalek, Carnegie Mellon Presentation to Carnegie Mellon Qatar Library November 9 & 10, 2005.
Masaryk University Libraries Miroslav Bartošek Library and Information Centre – Institute of Computer Science MU.
The Czech Digital Library and Tools for the Management of Complex Digitization Processes Martin Lhoták Library of the Academy of Sciences Czech Republic.
IFLA Newspapers pre-conference Geneva, Arturs Zogla
DIGITIZATION OF PAPER DOCUMENTS OF INSTITUTE OF OCEANOGRAPHY’S LIBRARY
Challenges against building FADA
DIGITAL ARCHIVES Into the Light
IMAODBC, The Hague, 5-9 sept 2005
Poster Title Researchers’ Names Company or Institution
Pricing from an open-access publisher’s perspective
Scan to USB.
Making PDFs Accessible
New Platform to Support Digital Humanities in the Czech Republic
Lars Björnshauge, Lund University Libraries
Current Challenges in Digitization
Quick and Dirty: the art of OCR
Presentation transcript:

DML-CZ: Scanning and adjusting the images Martin Lhoták Academy of Sciences Library Launching the DML-CZ Prague

DML-CZ Workflow 1. Preparation 2. Scanning and adjusting the images 3. OCR 4. Metadata harvesting (MR, ZBL) 5. Integration 6. Digital Library

Content 1. Digitization Centre of the AS Library 2. Scanning 3. Adjusting the images 4. Basic metadada 5. OCR 6. Back up and movement of the data 7. Production till now

Digitization Centre of the AS Library In operation since Builded with support from EU Solidarity fund after floods in Czechia in 2002 Main aim - to build a digital library of scientific publications, published in the Academy of Science of the Czech Rep. Digital Library of ASCR Partner of DML-CZ project since 2005

The Academy of Science of the Czech Republic > 50 scientific institutes 7500 employees, (4000 R&D) > articles, reports, etc. a year publish > 90 journals (circa 3000 articl.) > 100 years history

Digitization Centre of the AS Library 2 x A2 bw scanners Zeutschel OS x A1 color scanner Digibook x A4 fast production scan. Panasonic Staff – 8 to 10 people Monthly production pages Overall production > pages

DML-CZ: Scanning 2 x A2 bw scanners Zeutschel OS DPI 4 bit greyscale 1 page = 1 file usually A5 TIFF with lossless LZW compression circa 10 MB

Image Adjusting Software Book Restorer from i2S Designed to process scanned books Geometrical correction Crop Blur Binarization Despecle

Basic Metadata XML (DTD of The Czech National Library) Title basic biblographic data Physical size of the journal Numbers of pages Software Sirius (CZ)

OCR Fine Reader runs: - 1. to recognize language of paragraph - 2. to do OCR with right language OCR workflow developed by team of Dr. P. Sojka Output – double layer PDF: - 1. layer scanned picture - 2. layer „OCRed“ text

Back up and movement of the data Main steps and outputs: 1. scanning – TIFF 2. image adjust. and basic metadata – TIFF, XML 3. OCR – PDF After each step above: One copy to server in Brno Two copies on LTO tapes

Production for DML-CZ till now Scanning: pages Image adjust.: pages Basic metadata: pages OCR: pages Disproportion: some data was obtained from GDZ Goettingen

Alternative output of the Acad. of Sci. mathematic

Thank you! Questions? Martin Lhoták