Digitalization I ain't happy, I'm feeling glad I got sunshine, in a bag I'm useless, but not for long My future is coming on By Alexander Popov
1.Why digitalization?
It makes books more accessible It makes books more accessible It provides more opportunities for books collections It provides more opportunities for books collections It makes process of reading easier It makes process of reading easier
LBook eReader, Sony LIBRIe
2. What format for digitalization? Graphics (jpeg, tiff, png) Graphics (jpeg, tiff, png) + True to original copy + Easy to do - Heavy - Problems with sorting
2. What format for digitalization? Text formats (rtf, html, doc) Text formats (rtf, html, doc) + Light + Easy for copying - Many mistakes and variances - Troubles with fonts and graphics
2. What format for digitalization? Adobe Acrobat Adobe Acrobat + Binds pages - Either heavy or inaccurate
2. What format for digitalization? DJVu (AT&T and LizardTech) DJVu (AT&T and LizardTech) + Graphic format + Really much liter
Viewing DJVu Download and install plug-in for IE from Or install WinDjView from
3. Steps in digitalization Scanning Editing Enhancing Encoding Finalization
3.1 Scanning
1. Scanner + good quality - slow, deforms books
3.1 Scanning 1. Scanner 2. Professional bookscanner - expensive APT BookScan 1200
3.1 Scanning 1. Scanner 2. Professional bookscanner 3. Digital camera with holder - less quality - limitations with memory card
3.1 Scanning Parameters 1. Resolution: > 300 dpi 2. Choosing a color scheme
3.1 Scanning 1 Bit (b/w) 4-8 bit Grayscale 4-24 bit Colored text b/w graphics color graphics Color schemes
3.2 Editing 1. Splitting doubled pages 2. Cropping 3. Deskewing 4. Converting format to tiff (optional) 5. Converting color scheme
3.2 Editing
1. Splitting doubled pages 2. Cropping 3. Deskewing 4. Converting format to tiff (optional) 5. Converting color scheme Automatic batch processing is possible using Adobe Finereader, ScanKromsator, etc.
3.3 Enhancing
Combination of b\w text scans with grey graphics 1 Bit graphic file
3.4 Encoding 1 bit tiff-files sorted by name DJVu file
3.4 Encoding (software) Express Enterprise with DjVu v5.x DjVu Solo Configuration Manager Workflow Manager
3.5 Finalization a. Adding OCR (optical character recognition) Полицейские машины с включенными сиренами пробивались сквозь эту толчею. Комиссар Иенсен сидел в первой. Это была обычная полицейская машина, темно-синяя, с полоской. Следом ехал серый автобус с зарешеченными стеклами в задней двери и вращающимся прожектором на крыше. Начальник полиции вызвал Иенсена по радиотелефону. Graphic layer Text layer
3.5 Finalization a. Adding OCR (optical character recognition) 1. DjvuOCR 2.0 pre + ABBYY FineReader v7.0 (+ free, better quality – complicate process) 2. Document Express Editor 5.0 (+ simple – less quality)
3.5 Finalization a. Adding OCR (optical character recognition) b. Adding the colored cover Document Express Editor 5.0 Document Express Editor 5.0
3.5 Finalization a. Adding OCR (optical character recognition) b. Adding the colored cover Document Express Editor 5.0 Document Express Editor 5.0 c. Subscribe your document Document Express Editor 5.0 (or djvused for Russians)
4 Creation of collections of DJVu books The most natural way to create a library of electronic books is arranging them as a web-site HTML Site manager
4 Creation of collections of DJVu books The most natural way to create a library of electronic books is arranging them as a web-site HTML Site manager