Download presentation
Presentation is loading. Please wait.
Published bySylvia Mills Modified over 9 years ago
1
Mark Sullivan Digital Library of the Caribbean
2
Imaging Imaging Theory & Specifications Recommended Equipment and Software 2 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
3
3 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
4
Imaging Theory & Best Practices Bit Depth & Color Space Resolution File Types Image Compression OCR Sample Directories Questions 4 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
5
Bit Depth & Color Space Bi-tonal, “black and white”, 1 bit Greyscales 8-bit ( 256 shades of gray ) 16-bit (65536 shades of gray ) RGB ( usually 24-bit ) CMYK ( usually 32-bit ) 5 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
6
Bit Depth & Color Space Image: © Nevit Dilmen found at Wikimedia commons 6 RGB “built” from 3 color channels dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
7
Bit Depth & Color Space Color Fidelity “Full Informational Capture” Meaningful color should be retained 7 Bi-tonal8-bit Greyscale24-bit Color dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
8
Bit Depth… : Recommended (Almost) never scan 1-bit Completely grey items should (usually) be scanned 8-bit greyscale. Items with meaningful color should be scanned 24-bit RGB Trade-offs between quality and file size 8 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
9
Bit Depth… : Rationale Text – Optical Character Recognition 9 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
10
Resolution Resolution of an image expressed in pixels PPI – pixels per inch DPI – dots per inch 10 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
11
Resolution : Recommended R ESOLUTION U SE F OR 300 pixels per inch (ppi) Printed text with normal sized fonts Oversized documents and maps Manuscripts with legible script 600 pixels per inch (ppi) Photographs and select graphic arts Printed text with very small fonts Manuscripts with difficult scripts 11 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
12
Resolution : Rationale 1 Newspaper graphics printed at 80 dpi Magazine graphics printed at 120 dpi High-end graphics printed at 300 dpi Scanning at 300 dpi is sufficient 12 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
13
Resolution : Rationale 2 Text – Optical Character Recognition 13 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
14
Resolution : Rationale 3 Photographs Use 600 dpi Continuous-tone images Unexpected use – capture all details 14 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
15
File Types Save archival masters as TIFF Internet delivery as JPEGs or JPEG2000s 15 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
16
Image Compression Save archival TIFFs as non-compressed “Lossy” vs. Lossless compression 16 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
17
OCR Optical Character Recognition Creation of plain text from an image file Just as important is the positional information! Text highlighting Text analysis 17 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
18
OCR : ALTO XML LOC XML schema / standard “Analyzed Layout and Text Object” Contains position (and style) of each word, with possible variants Can be embedded within a METS file Used by NDNP 18 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
19
OCR : ALTO XML 19 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
20
File Types (sample directory 1) 00001.tif (archival master TIFFs) 00001.jpg (standard page view) 00001.jp2 (zoomable page view) 00001thm.jpg (thumbnail) 00001.txt (OCR’d text) GOOD! 20 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
21
File Types (sample directory 2) 00001_archive.tif (archival master TIFFs) 00001_processed.tif (processed TIFF) 00001.jpg (standard page view) 00001.jp2 (zoomable page view) 00001thm.jpg (thumbnail) GOOD! 21 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
22
File Types (sample directory 3) 00001.tif (archival master TIFFs) 00002.tif (archival master TIFFs) 00003.tif (archival master TIFFs) 00004.tif (archival master TIFFs) Book.pdf (presentation PDF) FINE! 22 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
23
File Types (sample directory 4) Book.pdf (presentation PDF) BAD! Do not scan directly to PDF, or any other presentation file type 23 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
24
Review of Topics Bit Depth & Color Space Resolution File Types Image Compression OCR Sample Directories 24 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
25
dLOC Training (7/29/2013) Gainesville, FLMark Sullivan25
26
26 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
27
Scanning Equipment Flatbed scanners Sheet-feed scanners Book scanners Map scanners Microfilm 27 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
28
Flatbed Scanners Microtek ScanMaker 9800XL Epson Expression 10000XL 28 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
29
Sheet-feed Scanners Panasonic KV-S2046C 29 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
30
Book Scanners i2S CopiBook ( 24-bit color ) Konica Minolta PS7000 with grayscale up-grade 30 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
31
Oversized Document Scanners Camera back, vacuum table, etc.. Betterlight Super 8K-HS 31 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
32
Microfilm Scanners 32 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan
33
dLOC Training (7/29/2013) Gainesville, FLMark Sullivan33
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.