Presentation is loading. Please wait.

Presentation is loading. Please wait.

Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)

Similar presentations


Presentation on theme: "Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)"— Presentation transcript:

1 Mark Sullivan Digital Library of the Caribbean

2 Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

3 3 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

4 Imaging Theory & Best Practices  Bit Depth & Color Space  Resolution  File Types  Image Compression  OCR  Sample Directories  Questions 4 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

5 Bit Depth & Color Space  Bi-tonal, “black and white”, 1 bit  Greyscales 8-bit ( 256 shades of gray ) 16-bit (65536 shades of gray )  RGB ( usually 24-bit )  CMYK ( usually 32-bit ) 5 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

6 Bit Depth & Color Space Image: © Nevit Dilmen found at Wikimedia commons 6 RGB “built” from 3 color channels dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

7 Bit Depth & Color Space  Color Fidelity  “Full Informational Capture”  Meaningful color should be retained 7 Bi-tonal8-bit Greyscale24-bit Color dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

8 Bit Depth… : Recommended  (Almost) never scan 1-bit  Completely grey items should (usually) be scanned 8-bit greyscale.  Items with meaningful color should be scanned 24-bit RGB  Trade-offs between quality and file size 8 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

9 Bit Depth… : Rationale  Text – Optical Character Recognition 9 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

10 Resolution  Resolution of an image expressed in pixels  PPI – pixels per inch  DPI – dots per inch 10 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

11 Resolution : Recommended R ESOLUTION U SE F OR 300 pixels per inch (ppi) Printed text with normal sized fonts Oversized documents and maps Manuscripts with legible script 600 pixels per inch (ppi) Photographs and select graphic arts Printed text with very small fonts Manuscripts with difficult scripts 11 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

12 Resolution : Rationale 1  Newspaper graphics printed at 80 dpi  Magazine graphics printed at 120 dpi  High-end graphics printed at 300 dpi  Scanning at 300 dpi is sufficient 12 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

13 Resolution : Rationale 2  Text – Optical Character Recognition 13 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

14 Resolution : Rationale 3  Photographs Use 600 dpi Continuous-tone images Unexpected use – capture all details 14 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

15 File Types  Save archival masters as TIFF  Internet delivery as JPEGs or JPEG2000s 15 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

16 Image Compression  Save archival TIFFs as non-compressed  “Lossy” vs. Lossless compression 16 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

17 OCR  Optical Character Recognition  Creation of plain text from an image file  Just as important is the positional information! Text highlighting Text analysis 17 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

18 OCR : ALTO XML  LOC XML schema / standard  “Analyzed Layout and Text Object”  Contains position (and style) of each word, with possible variants  Can be embedded within a METS file  Used by NDNP 18 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

19 OCR : ALTO XML 19 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

20 File Types (sample directory 1)  00001.tif (archival master TIFFs)  00001.jpg (standard page view)  00001.jp2 (zoomable page view)  00001thm.jpg (thumbnail)  00001.txt (OCR’d text) GOOD! 20 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

21 File Types (sample directory 2)  00001_archive.tif (archival master TIFFs)  00001_processed.tif (processed TIFF)  00001.jpg (standard page view)  00001.jp2 (zoomable page view)  00001thm.jpg (thumbnail) GOOD! 21 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

22 File Types (sample directory 3)  00001.tif (archival master TIFFs)  00002.tif (archival master TIFFs)  00003.tif (archival master TIFFs)  00004.tif (archival master TIFFs)  Book.pdf (presentation PDF) FINE! 22 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

23 File Types (sample directory 4)  Book.pdf (presentation PDF) BAD! Do not scan directly to PDF, or any other presentation file type 23 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

24 Review of Topics  Bit Depth & Color Space  Resolution  File Types  Image Compression  OCR  Sample Directories 24 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

25 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan25

26 26 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

27 Scanning Equipment  Flatbed scanners  Sheet-feed scanners  Book scanners  Map scanners  Microfilm 27 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

28 Flatbed Scanners Microtek ScanMaker 9800XL Epson Expression 10000XL 28 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

29 Sheet-feed Scanners Panasonic KV-S2046C 29 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

30 Book Scanners i2S CopiBook ( 24-bit color ) Konica Minolta PS7000 with grayscale up-grade 30 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

31 Oversized Document Scanners Camera back, vacuum table, etc.. Betterlight Super 8K-HS 31 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

32 Microfilm Scanners 32 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan

33 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan33


Download ppt "Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)"

Similar presentations


Ads by Google