2002 September -- ejk/UF RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging Distillation Other topics?

Slides:



Advertisements
Similar presentations
Issues and approaches to preservation metadata Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Advertisements

Creating textual resources Printed documents. Content of this session Types of printed documents Methods of capture Some examples.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Sharpdesk Overview Desktop Composer Search Imaging      
Services Digitisation & Content Management. 600 People – India.
OCLC Online Computer Library Center Microfilmed Newspapers: Selection for Digitization Success ALA June 25, 2006 OCLC Preservation Service Centers.
Client Lunch & Learn (12:15). Association for Information & Image Management Nov Research Scanner Utilization.
Got Paper? Thinking about going paperless or at least as paperless as possible? NAMVBC-2013.
PDF (Portable Document Format) for Digital Preservation and Delivery John Laurie Digital Initiatives Librarian The University of Auckland Library National.
These ain’t “Old News”! Creating access to historic newspapers Christine Guenther OCLC Product Manager, Digital Services Preservation Service Centers Bethlehem,
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Strategic Thinking and Significant Characteristics Hamish James.
JSTOR & OCR - A Case Study Kiffany Francis. What is JSTOR? “JSTOR is a not-for- profit organization with a dual mission to create and maintain a trusted.
Angelika Menne-Haritz The MEX editor - METS and the presentation of digitised archives The MEX editor: METS and the Internet presentation of.
Developing a strategy for quality Ira Revels Digital Project Manager Cornell University Library.
UNIVERSITY OF MACEDONIA ECONOMIC AND SOCIAL SCIENCES Support and Inclusion of students with disabilities at higher education institutions in Montenegroz.
Delivering Value Driven Document Management. The Business Case An unfulfilled need in the market for a powerful, comprehensive and value driven document.
Accessibility of online instructional tools and documents Terrill Thompson ATUS Technology Accessibility Consultant x 2136
Evaluating the use of OCR on a Mobile Device Presented by : Hamed Alharbi Supervisor by :Dr Brett Wilkinson.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
The Voice of A Community Chinese Times Digitization Project Ian Song Prepared for the Multicultural Canada Conference
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
Port Townsend Leader Historical Newspaper Archive Keith Darrock.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
European Metadata Initiatives: The METAe Metadata Engine Simon Tanner Higher Education Digitisation Service
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Digital Reformatting of Text Aaron Choate Digital Library Production Services The University of Texas Libraries.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
CHAPTER FIVE TEXT.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
1 Bridging the gap between the paper past and digital future.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
University of Florida Digital Collections.
An exercise in preservation and applied technology Making an Electronic Text.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
The Century Archive Project “CAP” Technology-Independent Information Storage Steven H. McCown & Michael Leonhardt Storage Technology Corporation 4 April.
Laurie N. Taylor Lourdes Santamaría-Wheeler The Basics of Digitizing Collections.
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
1 THE AUSTRALIAN NEWSPAPERS DIGITISATION PROGRAM (NDP) Rose Holley – Manager Newspaper Digitisation Program Presentation for Spydus 31 October 2007, NLA,
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
RECORDS MANAGEMENT Judith Read and Mary Lea Ginn Chapter 12 Electronic Media and Image Records 1 © 2016 Cengage Learning ®. May not be scanned, copied.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Lecturer: Tom Worthington Date: 3/08/00Overhead sheet 1 File: C|\tomw\web\2000\yhtml.ppt YXML? " Why the Extensible Markup Language? Tom WorthingtonTom.
MSU Libraries’ Course Materials Program:
DAITSS: Dark Archive in the Sunshine State
Content-level intellectual control for digital archives
Digitisation in academic libraries: Experience from Makerere University Library, Kampala Uganda By Patrick Sekikome Presented at the CERN-UNESCO School.
Statewide Digitization and the FCLA Digital Archive
Digital Archival Management Solution (DAMS)
Accessible Documents: The journey so far
University of Florida Digital Collections
Terms 1 Terms 2 Terms 3 Terms 4 Terms 5 1pt 1 pt 1 pt 1pt 1 pt 2 pt
RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging
My Program Session Title
CROWLEY & NEXUS IMAGING SOLUTIONS
Current Challenges in Digitization
Presentation transcript:

2002 September -- ejk/UF RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging Distillation Other topics?

2002 September -- ejk/UF CONTEXT Image Only Pilots Australian Periodical Publications, National Library of New Zealand. Papers Past Image & Indexing/Tagging Pilot University of Florida. Caribbean Newspaper Imaging Project University of Florida. Florida Newspaper Project Image & OCR Pilots Lambrakis Press Archives ProQuest. Historical Newspapers™ TIDEN Project : a Nordic Digital Newspaper Library Olive Software Pilot The British Library

2002 September -- ejk/UF WEB-INTERFACE PERFORMANACE Primary Purpose: Characterize the bias of individuals conducting study Products: How to use ActivePaper TM to Your Advantage Integration with CONTENTdm, XPAT 5.0, other Alternate deliverable images Centralized service – Distributed content – Variable platforms

2002 September -- ejk/UF DTD EXTENSIBILITY Primary Purpose: Assess the XML against established newspaper uses Products: How to use ActivePaperTM to Your Advantage Document the XML as a public DTD Establish a maintenance authority Provide for extension of the DTD Automation for extended tagging How to construct a style sheet Integration with CONTENTdm, XPAT 5.0, other Define issues per the Economic Model

2002 September -- ejk/UF IMAGING Directory Structure and File Naming Archival Formats Optimized Imaging

2002 September -- ejk/UF IMAGING: Directory Structure and File Naming Primary Purpose: Recommended practices Products: Methods for dealing with anomalies Automated name capture during imaging

2002 September -- ejk/UF IMAGING: Archival Formats Primary Purpose: Description of file formats & their characteristics for archive, distillation, and distribution Products: Preservation metadata Anticipate migration Schedule & fee structure for inspection & migration Strategy for format migrations & emulation

2002 September -- ejk/UF IMAGING: Optimized Imaging Primary Purpose: Best practices for microfilming and digitizing (quantitative assessments) Film reduction ratio Evenness of illumination on film Film background density Quality Index & DPI/PPI Skew Color-space & Bit-depth Image density/black & white points Despeckling and Sharpening Image restoration methods

2002 September -- ejk/UF IMAGING: Optimized Imaging Environments: Operating System Scanning Hardware Lighting and Light Filtration Post-processing Other? Other Products: Control target for OCR assessment Revision: RLG Preservation Microfilming Guidelines

2002 September -- ejk/UF DISTILLATION Document Zoning Optical Character Recognition

2002 September -- ejk/UF DISTILLATION: Document Zoning Primary Purpose: Confirm assumptions re: document zoning OCR has difficulty processing large letters Smaller zone yield more accurate text Products: Establish reference to the... PDF (fully scaled) TIFF Other derivative file formats (fully scaled)

2002 September -- ejk/UF DISTILLATION: OCR Primary Purpose: Provide quantitative OCR accuracy information Areas of Investigation: Distillation Source Images Language and Fonts Column & Line Density Relative Density/Contrast Text Curvature and Other Defects

2002 September -- ejk/UF DISTILLATION: OCR Primary Purpose: Predict accuracy contingent upon source document (printing technologies & filming standards) Test-Set Characterization: Source type (newspaper or microfilm) Production date (technologies & standards used) Additional Products: Best practices Accuracy : Cost – Matrix Distillation Source Images

2002 September -- ejk/UF DISTILLATION: OCR Language and Fonts Primary Purpose: Demonstrate ability to distill languages, character sets & fonts Test-Set Characterization: Language & character set groups Font face & font size groups Regional variant spellings Additional Products: Olive Software Speaks Your Language How Olive Software Learns Your Lingo Stylized text recognition & distillation guide

2002 September -- ejk/UF DISTILLATION: OCR Column & Line Density Primary Purpose: Demonstrate ability to distill compact text Test-Set Characterization: Pre-1900 newspapers Advertisement pages Pages predominantly 8 pt. type or less Pages with less than 1 mm space between lines Pages with characters spaced at or below ⅓ mm

2002 September -- ejk/UF DISTILLATION: OCR Relative Density/Contrast Primary Purpose: Investigate low and uneven contrast materials Test-Set Characterization: Low contrast pages Pages with low contrast zones Printing, Filming, & Age/Storage Defects Additional Products: Best practices Accuracy : Cost – Matrix Don’t forget to buy the Life Insurance

2002 September -- ejk/UF DISTILLATION: OCR Text Curvature and Other Defects Primary Purpose: Benchmark current capability to distill curved text & other defects of printing or filming Test-Set Characterization: Curved text zones Broken character zones Broken line zones Garbage elements (stains, etc.) Additional Products: (Additional automatic image correction processes)