ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC Marthie de Kock The Hong Kong Institute of Education 9 December 2002
Education Imaging System (EdIS) Hong Kong Institute of Education Library
3 Points for discussion Scope and functions EdIS Phase I EdIS Phase II Background Different document classes Data retrieval & searching INNOPAC and the Z server
4 Scope Provide a sophisticated system to manage the growing electronic media including text, black & white scanned images, colour photos, audio, video and multimedia presentations available to and in HKIEd library. Provide an effective web interface to retrieve on-line digitised materials.
5 System Functions Capture of content, storage & management Scanning & OCR Supports both English and Chinese indexing and full text searching
6 Background First Digital Library initiatives of HKIed Library Joint project between IBM & Library with technical support by ITS July signed contract with IBM and it ’ s Digital Library June the system was launched
7 Search Interface of EdIS > The Main Screen
8 Contents of EdIS Phase I Four Document Types Document types Digitised items Newspaper clippings Image scanning & OCR Examination papers Image scanning & OCR Curriculum materials Multimedia objects Student Projects Multimedia objects
9 Document Types: News Clippings & Exam Papers News clippings: Past newspaper clippings scanning, OCR, indexing Wiser News indexing & CMC operations Exam Papers: Departments scanning, OCR, indexing
10 Document Types: Curriculums & Student Projects Digitising procedures included: Content Analysis Categorise multimedia objects Write a summary Digitise materials, saving files with logical file names, web page design & preparing scripts for uploading Upload documents & testing
11 Basic Search Screen of Curriculum Materials
12 Search results screen of [Title = dance]
13 Selected the target page from the hit-list.
14 EdIS Phase II Include Archive materials Improve multimedia searching Search Archive materials via INNOPAC No response – IBM’s DL and CMC June 2001 new Tender specifications Vitova
15 EdIS Phase II Development Customise system Project development – July 2001 Z server System delivered – April 2002 Interface – uploading of Wiser news
16 System Architecture Three subsystems : Client subsystem The front-end PC workstations with Netscape or Microsoft web browser are available for record retrieval and viewing. Capturing Subsystems Used for content preparation (scanning OCR and indexing) Server Subsystem The production server - stores records and manages the systems operations
17 Configuration Hardware: SUN Enterprise 250 server 36 GB data storage space Configured as RAID 0 (disk mirror) Operating Software: ORACLE Database 8i for SUN Sparc Solaris Unix 2.7 Z39.50 server for document searching
18 Hardware and software Application software VitalDoc Document Imaging system - 40 user license Two VitalScan licenses for desktop Scanning and OCR Chinese OCR - TsingHau Wintone ver. 8.0
19
20
21 Other hardware Two scanning/OCR workstations Minolta PS7000 Scanner Ricoh IS330DC DF and Flatbed scanner
22
23
24
25
26
27 Typical Searching Procedure Enter Searching Criteria Browsing Hit List View Result/Content Review History New Search Select Class/Database
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42 Future? End