Indexing Innovations 14.2 Seminar 14.1 Seminar - Filing Procedures.

Slides:



Advertisements
Similar presentations
Visit the ccScan Website Scan, Import, and Automatically File documents to the Cloud SCAN, IMPORT, AND AUTOMATICALLY FILE DOCUMENTS TO SALESFORCE ® Introduction.
Advertisements

Database Management System MIS 520 – Database Theory Fall 2001 (Day) Lecture 13.
Importing Transfer Equivalencies: How to Maximize Efficiency How Columbia College Office of Registrar improved productivity through third party solutions.
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
CS-3013 & CS-502, Summer 2006 Memory Management1 CS-3013 & CS-502 Summer 2006.
Computer Organization and Architecture
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
HADOOP ADMIN: Session -2
American Law Reports (ALR)
Job_List: the Basics System Librarian, Make your Job easier with job_list Presented at NAAUG 2002 by Susan Marshall.
Vended Authority Control --Procedures and issues.
Chapter 5 Operating System Support. Outline Operating system - Objective and function - types of OS Scheduling - Long term scheduling - Medium term scheduling.
Items 14.2 Seminar 5 March Seminar Items 2 Session Agenda Item record - structural changes Call No. Filing Item sorting routines Item Form.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
Chapter 3 Process Description and Control
Filing and Word Breaking Procedures. 2 Session Agenda Pre-14.x tab_word_breaking table Structure Procedures Special remarks tab_filing table Structure.
Index Building Overview Database tables Building flow (logical) Sequential Drawbacks Parallel processing Recovery Helpful rules.
Understanding InfoHawk Indexes Technical Background for Libraries Staff Patricia Baird Sue Julich.
May 7, We manage documents and their changes with versioning and check out/check in procedures.
Building User Services with OCLC’s WorldCat Local Washington State University Libraries Al Cornish, Head of Library Systems Lihong Zhu, Head of Technical.
ALEPH, Sort of Jane Aitkens Associate Systems Librarian McGill University Libraries June 4, NAAUG.
Items 12.3 to 14.2 Seminar Seminar Items 2 Session Agenda Item Record - structural changes Item Form Call No. Filing Item Sorting routines Item.
ALEPH500 ALEPH500: Authority Control Management. ALEPH500 Creation of BIB-AUT links AUT record 100 $$a Plato 400 $$a Aplaton BIB record 100 $$a Aplaton.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Union Catalog Architecture Tsach Moshkovits, Development Team Leader Olybris, Ex Libris Seminar 2005 Kos, April 2005.
Why do we need a database?
Introduction to KE EMu
Integrate, check and share documents Module 3.3. Integrate, check and share documents Module 3.3.
Visual Basic for Application - Microsoft Access 2003 Finishing the application.
Loading Bibliographic Records Online and in Batch Pat Riva Romance Languages Cataloguer/ Bibliographic Database Specialist McGill University
1 Copy and paste your photo into your opening and closing slide Aleph Parallel Indexing Jerry Specht Senior Support Analyst
Understanding Indexes: WORD and Other Ex Libris NAAUG May 2003 Marie Erdman
Headings – Useful Concepts and Innovations in Prepared by Marina Spivakov, 2002; updated by Jerry Specht, June 2003.
Web OPAC Developments 14.2 Seminar March Seminar 2 WEB OPAC: Major Changes 1.Apache 2.UTF-8 environment 3.Profile sensitive user environment.
Cataloging v.16 eSeminar September 2003 Judith Fraenkel.
CCS Information and Support Center Introduction. What is the information center for? Not only does our web-based.
Setting up and using Acquisitions-related indexes and logical bases. Version 18 and up Yoel Kortick.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Index Building.
Lesson 13 PROTECTING AND SHARING DOCUMENTS
Core LIMS Training: Project Management
CIRC System Librarian.
Understanding Indexes: Headings
Presenter: Svetlana Smirnov
Archiving and Document Transfer Utilities
Web Application.
INLS 623– Database Systems II– File Structures, Indexing, and Hashing
Version 20 Feature Developments
Access Maintaining and Querying a Database
Lesson 23 Managing and Reporting Database Information
Access Tutorial 3 Maintaining and Querying a Database
What has the Library bought for you lately
Receiving New Lending requests
Single Sample Registration
Building bases according to sub library
William Stallings Computer Organization and Architecture
Lesson 13 PROTECTING AND SHARING DOCUMENTS
Chapter 1: Introduction
Chapter 4: Threads.
Maintaining Documents and Macros
Chapter 9: Virtual-Memory Management
תוכניות שירות במודול רכש של אלף
Lesson 12 -Maintaining Documents & Macros Lesson 13 - Protecting & sharing documents Lesson 14 - Advanced Options #1.12, #1.13, & #1.14.
Operating Systems.
Cataloging 14.2 Seminar.
Multithreaded Programming
Lesson 24 Managing and Reporting Database Information
Assignment 3 Querying and Maintaining a Database
How to use SDI Version 16 Yoel Kortick.
American Law Reports (ALR)
Presentation transcript:

Indexing Innovations 14.2 Seminar 14.1 Seminar - Filing Procedures

Session Agenda Filing and word breaking procedures Indexing procedures - new features: Parallel processing Updating a group of indexes New indexing routines Filing Procedures 14.1 Seminar - Filing Procedures

Parallel Processing Filing Procedures 14.1 Seminar - Filing Procedures

Parallel Processing Problems with indexing batch routines in the early versions of ALEPH: Long run time Computer resources not fully utilized - single process per stage No recoverability – if indexing failed, the whole building process needed to be rerun Filing Procedures 14.1 Seminar - Filing Procedures

Parallel Processing In 14.2 all the index creation jobs (with the exception of p_manage_27) enable parallel processing. Filing Procedures 14.1 Seminar - Filing Procedures

Parallel Processing Optimal utilization of computer resources (Large databases - multiple processors) Certain stages of index creation can be split into several cycles – this allows you to divide the workload among different processors Indexing is much quicker Filing Procedures

Parallel Processing – Tracking Assignment progress table: good control of indexing stages 0001 + + ? - 000000001 000010000 0002 + ? - - 000010001 000020000 0003 + - - - 000020001 000030000 0004 ? - - - 000030001 000040000 0005 ? - - - 000040001 000050000 0006 ? - - - 000050001 000060000 0007 - - - - 000060001 000070000 0008 - - - - 000070001 000080000 0009 - - - - 000080001 000090000 + success ? in process - not processed Filing Procedures

Parallel Processing - Recovery If: database tables need to be enlarged not enough disk space - intermediate files not enough disk space - sort general disaster You do not have to rerun the whole process! Filing Procedures

Parallel Processing - Recovery Recovery stages: identify last successful section change “in process” signs (?) to “not processed” sign (-) rerun discrete stage scripts: For example: p_manage_01_a p_manage_01_c p_manage_01_d p_manage_01_d1 Filing Procedures

Parallel Processing – Main Features Indexing is quicker Tracking is easier Recoverability is possible Filing Procedures

Updating a Group of Indexes Filing Procedures 14.1 Seminar - Filing Procedures

Updating a Group of Indexes p_manage_01 and p_manage_02 have a new feature allowing you to update a specific group of indexes. Col.8 defines a group of headings/word indexes for updating: 11 W 008 F07-04 01 A WRD WYR 11 W 008 F35-03 01 A WRD WLN 11 W LOC## -o 03 WRD WCL 11 W 041## abdefg 41 A WRD WLN Filing Procedures 14.1 Seminar - Filing Procedures

Updating a Group of Indexes This option is only available when the program is run from the prompt command line. It is not available from the Web Services. The following is an example of the way in which the program should be run for fields that belong to group B: csh -f p_manage_02 USM01,1,000000000,999999999,B,1,0,00, csh -f p_manage_01 USM01,1,000000000,999999999,B,1,0,00, Filing Procedures 14.1 Seminar - Filing Procedures

Z0102 – COUNTERS FOR LOGICAL BASES Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Pre-14.2 – Problem: Solution: Filing Procedures Scanning logical bases which are less than 50% of the total database is very inefficient (slow, irrelevant unlinked headings ) Solution: There is a new index z0102 which ‘divides’ z01 into sections in accordance with the existing logical bases. Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Filing Procedures Example of z0102 record: 14.1 Seminar - Filing Procedures

z0102 When a logical base is being browsed, the system uses the Z0102 table to “decide” whether to display the heading (Z01) without having to retrieve the documents attached to the heading, Read them, and then “decide”. Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Structure: A record is built for each Z01 and each logical base, giving the filing text and sequence (in order to make the SCAN more efficient) and a counter of the number of relevant docs. Records are built for "see" reference headings, as well as for preferred headings. The record does not include pointers to the doc records; this is still done by Z02. Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Filing Procedures Run p_manage_32 to create z0102 Building the table: Run p_manage_32 to create z0102 Run p_manage_34 to update z0102 - p_manage_32 runs on all Z01 records and builds Z0102. When p_manage_02 is run, p_manage_32 should be run directly afterwards. - p_manage_34 runs on Z01 records that have been "touched" since the last time 32 or 34 were run. It should be run on a regular basis -- i.e. nightly, listed in the job_list (UTIL E/15/1). Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Z01 records that have been "touched“… Filing Procedures - Z01 has a new field, Z01-UPDATE-Z0102. - p_manage_02 set this flag to "Y". p_manage_32 and _34 set this flag to "N". update of z01 sets Z01-UPDATE-Z0102 is set to"Y". - p_manage_34 re-indexes Z01 records that have Z01-UPDATE-Z0102 = "Y". Filing Procedures 14.1 Seminar - Filing Procedures

z0102 Restrictions: Filing Procedures Z0102 is used only for the WEB OPAC browse A new switch in the WEB OPAC defines which tables are involved in BROWSE. If TAB10-Z0102-IN-USE = ‘Y’ – browse is performed by z0102 If TAB10-Z0102-IN-USE = ‘N’ –z0102 does not participate in BROWSE Presently, there is no online update of z0102. Filing Procedures 14.1 Seminar - Filing Procedures

New Batch Jobs for AUT Enrichment Pre – 14.2 : AUT enrichment and correction of BIB after initial conversion or re- indexing is very time-consuming (it takes up to several days). Solution: New batch jobs for AUT enrichment and correction of BIB libraries. These batch jobs will replace the background running of ue_08 after a re-indexing of the z01 indexes. p_manage_102: enrich the BIB z01 index from the entire AUT library p_manage_104: reset the Z01 created from regular indexing to "-CHK-" status p_manage_103: send Z07 records to all potential "corrected” BIB docs. Filing Procedures 14.1 Seminar - Filing Procedures

New Batch Jobs for AUT Enrichment p_manage_102: enrich the BIB z01 index from the entire AUT library p_manage_104: reset the Z01 created from regular indexing to "-CHK-" status p_manage_103: send Z07 records to all potential "corrected" BIB docs. Filing Procedures 14.1 Seminar - Filing Procedures