Presentation is loading. Please wait.

Presentation is loading. Please wait.

Presenter: Svetlana Smirnov

Similar presentations


Presentation on theme: "Presenter: Svetlana Smirnov"— Presentation transcript:

1 Presenter: Svetlana Smirnov
Structure and Logic of WORD Index ALEPH Operations meeting July 16, 2007 DRAFT Presenter: Svetlana Smirnov

2 Copyright Statement All of the information and material inclusive of text, images, logos, product names is either the property of, or used with permission by Ex Libris Ltd. The information may not be distributed, modified, displayed, reproduced – in whole or in part – without the prior written permission of Ex Libris Ltd. TRADEMARKS Ex Libris, the Ex Libris logo, ALEPH 500, SFX, SFXIT, MetaLib, DigiTool, Verde, Primo, Voyager, Journals Onsite, MetaSearch, MetaIndex and other Ex Libris products and services referenced herein are trademarks of Ex Libris, and may be registered in certain jurisdictions. All other product names, company names, marks and logos referenced may be trademarks of their respective owners. DISCLAIMER The information contained in this document is compiled from various sources and provided on an "AS IS" basis for general information purposes only without any representations, conditions or warranties whether express or implied, including any implied warranties of satisfactory quality, completeness, accuracy or fitness for a particular purpose. Ex Libris, its subsidiaries and related corporations (the "Ex Libris Group") disclaim any and all liability for all use of this information, including losses, damages, claims or expenses any person may incur as a result of the use of this information, even if advised of the possibility of such loss or damage. © Ex Libris Ltd., 2007

3 Word Index Structure – ORACLE Tables
Word indexing process includes a number of ORACLE tables: Z95 – Words per Document Z97 – Word Dictionary Z970 – Synonyms Z98 – Bitmap Z980 – Bitmap for updates

4 Word Index Structure – ORACLE Tables
Z95 – Words per Document – Structure Z95 ORACLE table contains the following information per each document: * Z95_DOC_NUMBER * Z95_DATA: pointer to Z97, location in a document, word index code number from tab00.lng (W-001)

5 Word Index Structure – ORACLE Tables
Z95 – Words per Document – Structure View in Util F/4 (key=doc number) Record number Pointer to Z97 (Z97_REC_NUMBER) Location in a document Word index code number from tab00.lng

6 Word Index Structure – ORACLE Tables
Z95 – Words per Document – Structure View in Util F/4 (key=doc number) Lets look on a first line of Z95 which has word number 0001 from word file code 011 (col. 3 of tab00.lng). We see it goes to WFM word index.

7 Word Index Structure – ORACLE Tables
Z97 – Dictionary – Structure Z97 ORACLE table contains the following information: * Word text (dictionary of words) * Z97_rec_number (pointer from a dictionary to other ORACLE tables)

8 Word Index Structure – ORACLE Tables
Z97 – Dictionary – Structure View in Util F/4 In Util F/4 Z97 has two keys: 1 – word text 3 – pointer number Z97_rec_number Searching for word ‘OPS’ (by key ‘1’) from Author “OPS Meeting”: Z97_rec_number

9 Word Index Structure – ORACLE Tables
Z97 – Dictionary – Structure View in Util F/4 Z95 Search by key ‘3’ - z97_rec_number

10 Word Index Structure – ORACLE Tables
Z98 – Bitmap (per word) – Structure Z98 ORACLE table contains the following information: * Word index code number from tab00.lng * Z97_rec_number (pointer) * Map (documents include the word) Util F/4 Key = “Word index code number from tab00.lng” + Z97_rec_number

11 Word Index Structure – ORACLE Tables
Z980 – Bitmap (per word) for updates – Structure Z980 ORACLE table contains the following information: * Word index code number from tab00.lng * Z97_rec_number (pointer) * Z980_NO_ENTRIES – counter for number of updates * Record number updated Util F/4 Key = “Word index code number from tab00.lng” + Z97_rec_number Doc number

12 Word Index Structure – ORACLE Tables
Z970 – Synonyms – Structure Z970 table is used for defining synonym groups. Synonyms are words that share the same bitmap (Z98) value; in other words, they are considered equals. ‘Find’ command for any word defined as synonym in a group will retrieve documents for all other words from this group. Z970 contains two field: * Z970 – common word * Z970 - word

13 Word Index Structure – ORACLE Tables
Z970 – Synonyms – Structure For each synonym word there is a record in Z970 Z97

14 Word Index Structure – ORACLE Tables
Util F/4 – Word3 – data file Word3 display data file combines Z98 and Z980 Oracle tables Util F/4 Key = “Word index code number from tab00.lng” + Z97_rec_number

15 Word Index Structure – ORACLE Tables
Util F/4 – Search for a specific word For example, we want to find if a document words were indexed correctly. For this reason, we will look for a record number in z98 or z980. Search Z97 – word dictionary for a word Take Z97_rec_number Find in tab00.lng word index code number Go to word3 file search for a records include the word

16 Word Index Structure – ORACLE Tables
Util F/4 – Search for a specific word Example We will check if word ‘structure’ from tag 245 of record number was indexed correctly According to tab11_word words from field 245 indexed in Word Indexes WRD and WTI According to tab00.eng Word Index numbers are 001 for WRD and 002 for WTI

17 Word Index Structure – ORACLE Tables
Util F/4 – Search for a specific word Number of records where the word ‘structure’ appears Record number has been found in a result list

18 Word Index Structure – ORACLE Tables
Web OPAC – Search for a specific word

19 Word Index Structure – ORACLE Tables
Links between Word ORACLE tables Z95 Z97_rec_number Z970 Z97_rec_number Z97_rec_number_2 Z97 Z97_rec_number Z98 Z97_rec_number Z980 Z97_rec_number

20 Word Index Structure – ORACLE Tables
P-MANAGE-01 Word Building Service p-manage-01 has two parts * p-manage-01-a script creates Z95 and Z97 * p-manage-01-e script creates Z98 Z980 – doesn’t created by p-manage-01 Z980 created by ue-01 and other doc update processes.


Download ppt "Presenter: Svetlana Smirnov"

Similar presentations


Ads by Google