Indexing and filing the “thorn” character as “th” Yoel Kortick Jan. 2011.

Slides:



Advertisements
Similar presentations
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
Advertisements

Filing and Word Breaking Procedures. 2 Session Agenda Pre-14.x tab_word_breaking table Structure Procedures Special remarks tab_filing table Structure.
Chapter 2 Section 3 -Substances can be classified as either elements or compounds element- the simplest form of matter that can exist under normal conditions.
Understanding InfoHawk Indexes Technical Background for Libraries Staff Patricia Baird Sue Julich.
How to publish to PRIMO with a URL to the ADAM digital object Yoel Kortick.
How to sort the “Order Information report” from the service “Print Acquisitions Records acq-03” Yoel Kortick.
V. 21. Controlling and limiting the creation of photocopy and hold requests according to “service hours”. Rep_Ver Yoel Kortick.
Display of combined characters in Aleph GUI Yoel Kortick May 2010 Version 20.2 Minor Release Development A rep_change #
How to add additional privileges to user_function.lng (originally SI ) Presenter: Yoel Kortick.
How to control bracket and parentheses appearance in right to left display of web Presenter: Yoel Kortick.
1 Controlling directionality with Unicode Characters Yoel Kortick Aleph Support manager.
Full text indexing of multi character PDF documents as ADAM digital objects. V18 RC 2089 This presentation applies to Version 18 and up Presenter: Yoel.
© 2016 Ex Libris | Confidential & Proprietary Publishing non-preferred terms to Primo Harvard University April Yoel Kortick Senior Librarian
1 Yoel Kortick Senior Librarian Adding a local Electronic Collection.
Yoel Kortick Circulation Request Refusal Version 21 rep_ver #
Setting up and using Acquisitions-related indexes and logical bases. Version 18 and up Yoel Kortick.
How to use and understand the Category Mechanism Setup Version 18 and up Yoel Kortick.
Using home made fix procedures for non Latin characters Yoel Kortick.
Expansion of available character set for configuration in ALEPH tables Idit Shachar-Abutbul May 2011 Aleph operations meeting.
Validation Check Version 21 rep_ver # Yoel Kortick.
SMS Messaging in Aleph 500 Version 20 and up
How to separate semicolon delimited values to separate columns
Understanding Indexes: Headings
Yoel Kortick Senior Librarian
How to change default item status after binding
Rep change 1590 (ver 18) Access to Google books
Using character conversion to display non standard Unicode values
How to make a bibliographic base of records with attached ADAM digital objects Yoel Kortick October, 2007.
How to “expand” holding library fields to bibliographic record
Version 20 Feature Developments
How to add a fix procedure for copy cataloging from a Z39.50 base
How to configure and use title hold requests version 18
Creating links from an Alma Analytics report to an external web page
How to Define Separate Order Counters for Separate Sub-Libraries
Yoel Kortick Senior Librarian
How to change default item status after binding version 16 and up
Title of Your Research Heading 3 Heading 6 Heading 4 Heading 1
Yoel Kortick Senior Librarian
Publishing Course Readings to PRIMO Version 20
Alma Analytics Usage Yoel Kortick | Senior Librarian.
Introduction This presentation will show two ways of making a report of newly acquired records: Via services Retrieve Catalog Records (ret-01) and Print.
Metadata Editor Introduction
Special Issues in Hebrew Cataloging in Alma
Importing Serial Prediction Patterns Via the Service Import 85X records (Serial-52) Yoel Kortick.
Yoel Kortick Senior Librarian Alma Product Management
Presenter Name 32-Point Presenter Title 20-Point CSB_SJU_template
Yoel Kortick Senior Librarian
Databases Software This icon indicates the slide contains activities created in Flash. These activities are not editable. For more detailed instructions,
Benchmark Series Microsoft Word 2016 Level 2
Yoel Kortick Senior Librarian
Click ‘browse’ to search your device for
Search V.16.
SEARCHING ERIC THESAURUS & LIMIT RESULTS
Yoel Kortick Senior Librarian
Put the dots on the shift keys.
Please send any images as a separate file
Indication rules and set filtering
YOUR TITLE HERE SUBTITLE HERE.
Library Search Procedure
The key to success, Yesform
VIETNAM ACADEMY OF SCIENCE AND TECHNOLOGY
POWER YOUR POINT Add your text in here.
Insert Presentation Title Here Insert Presentation Summary Here
The key to success, Yesform
POWER YOUR POINT Add your text in here.
PLEASE ADD YOUR TITLE HERE.
Configuring and using the Accession Number and Inventory Number
Alternate graphic representation 880 field
New “replace string” parameter for fix_doc_do_file_08
Yoel Kortick Senior Librarian
Presentation transcript:

Indexing and filing the “thorn” character as “th” Yoel Kortick Jan. 2011

2 Question What settings do I need in tables unicode_to_filing_source and unicode_to_word_gen for the latin character thorn to be filed as th and retrieved by a word search as th? Currently this is not happening (example record )

3 Background 00DE is the Capital letter thorn 00FE is the Lowercase thorn You can read more about the thorn at

4 Sample records System number has 245 field with thorn: The library book about the þorn in ancient England. System number has 245 field with th: The library book about the thorn in ancient England.

5 Here is the record with the thorn

6 Here is the record with the th

7 What do we need? We need the Unicode value 00FE to be treated like a combination of 0074 and This way the Thorn (00FE ) will be like th ( ) Our example will not only make the words file together but also the headings (for browse)

8 What currently happens? Each title heading is filed separately

9 What currently happens? Each word is filed separately It finds “thorn” but not “þorn”

10 What currently happens? Each word is filed separately It finds “þorn” but not “thorn”

11 Making the word search change The first table we need to define for words is unicode_to_word_gen in directory $alephe_unicode. We will add the following two lines: USM50> egrep '00DE|00FE' unicode_to_word_gen 00DE #LATIN CAPITAL LETTER THORN 00FE #LATIN SMALL LETTER THORN The first line changes the capital thorn to th ( ) The second line changes the lower case thorn to th ( )

12 Making the word search change We also need to make sure that unicode_to_word_gen is being used for the word building procedures. In order for unicode_to_word_gen to be used for the word building procedures the following line should exist in tab_character_conversion_line in directory $alephe_unicode. USM01> grep unicode_to_word_gen tab_character_conversion_line WORD-FIX ##### # line_utf2line_utf unicode_to_word_gen USM01> After the change util e 1 must be restarted and records reindexed

13 Seeing the word search change Now if we search for þorn we get both records

14 Seeing the word search change Now if we search for thorn we get both records

15 Making the browse change Now we will make a browse for the heading have them both go to the same heading. We will make the change for TIT (Title) index. The title index uses filing procedure 11: USM01> grep -w TIT $data_tab/tab00.eng H TIT ACC Titles USM01>

16 Making the browse change The filing procedure 11 uses FILING-KEY-10 for normalization and FILING-KEY-01 for filing: USM01> grep ^11 $data_tab/tab_filing | grep char_conv 11 N char_conv FILING-KEY F char_conv FILING-KEY-01 USM01>

17 Making the browse change In table tab_character_conversion_line in directory $alephe_unicode we see what tables these FILING- KEYs refer to USM01> grep FILING-KEY-01 tab_character_conversion_line FILING-KEY-01 ##### # line_utf2line_utf unicode_to_filing_01 USM01> grep FILING-KEY-10 tab_character_conversion_line FILING-KEY-10 ##### # line_utf2line_utf naco_diacritics So now we will make the change in unicode_to_filing_01 and in naco_diacritics (both in $alephe_unicode)

18 Making the browse change USM01> egrep '00DE|00FE' naco_diacritics 00DE #LATIN CAPITAL LETTER THORN 00FE #LATIN SMALL LETTER THORN USM01> egrep '00DE|00FE' unicode_to_filing_01 00DE #LATIN CAPITAL LETTER THORN 00FE #LATIN SMALL LETTER THORN USM01> Above in both files we change the capital and lowercase thorn to uppercase T and H. The reason we change to uppercase is because this is part of the filing procedure. After the change util e 1 must be restarted and the records reindexed

19 Making the browse change Now both records are included in one heading The display text here has th instead of þ because that was the first indexed heading.

20 Making the browse change Now both records are included in one heading

Thank You! Yoel Kortick