Union Catalog Architecture Tsach Moshkovits, Development Team Leader Olybris, Ex Libris Seminar 2005 Kos, April 2005.

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

Auto-Graphics Update Mary E. Jackson Product Manager, Resource Sharing October 20, 2010.
Quick Review of Apr 10 material B+-Tree File Organization –similar to B+-tree index –leaf nodes store records, not pointers to records stored in an original.
ALEPH version 19.01/20.01 Cataloging & Acquisitions/Serials Updates South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
Comp 335 File Structures Indexes. The Search for Information When searching for information, the information desired is usually associated with a key.
South Dakota Library Network ALEPH Acquisitions Overview South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD © South.
Concepts of Database Management Sixth Edition
Concepts of Database Management Seventh Edition
Managing Multi-Level Patrons and Staff Users Judy Levi, Senior Product Analyst Olybris, Ex Libris Seminar 2005 Kos, April 2005.
ALEPH 500 Union Catalogue Overview Judy Levi Senior Product Analyst Ex Libris Ltd. November 2004.
Patron Initiated Resource Sharing Carmit Marcus. Patron Initiated Resource Sharing 2 The cliches … in this era of reduced funds for acquisitions… … imperative.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Catalog: Batch delete old Patron Records How to conduct global/batch updates to records – patron Adding Faculty and Patron/Student Records Manually Standardizing.
A Guide to SQL, Seventh Edition. Objectives Understand, create, and drop views Recognize the benefits of using views Grant and revoke user’s database.
Data Structures Introduction Phil Tayco Slide version 1.0 Jan 26, 2015.
ALEPH version Course Reading & Reserves Course Documents South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
MyiLibrary® ‘Search & View’ Website Training June 8, 2010.
South Dakota Library Network ALEPH v21 Staff User Upgrade Information Cataloging and Systems South Dakota Library Network 1200 University, Unit 9672 Spearfish,
Version 18 Upgrade: Web OPAC. Version 18 Upgrade: Web OPAC Customization 2 All of the information in this document is the property of Ex Libris Ltd. It.
South Dakota Library Network ALEPH Basic Circulation Training Patrons South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
Cataloging v.16 eSeminar April 2004 Judith Fraenkel.
General Systems Information ALEPH v20.01 Library Staff Training © South Dakota Library Network, 2013 ©Ex Libris (USA), 2011 Modified for SDLN Version
Items 14.2 Seminar 5 March Seminar Items 2 Session Agenda Item record - structural changes Call No. Filing Item sorting routines Item Form.
Running a Report.  List Bibliography Report  Found under: All Titles Purpose : Creates customized bibliographies by catalog, call number, or item characteristics.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
ECPIC Scoring Quick Guide: Scorecards (1) Enter a name and description that defines the purpose of your scorecard. For example: Agency CIO Rating Evaluation.
File Processing - Indexing MVNC1 Indexing Jim Skon.
March 16 & 21, Csci 2111: Data and File Structures Week 9, Lectures 1 & 2 Indexed Sequential File Access and Prefix B+ Trees.
Web OPAC & GUI (Staff) Search v.16 eSeminar Doron Greenshpan.
1 Index Structures. 2 Chapter : Objectives Types of Single-level Ordered Indexes Primary Indexes Clustering Indexes Secondary Indexes Multilevel Indexes.
DATA STRUCTURE & ALGORITHMS (BCS 1223) CHAPTER 8 : SEARCHING.
The Functions of Operating Systems Interrupts. Learning Objectives Explain how interrupts are used to obtain processor time. Explain how processing of.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Concepts of Database Management Seventh Edition
Introduction to Databases Trisha Cummings. What is a database? A database is a tool for collecting and organizing information. Databases can store information.
Views Lesson 7.
Now, please open your book to page 60, and let’s talk about chapter 9: How Data is Stored.
South Dakota Library Network Aleph Tables Training ~ Aleph Administration © South Dakota Library Network, 2008 ©Ex Libris (USA), 2004 Modified for SDLN.
Microsoft Access. Microsoft access is a database programs that allows you to store retrieve, analyze and print information. Companies use databases for.
O FFICE M ANAGEMENT T OOL - II B BA -V I TH. Abdus Salam2 Week-7 Introduction to Query Introduction to Query Querying from Multiple Tables Querying from.
UB & UC An Update on Resource Sharing Developments.
Tutorial for Circulation Staff FIU Library
1 Copy and paste your photo into your opening and closing slide Aleph Parallel Indexing Jerry Specht Senior Support Analyst
How to transfer 85x prediction patterns from ADM library to HOL library Presenter: Yoel Kortick.
ALEPH version Course Reading & Reserves Staff Permissions South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
1 Certification Training Resource Management 03 Normalization Rules Merge Methods Match Methods.
Memory Management OS Fazal Rehman Shamil. swapping Swapping concept comes in terms of process scheduling. Swapping is basically implemented by Medium.
ILL Inter-Library Loan. Inter-Library Loan Overview The ILL module is for the management of Inter-library loans received and sent by Your library.
Integrated ISO ILL for staff users Borrowing requests – part one Yoel Kortick 2007.
Cataloging v.16 eSeminar September 2003 Judith Fraenkel.
How to create and use authority records Version 16 and up Yoel Kortick.
How to add additional privileges to user_function.lng (originally SI ) Presenter: Yoel Kortick.
Bibliographic Record Description of a book or other library material.
Integrated ISO ILL for staff users Setup Yoel Kortick 2007.
SILO File Upload & Feedback System By Marie Harms State Library of Iowa August 18 & 19, 2010.
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick Senior Librarian Cataloging introductory flow.
An Introduction to the Bibliographic Metadata Profile in Alma
Subject Name: File Structures
How to configure and use title hold requests version 18
Update SFX from Verde - Availability
Receiving New Lending requests
Metadata Editor Introduction
Importing Serial Prediction Patterns Via the Service Import 85X records (Serial-52) Yoel Kortick.
Integrated ILL GUI desktop
Resource Sharing Locate
ALEPH Version 22 Beginning Cataloging
Processor Fundamentals
Cataloging 14.2 Seminar.
Indexing 4/11/2019.
Prediction Patterns and Summary Holdings
Presentation transcript:

Union Catalog Architecture Tsach Moshkovits, Development Team Leader Olybris, Ex Libris Seminar 2005 Kos, April 2005

Union Catalog Architecture 2 Overview The Union Catalog is a sophisticated mechanism that supports the integration of disparate libraries into a single environment. By environment, we mean a unified User view, rather than a single database or a merged index.

Union Catalog Architecture 3 Overview The following will be discussed in this session: Union catalog structure Union catalog vs. Unified catalog Equivalency Merge

Union Catalog Architecture 4 A Unified Catalog Usually, a Union catalog involves a catalog where all Equivalent records are merged into one new record. In this scenario, the original records are not saved, and the index is built on the merged version of the records. Obviously, the merged record must include information about its different parts to allow navigation from the record to remote resources.

Union Catalog Architecture 5 Unified Catalog Drawbacks Match and Merge is preformed on load time, record by record. This is a slow process when additional resources are added. A new resource may not be available until the slow load process is completely finished. Updating a record is complex, since it may require more than just updating its merged record. This is true because the equivalence relation is not necessarily transitive.

Union Catalog Architecture 6 Unified Catalog Drawbacks Merging becomes even more problematic if the merge algorithm suggests that not all data is preserved for every source record. In such a case, any match and merge process must re- access all remote resources to retrieve all original records. It is also impossible to update the unified catalog with a standard Cataloging GUI.

Union Catalog Architecture 7 ALEPH Union Catalog Contributors A Import Load / Catalog New/Update/Delete B Create Equivalence C Merge“Just in Time” Equivalence Table (Z120) IndicesOriginal Records Unified Catalog Structure – Virtual Approach

Union Catalog Architecture 8 Union Structure – Level A Records are stored as distinct entities in the database. Records can be loaded from an external resource or cataloged with the ALEPH Cataloging module. Records from an external resource can hold an identifier to the external resource to allow simple updating or navigation to an external resource. Indices are created using the standard ALEPH indexing scheme.

Union Catalog Architecture 9 Union Structure – Level B An Equivalence table is created by mapping each record to its equivalent records. The equivalence relation is not necessarily transitive. This table can be recreated any time, leaving the records intact.

Union Catalog Architecture 10 Union Structure – Level C Result sets will be de-duplicated to contain only one record per group of equivalents. Browse lists will de-duplicate their counters to count only one record per group of equivalents. User View uses on-the-fly Merge to present a single record that is built from a group of equivalents. The Merge algorithm can vary from user to user.

Union Catalog Architecture 11 It is simple to update a record by unlinking it from the Equivalence table and marking it as “New.” This action breaks all existing connections in the group. A new record is simply inserted as equivalent only to itself. In all cases, the data of each record stays intact in the database. Virtual Approach Advantages

Union Catalog Architecture 12 A separate job runs on all equivalency tables marked as “New.” The job assures that records in a group are evaluated for their real equivalency. It takes no longer to load external resources here than it does to load and index in ALEPH. Virtual Approach Advantages

Union Catalog Architecture 13 The worst-case effect of update, insert, or delete is that between the time a record is updated, until the time that equivalency entries are (re)created, the group of equivalent records appears as non-equivalent. There is 100% uptime. Virtual Approach Advantages

Union Catalog Architecture 14 The same uptime considerations apply if the match algorithm is to be changed. Changing the merge algorithm has absolutely no effect, since it is executed “just in time.” Virtual Approach Advantages

Union Catalog Architecture 15 An equivalency table is created for each record in the database, and points to itself. Pool selection: The equivalency search is minimized to a certain number of candidates. This is usually done on a direct index, such as ISBN, ISSN, or LCCN, and is therefore relatively fast. If the number of candidates exceeds a certain limit, the record itself will be considered as the only candidate. Equivalency Table Creation

Union Catalog Architecture 16 Final match: The equivalent records from the pool are found. Matching and conflicting fields are searched. Matching adds a positive weight, while conflicts add a negative weight. The total weight is checked against a threshold. Equivalency Table Creation

Union Catalog Architecture 17 When both stages are complete, each record has a Z120 record, holding the numbers of all equivalent records. Z120 is never empty. It holds the record’s own number if no equivalencies are found. Both the pool selection program and the match program are table-defined, not hard-coded Equivalency Table Creation

Union Catalog Architecture 18 Merge When a user wants to view a record, a merge is done on all the records in its equivalency table, combining them into a single display. No merged record actually exists in the database. This is a virtual display created on request.

Union Catalog Architecture 19 Merge A merged record display is built by taking the “basic” fields from the preferred record and adding other fields from each of its equivalent records. The preferred record is selected by assigning weights to all the equivalent records based on table-defined criteria, and the top weight wins. The merge program is also table-defined.

Union Catalog Architecture 20 Implementation The union_global_param tables defines the programs (algorithms) used for different Union Catalog tasks. ! !!!!!-!-!!!!!!!!!!!!!!!!!!!!-!!!!!!!!!!!!!!!!!!!! USM90 B candidate_prog union_candidate_cdl USM90 B match_prog union_match_cdl USM90 B preferred_prog union_preferred_cdl USM90 B merge_prog union_merge_aleph USM90 B normalize_prog union_normalize_cdl

Union Catalog Architecture 21 !!!!!-!!!!!!-!!!!!!!!!!-!!!!!!!!!!!!!!!!!!!!!!!!!-!!! LDR F05-01 EQUAL d -10 LDR F17-01 NOT-EQUAL 1,2,3,4,5,7,8,u,z ## PRESENT ## PRESENT ## PRESENT ## PRESENT 001 The table defines a value for each field. All values are added according to the specifications in the middle columns. The record with the highest value is selected as the preferred record. Preferred Table – An Example

Union Catalog Architecture 22 Match Table – An Example !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!-!-!!!> date exact match date within date mismatch short title match full title match full title occur within full title mismatch full title keywords full title keywords order b exact match b occur within b mismatch The accumulative sum will be compare against a defined threshold

Union Catalog Architecture 23 Match Table – An Example Different fields are compared to determine whether two records match. For each field, if a match is found, the plus value is added to the total match weight. Otherwise, the minus value is subtracted from the total matched weight. The threshold in the first line defines the weight above which two records are considered a match.

Union Catalog Architecture 24 Workflow Illustration Single BIB record BIB ’ s pool of candidates BIB ’ s pool of matched records (= equiv table) queue of new/updated records Resources Contributors

Union Catalog Architecture 25 “Union Catalog” - On top of Bibliographic + Holdings database “Union View” - On top of ALEPH 500 administrative database Two Types of Union Catalogs

Union Catalog Architecture 26 SOURCE 1SOURCE 2SOURCE 3 UNION CATALOG JUMP Normalize records Bibliographic and Holdings Database

Union Catalog Architecture 27 When records are loaded from various resources, fixes are done to normalize their structure and data. Checks could be performed prior to the load so that incompatible records are rejected. Bibliographic and Holdings Database

Union Catalog Architecture 28 Jump to original View in union holdings Bibliographic and Holdings Database

Union Catalog Architecture 29 ADM 1ADM 2ADM 3 BIB 2 BIB 3 Librarian View Union Catalog - User View BIB 1 ALEPH 500 Database

Union Catalog Architecture 30 Records are managed in standard ALEPH 500 in a single BIB and ADM library, but separately per sub-library or administrative unit. The Staff User view does not change from an administrative GUI prospective. A user (patron) has a unified view on the PAC. ALEPH 500 Database

Union Catalog Architecture 31 ALEPH 500 Database

Union Catalog Architecture 32 ALEPH 500 Database

Union Catalog Architecture 33 ALEPH 500 Database