Download presentation
Presentation is loading. Please wait.
1
Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Sponsored by University of Rochester River Campus Library
2
Overview Project overview Features Future development Demo Questions
3
Project Name matching Names are entered differently. Multiple pens names by the same person. Finding matching records Easy when authority record of an author already exists. A new authority record is created when an author does not exist. Importing different record formats
4
Technologies Used Java XML MySQL Hibernate Ant Marc4J
5
Supported Record Types MARC Authority records MARC Bibliography records Dublin Core records
6
Features A persistent data storage Import records Match records A functional API A prototype GUI
7
Importing Identifies the correct record formats Imports Marc and DublinCore XMLs Uses Marc4j to parse raw data to Marc XMLs Detects duplications Updates records with new information
8
Matching Phases
9
Matching Loops through all unmatched records. Tries various strategies and string transformations in order of confidence. If a match is found, a link is created with evidence. If no match is found, a new Authority record is created based on the Bibliographic record information
10
Name Transformations Names are transformed to get better matches. For example, Homer Simpson Simpson, Homer Smith, Elizabeth ($q Ann Elizabeth) Smith, Ann Elizabeth De la Mare, Walter Mare, Walter De la Vanughan Williams, Ralph Williams, Ralph Vanughan
11
Discriminators Adjusts the confidence in a match based on a discrimination criterion. For example, Common names Publication dates
12
Graphical User Interface Schedules jobs Filters and sorts results Views records and matches Manually matches of records
13
Future Possibilities Support for new metadata formats A web-based interface Searching (backend to a OPAC) GUI improvements
14
Demo
15
Questions and Comments?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.