Presentation is loading. Please wait.

Presentation is loading. Please wait.

Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Faculty advisor - Robert Bubacz.

Similar presentations


Presentation on theme: "Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Faculty advisor - Robert Bubacz."— Presentation transcript:

1 Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Faculty advisor - Robert Bubacz Sponsored by University of Rochester River Campus Library

2 Overview Project overview Features Future development Demo Questions

3 Project Two types of records A bibliographic record represents a book, and is linked to multiple authority records. An authority record represents a single author or subject.

4 Goals Bibliographic Record Author field Name Date of Birth, Death Authority Record Authorized Form Alternate Forms: Alternate form 1 Alternate form 2 … See Also References to other authority records

5 Project Name matching Names are entered differently. Multiple pens names by the same person. Finding matching records Easy when authority record of an author already exists. A new authority record is created when an author does not exist. Importing different record formats

6 Technologies Used Java XML MySQL Hibernate Ant Marc4J

7 Supported Record Types MARC Authority records MARC Bibliography records Dublin Core records

8 Features A persistent data storage Import records Match records A functional API A prototype GUI

9 Importing Identifies the correct record formats Imports Marc and DublinCore XMLs Uses Marc4j to parse raw data to Marc XMLs Detects duplications Updates records with new information

10 Matching Phases

11 Matching Loops through all unmatched records. Tries various strategies and string transformations in order of confidence. If a match is found, a link is created with evidence. If no match is found, a new Authority record is created based on the Bibliographic record information

12 Name Transformations Names are transformed to get better matches. For example, Homer Simpson  Simpson, Homer Smith, Elizabeth ($q Ann Elizabeth)  Smith, Ann Elizabeth De la Mare, Walter  Mare, Walter De la Vanughan Williams, Ralph  Williams, Ralph Vanughan

13 Discriminators Adjusts the confidence in a match based on a discrimination criterion. For example, Common names Publication dates

14 Graphical User Interface Schedules jobs Filters and sorts results Views records and matches Manually matches of records

15 Metrics Effort by type of activity Test metrics (JUnit) - 45 tests Defects by types

16 Defects by Type

17 Effort by type of activity

18 Status R1 - 2/28/07 R2 - 3/11/07 R3 - 4/3/07 R4 - 5/8/07

19 Future Possibilities Support for new metadata formats A web-based interface Searching (backend to a OPAC) GUI improvements

20 Demo

21 Questions and Comments?


Download ppt "Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Faculty advisor - Robert Bubacz."

Similar presentations


Ads by Google