Presentation is loading. Please wait.

Presentation is loading. Please wait.

R utgers C ommunity R epository RU CORE Fedora Repository Object Datastreams.

Similar presentations


Presentation on theme: "R utgers C ommunity R epository RU CORE Fedora Repository Object Datastreams."— Presentation transcript:

1 R utgers C ommunity R epository RU CORE Fedora Repository Object Datastreams

2 R utgers C ommunity R epository RU CORE Pre-indexing Program The pre-indexing program populates the Search database with sorting and path information from the Fedora database as well as collection information from the Objects. Fedora Database Search Database Fedora Objects

3 R utgers C ommunity R epository RU CORE Create Indexes The indexing program uses the search database to find Objects for particular collections and then combines Descriptive metadata with XML full text datastreams to Create “search objects” for indexing with Amberfish. Search Database Filter Search Objects Atomic Amberfish Indexes

4 R utgers C ommunity R epository RU CORE Amberfish ® Amberfish is text retrieval software distributed as open source software under the terms of version 2 of the GNU General Public License (GPL). - Automatic searching across multiple databases (allowing modular indexing). We refer to this as atomic indexes. - Indexing/search of semi-structured text (i.e. both free text and multiply nested fields) - Boolean queries, right truncation, phrase searching, relevance ranking, support for multiple documents per file, incremental indexing. Read more - http://www.etymon.com/tr.html

5 R utgers C ommunity R epository RU CORE Searching… User Search Interface [Coll1][Coll2][Coll3] Ambersearch.php Sort Results Search Database ++ Coll1 IndexColl2 IndexColl3 Index Search Results

6 R utgers C ommunity R epository RU CORE Collection Hierarchy A MySQL database is used to create and display parent/child collection relationships. The database is a compact relational model. A class was written to build collection hierarchies in the search interface and create structure maps of the collection when needed. Collection Hierarchy Search Interface A start point (collection id) along with the max depth are defined in a function call. The collection tree is then built in the search interface. Structure Map (SMAP) Generation When a collection objects structure/hierarchy is changed a structure map(XML) can be generated and stored in the collection object in the repository. In the event a collections hierarchy needs to be rebuilt we have preserved the collections lineage in the repository. To create the SMAP a start point (collection id) need only be defined. A function then probes the database to determine the collections maximum depth. Once this is discovered an appropriate SMAP is generated and appended to the object.

7 R utgers C ommunity R epository RU CORE Collection Hierarchy Methods A PHP class of methods is used by the WMS, dlr/EDIT and search interfaces to add, update, delete and display collection hierarchies. List of methods AddChildAddNewCollectionAddSearchTerm AreRelatedBuildCollTreeBuildQueryCollHierarchy ChangeChildChangeParentDeleteChild DeleteCollectionDeleteDeadChildrenDeleteOrphans DeleteSearchTermDisplayCollectionSearchTermsGetCollectionInfo GetCollectionSearchTermsGetCollectionStructureInfoGetCollidMySQL GetCollidMySQLByFedoraIDGetCollidWMSGetCollidWMSByFedoraID GetDeadChildrenGetDirectChildrenGetDirectParents GetOrphansGetSearchTermFieldsUpdateSearchTerm

8 R utgers C ommunity R epository RU CORE Partner Portals Background -Provide the capability to allow partners, other institutions and individuals to attach the repository search engine with selected collections to their website -Built off existing search code used on NJDH and RUcore sites. -An extension offered to NJDH partners and RUcore participants. -Minimal systems requirements. -Simple setup and maintenance for partners, assumed they are not technically orientated. -Ability to customize their collection list, subscribe. How it works… -Username/password and a unique key are generated and assigned. -Partner has access to subscribing to collections of their choice. -Partner embeds a URI, IFrame, on their web site that allows for access.

9 R utgers C ommunity R epository RU CORE Contact Chad Mills: cmmills@rci.rutgers.edu Jeffery Triggs: triggs@rutgers.edu


Download ppt "R utgers C ommunity R epository RU CORE Fedora Repository Object Datastreams."

Similar presentations


Ads by Google