Breeda Herlihy, IR Manager, UCC Library
UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections, Archives & Repository Services Shortlist of 3 platforms DSpace – Demo site provided by Enovation Solutions Eprints – DemoPrints, demo site provided by Eprints Digital Commons – Sample sites, web conference call, questionnaire Evaluation matrix Recommendation for DSpace to Library Strategy Group
Evaluating IR platforms Evaluation matrix based on Open Society Institute- A Guide to Institutional Repository Software, Technical Evaluation of selected Open Source Repository Solutions on behalf of CPIT Version 1.3 approved, Commissioned by OARINZ System documentation available from various websites Literature Review Repositories Support Project, UK *New* evaluation March
Reasons for recommendation Open source repository solution No annual licence fees Initial investment in set up and configuration required Development of staff skill set in longer term Integration with new Research Support System DSpace Foundation and Fedora Commons announced plans to combine strengths in July 2008 Have since combined their organizations to create DuraSpace Active and open development community Wide adoption nationally and internationally
DSpace technical specification Operating system : Linux / UNIX / MacOSx/ Windows/ Solaris UCC use RedHat Linux Programming Language : Java Database: Oracle / PostgreSQL UCC use PostgreSQL Web server: Any, ships with Apache Tomcat UCC use Tomcat Web User Interface : JSP or XML UCC use JSP Internal search engine : Lucene
DSpace set up in UCC Enovation Solutions set up and configured DSpace v for UCC provided advice on hardware specification annual maintenance contract maintain ‘master copy’ of CORA source code in SVN Library IT System administration Hardware maintenance ….and general hand holding! CORA - Cork Open Research Archive live 31 March 2009
CORA server architecture User Interface JSP Web application server Apache Tomcat PostgreSQL database Database Server Dell PowerEdge 1950 Metadata Organization of content Information about e-people Authorization Workflows Web Server Dell PowerEdge R300 Asset store – deposited items
CORA user interface… very close to default installation
but customization is possible….web ui
Localization – multilingual support
More customization …statistics Default DSpace statistics…pretty basic but can be public or private
Google Analytics Google Analytics allow a richer and more detailed suite of statistics such as: Time visitors spent on the site Where they came from Terms they used in search engines to find items The geographic location of visitors How many pages they looked at Which pages they started and ended their visit on JavaScript that needs inserting in the footer of all your DSpace pages
Statistics Add on – University of Minho
Used by Research UCD
So what does DSpace do? An open source solution to Capture – mediated and self archiving Store – bitstream, licences, descriptive & technical metadata Index – metadata and full text indexing Distribute – OAI-PMH Preserve – various file formats scholarly works in any digital format.
CORA capture of content Mediated archiving by IR Manager
Self archiving by researchers -UCC Research Support System Integrated with CORA via SWORD
Workflows can be customised No workflow – one person performs all steps Accept/Reject Accept / Reject / Edit Metadata Edit Metadata Different workflows per collection Change steps in workflow e.g. In ULIR license is step 2
Store - Hierarchy
Index 1. Descriptive Metadata 2. Full text indexing
Distribute Records are exposed through OAI-PMH
Preserve Data files, also called bitstreams, are organized together into related sets. Each bitstream has a technical format and other technical information. This technical information is kept with bitstreams to assist with preservation over time. DSpace is committed to going beyond reliable file preservation to offer functional preservation where files are kept accessible as technology formats, media, and paradigms evolve over time for as many types of files as possible.
DSpace Roadmap Version 1.6 expected early New features to include Statistics Embargo facility Batch metadata editing Tidying up of metadata (e.g. spell check) Restructuring of metadata (move elements from one field to another) Global find and replace Add new items (metadata only) without having to create SIPs that conform to the DSpace batch import format Bulk move items between collections Bulk ‘map’ items into new collections New documentation improvements
DSpace training and resources Online course in CADAIR Aberystwyth University IR The DSpace course DSpace Live CD DSpace wiki DSpace mailing lists General / Technical / Developer IRC (internet relay chat) channel Alternative service providers DSpace.org