DSpace - Digital Library Software Kaushal Giri Dy. Manager – Knowledge Centre Export-Import Bank of India kaushal.giri@eximbankindia.in Nilesh Shewale Librarian Don Bosco Institute of Technology shewale.nilesh2008@gmail.com
Outline of Talk Introduction Features of Digital Library Why DSpace Digital Library Architecture and System Requirement What is DSpace? H/W and S/W requirements What DSpace can do?
Can I Retrieve Relevant Pages ? Electronic Books Read and make notes Other Digital Media Search for hours
Introduction Digital libraries encompass a whole range of information services related work such as Organization of digital information Information retrieval User interface Archiving and preservation Services and social issues Evaluation and applications to particular areas
Features of Digital Library Low cost, including all hardware and software components Technically simple to install and manage Robust Scalable Open and inter-operable Modular User Friendly Multi-user (including both searching and maintenance) Multimedia digital object enabled Platform independent (including both client and server components) interoperable
Why DSpace Digital Library DSpace is An open source technology platform which can be customized and its capabilities can be extended A service model for open access and/or digital archiving for perpetual access A platform to build an Institutional Repository and the collections are searchable and retrievable by/on the Web To make available institution-based scholarly material in digital formats. The collection will be open and interoperable.
Architecture and System Requirement The DSpace system is organized into three layers The Storage Layer: responsible for physical storage of metadata and content The Business Layer: deals with managing the content of the archive, users of the archive (e-people), authorization, and workflow The Application Layer: containing components that communicate with the networked world outside of the individual DSpace installation
DSpace is a joint project of MIT Libraries and Hewlett-Packard Labs
What is DSpace? Digital Object management system Create, search and retrieve digital objects Facilitate preservation of digital objects An open source software Allows open access and digital archiving Allows building Institutional Repositories
H/W and S/W requirements UNIX recommended (Java-based program should run on anything) Open source, built on Apache web server and Tomcat Servlet engine Uses postgreSQL or Oracle relational database
What DSpace can do? Captures Digital content in any formats directly from creators (e.g. researcher, authors) Describes Descriptive, technical, rights metadata Persistent identifiers OAI-PMH version 2.0 compliant Allow metadata creation
Possible types of Content Preprints, articles Postprints Technical Reports Conference Papers Theses/Dissertations Datasets e.g. statistical, geospatial, scientific
Formats of Content Images visual, scientific, etc. Audio files Video files Digitized library collections
Information Model Communities Departments, Labs, Research Centers, Schools… Collections Items Files (bitstreams) Multiple formats - same content Complex objects – multiple files
Intellectual Property Click-through license during submission Grants DSpace non-exclusive right to acquire, manage, preserve, distribute the item Does not grant DSpace copyright Copy of license stored with item
Goodies Modular architecture, well-defined APIs 100% open source Programmed in java RDBMS and SQL for metadata CNRI “handles” for persistent identifiers OpenURL linking OAI-PMH for exposing metadata
Backend Technology Apache, Tomcat, OpenSSL/mod_ssl Java PostgreSQL/Oracle CNRI Handle System 5 (persistent ids) Lucene Search Engine
Standards Dublin Core only Descriptive metadata only OAI-PMH v 2.0 (Open Archive’s Initiative Protocol for metadata harvesting) UNICODE Compliant
Capabilities Exports in XML format Supports crosswalks through OAI-PMH DC (Dublin Core) Qualified DC METS (Metadata Encoding and Transmission Standard MODS (Metadata Object Description Schema – sibling of MARCXML) Can be extended to any Metadata Schema
Customization Screens (Manakin) E-mails Any language interface Metadata Input-forms Display of results Fields to be Indexed Access restrictions
Advanced Feature Grid Compliant (Storage) LDAP authentication Usage statistics generation SFX Server integration RSS (Really Simple Syndication) Item Recommendation to a friend Use of Thesaurus (though not OWL/SKOS/RDF) Full-text indexing of PDF, MS-WORD files
& Questions nswers A
Thanks for Your Patience & Attention! kaushal.giri@eximbankindia.in shewale.nilesh@gmail.com