Hussein Suleman University of Cape Town Department of Computer Science

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

Version Policies and the OpenDOAR Policies Tool Peter Millington, University of Nottingham Version Identification Workshop, London, 22-Apr-2008.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
EPrints: A Biodiversity The Recent ECS publications feed on the plasma display in the foyer comes from EPrints.
IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
Repositories, Learned Societies and Research Funders Stephen Pinfield University of Nottingham.
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
EPrints 2.0 / March 4 th 2002 / Glasgow / Chris Gutteridge Introduction to EPrints 2.0 March 4 th 2002 Glasgow Christopher Gutteridge from the Department.
Role of librarians in the development of Institutional Repositories Susan Ashworth University of Glasgow.
Institutional Repository for CDU What’s in your bottom drawer? Ruth Quinn, Director Library and Information Access Charles Darwin University.
Lawrence Webley, Hussein Suleman, Tatenda Chipeperekwa University of Cape Town Department of Computer.
Sally Rumsey ORA Service & Development Manager Why ORA? Why Fedora?
DSpace, ETDs, Automatic Metadata Extraction Bradley Hemminger Jackson Fox Mao Ni School of Information and Library Science University of North Carolina.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
What is Wrong with Digital Repository Software? Or why to Archive Now ! Hussein Suleman University of Cape Town Department of Computer.
Digital Library Architecture and Technology
How to participate in the Union Catalogue Project Hussein Suleman Sivulile – Open Access South Africa Advanced Information Management.
Hussein Suleman University of Cape Town Department of Computer Science Advanced Information Management Laboratory High Performance.
JISC CETIS Conference, Oxford, November 2004 Repositories: State of ELF “volunteer”: Martin Morrey Intrallect Ltd.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
Database What is a database? A database is a collection of information that is typically organized so that it can easily be storing, managing and retrieving.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Basudeb Adhikary Librarian, Netaji Mahavidyalaya, Hooghly, WB & Sarmistha Adhikary Librarian AKPC Mahavidyalaya, Hooghly, WB.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
ETD Software Options Hussein Suleman University of Cape Town October 2003.
DSpace vs Fedora Ralph LeVan OCLC Research. What Do You Want From a Repository? How do you create your metadata? How do you assemble your objects? How.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
ETDs and NDLTD Hussein Suleman University of Cape Town May 2004.
Open Access and Institutional Repositories. Accra, June 2007 Institutional repositories in SA research institutions: the DISA experience Dr D Peters.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Not to Wait is the Answer: Institutional Repositories from the Bottom-up Hussein Suleman University of Cape Town July 2004.
Institutional Repositories and Licensing of Research Output advanced information management laboratory university of cape town department of computer science.
Hussein Suleman University of Cape Town Department of Computer Science Advanced Information Management Laboratory High Performance.
William J Nixon Setting up a Repository. Introduction Key Features to consider (and review) Wide Range of Technology Available –Best fit for purpose –Clear.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
Portlet Development Konrad Rokicki (SAIC) Manav Kher (SemanticBits) Joshua Phillips (SemanticBits) Arch/VCDE F2F November 28, 2008.
Beyond the Repository: Research Systems, REF & New Opportunities William J Nixon Digital Library Development Manager.
Joseph JaJa, Mike Smorul, and Sangchul Song
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
VI-SEEM Data Repository
Jay Bhatt Drexel University Libraries
Institutional Repository at NIO: Inspiration to Implementation
Sophia Lafferty-hess | research data manager
SCALABLE OPEN ACCESS Hussein Suleman
Grey Literature Repositories and CRIS in a SOA Environment
Digital Repositories The management of learning objects
Implementing an Institutional Repository: Part II
NSDL Data Repository (NDR)
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Jörgen Eriksson Setting up an institutional archive: some technical and organizational considerations Jörgen Eriksson
Malte Dreyer – Matthias Razum
Institutional Repositories
This presentation will probably involve audience discussion, which will create action items. Use PowerPoint to keep track of these action items during.
Managing Private and Public Views of DDI Metadata Repositories
Dataverse for citing and sharing research data
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
RCSI institutional repository rcsi
Presentation transcript:

What is Wrong with Digital Repository Software? Or why to Archive Now ! Hussein Suleman hussein@cs.uct.ac.za University of Cape Town Department of Computer Science Advanced Information Management Laboratory Sivulile November 2006

Outline 1 What are Digital Object Repositories From Closed to Open Access From Heritage to Education Using Existing Software Building Software

What is a Digital Object Repository? source: DISA, Univ. of KZN http://disa.ukzn.ac.za

Heritage Repository Object source: Mayibuye, DISA, Univ. of KZN http://disa.ukzn.ac.za

DORs in Education source: Worldwide Greenhouse Education, Univ. of Vermont http://www.uvm.edu/wge/education.htm

Closed Repositories source: IEEE Explore http://ieeexplore.ieee.org/Xplore/home.jsp

Open Access Repositories - Research source: arXiv.org, Cornell University http://arxiv.org/

Open Access Repositories - Teaching Consortium for the Advancement of Undergraduate Statistics Education http://www.causeweb.org/resources/

Why Open Access? Lawrence (2001): clear correlation between open access and impact in CS Eysenbach (2006): double citation impact for open access as compared to closed journal and closer to home… Suleman (2006): open access repository part of normal operations of department – formal record of research open access archive used extensively by external parties indexed aggressively by search engines resources accessed almost immediately after deposit and consistently thereafter

OA Repositories in SA source: UCT Lawspace, Univ. of Cape Town http://lawspace.law.uct.ac.za

OpenDOAR’s view on South Africa

How to Build a Digital Repository source: DSpace: An Open Source Dynamic Digital Repository, DLib Magazine http://www.dlib.org/dlib/january03/smith/01smith.html

(Open Source) Repository Packages ?

Outline 2 Using Existing Software What Repository Software does RIGHT? Arguments against Repositories Future (Im)perfect

What does repository software do RIGHT? Infrastructure Digital Objects are stored/archived Users can access items from the Web Services Search full-text and metadata Browse by author/title/etc. Interoperability OAI-PMH interoperability Ability to ingest and export items Security User roles and authentication

Arguments against OA Repositories Digital Repositories are a lot of hype – the technology is actually not yet mature enough to be practical or usable! It is so difficult for us as poor institutions in South Africa, with few staff and inadequate facilities and training.

Training, Policies, Staff, Tools, … Have you been to a Sivulile workshop? Policies Listen to the other speakers! Staff What proportion of your library staff are IT vs. how many customers use IT rather than physical resources? Tools Are they any good? or are the tools pure EVIL?

Repository Evils ? No integration with Windows/Linux/BSD Modern operating systems have packaging systems – many IR systems are still distributed in “source code”. Why not?

Repository Evils ? Low-level Components No clean external API to communicate with services within most systems. None of DSpace, EPrints, Greenstone …

Repository Evils ? Customisation requires programmer! Date: Mon, 23 Oct 2006 20:58:20 +0100 From: Christopher Gutteridge <cjg@ecs.soton.ac.uk> To: "EPrints.org Technical List" <eprints-tech@ecs.soton.ac.uk> Subject: Re: [EP-tech] Chicago citation style and e-prints To show the family name first last you can do @creators;order= fg@ or gf gf= given family fg= family, given To change the seperators to what you wand add the following to your local phrase file (not system-phrases) <epp:phrase id= "lib/metafield:join_name">, </epp:phrase> <epp:phrase id= "lib/metafield:join_name.last"> and </epp:phrase> But that can't be set on a per-citation type level.

Repository Evils ? Customisation requires programmer! Date: Wed, 18 Oct 2006 10:24:45 +1300 (NZDT) Subject: Re: [greenstone-users] Another question: multivalued fields From: sw64@cs.waikato.ac.nz Hello Ed, You can use AZCompactList classifier, for which the "allvalues" parameter is the default. Also, you can use [sibling:Subject] or [sibling:Author] format statement to display all multiple values of Subject or Author in VList, Regards Shaoqun > Thank you for your help a few weeks ago. I did get a sample collection > working, and have just heard that the organisation is keen to continue – I > now need tog et a couple of things working that I didn't finish before. > One of those is allowing for multiple values for Author and Subject. I have > created additional columns in the .cfg file and put some test data in. > Where is the "allvalues" parameter put and can it be done through the GLI > or do I have to edit the config file?

Repository Evils ? Poor Scalability Most systems do not scale well beyond small collections. EPrints DSpace source: Technical Evaluation of selected Open Source Repository Systems, Catalyst IT

Repository Evils ? Identity not easily removable DSpace is the name of the software, not the archive!

Repository Evils ?

Repository Evils ? Buy-In / Lock-In How easy is it to switch from DSpace to EPrints to Fedora to … ?

Repository Evils Are these issues significant roadblocks? Are they being addressed at all?

Current Development Directions Problem Solution Low-level interfaces Greenstone 3 using Web Services Scalability Key concern in DSpace 3 architecture Configurability Tools are becoming more automated and flexible over time OS integration Will improve as tools become mature and development stabilises Lock-in and Identity Will improve when tools don’t need to be “sold”

Current Research Directions Fedora Generic interface to scalable repository Pathways / OAI-ORE (Submit) Interface to repository components OCKHAM Service registry for composition of systems from components DILIGENT Scalability of repositories in a grid AJAX Interactive Web-based interfaces

Future Im(perfect) Repository software gets a lot RIGHT, Repository software still has issues, Software use should not require a programmer Software is a means to an end, not an end Software should be appropriately engineered for reuse on a large scale like other OSS tools Software should easily integrated into other systems … but most of these issues have been noted and/or are being actively addressed

Bottom Line We NEED Digital Object Repositories Why? To store and disseminate knowledge When? NOW ! How? Use popular software packages DONT PANIC! You’re not alone The software has issues but is constantly improving The easiest possible way So… But that doesn’t do everything I want ! DONT WAIT! The need is greater than the (perceived) technical problems

Open Access and Institutional Repositories NOW!

the end.