Building an Institutional Research Repository from the Ground Up: The ARROW Experience Dr Andrew Treloar Project Manager, Strategic Information Initiatives.

Slides:



Advertisements
Similar presentations
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Advertisements

October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
ARROW Progress Report to CAUL September 2004 Geoff Payne, ARROW Project Manager.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
ARROW Progress Report to CAUL, April 2005 Cathrine Harboe-Ree ARROW Project Leader.
ARROW Institutional Repositories Presentation to the APSR / University of Tasmania Repositories Seminar 4 May 2006 Geoff Payne Director Library Corporate.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
MacKenzie Smith Associate Director for Technology MIT Libraries.
The ARROW Project: A consortial institutional repository solution, combining Open Source and proprietary software David Groenewegen ARROW Project Manager.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Teula Morgan The Adaptable Repository: Swinburne Online Journals.
DSpace Rea Devakos and Gabriela Mircea University of Toronto Libraries.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Digital Repository Service ___________________________ Yale University Library Audrey Novak, Head IS&P 7 March 2007.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
California Digital Library eScholarship Repository Int’l Conference on Digital Institutional Repositories 9-10 December 2004, Hong Kong Catherine H.Candee.
Planning for a University of Guelph Institutional Repository: DSpace Implementation Helen Salmon & Ron MacKinnon Presentation to Information Services Committee.
The National Library’s role in the Australian Research Information Infrastructure projects Warwick Cathro National Library of Australia Coalition for Networked.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Publishing Solutions for Contemporary Scholars: The Library as Innovator and Partner Sarah E. Thomas University Librarian Cornell University Ithaca, NY.
Australian Partnership for Sustainable Repositories AUSTRALIAN PARTNERSHIP FOR SUSTAINABLE REPOSITORIES Caul Meeting 2005/2 Brisbane 15.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Digital Library Architecture and Technology
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Alternative Models of Scholarly Communication: The "Toddler Years" for Open Access Journals and Institutional Repositories Greg Tananbaum President The.
The DSpace Course Module – An introduction to DSpace.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Building an Institutional Research Repository from the Ground Up: The ARROW Experience Dr Andrew Treloar Project Mgr, Strategic Information Initiatives.
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
Digital/Open Access repositories Paul Sheehan Director of Library Services DCU HEAnet National Networking Conference Athlone 11 th November 2005.
Group-based Repositories in Oz Diane Costello Council of Australian University Librarians ICOLC Montreal 2007.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
PNC 2005 Hawaii Toward an Institutional Repository at the Data Service of NDAP Ya-ning Chen, Shu-jiun Chen Computing Centre, Academia Sinica Taiwan.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
Getting Your Publications to the Masses: Using W&L’s Institutional Repository to Enhance Scholarly Communication Elizabeth Anne Teaff, MLIS August 31,
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Institutional Repositories: One Road to Open Access William J Nixon, Service Development DAEDALUS, University of Glasgow JISC CNI Roads to Open Access.
ARROW Institutional Repositories for Managing e-Theses Presentation to ETD September 2005 Geoff Payne, ARROW Project Manager.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
DSpace - Digital Library Software
Digital Repositories: Concepts and Issues By Devendra. S. Gobbur (Sr) Assistant Librarian, Gulbarga University, Gulbarga. 10 NOV, NOV, 2009.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
California Digital Library eScholarship: a UC Publishing Initiative Catherine H.Candee Director, Publishing and Strategic Initiatives Office of Scholarly.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Hussein Suleman University of Cape Town Department of Computer Science Advanced Information Management Laboratory High Performance.
Institutional Repository “A university-based institutional repository is a set of services that a university offers to the members of its community for.
Scholarly works, research, reports, publications What is an Institutional Repository? Focus on Research Groups Promoting Physics Faculty, Students and.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Ktisis: Building an Open Access Institutional and Cultural Repository Alexia Kounoudes, Petros Artemi, Marios Zervas Library and Information Services,
VITAL and the ARROW solution
OceanDocs Digital Repository of Marine Science Research Outputs
? What is Institutional Repository for Rutgers University
IRUS-UK and ORCIDs Paul Needham Cultivating ORCID: Encouraging growth
Introduction, Features & Technology
VI-SEEM Data Repository
Introduction to DSpace
DPubS: An Open Source Electronic Publishing System
Publishing Solutions for Contemporary Scholars: The Library as Innovator and Partner Sarah E. Thomas University Librarian Cornell University Ithaca, NY.
Presentation transcript:

Building an Institutional Research Repository from the Ground Up: The ARROW Experience Dr Andrew Treloar Project Manager, Strategic Information Initiatives & ARROW Technical Architect Status Snapshot as of September 2004 (pre-Bandicoot)

Vacant Lot

Context – Global  Increasing focus on content as institutional asset  Increasing proportion of this content is now born- digital or re-born digital  Wide uptake of software such as Dspace and eprints.org  Open Access scholarship movement gathering strength worldwide  Recent UK House of Commons STC report calling for establishment of institutional research repositories and mandated deposit

Context – Australian  Higher Education Information Infrastructure Advisory Committee (HEIIAC) report in Nov 2002 identified need for Research Information Infrastructure  DEST arranged Digital Object Repository Management meeting in Sydney in May 2003  DEST called for RII bids in June 2003  Four successful:  Australian Digital Theses (ADT)  Australian Partnership for Sustainable Repositories (APSR)  Meta Access Management System (MAMS)  Australian Research Repositories Online to the World (ARROW)

Design Brief

Requirements – Content Streams  E-Prints  Pre-prints, postprints, working papers, etc  Digital theses  Masters and Ph. D.  Electronic Publishing  Open-access ejournals  DEST Returns  Actually, database behind the returns  Non-University Research  ‘Scholar in the Garden Shed’

Requirements – Content Types  Based on Dspace philosophy:  Lots of digital material is already lost  Most digital material is at risk  Preserving bits is better than nothing  It is important to capture as much information as possible  It will be necessary to evaluate cost/benefit trade-offs over time  Decided to divide content into three types:  Supported  Known  Unsupported  Long list of actual types in referenced paper (URL at end)

Architectural Drawings

Architecture Considerations  Common Repository  because boundaries between Research and Teaching/Learning are very fluid  Series of Content Workflow and Management layers  to handle ingest/management of content  Exposure of content in variety of ways  to maximise access

ARROW OLAD

Building Materials - Foundation

Repository  Repository decision determines a number of other aspects of project  Functionality  Type of application development  Lots of options available (refer  Version 3 of this report due out soon  Careful examination of alternatives narrowed quickly to focus on DSpace & FEDORA

Repository – Dspace  Joint activity between MIT Libraries and Hewlett-Packard to develop a software system to enables institutions to:  Capture and describe digital works using customized workflow processes  Provide access to an institution's digital works so users can search and retrieve items in the collection  Preserve digital works over the long term  Being made available under the BSD open source license to other groups to run as-is, or to modify and extend as needed.  Can best be thought of as a general-purpose repository application, with a series of both hard-wired and preferred behaviours  Designed to provide stable long-term storage needed to house the digital products of MIT faculty and researchers

Repository – FEDORA  Not the RedHat FEDORA...  Flexible Extensible Digital Object and Repository Architecture  Joint venture between UVA Library and Cornell CS  Both a software platform and an architecture  Open source, digital object repository system using public APIs exposed as web services  Best thought of as services-mediation infrastructure, rather than an off-the-shelf application  Underlying object-based model

Repository – Decision  After lots of due diligence, decided to go with FEDORA:  better/cleaner underlying architecture (flexible not hierarchical)  easier to build on top of (APIs exposed as web services)  designed from ground up as services provider and mediator (not packaged application)  powerful idea of objects and disseminators (content behaviours)

Construction Strategy: Sub-Contract or DIY?  Original bid assumed that project would hire and manage development team  ARROW Project Manager (Geoff Payne) realised we could do much better by sub-contracting development work to a company already familiar with FEDORA:  outsource risk  save time by avoiding initial learning curve  partner in way that met ARROW and company needs  increase attractiveness of FEDORA  build a sustainable support and enhancement model

VTLS the Builder  ARROW entered into contract with VTLS (Blacksburg, VA) to  acquire VITAL 1.0 (and successor versions)  extend the functionality of FEDORA either by contributing back to the core FEDORA code or by writing a series of ARROW-commissioned modules  ARROW-commissioned modules to be open-sourced using the same license as the FEDORA code  VTLS will be able to build products on top of these new ARROW-commissioned modules, but so will anyone else

Open-Access Publishing  VTLS won’t be writing all the modules  Need module to provide simple OA ejournal publishing  Have decided to use the Open Journal System ( from the Public Knowledge Project at UBChttp://  Provides high-level of devolved functionality  Still deciding how best to integrate this with rest of ARROW

Building Materials - Frame

Application Framework  ARROW-commissioned modules will  call FEDORA API-A (Access) and API-M (Management) web services  expose themselves as Web Services  Possible that combination of ARROW-modules and FEDORA will lead to refactoring of existing APIs into:  API-A (Access)  API-S (Search)  API-M (Management)  API-W (Workflow)

FEDORA Development Consortium  Announced at same time as ARROW-VTLS deal  Joint activity of FEDORA, VTLS, ARROW, and others  partners selected on ability to contribute and resources to make it happen  Rest of 2004 will be spent working out how this might function  Work towards API-W will be used as process testbed

Building Materials - Doors and Windows

Search and Exposure  Exposure of metadata for OAI-PMH harvesting  Open Archives Initiative - Protocol for Metadata Harvesting  Each repository will be an OAI Data Provider  Support for direct searching via SRU/SRW  Simpler version of Z39.50  Exposure of full text (including derived full text) for spidering by Google and other search engines)  Local search gateways at each ARROW site   National Resource Discovery Service offered by NLA   NLA acting as OAI Service Provider (as well as Data Provider with their non-uni research repository)  Possible RSS feeds later

ARROW Web Site Project Information National Library of Australia Swinburne UNSW Monash ARROW Repository Digital Object Storage using Fedora & VITAL Members only area Meeting Minutes etc National Library of Australia ARROW Resource Discovery Service Using TeraText to index metadata harvested by OAI PMH ARROW Open Access Journal Publishing System Using OJS from Public Knowledge Project Internet Search Engines Capture text exposed by ARROW Repositories ARROW Branded Services Profile Internet

Building Site

State of Development  Funding commenced in February  A$ 3.66*10 6 over 3 years  Project Manager appointed in February  Contract with VTLS signed in June  FEDORA Phase 2 funding secured in June  US$ 1.4*10 6 over 3 years  Anticipated delivery of ARROW Phase 1 (Bandicoot) functionality in September  Anticipated delivery of ARROW Phase 2 (Bilby) functionality in February 2005

Phased Deliverables  DEST Metadata  Collections  Copyright support  Object validation  Search engine support  Still Images  PDF  RTF  XHTML  SRU/SRW  Web-based XML Editor  SMIL  Audio  Video  DEST Reporting  Multiple Object Viewing and Editing

Open House?

What we’ve learned already  All IT projects involve People, Processes and Technology. In addition, this one has a heavy focus on Content.  These proportions are going to change over time Component People5%20%35% Processes10%20%10% Technology75%20%5% Content10%40%50%

ARROW Availability  ARROW partners (NLA, Monash, UNSW, Swinburne) will be testing and refining beta software this year and early next year  Hope to be able to offer ARROW more broadly around mid-2005  will be regularly updated with news and more information

Questions?   Project Manager   Technical Architect   Project web site   Link to updated version of AusWeb04 paper about development of ARROW architecture