Foundations of Excellence DSpace vs Fedora: Or what I do on my summer vacation.

Slides:



Advertisements
Similar presentations
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Raman Ganguly Computer Center University of Vienna.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
Daedalus Service Development Stephen Gallacher Lesley Drysdale.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Technical Tips and Tricks for User Support Mike Gardner
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Digital Repository Service ___________________________ Yale University Library Audrey Novak, Head IS&P 7 March 2007.
Academic Services Interactive Media Managing the Web with Java JA-SIG Winter 2002 Robert Sherratt Academic Services, Interactive Media.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
The DSpace Course Module – DSpace Installation. Module objectives  By the end of this module you will:  Understand the platforms DSpace can be hosted.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Module - Technical Basics
Developing Interfaces and Interactivity for DSpace with Manakin Part 2: Technical and Conceptual Overview of Dspace and Manakin Eric Luhrs Digital Initiatives.
Developing an Ingest Service for Fedora Ryan Scherle Muzaffer Ozakca.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
PROJECT HYDRA SNEAK PEAK – ADVANCE SHOWING Brought to you by the Digital Repository Task Force Steve Marine (chair), Ted Baldwin, Dan Gottlieb, Kevin Grace,
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Computer Emergency Notification System (CENS)
One Platform, Two Stories. Willamette University Oregon State University.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
Archivists’ Toolkit: Introduction March 12, 2007 Jody Lloyd Thompson.
ISpheresImage iSpheresImage Feature Overview and Progress Summary.
Digital Library Syllabus Uploader Will Cameron CSC 8530 October 19, 2006 Project Presentation 2.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Digital Repository Service Update ___________________________ Yale University Library Roy Lechich, ILTS Audrey Novak 15 Aug 2007.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Visionary Technology in Library Solutions VITAL Access Portal.
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Markus Höckner Computer Center University of Vienna.
ALA Annual Meeting Claire Cocco Global Product Manager CONTENTdm Users Group June 30th, 2008.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Digital Library Syllabus Uploader Will Cameron CSC 8530 Fall 2006 Presentation 1.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Institutional Repository for Milligan College. Introduction.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
The world’s libraries. Connected. The Benefits of CONTENTdm Hosting Services OCLC’s Digital Lifecycle Webinar Series April 9, 2013.
7th Annual Hong Kong Innovative Users Group Meeting
VI-SEEM Data Discovery Service
Introduction, Features & Technology
UNC Digital Library Project
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Archiving and Delivery of Student Portfolios
Introduction to DSpace
Islandora Learning Objectives
The Fedora Project April 28-29, 2003 CNI, Washington DC
Is a Content Management System in Your Future?
Presentation transcript:

Foundations of Excellence DSpace vs Fedora: Or what I do on my summer vacation

TRLN: Staff Enrichment Series: 8 Nov, 2007 Objectives Background: Why we even considered a digital repository Background: Why we even considered a digital repository FOE – version 1 FOE – version 1 DSpace & Fedora: 50,000 foot view DSpace & Fedora: 50,000 foot view FOE – version 2 FOE – version 2 FOE – version 3 FOE – version 3 Where to from here? Where to from here?

TRLN: Staff Enrichment Series: 8 Nov, 2007 Background

75 th Anniversary Duke University School of Medicine established in 1930 Duke University School of Medicine established in – year-long celebration 2005 – year-long celebration New published history New published history Articles, videos, speeches Articles, videos, speeches Alumni weekend gala event Alumni weekend gala event Josiah C. Trent Foundation Grant Josiah C. Trent Foundation Grant

TRLN: Staff Enrichment Series: 8 Nov, 2007 Digitization Project 500 images documenting the first 3 decades of the School of Medicine and Hospital 500 images documenting the first 3 decades of the School of Medicine and Hospital Image groups: Image groups: Buildings Buildings Education Education Events Events Clinical Clinical People People Technology Technology

TRLN: Staff Enrichment Series: 8 Nov, 2007 Digitization Project (cont.) Selection – Whole staff Selection – Whole staff Digitization – Outsourced to University Photography Digitization – Outsourced to University Photography Description – Technical services and Reference coordinators Description – Technical services and Reference coordinators Subject terms – Technical services coordinator, Head, Cataloging services. Subject terms – Technical services coordinator, Head, Cataloging services. Controlled vocabulary – Notetab templates and libraries Controlled vocabulary – Notetab templates and libraries

FOE1.0 XML, XSLT, and Postgresql

TRLN: Staff Enrichment Series: 8 Nov, 2007 FOE images = 600 xml files = 2 xslt stylesheet 600 images = 600 xml files = 2 xslt stylesheet Xml = EAD2002 Xml = EAD2002EAD2002 XSLT = 1) convert xml to html; 2) convert xml to SQL statements XSLT = 1) convert xml to html; 2) convert xml to SQL statements Postgresql database used only for search Postgresql database used only for search Result html Result html html html

TRLN: Staff Enrichment Series: 8 Nov, 2007 Issues SQL search statements worked…not SQL search statements worked…not No indexing by search engines No indexing by search engines JDBC JDBC I am not a programmer I am not a programmer Definite need for improvements Definite need for improvements

TRLN: Staff Enrichment Series: 8 Nov, 2007 DSpace & Fedora: A Birds-eye View

TRLN: Staff Enrichment Series: 8 Nov, 2007 Need for a Digital Repository DSpace DSpace First released in Developed by MIT Libraries and Hewlett-Packard (USA Today) First released in Developed by MIT Libraries and Hewlett-Packard (USA Today)USA TodayUSA Today Current version (download) Current version (download)download Optimal performance in a *nix environment, but should operate in any environment Optimal performance in a *nix environment, but should operate in any environment Written in Java Written in Java VERY active listservs VERY active listservs Manakin – TAMU created “front-end” which makes for easier UI localization Manakin – TAMU created “front-end” which makes for easier UI localization

TRLN: Staff Enrichment Series: 8 Nov, 2007 Need for a Digital Repository (cont.) FEDORA (Flexible Extensible Digital Object and Repository Architecture) FEDORA (Flexible Extensible Digital Object and Repository Architecture) Began as a DARPA and NSF-funded research project at Cornell in 1997 Began as a DARPA and NSF-funded research project at Cornell in , UVA and Cornell: $1M Mellon grant 2001, UVA and Cornell: $1M Mellon grant 1.0 released released 2003 Current version (download) Current version (download)download Optimal performance in a *nix env, but will run on Windows based systems Optimal performance in a *nix env, but will run on Windows based systems Written in Java Written in Java Several front-end tools developed. (more in a moment) Several front-end tools developed. (more in a moment)

TRLN: Staff Enrichment Series: 8 Nov, 2007 Side by side testing Testing environment: Testing environment: Lenovo T60, 120 G hard drive, 2 G memory, Fedora 7, kernel, java 1.5 Lenovo T60, 120 G hard drive, 2 G memory, Fedora 7, kernel, java 1.5

TRLN: Staff Enrichment Series: 8 Nov, 2007 Requirements DSpace DSpace Java1.4 + Java1.4 + Apache Ant Apache Ant Postgresql (or Oracle 9 +) Postgresql (or Oracle 9 +) Jakarta Tomcat 4.x/5.x (I used 6.x) Jakarta Tomcat 4.x/5.x (I used 6.x) Can also run on Jetty or Caucho Resin Can also run on Jetty or Caucho Resin Fedora Fedora JDK JDK Optional Optional MySQL MySQL Postgresql Postgresql Oracle 9 Oracle 9 Jakarta Tomcat Jakarta Tomcat Ant if building from source code Ant if building from source code

TRLN: Staff Enrichment Series: 8 Nov, 2007 File Size & Download times DSpace DSpace 16 mb 16 mb 1:43 over a T1 line 1:43 over a T1 line 1:13 on a T line 1:13 on a T line Fedora Fedora 72 mb 72 mb 7:49 over a T1 line 7:49 over a T1 line 1:53 over a T line 1:53 over a T line

TRLN: Staff Enrichment Series: 8 Nov, 2007 Installation time DSpace DSpace Postgresql installation and set up: 8 minutes Postgresql installation and set up: 8 minutes Ant build and configuration: 8 minutes Ant build and configuration: 8 minutes DSpace/Tomcat configuration and deployment: 8 minutes DSpace/Tomcat configuration and deployment: 8 minutes Total time to live: 24 minutes Total time to live: 24 minutes Fedora Fedora Postgresql installation and set up: 8 minutes Postgresql installation and set up: 8 minutes Fedora install: 5 minutes Fedora install: 5 minutes Total time to live: 13 minutes Total time to live: 13 minutes

TRLN: Staff Enrichment Series: 8 Nov, 2007 Initial Live View DSpace DSpace Front Page Front Page Front Page Front Page Fedora Fedora Front Page Front Page Front Page Front Page

FOE2.0 Choosing our Digital Repository

TRLN: Staff Enrichment Series: 8 Nov, 2007 Deciding Factors DSpace DSpace Off-the-shelf view Off-the-shelf view Workflow process Workflow process Individual submitters, one project admin Individual submitters, one project admin Item submission form (link here) Item submission form (link here) Bulk load script (dc, item, mapfile) Bulk load script (dc, item, mapfile) Searchbot harvestable Searchbot harvestable OAI harvestable OAI harvestable Fedora Fedora Off-the-shelf view Off-the-shelf view One submitter One submitter Item submission not intuitive (link) Item submission not intuitive (link) Bulk load script (foxml) Bulk load script (foxml) Content Models (will return) Content Models (will return) Dissemenators Dissemenators Behavior Definitions Behavior Definitions Would require extensive programming Would require extensive programming

TRLN: Staff Enrichment Series: 8 Nov, 2007 FOE2.0 = DSpace Cup is Half Full March 2006 March 2006 Foundations new home Foundations new home Data submission form Data submission form Item View bld00012 Item View bld00012bld00012 Item Update Item Update Access Restrictions Access Restrictions Handle server Handle server

TRLN: Staff Enrichment Series: 8 Nov, 2007 FOE2.0 = DSpace Cup is Half Empty Object is entered as one item Object is entered as one item DSpace is self-contained DSpace is self-contained No real way to show complex relationships No real way to show complex relationships All or nothing metadata All or nothing metadata Access Restrictions Access Restrictions Handle server Handle server Searchbot indexing: Searchbot indexing: Item 2193/77 Title:, A. Jack Tannenbaum. Issue Date:, 10-Nov Abstract:, A. Jack Tannenbaum received his medical degree from Duke University in Item 2193/77 Title:, A. Jack Tannenbaum. Issue Date:, 10-Nov Abstract:, A. Jack Tannenbaum received his medical degree from Duke University in Item 2193/77 Item 2193/77

FOE3.0 “Our goal is to never be satisfied”

Content Models Reusing datastreams (next 2 slides borrowed from EDUCASE 2004 presentation by Grizzle, Wayland, and Wilper)

TRLN: Staff Enrichment Series: 8 Nov, 2007 Atomistic Model

TRLN: Staff Enrichment Series: 8 Nov, 2007 Compound Model

TRLN: Staff Enrichment Series: 8 Nov, 2007 An old favorite blanket Fedora minimally utilized Fedora minimally utilized Primarily used for archiving Library Administrative documents (Council and Management Team minutes, and Policies and procedures) Primarily used for archiving Library Administrative documents (Council and Management Team minutes, and Policies and procedures) Use of XACML policies to restrict access (156\.16\.\d{1,3}\.\d{1,3} lock down) Use of XACML policies to restrict access (156\.16\.\d{1,3}\.\d{1,3} lock down) Began looking at front-end GUIs Began looking at front-end GUIs

TRLN: Staff Enrichment Series: 8 Nov, 2007 Front End tools Fez – A web front-end management system for Fedora that is developed in PHP. Fez functionality includes: Web-based browsing and searching; Semi- advanced searching; Complex security; Basic image handling; Dublin Core. espace.library.uq.edu.au/documentation/ Fez – A web front-end management system for Fedora that is developed in PHP. Fez functionality includes: Web-based browsing and searching; Semi- advanced searching; Complex security; Basic image handling; Dublin Core. espace.library.uq.edu.au/documentation/ Elated - ELATED is a lightweight, general-purpose application for managing digital files. ELATED is built on top of the Fedora Repository system, and can be used as a digital assets management system, an institutional repository, or to meet other collection archiving, publishing and searching needs. Dublin Core metadata entry and search; Custom metadata by collection; Automatic previews for images; Collections with simple editorial workflow; Indexing and searching of content; User feedback, enabled by collection; Select and import existing Fedora objects Elated - ELATED is a lightweight, general-purpose application for managing digital files. ELATED is built on top of the Fedora Repository system, and can be used as a digital assets management system, an institutional repository, or to meet other collection archiving, publishing and searching needs. Dublin Core metadata entry and search; Custom metadata by collection; Automatic previews for images; Collections with simple editorial workflow; Indexing and searching of content; User feedback, enabled by collection; Select and import existing Fedora objects Both require extensive programming for localization Both require extensive programming for localization

TRLN: Staff Enrichment Series: 8 Nov, 2007 External Forces at play Fall 2006 we began a project to digitize 10,000+ cytopathology slides. Fall 2006 we began a project to digitize 10,000+ cytopathology slides. Images converted to JPEG2000 to increase user experience (example) Images converted to JPEG2000 to increase user experience (example)example Archives purchased Aware JPEG2000 Image Server Archives purchased Aware JPEG2000 Image Server History of Medicine image database, Historical Images in Medicine (HIM) needed new platform History of Medicine image database, Historical Images in Medicine (HIM) needed new platform

TRLN: Staff Enrichment Series: 8 Nov, 2007 Call out of the blue VTLS – Vital VTLS – Vital Open Repositories Open Repositories

TRLN: Staff Enrichment Series: 8 Nov, 2007 FOE3.0 = Fedora/Vital Cup is Half Full June 2007 June 2007 Foundations new home (link) Foundations new home (link) Data submission (3 ways to enter items) Data submission (3 ways to enter items) Item View bld00012 Item View bld00012bld00012 Object is entered as many datastreams (fedora view) Object is entered as many datastreams (fedora view)fedora viewfedora view Vita/Fedora/Aware…interoperability Vita/Fedora/Aware…interoperability Complex relationships Complex relationships Multiple metadata streams Multiple metadata streams Handle server Handle server Searchbot indexing: Searchbot indexing: A. Jack Tannenbaum. | MeDSpace Description: A. Jack Tannenbaum received his medical degree from Duke University in per00165, A. Jack Tannenbaum kB, JPEG 2000 Image... A. Jack Tannenbaum. | MeDSpace Description: A. Jack Tannenbaum received his medical degree from Duke University in per00165, A. Jack Tannenbaum kB, JPEG 2000 Image... A. Jack Tannenbaum. | MeDSpace A. Jack Tannenbaum. | MeDSpace

TRLN: Staff Enrichment Series: 8 Nov, 2007 FOE3.0 = Fedora/Vital Cup is Half Empty Fedora is open source, Vital is not Fedora is open source, Vital is not Customization possible with programming knowledge Customization possible with programming knowledge No way at this time to implement xacml policies (work arounds exist) No way at this time to implement xacml policies (work arounds exist) Vital upgrades require full software installation Vital upgrades require full software installation Local customization can cause breaks in certain functions Local customization can cause breaks in certain functions

Conclusions and obligatory links

TRLN: Staff Enrichment Series: 8 Nov, 2007 Selected Links DSpace – Manakin Fedora – Elated Fez Vital – – MeDSpace –