A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,

Slides:



Advertisements
Similar presentations
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Depositing e-material to The National Library of Sweden.
1 Institutional Repository (IR) Models Rutgers University Community Repository (RUcore) A digital library perspective (objects and collections) Flexible.
1 Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, Sweden.
ARROW Institutional Repositories Presentation to the APSR / University of Tasmania Repositories Seminar 4 May 2006 Geoff Payne Director Library Corporate.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
WMS: Democratizing Data
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
Introducing Symposia : “ The digital repository that thinks like a librarian”
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
The Tower Hotel, November 26, 2009 Research Data Management Infrastructure Programme Launch Event SUpporting Data Management Infrastructure for the Humanities.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
Developing an Ingest Service for Fedora Ryan Scherle Muzaffer Ozakca.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Hydra Europe Symposium | April 2015 | 1 Hydra and open access Chris Awre Hydra Europe Symposium London School of Economics, 24 th April 2015.
Group-based Repositories in Oz Diane Costello Council of Australian University Librarians ICOLC Montreal 2007.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
METADATA STANDARDS Andrew Wilson Project Manager Digital Preservation Project.
Implementing an Integrated DAMS FEDORA / VITAL at the National Library of Wales OR08 3 rd April 2008 Paul Bevan Glen Robson
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Digital initiatives Digital Initiatives at the National Library of Wales 19 th April 2007 Paul Bevan
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
VITAL at the National Library of Wales Glen Robson
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
ARROW Institutional Repositories for Managing e-Theses Presentation to ETD September 2005 Geoff Payne, ARROW Project Manager.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
What is Fedora Commons, and Why Should You Care? Cole Hudson and Graham Hukill.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
Developing a digital repository infrastructure for King’s College London RSP Training Day, 22 nd January 2009 Gareth Knight Centre for e-Research.
Customising Primo V3 for discovery of digital collections E-LUNA 2011 Annual Conference Milwaukee, WI – 13th May 2011 Stefania Riccardi Library Repository.
? What is Institutional Repository for Rutgers University
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
VI-SEEM Data Repository
Fedora Filling the “Sweet Spot” in the Information Landscape
DLCF Enabling technologies
LOD reference architecture
Presentation transcript:

A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon, Harry Sidhunata

UNSW Australia The University of New South Wales at a Glance:

UNSW Library Repository Service UNSW Library has an increasingly important role in the management and curation of UNSW research materials Library Repository Service (LRS) supports this by providing Web- based repositories to UNSW academic community Research Centre Fedora Primo Deposit/Edit Web-forms School Fedora Primo Deposit/Edit Web-forms Faculty Fedora Primo Deposit/Edit Web-forms

Fedora 3 repositories at UNSW Library UNSW Library Fedora 3-to-4 migration pilot UNSW Library use cases and Fedora 4 data models Lesson learned Future plans Outline

UNSWorks – the online institutional repository for PhD and Masters by research thesis materialUNSWorks – records –stores and disseminates digital preservation information –Integrated with UNSW Research Output System (Symplectic Elements) ResData – research data management planning and publishing serviceResData –integrated with UNSW Long-term Research Data Store (LTRDS) service and other enterprise systems Fedora 3 repositories at UNSW Library

Faculty-based repository services –based on a standard, extensible framework –customised to support specific requirements of individual disciplines –enables discovery, accessibility and citation of resource –Example: Faculty of Arts and Social Science repositoryFaculty of Arts and Social Science repository Fedora 3 repositories at UNSW Library

Goal: –formulate a strategy for upgrading the Library’s existing Fedora 3-based repositories Criteria: –compatibility with existing institutional data models –interoperability with related repository applications and workflows Use Cases/Test beds: ResData and UNSWorks Timeline: Jan-May 2015 UNSW Library Fedora 3-to-4 Migration Pilot

Migration Process Defined migration use cases based on ResData and UNSworks Use cases Deployed a test Fedora 4 instance Fedora 4 test repository REST APIs, versioning of records, integration with external triple stores Comparison with Fedora 3 functions Fedora 4 features evaluation

Migration Process Analysed default Fedora 4 data model and PCDM Mapped Fedora 3 object and datastream properties to Fedora 4 Fedora 4 data model design OAI-PMH module Audit service Fedora 4 plug-ins evaluation Formulated a strategy for implementing the Fedora 4 REST API based on Fedora 4 data model design and the result of evaluation of Fedora 4 features Implementation strategy formulation

Use Case 1: UNSWorks System Architecture

Use Case 1: UNSWorks Fedora Object Model - Datastreams Metadata (MODS – XML) Thesis file (PDF, DOC) Preservation Metadata (PREMIS – RDF) Supporting docs/Rights/licen ce (TXT, DOC) RELS-EXT (Handle) Preservation Metadata (PREMIS - RDF) Preservation Metadata (PREMIS – RDF) RELS-INT (Resource type, Preservation software) EVENTS (PREMIS – RDF) Thesis file (PDF, DOC)

Use Case 2: ResData System Architecture Deposit/Edit Fedora UNSW HR Database Harvesting Service (JOAI) MySQL 5.5 Storage Provisioning Service UNSW IT LTRDS

Use Case 2: ResData Fedora Object Model - Datastreams Dataset (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (Resource type) Activity/project (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (DOI, Resource type) Person (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (Resource type) RDMP (RDF) RELS-EXT (Resource type, storage info) 1 * * 1

Fedora 4 Data Model – PCDM adaption Source:

Fedora 4 Data Model for UNSWorks

Fedora 4 Data Model for ResData

Adaptation of PCDM –PCDM hierarchical model is similar to the UNSWorks model –Additional granularity needed to o record preservation and migration events o manage access-related information at both object and collection levels o ensure interoperability with ResData that does not conform to a hierarchical organisation. Fedora 4 Data Model Design – key considerations

Identifiers and URL structures –Built-in Pairtree algorithm for generating unique identifiers and to limit number of children under a single resource –Legacy Fedora 3 PIDs as “data properties” of migrated resource –Cool URIs with embedded semantic information –Example: /rest/[container name]/[container Pairtree id]/[resource id] Fedora 4 Data Model Design – key considerations

Audit history and versioning –Legacy Fedora 3 FOXML will be stored as a binary resource in Fedora 4 –Fedora 4 Audit Service to be used to record post- migration audit information –Legacy creation dates for Fedora 3 objects cannot be migrated - custom properties to be used –Legacy Fedora 3 PIDs as “data properties” of migrated resource –Fedora 4 versioning to be used to record Fedora 3 versions Fedora 4 Data Model Design – key considerations

Fedora 4 to be used as “headless” repository instances Fedora 4 REST API to be used by custom UIs and clients to manage CRUD of digital objects Fedora 4 integrated with external triplestore to enable access control via custom UIs and clients Update/re-factor existing Java-based Fedora 3 clients to support Fedora 4 Fedora 3-to-4 Migration – Implementation Strategy

Review of the existing institutional information models has identified a need for –better standardisation of existing RDF ontologies –migration of existing XML schemas to RDF ontologies to ensure more efficient interoperability between repositories Lessons learned

Investigation into access control-related ontologies, such as WebACL to enable standard-based access control of Fedora 4 objects Evaluate existing Open Source tools for Fedora 3-to- 4 migrations Enhance/standardise UNSW ontologies according to the Fedora 4 model developed Continue to be a platinum member of Fedora community Future plans

Upgration Pilot – UNSW UNSW +-+UNSW UNSWorks - eb/action/search.do?vid=UNSWORKS&reset_config =true eb/action/search.do?vid=UNSWORKS&reset_config =true ResData - s s Useful links