A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,

Slides:



Advertisements
Similar presentations
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Advertisements

October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Depositing e-material to The National Library of Sweden.
1 Institutional Repository (IR) Models Rutgers University Community Repository (RUcore) A digital library perspective (objects and collections) Flexible.
1 Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, Sweden.
ARROW Institutional Repositories Presentation to the APSR / University of Tasmania Repositories Seminar 4 May 2006 Geoff Payne Director Library Corporate.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
WMS: Democratizing Data
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
Introducing Symposia : “ The digital repository that thinks like a librarian”
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
Developing an Ingest Service for Fedora Ryan Scherle Muzaffer Ozakca.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Group-based Repositories in Oz Diane Costello Council of Australian University Librarians ICOLC Montreal 2007.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
METADATA STANDARDS Andrew Wilson Project Manager Digital Preservation Project.
Implementing an Integrated DAMS FEDORA / VITAL at the National Library of Wales OR08 3 rd April 2008 Paul Bevan Glen Robson
Digital initiatives Digital Initiatives at the National Library of Wales 19 th April 2007 Paul Bevan
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
ARROW Institutional Repositories for Managing e-Theses Presentation to ETD September 2005 Geoff Payne, ARROW Project Manager.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.
Breakout Session 2.2: A sustainable GEO Information System of Systems Chair: Lorenzo Bigagli Rapporteur: Greg Yetman.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Fedora Metadata The Basics 9/9/2008. Mini Glossary Fedora: ‘ Flexible Extensible Digital Repository Object Architecture;’ asset repository, metadata architecture.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
Data Citation Implementation Pilot Workshop
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
What is Fedora Commons, and Why Should You Care? Cole Hudson and Graham Hukill.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
Developing a digital repository infrastructure for King’s College London RSP Training Day, 22 nd January 2009 Gareth Knight Centre for e-Research.
Customising Primo V3 for discovery of digital collections E-LUNA 2011 Annual Conference Milwaukee, WI – 13th May 2011 Stefania Riccardi Library Repository.
DAITSS: Dark Archive in the Sunshine State
? What is Institutional Repository for Rutgers University
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
VI-SEEM Data Repository
Managing ETDs with Associated Complex Digital Objects
Experiences of the Digital Repository of Ireland
eCulture Science Gateway – reloaded
Fedora Filling the “Sweet Spot” in the Information Landscape
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
DLCF Enabling technologies
Presentation transcript:

A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon, Harry Sidhunata

UNSW Australia The University of New South Wales at a Glance:

UNSW Library Repository Services UNSW Library has an increasingly important role in the management and curation of UNSW research materials Library Repository Services (LRS) supports this role by providing Web-based repositories to UNSW academic community Research Centre Fedora Primo Deposit/Edit Web-forms School Fedora Primo Deposit/Edit Web-forms Faculty Fedora Primo Deposit/Edit Web-forms

Fedora 3 repositories at UNSW Library UNSW Library Fedora 3-to-4 migration pilot UNSW Library use cases and Fedora 4 data models Lessons learned Future plans Outline

UNSWorks – the online institutional repository for PhD and Masters by research thesis materialUNSWorks – records –stores and disseminates digital preservation information –Integrated with UNSW Research Output System (ROS, based on Symplectic Elements) ResData – research data management, planning and publishing serviceResData –integrated with UNSW Data Archive and other enterprise systems Fedora 3 repositories at UNSW Library

Faculty-based repository services –based on a standard, extensible framework –customised to support specific requirements of individual disciplines –enables discovery, accessibility and citation of resource –Example: Faculty of Arts and Social Science repositoryFaculty of Arts and Social Science repository Fedora 3 repositories at UNSW Library

Goal: –formulate a strategy for upgrading the Library’s existing Fedora 3-based repositories Criteria: –compatibility with existing institutional data models –interoperability with related repository applications and workflows Use Cases/Test beds: ResData and UNSWorks Timeline: Jan-May 2015 UNSW Library Fedora 3-to-4 Migration Pilot

Migration Pilot Approach Defined migration use cases based on ResData and UNSworks Use cases Deployed a test Fedora 4 instance Fedora 4 test repository REST APIs, versioning of records, integration with external triple stores and plug-ins, including OAI-PMH and Audit service Comparison with Fedora 3 functions Fedora 4 features evaluation

Migration Pilot Approach Analysed default Fedora 4 data model and PCDM Mapped Fedora 3 object and datastream properties to Fedora 4 Fedora 4 data model design Formulated a strategy for implementing a client to the Fedora 4 REST API based on Fedora 4 data model design and the result of evaluation of Fedora 4 features Implementation strategy formulation Manually migrated a subset of ResData records to the test Fedora 4 instance as a proof-of- concept Manual migration of test records

Use Case 1: UNSWorks System Architecture

Use Case 1: UNSWorks Fedora Object Model - Datastreams Metadata (MODS – XML) Thesis file (PDF, DOC) Preservation Metadata (PREMIS – RDF) Supporting docs/Rights/licen ce (TXT, DOC) RELS-EXT (Handle) Preservation Metadata (PREMIS - RDF) Preservation Metadata (PREMIS – RDF) RELS-INT (Resource type, Preservation software) EVENTS (PREMIS – RDF) Thesis file (PDF, DOC)

Use Case 2: ResData System Architecture Deposit/Edit Fedora UNSW HR/Grant Database Harvesting Service (JOAI) MySQL 5.5 Storage Provisioning Service UNSW Data Archive

Use Case 2: ResData Fedora Object Model - Datastreams Dataset (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (Resource type) Activity/project (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (Resource type) Person (RDF) RELS-INT (DOI, Handle, versioning) RELS-EXT (Resource type) RDMP (RDF) RELS-EXT (Resource type, storage info) 1 * * 1

Fedora 4 Data Model – the default LDP model Default Fedora 4 data/content model is aligned with the Linked Data Platform 1.0 Source:

Fedora 4 Data Model – PCDM adaption Source:

Fedora 4 Data Model for UNSWorks

Fedora 4 Data Model for ResData

Adaptation of PCDM –PCDM hierarchical model is similar to the UNSWorks model –Additional granularity needed to o record preservation and migration events o manage access-related information at both object and collection levels o ensure interoperability with ResData that does not conform to a hierarchical organisation. Fedora 4 Data Model Design – key considerations

Identifiers and URL structures –Built-in PairTree algorithm for generating unique identifiers and to limit number of children under a single resource –Legacy Fedora 3 PIDs as “data properties” of migrated resource –Cool URIs with embedded semantic information –Example: /rest/[container name]/[container PairTree id]/[resource id] Fedora 4 Data Model Design – key considerations

Audit history and versioning –Legacy Fedora 3 FOXML to be stored as a binary resource in Fedora 4 –Fedora 4 Audit Service to be used to record post- migration audit information –Legacy creation dates for Fedora 3 objects cannot be migrated - custom properties to be used –Fedora 4 versioning to be used to record Fedora 3 versions Fedora 4 Data Model Design – key considerations

Fedora 4 to be used as “headless” repository instances Fedora 4 REST API to be used by custom UIs and clients to manage CRUD of digital objects Fedora 4 to be integrated with an external triplestore to enable access control via custom UIs and clients Update/re-factor existing Java-based Fedora 3 clients to support Fedora 4 Fedora 3-to-4 Migration – Implementation Strategy

Review of the existing institutional information models has identified a need for –better standardisation of existing RDF ontologies –migration of existing XML schemas to RDF ontologies to ensure more efficient interoperability between repositories Lessons learned

Investigation into access control-related ontologies, such as WebACL to enable standard-based access control of Fedora 4 objects Evaluate existing Open Source tools for Fedora 3-to- 4 migrations Enhance/standardise UNSW Library ontologies Continue to be a platinum member of Fedora community Future plans

Fedora 4 Upgration Pilot – UNSW UNSW +-+UNSW UNSWorks - eb/action/search.do?vid=UNSWORKS&reset_config =true eb/action/search.do?vid=UNSWORKS&reset_config =true ResData - s s Useful links