SWORD where we are and how we got here CRIG Unconference Birkbeck College, London Thurs 6 th Dec 2007 Julie Allinson UKOLN, University of Bath Image: www.flickr.com/photos/venegas/8519843/

Slides:



Advertisements
Similar presentations
Creating Institutional Repositories Stephen Pinfield.
Advertisements

A centre of expertise in digital information management UKOLN is supported by: SWORD: An Overview 2 nd June 2009 Web Service.
UKOLN is supported by: Digital Repositories Roadmap: looking forward The JISC/CNI Meeting, July 2006 Rachel Heery Assistant Director R&D, UKOLN
A centre of expertise in digital information management Developing a Quality Culture For Digital Library Programmes Author & Presenter Brian Kelly UKOLN.
UKOLN is supported by: Put functionality Augmenting interoperability across scholarly repositories 20/21 April 2006 Rachel Heery, UKOLN, University of.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
A centre of expertise in digital information managementwww.ukoln.ac.uk Shared Infrastructure Services Review Rosemary Russell UKOLN University of Bath.
Linking Repositories Scoping Study Key Perspectives Ltd University of Hull SHERPA University of Southampton.
A centre of expertise in digital information management UKOLN is supported by: An introduction to … Repository reference models CETIS Metadata.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
SWORD Simple Web-service Offering Repository Deposit JISC CETIS SIG meeting University of Strathclyde, Glasgow Friday 29 th June 2007 Julie Allinson UKOLN,
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
Update on the SWORD Protocol & Future Directions.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
The DSpace Course Module – SWORD basics. Module objectives  By the end of this module you will:  Understand what SWORD is  Know what SWORD could be.
UKOLN is supported by: The Cutting Edge of SWORD 18 th May 2009 OR09, Atlanta, GA Adrian Stevenson and Julie Allinson SWORD Project Managers.
Simple Web service Offering Repository Deposit (SWORD)‏ Project kick-off meeting Birkbeck College, London, 30 th April 2007 Julie Allinson, UKOLN, University.
Julie Allinson Open Repositories 2008 Southampton 1st April 2008 Simple Web-service Offering Repository Deposit SWORD.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Your Name AutoArchive?:The ADS and the SWORDARM project Catherine Hardman - Archaeology Data Service University of York White Rose/RoaDMap 24 th May 2012.
Jisc initiatives to support REF 13/11/2014 Sarah Fahmy, OA Good Practice Manager.
Administration & Workflow
Institutional repositories a bluffer’s guide. Academic libraries and archives  Cataloguing –Computerised catalogue databases (e.g. OPACS) –Networked.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
Repository Deposit Service Description OR 2007 : the 2nd International Conference on Open Repositories San Antonio, Texas, USA, Jan 2007 Presenter:
A centre of expertise in digital information management UKOLN is supported by: Eprints Application Profile UK Repositories Search Project.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
1 UKOLN is supported by: SWORD Simple Web-service Offering Repository Deposit Defining Image Access final.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Supporting Federally Funded Research Requirements with DSpace and SWORD 10 th International Conference on Open Repositories Hui Zhang, Michael Boock Oregon.
UKOLN is supported by: Current JISC initiatives for Repositories Exchange of Experience on Institutional Repositories 17 th May 2007, Liverpool Julie Allinson.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
Towards smart storage for repository preservation services Steve Hitchcock, David Tarrant, Adrian Brown 1, Ben O’Steen 2, Neil Jefferies 2 and Leslie Carr.
SWORD Stories - Easy Deposit Cutting Through Repositories’ Red Tape Sarah Currier Consultancy | E-Learning * Resource Sharing * Web 2.0 * Metadata * Repositories.
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
Update on the VERSIONS Project for SHERPA-LEAP SHERPA Liaison Meeting UCL, 29 March 2006.
STANDARDS AND INTEROPERABILITY; RIGHTS ISSUES Status and summary 1.
Tablet PC Capstone CSE 481b Richard Anderson Craig Prince.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
JISC / CETIS Conference, Repositories Strand, Edinburgh, November 2005 Repositories…. …and the known unknowns.
Hydra Europe Symposium | April 2015 | 1 Hydra and open access Chris Awre Hydra Europe Symposium London School of Economics, 24 th April 2015.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
The Agora hybrid library project Rosemary Russell, UKOLN (UK Office for Library and Information Networking) Agora Communications Coordinator.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
Easy Desktop Deposit for intraLibrary: An Implementation of SWORD Sarah Currier Product Manager Intrallect Ltd Presentation to.
Supporting Further and Higher Education Collection description as Middleware The Information Environment Service Registry (IESR) Rachel Bruce, Information.
Joint Information Systems Committee Supporting Higher and Further Education Catherine Grout Assistant Director for Development, JISC/DNER
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
EXPLORER project Elizabeth Lunt Project Manager De Montfort University.
Sharing OERs via Jorum Siobhán Burke and Sarah Currier 12 th December 2012.
SWORD Simple Web-service Offering Repository Deposit Julie Allinson 25th March 2009, British Library.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
SWORD Simple Web-service Offering Repository Deposit By Aparna R. Belhe Archana Galipalli.
Functional Object Re-use and Exchange:
Hydra, research data and Archivematica
Jisc Research Data Shared Service (RDSS)
Policy Frameworks: building a firm foundation for your IR
QoS Metadata Status 106th OGC Technical Committee Orléans, France
Presentation transcript:

SWORD where we are and how we got here CRIG Unconference Birkbeck College, London Thurs 6 th Dec 2007 Julie Allinson UKOLN, University of Bath Image:

SWORD : 2 SWORD – beginning at the end Why? In the spirit of reverse engineering (all will become clear!) Order: –Now and next –Technical outputs and proof-of-concept –Protocol and profile –Candidate specifications –Defining the service –Requirements and parameters –Why? SWORD scenarios –The motivation for a project –Before SWORD - background and context

… - December 2007 now and next

SWORD : 4 Now and next : testing, community acceptance and dissemination Case studies documenting implementation experience Implementations of the SWORD interface forthcoming in arXiv, White Rose Research Online (EPrints) and Jorum SPECTRa deposit client for chemical data Article in Ariadne Presentation at Open Repositories 2008 (hopefully) Service expression for the eFramework – in development Open workshop at CRIG

March - October 2007 SWORD

SWORD : 6 SWORD – what have we produced, technically? A Simple Web-service offering repository deposit, which comprises a standard deposit interface implemented in Fedora, DSpace, EPrints and IntraLibrary – this takes deposits –code released on sourceforge and via EPrints a desktop deposit client – this does the deposit –code released on sourceforge a web-client as an alternative deposit service – in the end, what we’ve produced isn’t very complicated at all … which has been the spirit of SWORD all along, simple, lightweight and fit-for-purpose

SWORD : 7

SWORD : 8 SWORD : a standard for deposit So, how do we do this? What makes this a standard interface? –a standard!

SWORD : 9 Reviewing existing standards WebDAV ( JSR 170 ( JSR 283 ( SRW Update ( Flickr Deposit API ( Fedora Deposit API ( OKI OSID ( ECL ( ATOM Publishing Protocol ( charter.html)‏ charter.html ATOM Publishing Protocol ( charter.html)‏ charter.html

SWORD : 10 Atom Publishing Protocol “the Atom Publishing Protocol is an application-level protocol for publishing and editing Web resources” benefits of using the Atom Publishing Protocol –supports many of our parameters and requirements, in particular file deposit –it already exists and has an active development community –support is growing –it is well-used in popular applications –it has an extension mechanism –Google have created their own profile (gdata) drawbacks / risks –this isn’t what it was designed for – are we attempting to fit our square requirements into round holes? –without significant ‘interpretation’, it is only possible to deposit a single package/file OR an atom document – this means that we need to package up metadata and files

SWORD : 11 APP and SWORD parameters Mandatory (level 0 compliance) deposit any type of content repository or collection id Identifier deposit status (accepted, rejected, error), error codes, error description compliance level Optional (mandatory for level 1 compliance) mediated deposit support on-behalf-of target user repository / collection name collection policy, description accepted formats format namespace source repository checksum additional identifiers treatment description

SWORD : 12 APP and SWORD parameters Mandatory (level 0 compliance) deposit any type of content – APP yes repository or collection id – APP yes identifier – APP yes deposit status (accepted, rejected, error), error codes, error description – APP yes (and extension - optional) compliance level - extension deposit id target collection Optional (mandatory for level 1 compliance) extension - mediated deposit support extension - on-behalf-of target user APP yes - repository / collection name extension - collection policy, description APP yes - accepted formats extension - format namespace APP yes - source repository extension - checksum APP yes - additional identifiers extension - treatment description

SWORD : 13 A bit more detail about APP and the SWORD profile APP works by issuing HTTP requests (GET, POST) HTTP response and ATOM document is returned HTTP header extensions –Content-MD5 –Content-Disposition –X-On-Behalf-Of –X-Deposit-ID –X-Error-Code –X-Format-Namespace APP / ATOM extensions – – and - for developers

SWORD : 14 Example : service document

SWORD : 15 Example – deposit with HTTP POST

SWORD : 16 Example : response

March January 2007 the funding and project start

SWORD : 18 SWORD – aims and objectives aims to: –improve efficiency of the repository ‘Ingest’ function –improve options for populating (multiple) repositories with content –support common deposit interfaces –achieve greater repository interoperability through: –a standard specification for depositing content in repositories –implemented and tested (and refined) in EPrints, DSpace, Fedora and IntraLibrary, –and a prototype ‘smart deposit’ tool at all times being cognisant of UK requirements (as defined by the JISC Common Repository Interfaces Group – CRIG) and International work in this area (including the OAI-ORE activity)‏

SWORD : 19 Scope and assumptions authorisation and authentication –too big an issue for SWORD –APP mandates minimum HTTP authentication –other authentication mechanisms are left to the implementer deposit and ingest –we assume that metadata already exists –identifiers are an issue – to what extent should pre-existing identifiers and those created during the deposit be maintained –how far does the deposit service need to validate the deposit data integrity and provenance –no requirement to get back the exact package deposited? data types, metadata formats and content packages –how far should the deposit service check its ability to accept what is being deposited? –is there any value in being able to deposit a package format that the accepting repository doesn’t support? SWORD is testing a minimal set of packaging formats, mappings and translations – out of scope? Extracting metadata from packages – a post deposit issue? –demonstrating ‘true’ interoperability

SWORD : 20 The acronym and the team SWORD: –Simple –Web-service –Offering –Repository –Deposit SWORD partners –UKOLN, University of Bath –University of Southampton (EPrints) –University of Aberystwyth (DSpace, Fedora, reference client) –Intrallect (IntraLibrary) 6 months, started March 2007 building on the Deposit API work

January March 2006 the foundations - deposit api

SWORD : 22 Broadly motivated by … in general, developers are not creating repository systems and software from scratch repositories must interface with … –each other –users –other applications within the institution –services in the wider information landscape –(such as VLEs, authoring tools, packaging tools, name authority services, classification services and research systems) there is no common deposit API or protocol (OAI-PMH has made metadata interoperability an imperfect reality)

SWORD : 23 Pain points - what can’t we do? no standard interface for tagging, packaging or authoring tools to upload catalogued objects into a repository no standard interface for transferring digital objects between repositories no way of initiating a contribution workflow from outside a repository system no way of including deposit into a repository a part of service orientated architecture

SWORD : 24 From pain to possibility - some scenarios 1.Author deposits using a desktop authoring system to a mediated multiple deposit service 2.A user submits an IMS-compliant learning object to a National Repository using a client application 3.Deposit into multiple repositories 4.Transfer between intermediate hosts 5.Repositories share improved metadata 6.Experimental data output from spectrometer is 'saved as' a file and a file containing metadata on operational parameters is also generated. A data capture service is invoked and the files pertaining to the experiment are deposited, along with the necessary metadata, in the laboratory repository. See more at

SWORD : 25 Scenario 1 : Author deposits using a desktop authoring system to a mediated multiple deposit service Librarian L completes the deposit through the repository interface id Librarian L invokes deposit of a surrogate into arxiv.org Deposit id Author A deposits via an easy-deposit desktop application into the institutional repository's mediated deposit queue A lightweight deposit web service can facilitate this transfer of object(s)

SWORD : 26 Scenario 3 : Deposit in multiple repositories Deposit The depositor can choose one or more repositories to deposit into A lightweight deposit web service can facilitate this transfer of object(s) A depositor is required to submit to a Research Council repository, but they also wish to deposit into their institutional repository and a relevant subject repository

SWORD : 27 Scenario 4 : transfer between intermediate hosts A lightweight deposit web service can facilitate this transfer of object(s) id Deposit A repository may transfer objects to other repositories, or services, e.g. a preservation service id Deposit Subsequent repositories may also transfer objects id

SWORD : 28 Scenario 6 : laboratory auto-deposit Deposit Experimental data output from laboratory machines is deposited, along with the necessary metadata, in the laboratory repository in an automated process A lightweight deposit web service can facilitate this transfer of object(s) A metadata record is also deposited into the Institutional Repository

SWORD : 29 Functional requirements (a select few) support wide range of heterogeneous repositories –scholarly publications, data, learning objects, images, etc. accept submission of different digital object types in consistent way: –data and/or metadata in the form of complex objects or content packages support different workflows for deposit, e.g. –user to multiple repositories via intermediate client –user to repository, repository to additional repositories –user-triggered and machine-triggered deposit accept large-scale (scientific datasets) support collections and changes in policy and permissions support non-instantaneous processes, e.g. deposit pending mediation support more complex, authenticated deposit

SWORD : 30 Defining the deposit service(s) Explain: service offered by a repository, allowing remote users (machines or people) to inspect the repository for policy and/or other data –data in: introspection request (“explain”)‏ –data out: introspection response (“repository policy info”)‏ Deposit: service offered by a repository, allowing remote users (machines or people) to upload data –data in: deposit request with optional parameters (e.g.digital object ‘semantics’, metadata formats..) –data out: status (success, failure, pending), receipt confirmation and identifier

SWORD : 31 Future proofing with layers layered approach two levels of compliance –Level 0 compliance requires a set of mandatory elements and a set of optional elements –Level 1 offers a set of additional elements for richer functionality mandatory at level 1

SWORD : 32 Deposit – turning requirements into parameters Mandatory (level 0 compliance) –deposit any type of content –repository or collection id –identifier –deposit status (accepted, rejected, error), error codes, error description –compliance level Optional (mandatory for level 1 compliance) –mediated deposit –repository / collection name –collection policy, description –accepted formats –format namespace –source repository –checksum –additional identifiers –treatment description

SWORD : 33 Steps towards offering a deposit service explore use cases – Deposit API identify requirements – Deposit API parameters – outlined by Deposit API - refined by SWORD define services: deposit and explain – outlined by Deposit API – refined by SWORD agree layered approach – outlined by Deposit API - refined by SWORD draft XML serialisation to better understand requirements – Deposit API reviewed existing standards – SWORD agree standard – SWORD implement and test - SWORD

November 2005 ending at the beginning

SWORD : 35 Where did it all start partly at the JISC CETIS Conference 2005 and in Andy Powell’s ‘A service-oriented view of the Information Environment’ Taken forward by the Repositories Research Team at UKOLN/CETIS (mainly by Rachel Heery) Under the guise of the Deposit API working group - a small working group of international repository developers brought together to address the requirement for a common deposit standard for populating the growing network of repositories with content within the timescales of JISC repositories programmes before SWORD came into being it’s taken 2 years (more or less) to demonstrate that there can be a standard mechanism to deposit

SWORD : 36 Lessons learnt Ideas spark from small things But without someone to keep them on the rails they can easily be lost Keeping up momentum can be a challenge, but it doesn’t require daily s Money helps to get things done The foundations take time –evaluation and discussion, defining scenarios, reading long specification documents Having a focussed scope and a very clear idea of what the point is, whilst staying aware of the limitations is a good way of getting things done in a short time … as are talented developers and engaged ‘supporters’ Re-use rather than re-invention engages the community Going so far down one road and changing tack isn’t stupid or failure, it’s intelligent and successful … and, of course, a good acronym