Update on the Fedora Project Common Solutions Group September 2005 Tim Sigmon University of Virginia Special thanks to the Fedora Team for these slides!

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
Business Development Suit Presented by Thomas Mathews.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
The Fedora Project JA-SIG Winter Conference December 9, 2003 Tim Sigmon University of Virginia.
Depositing e-material to The National Library of Sweden.
ARROW Progress Report to CAUL September 2004 Geoff Payne, ARROW Project Manager.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Fedora New Features, New Collaborations, Bright Future Fedora Users Conference Copenhagen, Denmark September 28, 2005 Sandy Payette Co-Director Fedora.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
Update on the Fedora Project Where we’ve been and where we’re going Fedora Users Conference Rutgers University May Sandy Payette Co-Director.
Representing and Storing Complex Digital Objects Fedora CS 431 – April 11, 2005 Carl Lagoze – Cornell University Acknowledgements: Sandy Payette (Cornell)
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
The Fedora Project Where we’ve been and where we’re going Mellon OS Retreat March 2005 Sandy Payette Cornell University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
The Fedora Project Update as of January 2004 Ithaca, NY January 29, 2004 Sandy Payette Cornell Information Science.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
Digital Library Architecture and Technology
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
DAMS Implementation at NLW DAMS Implementation at NLW 20 th February 2007 Paul Bevan
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
Web Services and Fedora EDUCAUSE Mid-Atlantic Regional Conference January 14, 2003 Tim Sigmon University of Virginia.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Digital initiatives Digital Initiatives at the National Library of Wales 19 th April 2007 Paul Bevan
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
ARROW Institutional Repositories for Managing e-Theses Presentation to ETD September 2005 Geoff Payne, ARROW Project Manager.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
Fedora An Architecture for Complex Objects and their Relationships Old Dominion University, VA April 7, 2005 Sandy Payette Cornell University.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Introduction, Features & Technology
Overview: Fedora Architecture and Software Features
VI-SEEM Data Repository
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
VI-SEEM Data Repository
NSDL Data Repository (NDR)
The Fedora Project April 28-29, 2003 CNI, Washington DC
Presentation transcript:

Update on the Fedora Project Common Solutions Group September 2005 Tim Sigmon University of Virginia Special thanks to the Fedora Team for these slides!

Fedora Development Team Cornell University Sandy Payette (co-director) Chris Wilper Carl Lagoze Eddie Shin University of Virginia Thorny Staples (co-director) Ross Wayland Ronda Grizzle Bill Niebel Bob Haschart Tim Sigmon

Fedora Brief History Cornell Research (1997-present) –DARPA and NSF-funded research –First reference implementation developed –Interoperable Repositories (experiments with CNRI) –Policy Enforcement First Application ( ) –University of Virginia digital library prototype –Technical implementation: adapted to web; RDBMS storage –Scale/stress testing for 10,000,000 objects Open Source Software (2002-present) –Andrew W. Mellon Foundation grants –Technical implementation: XML and web services –Fedora 1.0 (May 2003) –Fedora 2.0 (Jan 2005) –Fedora 2.1 (coming soon!)

Known Use Cases for “Fedora Inside” Digital Library Collections Institutional Repository Educational Software Information Network Overlay Digital Archives and Records Management Digital Asset Management File Cabinet / Document Management Scholarly publishing

Fedora Repository 2.x Web Services Exposure

The Fedora Digital Object Model Persistent ID (PID ) Default Disseminator SystemMetadata Datastream (item) Digital object identifier Service Perspective: methods for disseminating “views” of content Internal: key metadata necessary to manage the object Item Perspective: Set of content or metadata items Datastream (item) Your Extension

Fedora Repository 2.x Web Services Exposure

Fedora – what’s new (version 2.0) FOXML (Fedora Object XML) –Simple XML format directly expresses Fedora object model –Easily adapts to Fedora new and planned features –Easily translated to other well-known formats Enhanced Ingest/Export of objects –FOXML, METS (Fedora extension) –Extensible to accommodate new XML formats –Planned: METS 1.4, MPEG21 DIDL

Fedora 2.0 (continued) Object-to-object Relationships –Ontology of common relationships (RDF schema) –Relationships stored in special datastream (RELS-EXT) Resource Index (RI) –RDF-based index of repository (Kowari triple-store) –Graph-based index includes: –Object properties and Dublin Core –Object Relationships –Object Disseminations RI Search (Search the repository as a graph) –Powerful querying of graph of inter-related objects –REST-based query interface (using RDQL or ITQL) –Results in different formats (triples, tuples, sparql)

Fedora 2.0 (continued) New Utilities Batch Modify Utility Repository Administrator Reporting Performance Tuning (~1 million objects) –Ingest testing (800K objects; 40 millisec/object) –Concurrency testing (access requests) Communications and Outreach New Fedora Web Site Improved Documentation Tutorials

Preview Fedora 2.1 (Sept. 2005) ECL license Support for SSL Authentication plug-ins –Tomcat realms and login modules Plug-in #1 : Tomcat user/password file or database Plug-in #2 : LDAP Plug-in #3 : Radius Authentication Authorization module –XML-based policies using XACML –Repository-wide policies –Object-specific policies –Fine-grained policy enforcement API actions X subject attributes X object attributes

Authorization: Example Policies Repository Policy –Allow access to all API-M methods to “administrator” –Allow access to the deleteDatastream method to “author” Specific Object Policy –Allow access to object “uva:100” if user is Thorny. Group Object Policy –Allow access to the “getFullArticle” dissemination of objects whose content model is “journal-article” if “faculty” –Allow access to the “secret” datastream if user is not “guest”

Authorization: Example Policies Time-oriented Policy –Permit students access to “answers” datastream of learning object cs:125 after May 15, 2005 –Deny all access to learning object cs:125 after June 15, 2005 Backend Service Security Policy –Deny callback by external service represented by Bmech:10

Preview of Fedora 2.1 (Sept. 2005) Enhanced OAI Provider Service (prOAI) –Harvest multiple metadata formats –Harvest datastreams and disseminations –Support for incremental harvest by modified date –Support for OAI sets –Highly configurable via queries against Resource Index Directory Ingest Service (and client tool) –Facilitate ingest of hierarchical directories of files –Submit files as.zip or.jar (with a METS manifest) –Automatically asserts parent-child relationships in RELS-EXT –Stages content and ingests as FOXML objects into repository Policy Builder Client –Simple user interface to create access policies –Automatically generate XACML Handle Generation Plug-in (PIDs as Handles)

Fedora Service Framework (beginning Fedora 2.1)

Fedora Service Framework ( )

Fedora Service Framework ( )

Fedora Web-based IR Client Web-based client for “institutional repository” Configurable End-user submission Object creation template for “content models” Basic Workflow Search/Browse Easy configuration of access policy Development to begin this fall.

More Dev-Team Priorities Federated Repositories Federation with other repositories (DSpace, aDORE) –note the Cornell/LANL NSF Pathways project. Fedora Showcase and News (on new website) “Content Model” Specification Language Advanced Object Creation Workbenches Tools for RDF browse and graph traversal Performance Tune – millions of objects Web services security and Shibboleth Code Refactoring Fedora as web app (.war) MVC2 pattern for REST-based web exposures Other misc.

VTLS Commercial support and value-add development –Similar to RedHat and Linux –Installation, training, support, hosting, etc. VITAL product (based on Fedora) contains VTLS- developed work flow extensions, management utilities, and enhanced searching capabilities In partnership with the ARROW project, VTLS is developing and contributing back to open source, e.g., –Handles integration –SRU/SRW interface to expose Fedora content –Metadata extraction and content validation via JHOVE –Automatic capture of technical metadata from images –Facilitate content exposure to web crawlers –Creating custom content models “Fedora inside”

ARROW Australian Research Repositories Online to the World Intention of the project is to achieve wider access to Australian research by making it available on-line with appropriate discovery facilities “ identify and test software or solutions to support best practice institutional digital repositories comprising e-prints, digital theses and electronic publishing” Selected Fedora and VITAL as its core repository solutions “Fedora inside”

NSDL National Science Digital Library –Mission: improve Science, Math, and Engineering education through digital libraries First implementation was a metadata repository using Oracle dbms to hold information about collections and items. Next generation NSDL will use Fedora to add value to digital content... don't just provide access. Create an information network overlay that supports: –Rich and dynamic information objects –Information reuse/refactoring –Graph-based information model (ontology-based relationships) –Fine-grained access management “Fedora inside”

AGU Digital Archive American Geophysical Union (a publisher) is developing a system for long-term preservation (20, 50, even 100 yrs) –Read the files –Understand the structure of the files –Ensure authentic copy of the work Selected Fedora based on an extensive list of requirements Emphasis on having good metadata –Descriptive (author, title, volume,...) –Technical (formats, versions,...) –Administrative (rights, events, audits,...) “Fedora inside”

Preservation of University Records Tufts and Yale received a grant from National Historical Publications and Records Commission (NHPRC) –To synthesize electronic records preservation research with digital library repository research in an effort to develop systems capable of preserving university electronic records at both institutions –To test the potential of Fedora to serve as the architecture for an electronic records preservation system. “Fedora inside”