Diving into the Technology of Hydramata: A Pluggable, Extensible Hydra Solution for Research Repositories June 13, 2014Open Repositories 20141 Rick Johnson,

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
The Hydra Framework as a Series of Diagrams Naomi Dushay Stanford University Libraries April,
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Collaborating to Manage Research Data Zheng (John) Wang Rick Johnson Hesburgh Libraries University of Notre Dame 12/10/13.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Ch:8 Design Concepts S.W Design should have following quality attribute: Functionality Usability Reliability Performance Supportability (extensibility,
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
H YDRA : V ENDOR E COSYSTEM Mark Bussey Chief Information Leafblower.
An Introduction June 17, 2013 Open Archival Information System (OAIS)
G ET A HEAD ON Y OUR R EPOSITORY Worldwide Hydra Connect #2 September 30 – October 2, 2014 Cleveland, Ohio #hydraConnect.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
One Body, Many More Heads, One Year Later Open Repositories 2012, Edinburgh.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
SUNYConnect Strategic Directions The next five years Carey Hatch SUNY Office of Library and Information Services.
DSpace Rea Devakos and Gabriela Mircea University of Toronto Libraries.
Durable Digital Repositories: The DSpace Project Bill Jordan University Libraries.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
One Body, Many Heads for Repository-Powered Library Applications Chris Awre Head of Information Management Library and Learning Innovation University of.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Shared IR Project Overview Rick Johnson Lead Project Director Shared IR University of Notre Dame Hydra Connect 2014 January 22, 2014Hydra Connect
US Hydra use overview Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head of Information Management Library and Learning Innovation.
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
1Hydra Connect 2: Working Group Framework Empowering the Community through a Framework for Interest Groups and Working Groups Robin Ruggaber University.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Adopting Hydra Making the case and getting going Chris Awre Hydra Europe Symposium London School of Economics, 23 rd April 2015.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
Hydramata Project Refresh March 10, 2014Hydramata Project Refresh1.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Shared IR Project Overview December 5, 2013Shared IR Kickoff Meeting.
PROJECT HYDRA SNEAK PEAK – ADVANCE SHOWING Brought to you by the Digital Repository Task Force Steve Marine (chair), Ted Baldwin, Dan Gottlieb, Kevin Grace,
Hydra Europe Symposium | April 2015 | 1 Hydra and open access Chris Awre Hydra Europe Symposium London School of Economics, 24 th April 2015.
One Body, Many Heads for Repository-Powered Digital Content Applications Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
One Body, Many Heads for Repository-Powered Library Applications Tom Cramer Chief Technology Strategist Stanford University Libraries CNI * 13 December.
Dermot Frost Digital Repository of Ireland Trinity College Dublin.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Implementing a Data Publishing Service via DSpace Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert May 20, 2009.
Hypatia Hydra Platform for Access to Information in Archives DLF Forum * Baltimore * October 31, 2011 Stanford University Bradley Daigle Julie Meloni Tom.
The Seaside Research Portal: A Best of Breed Approach to Digital Exhibits and Collection Management Rick Johnson, Head of Digital Library Services University.
VIVO and Scholarly Repositories: Synergistic Opportunities.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Digital Repository Service Update ___________________________ Yale University Library Roy Lechich, ILTS Audrey Novak 15 Aug 2007.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Hydra at The Royal Libary Hydra Europe Symposium, Dublin 7. April, 2014 Anders Conrad,
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Durable Digital Repositories: The DSpace Project Bill Jordan University Libraries.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Ingest and Dissemination with DAITSS
FLORIDA CENTER FOR LIBRARY AUTOMATION
Steering Group Member, Link Digital
Statewide Digitization and the FCLA Digital Archive
Introduction, Features & Technology
Hydra: a case study Chris Awre
Implementing an Institutional Repository: Part II
Open Archival Information System
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

Diving into the Technology of Hydramata: A Pluggable, Extensible Hydra Solution for Research Repositories June 13, 2014Open Repositories Rick Johnson, University of Notre Dame Linda Newman, University of Cincinnati Open Repositories, 2014

Why are We Here Again? June 13, 2014Open Repositories 20142

Where We Started June 13, 2014Open Repositories 20143

New World of Scholarship Multimedia Student Projects Non-text Thesis Big Data Any Format as Data Everything is connected June 13, 2014Open Repositories 20144

Shared Needs … DATA CURATION EXPERTS June 13, 2014Open Repositories Hydramata IndianaNorthwesternDCENotre DameCincinnatiVirginia

Curate Gem Mixed Architecture June 13, 2014Open Repositories 20146

Emerging Goals / Refreshed Vision Each institution’s implementation is part of a larger ecosystem not merely a standalone application, and thus the code should support institutional customization. The core system should be separate from discrete components and provide a reference implementation for each component. Each component can then be extended or exchanged according to the needs of an individual institution. Component customization may also be shared among like-minded institutions because the interface between the components and the core will remain consistent. June 13, 2014Open Repositories 20147

Hydramata A set of separate hydra-based automata, “hydramata”, that can be combined to form a single Hydra Head. Each hydramaton can be mixed and matched with other hydramata to provide wide format support tailored to fit your institutional or digital repository needs. June 13, 2014Open Repositories 20148

Transformation June 13, 2014Open Repositories 20149

June 13, 2014Open Repositories

Problems To Be Addressed ➢ Hard to understand ○ Unclear data flow ○ Hard to assign story points ➢ Hard to maintain ○ Engine design hard to test ○ Slow tests ➢ Hard to configure ○ Configurable only at install time ○ Monolithic; lacking modularity ➢ Ingest unreliable and opaque ○ Fedora/Solr sync problems ○ Orphaned Fedora objects ➢ Not set up for advanced work ○ Role-based Access Control ○ Batch ingest ○ Multi-stage workflow Decreasing velocity Increasing errors Diminishing ROI Preservation failure

Design Approach ➢ Separation of concerns into functional units: ○ Determine their responsibilities ○ Define their interfaces ○ Minimize coupling between functional units ➢ Tie units into a conceptual framework ○ Build out from ingest processing at the core ○ Treat works as arbitrary sets of metadata/content ○ Support interactive and batch processing equally ○ Implement an Asynchronous transaction model

Design Approach – Core IR Users can login to the system using their university credentials We are confident about the accuracy, persistence, security, and recoverability of what users upload Users can upload works via UI or batch Users can assign a license agreement to their works Works can be described, organized, discovered, viewed Works can be shared with (or restricted from) the right people Users can collaborate with others with individual works Users will get metrics about how their works are viewed There will be a variety of roles for people to interact with the IR Contributors names will link to standard identifiers (e.g. ORCID) DOIs can be assigned by users to works

Beyond Core IR Components considered outside of the core: ➢ IIIF ➢ TEI support ➢ Specialized viewers ➢ ORCID plugin (minting ORCIDs from Hydramata) ➢ Minting DOIs from Hydramata ➢ Syndication to external systems (using OAI, ResourceSync or similar): SHARE, ORCID, Google Scholar, DPN, APTrust, DPLA ➢ Exhibits (Spotlight) ➢ Contributor profiles, links to campus research information systems (VIVO)

Single Asynchronous(?) Transaction API UI Fedora Solr Content SIP

Single Asynchronous(?) Transaction API UI Fedora Solr Content ? ?

Ingest Processing Build on OAIS Reference Model concepts: ○ SIP - Submission Information Package ■ Gather user input into a SIP until validated ■ Maintain SIP until validated ○ AIP - Archival Information Package ■ Package work into Fedora Object(s) ■ Manage storage into Fedora ○ DIP - Dissemination Information Package ■ Aggregate work content and metadata in a common form ■ Provide in-progress and submitted works consistently

Single Asynchronous(?) Transaction API UI Fedora Solr Content Staging DB SIP

Rebuild the Core Yet to determine where some elements fit in: Role-Based Access Control Multi-stage workflow support

Problems Addressed ➢ Easier to understand ○ Design model clarifies overall function ○ Modularity makes tasking less difficult ➢ Easier to maintain ○ Created as an app not an engine ○ Decoupling helps test creation/execution ➢ Easier to configure ○ More intelligence in the core ○ Variation in works encapsulated in data ➢ Ingest process built to handle errors ○ Fedora/Solr updated only with valid data ○ No orphaned Fedora objects ➢ Accommodates advanced work ○ Reference implementations of modules allow programming to the interface while deferring full implementations Steady velocity Decreased errors Increasing ROI Preservation

What About Fedora 4? API UI Fedora Solr Content Fedora Virtual Workspace SIP

External Dissemination Layer: ORCID, DPLA, Spotlight External Dissemination Layer: ORCID, DPLA, Spotlight June 13, 2014Open Repositories Persistence Fedora, DB, Solr Persistence Fedora, DB, Solr External (e.g. ORCID, SHARE, Spotlight) Work Viewer/HTML Workflow/Permissions Data Mapper SIP/DIP Discovery

Swappable Hydramata June 13, 2014Open Repositories

Hydramata Application and Discovery Layer Plugging In and Out June 13, 2014Open Repositories VIVO ArchivesSpace Avalon Spotlight SHARE ORCID Hydramata Persistence Layer Other Hydra … Hydramata Work Data Resolver DIP Mapper Hydramata DIP Mapper DIP Mapper?

Hydramata Application and Discovery Layer Plugging In and Out June 13, 2014Open Repositories Hydramata Persistence Layer Hydramata Work Data Resolver Hydramata DIP Mapper Other Hydra Head

Hydramata Application and Discovery Layer Plugging In and Out June 13, 2014Open Repositories Hydramata Persistence Layer Hydramata Work Data Resolver Hydramata DIP Mapper ETD Profiles Images Video

June 13, 2014 Open Repositories Curate Development Team Lead Project Director Tech Lead Scrum Master Lead QA Developer Project Director Developer Product Owner Lead Product Owner Notre Dame Northwestern Cincinnati Indiana Virginia

June 13, 2014 Open Repositories Lead Project Director Tech Lead Lead QA Developer Project Director Developer Product Owner Notre Dame Northwestern Cincinnati Indiana Virginia Scrum Master Core IR Development

June 13, 2014 Open Repositories Lead Project Director Tech Lead Lead QA Developer Project Director Developer Product Owner Notre Dame Northwestern Cincinnati Indiana Virginia Scrum Master Swarm!

Specialized Viewers? Specialized Viewers? CORE IR Syndication (Share, DPLA)? Syndication (Share, DPLA)? ORCID? June 13, 2014 Open Repositories Hydramata Development Teams Lead Project Director Tech Lead Scrum Master Lead QA Developer Project Director Developer Product Owner Lead Product Owner Notre Dame Northwestern Cincinnati Indiana Virginia Lead QA Scrum Master

Acknowledgements Jeremy Friesen, Technical Lead Dan Brubaker Horst, Product Owner Paul Clough, Developer Ray Lubinsky. Product Owner Sue Richeson, Developer Julie Rudder, Lead Product Owner Mike Stroming, Developer Entire Hydramata Project Team (15+ individuals) June 13, 2014Open Repositories

Contact Info Rick Johnson, Linda Newman, Project Hydramata wiki: – Project Project Curate code base: – Hydra Community: – June 13, 2014Open Repositories

Q & A June 13, 2014Open Repositories