Toward a Digital Asset Management Ecosystem at Texas A&M University Libraries An update on developments in document workflows, data modeling, media service,

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

Focus on Your Content, Not on Ingesting Your Content Terry Brady Applications Programmer Analyst Georgetown University Library
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
ETD Management in the Texas Digital Library Adam Mikeal Texas Digital Library ETD 08 Aberdeen, Scotland June 6, 2008.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Importing Transfer Equivalencies: How to Maximize Efficiency How Columbia College Office of Registrar improved productivity through third party solutions.
Workflows for Digital Curation and Preservation Stacy Kowalczyk PASIG Dublin 2012 October 17, 2012.
1 Building a “Virtual Library Collection” through freely-accessible web sites: ‘Select Web Sites database’ at University of Vermont Wichada SuKantarat.
Digital Repository Development at Yale University Library Michael Dula CTO, Yale University Library December 8, 2014.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Administration & Workflow
ALEPH at the Crossroad IGeLU Oxford, 2014 Dalia Mendelsson The Library Authority.
1 Transportation Librarians Roundtable Transportation Research Thesaurus: WSDOT Use Cases February 14, 2008 Andy Everett Metadata Repository Administrator.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Asset Management System for CoolStateLA Thesis Presentation Winter 2008 Farrukh Shakil CS Dept., CalStateLA Advisor: Dr. Russ Abbott.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
DIGITIZATION OF COMPUTER SCIENCE QUESTION PAPERS IN BHARATHIDASAN UNIVERSITY LIBRARY By V. MUTHULAKSHMI SUPERVISOR Dr. M. SURULINATHI Assistant Professor.
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
US Hydra use overview Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head of Information Management Library and Learning Innovation.
At the North of England Institute of Mining and Mechanical Engineers Library, Newcastle upon Tyne.
Digital Library Architecture and Technology
City of Seattle Office of the City Clerk Open Government = Access Challenges and Opportunities with Digital Records.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Implementing an Institutional Repository at IUPUI: A Good IDeA Kevin Petsche Acting Digital Libraries Team Leader Emily Dill Public Services Librarian,
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Image Workflow Processes Elspeth Haston, Robert Cubey, Martin Pullan & David J Harris.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
Hypatia Hydra Platform for Access to Information in Archives DLF Forum * Baltimore * October 31, 2011 Stanford University Bradley Daigle Julie Meloni Tom.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Maps and their textual associations in a digital collection: a report from the Early Washington Maps project. Trevor Bond, Special Collections Librarian.
PREPARING FOR RESOURCE DESCRIPTION AND ACCESS (RDA) CATHY SALIKA NICOLE SWANSON CARLI Annual Meeting, Nov 9, 2012.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Document Solutions Document Solutions Confidential Property of FileMark Corporation Document Solutions Document Solutions July 2009 Repository for Submission.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
National Library of Finland Strategic, Systematic and Holistic Approach in Digitisation Cultural unity and diversity of the Baltic Sea Region – common.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
DESIGN AND DEVELOPMENT OF NOAA VIRTUAL LIBRARIES: THE INTERSECTION OF TRADITIONAL LIBRARY KNOWLEDGE AND CUTTING EDGE INFORMATION TECHNOLOGIES Dottie Anderson.
Memory Masters Preserving Digitized Histories— for today, for tomorrow, and for the future This project is made possible by a grant from the federal Institute.
Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University.
Where are my files? Discoveries in establishing a digital archive workflow Sally McDonald Archivist/Librarian Western History/Genealogy, Denver Public.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Automating the Audit: Updates from the Metadata Upgrade Project at the University of Houston Libraries Andrew Weidner, Metadata Librarian Santi Thompson,
NLM Update and Still Image Serving April 27, 2016 John Doyle, Doron Shalvi, TA Nguyen National Library of Medicine.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Access for user self- sufficiency: making rich local content intuitively available Catalog Transformed: From Traditional to Emerging Models of Use Program.
Digital Asset Management at Michigan Tech
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Eric Johnson Miami University 2016 August 15 IFLA
Software Project Configuration Management
Trove Tufts Digital Image Library
Information modeling and infrastructures for metadata
Digital library and OR 21 October 2002 Members’ Council
Metadata Editor Introduction
Martin Moyle Digital Curation Manager UCL Library Services, UK
Committee on Technical Processing Council on East Asian Libraries
Omeka for Digital Archives
Extended Document Management System (EDMS)
Library Technology Conference: Building Exhibits
Digital Repositories The management of learning objects
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Integrating Koha and IIIF to manage a digital library
Why IIIF? Shane Huddleston Jeff Mixter Dave Collins Product Manager
ArchivesSpace – Archivematica – DSpace Workflow Integration
APE EAD3 introduction - DARIAH - Brussels
Presentation transcript:

Toward a Digital Asset Management Ecosystem at Texas A&M University Libraries An update on developments in document workflows, data modeling, media service, and exhibitions Welcome to Managing Digital Assets from Curation to Exhibition Background: Magpie was presented last year at TCDL. The software is built using the Weaver framework, a Texas A&M University Libraries in house framework. The original use case for Magpie, or Metadata Assignment Tool at the time, was to facilitate annotation of legacy dissertations from their scanned and OCR’d form. There was also a requirement to prepopulate specific metadata fields from Voyagers MARC records for the legacy dissertations. Since then the use case evolved and more came into existence when recognizing the software's potential. William Welling, James Creel, Jeremy Huff, Jason Savell, Douglas Hahn, Sarah Potvin, Sean Buckner, Michael Bolton

Talk Outline Introducing the DAME Components and Technologies DAMS Task Force Needs Assessment Piloting and Evaluation Components and Technologies Document Workflows IR/Storage/Data Modeling Media Servers Discovery Exhibition Demonstration of Current Developments accuracy: ensure metadata is correct efficiency: annotating large collections in a reasonable amount of time normalization: avoid repeating metadata, avoid multiple ambiguous terms, use controlled vocabularies and name authorities synchronization: ensure metadata is consistent between repositories, exhibits, preservation, etc. Digital assets can come from many collections, some being homogeneous and some not. Sources of metadata can be from many sources such as a catalogs, repositories, exports, etc. The annotated digital asset can have many destinations, such as multiple IR, multiple exhibits, and/or preservation. A good exhibition solution needs to be able to exhibit multiple collections with digital assets from varying sources. It should be able to allow exhibit level annotations. It should be discoverable and aesthetic. It would be ideal for an exhibit layer to not duplicate the digital asset but reference it and utilize linked data.

Introducing the DAME Needs assessment revealed a diverse set of requirements not met by any single system Different exhibitions and collection types require different workflows Legacy dissertations Agricultural Bulletin Image Collections Newspapers and articles Existing Collections Etc. Legacy dissertations: This is an ongoing project to digitize, annotate, and preserve legacy dissertations. As already mentioned, this was the original use. The requirements have since changed. The annotation will be conducted using existing commercial tools, but still left the need to preserve the digital assets. The process required that the MARC record from Voyager be packaged with the documents and preserved in Archivematica. There was no user interaction required for this, so we introduced a headless mode to Magpie. When the scanned documents are finished they begin the headless process of gathering MARC record from Voyager and automatically packaged and pushed to Archivematica for preservation. Agricultural Bulletin: Dr. Robert McGeachin, Agriculture and Digital Services Librarian, came to use with a project to semi-automatically index the Agricultural Bulletin he has so diligently been annotating and marshaling the assets into Dspace. The project called for a UI that provided a lists of suggestions that were discovered by finding all occurrences of any term from the National Agriculture Library Thesaurus and applying a set of use-for rules. Image Collections: We have had requests to have digital exhibits of photos taken from physical exhibits.

Ecosystem MAGPIE Service and UI Pelican Service Authorization Service Scanning workstations Abbyy OCR DSpace Fedora IIIF Manifest Generator Archivematica Cantaloupe Spotlight Mirador Magpie: the application being presented Pelican: a web-service for name disambiguation, modified to provide subject suggestions by counting frequency of thesaurus words within a document and applying use-for rules Authorization Service: used to authenticate and authorize applications either by basic login or through a service provider such as shibboleth Scanning workstations: scanners for scanning physical documents Abbyy OCR: software run as a service to perform optical character recognition on scanned documents DSpace: Institutional repository Fedora: Institutional repository Archivematica: preservation software Spotlight: exhibition software

Document Workflow Components Projects Observers Authorities Automatic Suggestions Exporters Repositories Exhibits

IIIF Manifest Generator Filesystem DSpace CSV Metadata Spreadsheet Legacy Dissertations Agricultural Bulletins Primeros Libros (SAF) WW I Postcards (SAF) MAGPIE Authority Observer Observer Observer Observer Archivematica Repository (DSpace REST) Voyager (OPAC) Authority Repository (Archivematica REST) Fedora Repository (Fedora REST w/ PCDM) Suggestor Exporter Exporter Repository (Fedora REST w/ PCDM Pelican (NLP) DSpace SAF Import Archive Mirador IIIF Manifest Generator Spotlight Import CSV Spreadsheet Spotlight

Project Configuration Defined by JSON Configurable Metadata Fields Ingest mode Authorities Suggestors Repositories Exporters https://github.com/TAMULib/MetadataAssignmentToolService/blob/pre-production-debugging/src/main/resources/config/projects.json

Data Modeling DSpace Fedora IIIF Flat metadata PCDM Collections Presentations Images

Discovery and Exhibition Solr Fuseki Exhibition IIIF Manifests Cantaloupe Image Server Spotlight Mirador

Legacy Scanned Dissertations Filesystem DSpace CSV Metadata Spreadsheet Legacy Dissertations Agricultural Bulletins Primeros Libros (SAF) WW I Postcards (SAF) MAGPIE Authority Observer Observer Observer Observer Archivematica Repository (DSpace REST) Voyager (OPAC) Authority Repository (Archivematica REST) Fedora Repository (Fedora REST w/ PCDM) Suggestor Exporter Exporter Repository (Fedora REST w/ PCDM Pelican (NLP) DSpace SAF Import Archive Mirador IIIF Manifest Generator Spotlight Import CSV Spreadsheet Spotlight

Texas Agricultural Experiment Station Publications Filesystem DSpace CSV Metadata Spreadsheet Legacy Dissertations Agricultural Bulletins Primeros Libros (SAF) WW I Postcards (SAF) MAGPIE Authority Observer Observer Observer Observer Archivematica Repository (DSpace REST) Voyager (OPAC) Authority Repository (Archivematica REST) Fedora Repository (Fedora REST w/ PCDM) Suggestor Exporter Exporter Repository (Fedora REST w/ PCDM Pelican (NLP) DSpace SAF Import Archive Mirador IIIF Manifest Generator Spotlight Import CSV Spreadsheet Spotlight

IIIF Manifest Generator Filesystem DSpace CSV Metadata Spreadsheet Legacy Dissertations Agricultural Bulletins Primeros Libros (SAF) WW I Postcards (SAF) MAGPIE Authority Observer Observer Observer Observer Archivematica Repository (DSpace REST) Voyager (OPAC) Authority Repository (Archivematica REST) Fedora Repository (Fedora REST w/ PCDM) Suggestor Exporter Exporter Repository (Fedora REST w/ PCDM Pelican (NLP) DSpace SAF Import Archive Mirador IIIF Manifest Generator Spotlight Import CSV Spreadsheet Spotlight

Future Direction Full Production Deployment UI managed projects UI annotation to support PCDM A/V media support Heuristics to guess metadata Open source code of MAGPIE

Click to add your credits Thank You. Any Questions? Subtitle