Home-Grown Digital Library System Built Upon Open Source XML Technologies and Metadata Standards David Lacy Villanova University

Slides:



Advertisements
Similar presentations
Preservation of the Texas Agricultural Experiment Station Bulletin in the Digital Repository By Dr. Rob McGeachin Texas A&M University Libraries June,
Advertisements

VuFind Beyond MARC discovering everything else Demian Katz VuFind Developer
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Fedora TM and Repository Implementation at UVa Leslie Johnston, UVa Library DASER Summit November 22, 2003.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
PREMIS: To Be or Not To Be in My METS The Preservation Journey at the University of Connecticut Libraries ALA Annual 2013 ALCTS PARS Intellectual Access.
1.  Understanding about How to Working with Server Side Scripting using PHP Framework (CodeIgniter) 2.
Joachim Bauer Senior System Engineer, CCS
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
6/15/20151 Opportunities for Collaboration: The HEARTH Project Joy Paulson and Nathan Rupp Cornell University Digital Library Federation Spring Forum New.
Ingest and Loading DigiTool Version 3.0. Ingest and Loading 2 Ingest Agenda Ingest Overview and Introduction Ingest activity steps Transformers Task Chains.
CONTENT: A model for collaborative database building Trevor Bond Alan Cornish Washington State University Libraries.
WMS: Democratizing Data
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Consists of the following components (which are purchased separately) Resource Discovery * Web based deposit (including authorisation)* Full Text Index.
Sai Deng, Metadata Catalog Librarian, Wichita State University Libraries Tse-Min Wang, Graduate Student in CS, Wichita State University Digital Imaging.
OCLC Online Computer Library Center CONTENTdm 4.3 Claire Cocco Global Product Manager CONTENTdm October 3, 2007.
The New DRS (DRS 2) Introduction. What is DRS? Digital repository for preservation and access –Maintains integrity of deposited content –Preserves content.
A Digital Preservation Repository for Duke University Libraries Jim Coble Digital Repository Developer Open Repositories 2013.
Putting it all together for Digital Assets Jon Morley Beck Locey.
Danielle Baldwin, ITS Web Services CMS Administrator Application Overview and Joomla 1.5 RC 1 Highlights.
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day October 27, 2003 Carl Fleischhauer Office of Strategic Initiatives.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University
Web-based workflow software to support book digitization and dissemination The Mounting Books project books.northwestern.edu Open Repositories 2009 Meeting,
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Document Solutions Document Solutions Confidential Property of FileMark Corporation Document Solutions Document Solutions Apr 15, 2008 SMART Document Solutions.
Web based METS creation Ralf Stockmann case study.
From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Getting Started with CONTENTdm Corey Harper, University of Oregon Terry Reese, Oregon State University OLA - April 8, 2005.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Image Workflow Processes Elspeth Haston, Robert Cubey, Martin Pullan & David J Harris.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Maps and their textual associations in a digital collection: a report from the Early Washington Maps project. Trevor Bond, Special Collections Librarian.
METS, Standards and Rights METS, Safonau a Hawliau Vicky Phillips Digital Standards Manager Rheolwr Safonau Digidol 4 th March ydd Mawrth 2014.
XRX Basic CRUDS Create, Read, Update and Delete and Search XML Data Date: May 2011 Dan McCreary President Dan McCreary & Associates
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Domain-Expert Repository Management for Adaptive Hypermedia Learning System By Norazah Yusof & Paridah Samsuri Members of SPAtH Group Faculty of Comp.
Visionary Technology in Library Solutions VITAL Access Portal.
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Markus Höckner Computer Center University of Vienna.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Image Discovery & Access ACRL Image Resources Interest Group ALA Annual, Saturday, June 26, 2010 Nicole Finzer, Visual Resources Librarian, Digital Collections,
Sobek for Curators and Collection Managers Training Three: Quality Control and Serial Hierarchy Mark Sullivan December 2013 University of Florida George.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Post-ALA Annual July 11, 2008 Pre-Conference Workshop: The Care and Feeding of Compound Objects Geri Ingram OCLC Digital Collection Services Manager, User.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
The world’s libraries. Connected. CONTENTdm ® Digital Collection Management Solutions Learn what to consider when outsourcing your library’s digitization.
Information modeling and infrastructures for metadata
Introduction, Features & Technology
Outline Pursue Interoperability: Digital Libraries
Introduction to DSpace
DIGITAL ARCHIVES Into the Light
Metadata to fit your needs... How much is too much?
Presentation transcript:

Home-Grown Digital Library System Built Upon Open Source XML Technologies and Metadata Standards David Lacy Villanova University

Why Did We Do This?

Seriously, Why Did We Do This?

System Components A METS Metadata Editor A series of batch-process service image generation tools An XML Database repository A file server An OAI server A series of VuFind Record Drivers

Architecture Components METS XML eXist-db Orbeon Forms (Xforms Processor) Tesseract (OCR) Imagemagick

METS (Metadata Encoding and Transmission Standard)

Orbeon Forms (XML & XForms Processor) Browser independent, plugin free, XForms Processor AJAX driven interface controls XML Database (eXist) integration XML pipeline (XPL) engine for processing XML

XPL Pipelines Vocabulary for describing a processing model for XML – File System Controls – XQuery Submissions – Session Management

<xforms:submission id="batch-attach-submission" method="post" replace="none" ref="instance('rename-file-instance')" action="/rename-file.xpl" >

XPL File Processor …. Filename Directory New Filename New Directory

Collection Development Special Collections Material Strategic Partnerships Catholica United States Irish History Regional History Faculty and Alumni Scholarly Material > 9000 items

(Rapid) Work-flow Select item Scan TIFFs Process service images Instantiate Digital Item Batch-Attach TIFFs and Service Images Add Metadata Index into VuFind

Service Images Process Scanned Images (Cron) OCR (Tesseract) Produce Service Images (ImageMagick) – Large – Medium – Thumbnail

Collection View Add Collections Add Resources / Items Edit Metadata Batch-Attach Files View Raw METS XML Relocate Item Delete Item

Resources and Collections View

Batch Attach Read Processed Images (via oxf:directory-scanner) Add nodes to (via xforms:insert) Move Files to File Server (via oxf:file pipeline)

Batch Attatch

Metadata - Completion Status Agent Information – Editors – IP Owners – Disseminators – Etc.

Metadata - Descriptive Metadata Dublin Core (DC) Looking to expand this area to other descriptive standards

Metadata - and Physical description Control Order Add / Delete files Edit Labels

Metadata - and 2 levels of file association – Page Level – Document Level

Problems XML file size / Large Volumes – Orbeon document serialization and XML processing occurs during several events Could disable this at cost of AJAX functionality – Solved Paginate the table displaying page/line items Retrieve relative rows/items from repository Save document using XQuery Upate Infinite METS Flexibility – Not solved

Front End Expose Content via OAI-PMH Index into VuFind Search Metadata and OCR/Full Text Digital Object Viewer and Page Turner – Page items – Document items

OAI-PMH Server Written in XQuery METS or DC

Roadmap Incorporate Other Metadata – MODS, TEI, PREMIS Breakout METS Metadata Editor Alternative Repository Integration JPEG2000 Support Document Delivery (PDF wrappers, ePub) Logical

Roadmap ContentDM Migration

Coming April 2011 David Lacy Villanova University