Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010.

Slides:



Advertisements
Similar presentations
Focus on Your Content, Not on Ingesting Your Content Terry Brady Applications Programmer Analyst Georgetown University Library
Advertisements

SIG Proceedings Preparation There are two methods used to produce SIG proceedings: –Preferred Vendor preparation service –Conference Leader preparation.
NIMAC Operations: The File Certification Process June 24, 2008 Nicole Gaines.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata.
One sentence, one sum. By Mark.
Stop the Presses! Presenting the Minnesota Digital Newspaper Hub Karen Lovaas & Jane Wong Minnesota Historical Society.
Second Grade Sight Words. high 229 every 230 near 231.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
{ Building Open Access To Our Heritage Andrew Weidner Project Coordinator, New Mexico Historical Newspapers University of North Texas Libraries: Digital.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
GDT V5 Web Services. GDT V5 Web Services Doug Evans and Detlef Lexut GDT 2008 International User Conference August 10 – 13  Lake Las Vegas, Nevada GDT.
Lauren Dzura LIS 654 Digital Repository Plan
Quality Review of Digital Newspapers: Lessons Learned at UNT National Digital Newspaper Program Awardee Conference September 27, 2012.
Chris Marks. 1.1 I can describe what types of information are needed. Logo Idea 1 I do not want this logo to be my final logo because it looks rubbish.
Martin Halbert UNT Dean of Libraries MetaArchive President Monday, April 11, 2011 Newspaper Archive Summit University of Missouri Columbia, MO.
AUTOMATED NDNP QUALITY REVIEW Andrew Weidner Project Coordinator, New Mexico Historical Newspapers University of North Texas Libraries: Digital Newspaper.
Sobek for Curators and Collection Managers Training Two: Submitting and Editing Resource Files and Metadata Mark Sullivan November 2013 University of Florida.
Metadata Input Tool for CADIS Scientists and Data Managers by D. Stott August 8, 2007.
CSUN eCommons Submitting Learning Objects to CSUN eCommons: A Preliminary Guide February 7, 2008.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Content Transfer NDIIPP Meeting July 9, 2008 Jane Mandelbaum, LC.
O PEN A CCESS TO O UR H ERITAGE The Gateway to Oklahoma History Cross Timbers Library Conference – August 16, 2013 Sarah Lynn Fisher University of North.
Automating software releases brian d foy August 24, 2004 Dallas/Ft. Worth Perl Mongers sponsored by Metallect.
First of all – lets look at the window’s you are going to use. At the top you have a toolbar, with all your various tools you can use when customising.
Building flexible workflows with Fedora at the University of York Julie Allinson and Frank Feng The 5 th International Conference on Open Repositories.
CAA Database Overview Sinéad McCaffrey. Metadata ObservatoryExperiment Instrument Mission Dataset File.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Metadata & Repositories Jackie Knowles RSP Support Officer.
 A Javascript library designed to simplify client-side scripting of HTML.
Transparent Format Migration of Preserved Web Content D. S. H. Rosenthal, T. Lipkis, T. S. Robertson, S. Morabito Lib Magazine, 11(1), 2005
Outcome Agile Testing. 2© 2010 OutSystems – all rights reserved How do you ensure the quality of your delivery?
12 x 12 = x 11 = x 10 = x 9 = 108.
Shared Metadata for Regional Landmarks: Building Names Mark Phillips Assistant Dean for Digital Libraries UNT Libraries.
Leveraging the Results of NDNP: the Texas Digital Newspaper Program.
Creative Create Lists Elizabeth B. Thomsen Member Services Manager
Contract Lifecycle Management In the Disruptive Age
Moving on : Repository Services after the RAE
Trove Tufts Digital Image Library
Creating Clinical Notes in ZIMS R2
Fry Words
Bringing Undergraduates to Archival Research Through a Digital Lens
Bentley Project Reel Digitization Bentley Historical Library t
Effective way to build test Automation strategy in Agile
Checkout New Personalized Features on Hulu Live Tv
MAKE SDTM EASIER START WITH CDASH !
FAST Administration Training
Picked up some of this along the way?
Journal separation anxiety
Adventures in ETD metadata wrangling:
RDA and translations Gordon Dunsire, Chair, RSC
Sobek for Curators and Collection Managers
Academic Communication Lesson 3
NIMAC for Publishers & Vendors: Batch Delivery Procedures
Conditionally Confirming a Submit
Introduction to Metadata
Learning the Basics of ArcMap 3.3 Updated 4/27/2010
Automation and Scalability in Digital Preservation
Data Updates.
Syncing Omeka with Fedora Commons
HP Quality Center 10.0 The Test Plan Module
Mukurtu CMS: Creating a Digital Heritage Item
Bentley Audio Digitization
Week: 1 Update By: Nick Arneson
TSDS - Texas Student Data System PEIMS
Advanced Tips and Tricks
Presentation transcript:

Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010

Repurposing NDNP content.

Or...

What we learned when we started to eat our own dog food.

We had two years of happy content delivery

Then we got to the point where we had to add it to our system.

We had a slightly different model in our system

So we had to write our own ingest tools

Which wasn't hard because we had a nice “robust” specification

But we ran into some interesting 'stuff'

Our process for reusing NDNP content.

LC accepts batch, and ships it back

We move it to the local network

And start to add additional metadata

What do we add?

Our model for the Portal to Texas History (and all of our digital collections)

Has us adding a bit more metadata for each issue.

Really it is most of the data in the title record pushed down to the issue level

We add the following fields

title.serial subject.lcsh subject.untl-bs description.content description.physical coverage.placeName coverage.era publisher creator.editor contributor.(various) language identifiers.(various)

We create a metadata template for each group of “like” issues.

Creating a new template anytime something significant changes.

Then we automate the addition of date, volume, issue, edition, pagination from the METS/MODS

I run a few scripts to check groupings for consistency.

Then I create submission packages for each issue.

Issues get added to our repository where they are treated like any other content in our repository.

We also shove the full batch into our repository as a digital object.

We sometimes catch things that were missed in previous QC

Mainly because it is a different view of the data

Looking at the same data in different ways is a very good qc tool

lists, simple graphs or timelines

Reveal subtle patterns that “could” be problems

Bring it back around Mark...

Oh yeah QC of metadata...

So one of the things we've wanted to experiment with was looking at a batch with different views of the data.

And up until recently that has required more effort than we were willing to put toward the project.

We are very interested in using the Chronicling America application as a framework for understanding these NDNP batches easier.

Creating views of batches that we could share with other awardees.

Hopefully improving quality along the way.