Download presentation
Presentation is loading. Please wait.
Published byBasil Ward Modified over 8 years ago
1
Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010
2
Repurposing NDNP content.
3
Or...
4
What we learned when we started to eat our own dog food.
5
We had two years of happy content delivery
6
Then we got to the point where we had to add it to our system.
7
We had a slightly different model in our system
8
So we had to write our own ingest tools
9
Which wasn't hard because we had a nice “robust” specification
10
But we ran into some interesting 'stuff'
11
Our process for reusing NDNP content.
12
LC accepts batch, and ships it back
13
We move it to the local network
14
And start to add additional metadata
15
What do we add?
16
Our model for the Portal to Texas History (and all of our digital collections)
17
Has us adding a bit more metadata for each issue.
18
Really it is most of the data in the title record pushed down to the issue level
19
We add the following fields
20
title.serial subject.lcsh subject.untl-bs description.content description.physical coverage.placeName coverage.era publisher creator.editor contributor.(various) language identifiers.(various)
21
We create a metadata template for each group of “like” issues.
22
Creating a new template anytime something significant changes.
23
Then we automate the addition of date, volume, issue, edition, pagination from the METS/MODS
24
I run a few scripts to check groupings for consistency.
25
Then I create submission packages for each issue.
26
Issues get added to our repository where they are treated like any other content in our repository.
27
We also shove the full batch into our repository as a digital object.
28
We sometimes catch things that were missed in previous QC
29
Mainly because it is a different view of the data
30
Looking at the same data in different ways is a very good qc tool
31
lists, simple graphs or timelines
32
Reveal subtle patterns that “could” be problems
33
Bring it back around Mark...
34
Oh yeah QC of metadata...
35
So one of the things we've wanted to experiment with was looking at a batch with different views of the data.
36
And up until recently that has required more effort than we were willing to put toward the project.
37
We are very interested in using the Chronicling America application as a framework for understanding these NDNP batches easier.
38
Creating views of batches that we could share with other awardees.
39
Hopefully improving quality along the way.
40
http://texashistory.unt.edu
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.