Presentation is loading. Please wait.

Presentation is loading. Please wait.

Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010.

Similar presentations


Presentation on theme: "Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010."— Presentation transcript:

1 Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010

2 Repurposing NDNP content.

3 Or...

4 What we learned when we started to eat our own dog food.

5 We had two years of happy content delivery

6 Then we got to the point where we had to add it to our system.

7 We had a slightly different model in our system

8 So we had to write our own ingest tools

9 Which wasn't hard because we had a nice “robust” specification

10 But we ran into some interesting 'stuff'

11 Our process for reusing NDNP content.

12 LC accepts batch, and ships it back

13 We move it to the local network

14 And start to add additional metadata

15 What do we add?

16 Our model for the Portal to Texas History (and all of our digital collections)

17 Has us adding a bit more metadata for each issue.

18 Really it is most of the data in the title record pushed down to the issue level

19 We add the following fields

20 title.serial subject.lcsh subject.untl-bs description.content description.physical coverage.placeName coverage.era publisher creator.editor contributor.(various) language identifiers.(various)

21 We create a metadata template for each group of “like” issues.

22 Creating a new template anytime something significant changes.

23 Then we automate the addition of date, volume, issue, edition, pagination from the METS/MODS

24 I run a few scripts to check groupings for consistency.

25 Then I create submission packages for each issue.

26 Issues get added to our repository where they are treated like any other content in our repository.

27 We also shove the full batch into our repository as a digital object.

28 We sometimes catch things that were missed in previous QC

29 Mainly because it is a different view of the data

30 Looking at the same data in different ways is a very good qc tool

31 lists, simple graphs or timelines

32 Reveal subtle patterns that “could” be problems

33 Bring it back around Mark...

34 Oh yeah QC of metadata...

35 So one of the things we've wanted to experiment with was looking at a batch with different views of the data.

36 And up until recently that has required more effort than we were willing to put toward the project.

37 We are very interested in using the Chronicling America application as a framework for understanding these NDNP batches easier.

38 Creating views of batches that we could share with other awardees.

39 Hopefully improving quality along the way.

40 http://texashistory.unt.edu


Download ppt "Experience Talks: Post-Digitization Quality Control Strategies and Tools Mark Phillips – August 5, 2010."

Similar presentations


Ads by Google