Download presentation
Presentation is loading. Please wait.
Published byNeal Woods Modified over 9 years ago
1
Can Bilateral Digitization Tear Down the Wall Between Institutions and the Public? Ben Brumfield Digital Frontiers 2012
6
“You know Ben that it really stinks that I can't get access to the original. My grandfather Jeremiah wrote the diary so that I could read about his daily life happenings. My grandfather Edward used to own it and if he had known that I would be so interested in it I'm sure he would have kept it and given it to me instead of the university.” Alan Williams, 2009 email
7
Walls Professionally conserved Publicly accessible Catalogued 1000 miles away Reading room restrictions “Permission-to-publish” agreements Costly scanning fees
8
Penetrating the Walls Digitization Collaboration
9
Shallow Digitization (Institutional Version) “Scan-and-dump” facsimiles –Limited metadata –No transcripts –Not crawlable
10
Shallow Digitization (Amateur Version) Full transcripts –No facsimiles –No provenance –No metadata on sources –Invisible editorial decisions Cut-and-paste replication –No attribution
11
Deep Digitization Institutional Challenges –Funding –Manpower Non-institutional Challenges –Standards –Access to sources
12
Crowdsourcing Who are the volunteers? What can they do? OldWeather.org Zenas Matthews Harry Ransom Center Fragments
14
Accuracy Individual transcriptions are about 97% accurate Of 1000 transcribed logbook entries: –3 will be lost because of transcription errors –10 will be illegible –At least 3 will be errors in the logs
15
OldWeather Participation More than 1.6 million weather observations. 16,000 volunteers. 1 million log pages transcribed. Mean contribution of 100 transcriptions per user.
16
OldWeather Participation More than 1.6 million weather observations. 16,000 volunteers. 1 million log pages transcribed. Mean contribution of 100 transcriptions per user – but this statistic is worthless!
18
Power-law Distribution Most contributions are made by a core of well-informed enthusiasts. True regardless of project size. What are the implications?
20
One “Well-Informed Enthusiast” In 14 days, –Entire diary transcribed –250 revisions to 43 pages –Two dozen footnotes
24
Crowdsourcing’s Virtuous Circle Volunteers Deep digitization Findability More Volunteers!
25
One Volunteer’s Story Nat Wooding –Retired data analyst –100 pages of Julia Brumfield’s diaries transcribed and indexed in six months –No relation to diarist
26
One Volunteer’s Story Nat Wooding –Retired data analyst –100 pages of Julia Brumfield’s diaries transcribed and indexed in six months –No relation to diarist –Great-uncle was diarist’s letter carrier, also named Nat Wooding
29
Non-institutional Digitization
30
The Invisible Archive Private collections Family archivists (filing cabinets) –or their heirs (boxes in the attic) Non-notable subjects Flickr
31
The Standards Problem “We can't overemphasize the potential futility of citing websites, any websites,but especially non-institutional websites.” –Diggitt McLaughlin (H-SHEAR 2011-04-27)
32
The Standards Problem “Needless to say, amateurs will continue to put out poorly edited versions of documents in print which we, as professionals, will continue to eschew using.” –Christopher L. Miller (H-OIEAHC list, 1996-05-07)
33
Solutions Collaboration Participation by professionals in amateur projects FreeREG/FreeCEN
35
Solutions Community Flickr RootsTech
36
Solutions Software Platforms Suggested rigor Graceful degradation
37
Thanks! Ben Brumfield benwbrum@gmail.com http://fromthepage.com/ Slides and transcript to be posted at http://manuscripttranscription.blogspot.com/
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.