Download presentation
Presentation is loading. Please wait.
Published byBaldric Holt Modified over 8 years ago
1
The Challenge of Collecting and Providing Access to Social Media Content Vakil Smallen Rachel Trent Brian Dietz Peter Broadwell Camille Tyndall Watson
2
2012 - SFM initiated 2013 - 2014 - IMLS Sparks! Ignition Grant (LG-46-13-0257) 2014 - 2017 - NHPRC Grant (DI-50017-14) GWU + Social Feed Manager @SocialFeedMgr | sfm@gwu.edu Main website & blog: go.gwu.edu/sfm Documentation: social-feed-manager.readthedocs.io/en/m5_004 Code & tickets: github.com/gwu-libraries/sfm-ui Working paper (give us feedback!): bit.ly/prov-tweet-doc
3
My #HuntLibrary https://d.lib.ncsu.edu/myhuntlibrary Social Media Archives Toolkit https://go.ncsu.edu/smalt Social Media Combine https://go.ncsu.edu/smcombine The Toolkit and Combine were made possible through funding from the federal Institute of Museum and Library Services (IMLS) under the provisions of the Library Services and Technology Act (LSTA) as administered by the State Library of North Carolina, a division of the North Carolina Department of Cultural Resources.
4
Peter Broadwell (@PeterBroadwell) Collecting and Providing Access to Social Media Content Society of American Archivists, Atlanta, GA, August 5, 2016 The UCLA Broadcast NewsScape >294,000 hours of TV news archived digitally Recorded 2005-present, ca. 100 shows/day 14 countries, 11 languages 46 networks (20 non-US), 379,000 shows Searchable by ~3.61 billion words of captions, on-screen text, official transcripts. Twitter collections at the UCLA Library Collected by Digital Library with Social Feed Manager – or – Collected by faculty with various tools and given to us later Examples: 785,000 tweets from Arab Spring movements (collected by faculty) Millions of tweets about various global disasters (Library)
5
Peter Broadwell (@PeterBroadwell) Collecting and Providing Access to Social Media Content Society of American Archivists, Atlanta, GA, August 5, 2016 Twitter/TV news linking experiment Apply DBpedia Spotlight Named Entity Recognition (NER) software to TV and Twitter collections on second GOP presidential primary debate on 9/16/2015 Twitter: 800,000 tweets TV: CNN coverage of debate Minute granularity Persons, Organizations, Places Results: Linked entities with URIs to DBpedia resources Visualization of correlations between entities in collaboration with Martin Klein, UCLA Library
6
Peter Broadwell (@PeterBroadwell) Collecting and Providing Access to Social Media Content Society of American Archivists, Atlanta, GA, August 5, 2016 Twitter/TV News debate coverage: Persons http://sologlo.library.ucla.edu/NER/twitter/gop_persons.html
7
Peter Broadwell (@PeterBroadwell) Collecting and Providing Access to Social Media Content Society of American Archivists, Atlanta, GA, August 5, 2016 Matching Twitter term profiles to news programs http://sologlo.library.ucla.edu/NER/tv/
8
Authority to collect G.S. 132—Public Records Law Defines a public record Defines State Archives as custodian of public records G.S. 121-5 Archives and History Assigns responsibilities for providing guidance and as the keeper of the permanent records of the state Assigns records management responsibilities Defines how records are managed through a records retention program General Schedule for State Agency Records Best Practices for Social Media Usage for State Agencies
9
CHALLENGES ► How do you capture complete, authentic records and metadata? ► Once you capture them, how do you manage these records? ► What do citizens expect? ► What/how should access look?
10
Questions? Camille Tyndall Watson Digital Archivist, State Archives of NC Camille.tyndallwatson@ncdcr.gov
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.