Latin American and Human Rights Web Archiving as part of Research Library Special Collections Kent Norsworthy LLILAS Benson Digital Curation Coordinator,

Slides:



Advertisements
Similar presentations
Web Archives and Large-Scale Data: Preliminary Techniques for Facilitating Research Nicholas Woodward Latin American Network Information Center
Advertisements

Metadata for Digital Content at the Library of Congress Jane Mandelbaum Information Technology Services Library of Congress May 2009.
LIBRARY & ARCHIVES CANADA Canadas Knowledge Institution for the 21 st Century Presentation to the Conference of Directors of National Libraries August.
Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
Providing collections, tools and services for digital humanities A national library perspective Clément Oury Head of Digital Legal Deposit Bibliothèque.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Social Media.
Bibliothèque nationale de France Tallinn, BnF update: production and development priorities in 2015.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Workforce Demand and Career Opportunities in University and Research Libraries NAS Symposium on Digital Curation Anne R. Kenney July 19, 2012.
Ball State University Libraries A destination for research, learning, and friends Using Google Analytics Data to Expand Discovery and Use of Digital Archival.
Newspaper Preservation through Collaboration and Communication The Texas Digital Newspaper Program By Ana Krahmer & Mark Phillips University of North Texas.
Reference 2.0: Using New Web Technologies to Enhance Public Service Texas Library Association Conference April 17, 2008 Stephen F. Austin State University’s.
New organisational perspectives in 'library business' in the future – case study Finland Kristiina Hormia-Poutanen National Library of Finland.
HARVARD UNIVERSITY iCOMMONS March 28, Integrated Academic Infrastructure The first six months LiMIT Meeting March 28, 2007 Susan A. Rogers.
OU Digital Library development project Liz Mallett – Project Manager James Alexander – Project Developer 25 January 2012.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Archive-It collection on “Occupy Movement 2011/2012” Archiving Web Content.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Columbia University Libraries and Global Resources Network Conference Human Rights Archives and Documentation: Meeting the Needs of Research, Teaching,
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
How to Face the Challenges of Web Archiving? The experiences of a small library on the edge. Chloe Martin, Internet Memory Catherine Ryan, National Library.
Web The Internet Archive. Agenda Brief Introduction to IA Web Archiving Collection Policies and Strategies Key Challenges (opportunities for.
The Web is a Mess: or How I Learned to Stop Worrying and Love Web Archiving Lori Donovan, Internet Archive.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
The Web Archiving Service Tracy Seneca California Digital Library California Digital LibraryNew York UniversityUniversity of North Texas National Digital.
WHS joined Archive-It in the fall of 2010 Began capturing state information with the capture of Governor Jim Doyle’s websites at the end of the administration.
MADGIC is… MAPS and ATLASES DATA: NUMERIC and GEOSPATIAL (for use with special software) GOVERNMENT INFORMATION (parliamentary and other official reports,
Plans for 2015 Tallinn, Jan 29 th, 2015 Ditte Laursen, Sabine Schostag,
Kent Norsworthy, LLILAS and Benson Latin American Collection Jade Diaz, University of Texas Libraries2012 Texas Conference for Digital Libraries Kent Norsworthy,
MADGIC is… MAPS and ATLASES DATA (NUMERIC and GEOSPATIAL) for use with special software GOVERNMENT INFORMATION (parliamentary and other official reports,
Caught in the Web: Web Archiving at U of A Libraries Geoff Harder and Kenton Good Digital Preservation Seminar | March 5, 2010 | University of Alberta.
EUscreen: Examining An Aggregator ’ s Role in Digital Preservation Samantha Losben Digital Preservation - Final Project December 15, 2010.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Digital Special Collections Users Council Annual Meeting May 9, 2008.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Web Archiving at the National Library of Australia Russell Latham Senior Web Archivist, National Library of Australia.
In, Out, and Beyond: Integrating Special Collections at UCLA Library Tom Hyry UCLA Library Special Collections Living the Future Conference April 23, 2012.

AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson The University of Texas at Austin Latin American Digital Library Initiative,
Francis X. Blouin Jr. Bentley Library, University of Michgian June 5, 2012 Libraries in the Service of their Universities.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
Researching Your Project: Picking a Topic. Books If you have no idea for at topic one can “browse”... Check the Bibliographies Library has many and new.
How do I find works in the Repository?. University of Texas Libraries UT DR Digital Repository Search in the Repository Keyword search from the Repository.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
A National Library for Australian Educational Research Sue Clarke Manager, Cunningham Library Australian Council for Educational Research 27 th IATUL Annual.
The University of Texas at Austin If We Build it, Will They Come? Providing Enhanced Access to an Archive-It Collection LAGDA - Latin American Government.
The Boston TV News Digital Library: Partners WGBH Media Library and Archives (WGBH) Northeast Historic Film (NHF) Boston Public Library (BPL)
Government Documents Made Easy? Or Just Easier?. 90% since % since 2000 Retrospective conversion projects Retrospective conversion projects Search.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Increasing Student Engagement and Motivation with Web 2.0 Tools Presenters: Karla V. Kingsley, Ph.D. John A. Unger, Ph.D. UNM Success in the Classroom.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
Digital Library of the Caribbean (dLOC) & Digital Humanities LEAH R. ROSENBERG LAURIE N.
Use cases for BnF broad crawls Annick Lorthios. 2 Step by step, the first in-house broad crawl The 2010 broad crawl has been performed in-house at the.
Latin American Digital Initiatives: Building a Post-Custodial Digital Repository in Islandora Melanie Cofield, Brandon Cornell, Jon Gibson, Jose Gonzalez.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
Richard Marciano Professor, University of Maryland iSchool Affiliate Professor, Computer Science Director, Digital Curation Innovation Center (DCIC) University.
Data mining in web applications
Archiving & Preserving Digital Content
Emphasize “scholarly” and “universities” to distinguish TDL from other efforts. A digital infrastructure for the scholarly activities of Texas universities.
John Cox, University Librarian, National University of Ireland Galway
Latin American Government Documents Archive, LAGDA
Metadata to fit your needs... How much is too much?
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Panel on Web Archiving Government Information: LAC’s Program Update
Web archives as a research subject
Presentation transcript:

Latin American and Human Rights Web Archiving as part of Research Library Special Collections Kent Norsworthy LLILAS Benson Digital Curation Coordinator, LLILAS & University of Texas Libraries Archive-It Partner Meeting December 3, 2012

Lagda & Hrdi

presidencia.gob.hn Before the Coup... Honduras before the coup

presidencia.gob.hn... After the Coup Honduras after the coup

Paraguay before the coup presidencia.gov.py... Before the Coup

Paraguay after the coup presidencia.gov.py... After the Coup

Sierra Leone Truth Commission

Human Rights Alert, Manipur

LAGDA Snapshot LAGDA = 280 seeds, about 15 gov ministries per each of 18 countries crawled quarterly since 2005 Files crawled and archived to date in LAGDA75 million Data archived6.3TB HTML pages archived per crawl1.6 million PDF documents archived per crawl 260,000 Monthly average pageviews on LAGDA 2,918 LAGDA snapshot

LAGDA local collaboration Area Studies: LLILAS = 140 UT Latinamericanist faculty Benson Latin American Collection UT Libraries: Preservation, Metadata, Systems Texas Advanced Computing Center, TACC: Data Mining LAGDA local collaboration

HRDI & LAGDA external collaboration Internet Archive Latin American Studies Association, LASA Society of American Archivists, SAA Center for Research Libraries, CRL Seminar on the Acquisition of Latin American Library Materials, SALALM LAGDA external collaboration

HRDI external partners Genocide Archive of Rwanda Free Burma Rangers Museo de la Palabra y la Imagen, El Salvador WITNESS Texas After Violence Project HRDI external partners

Speeches Full text Audio Video Official Statistics Census Surveys Economic data Reports Regular Annual Reports State of the Union Sector Reports What we capture Born Digital (Website as a digital object) (Social Media) Latin American government documents

Ephemeral nature of HRDI & LAGDA target sites “Coups and earthquakes” Ordinary governmental succession Lack of preservation mandate or infrastructure Ongoing changes in policy and ideology HRDI: at-risk nature of the organizations themselves HRDI: Government censorship and repression Ephemeral nature of target sites

LAGDA users and how they use it Professors: research in political science Students: classroom assignments & research projects Comparative and cross-national research Original owners/publishers of the content Safeguarding national patrimony LAGDA users and how they use it

HRDI users and how they use it Broadly: Memorialization, scholarship, & justice Human rights activists & advocacy organizations Legal community University scholars and students Survivors of human rights violations & family members Safeguarding national patrimony HRDI users and how they use it

Topic modeling for improved search and discovery When traditional search and browse isn’t enough Enables users to interrogate the archive with new types of research questions Can be applied to any type of large-scale collection of digital objects Topic model construction combines machine learning and algorithms with validation by subject matter experts Using tag clouds for now, but could move up to more advanced data visualization techniques Topic modeling

Topic modeling user interface step one

Topic modeling user interface step two

Topic modeling user interface step three

Web archiving at UT Austin: Mainstreaming Special Collections Transformation of special collections Redefining the role and identity of the research library More integrally connect the library to teaching, scholarship, and to 21 st century learners Explicit recognition of Human Rights and Latin American Studies as priorities Mainstreaming special collections 1

Web archiving at UT Austin: Mainstreaming Special Collections Focus on born digital and non-custodial archiving New forms of digital scholarship emerge as technology and content meet in special collections Big data increasingly present in humanities/social sciences Web archiving, non-custodial archiving, large text corpora Web archives as primary source material Mainstreaming special collections 2

Contact Google: LAGDAGoogle: HRDI Kent Norsworthy Digital Curation Coordinator T-Kay Sangwand Human Rights Archivist Contact