University of Alberta Libraries: Bringing in the Harvest Archive-It Partners Meeting.: October 2011.: Geoff Harder.

Slides:



Advertisements
Similar presentations
E-resources Collection Management Anna Grigson E-resources Manager.
Advertisements

1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
MAIN MESSAGE key reasons enumerated ->please read speaker notes Research. Report. Reposit. Deposit your scholarly research - it’s as easy as 1, 2, 3 id.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
E-prints: the Nottingham Experience Stephen Pinfield and Mike Gardner.
Sir Hilary Jenkinson: The duties of the Archivist …. are primary and secondary. In the first place he has to take all possible precautions for the safeguarding.
KNOWLEDGE EXCHANGE AND LIBRARIES Fatt-Cheong CHOY University Librarian Nanyang Technological University Singapore.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
The FDLP Web Archive Dory Bower Archive-It Partner Meeting November 18, 2014.
Web archiving at the NLA ‘ Archiving the music web’ Music Council of Australia Annual Assembly 28 September 2009 Paul Koerbin Manager Digital Archiving.
Developing PANDORA Mark Corbould Director, IT Business Systems.
ARCHIVING DATA Research Data Management. Archive - a place where public records or other historical documents are kept. An extensive record or collection.
Preserving the Unpreservable: Form, Content, Copyright and the Archiving of Born-Digital Newspapers Lisa Lynch Concordia University Paul Fontaine McGill.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
August 14, 2015 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
1 Archive-It Training University of Maryland July 12, 2007.
Canadian Research Libraries: A History of Cooperation Canadian Research Libraries: A History of Cooperation Gwendolyn Ebbett Dean of the Library University.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Web The Internet Archive. Agenda Brief Introduction to IA Web Archiving Collection Policies and Strategies Key Challenges (opportunities for.
The Web is a Mess: or How I Learned to Stop Worrying and Love Web Archiving Lori Donovan, Internet Archive.
EDINA Multimedia Services: NewsFilm Online Vivienne Carr
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
University Libraries Library Systems Office. Life on MARS Mason Archival Repository Service Dorothea Salo Digital Repository Services Librarian Library.
Digitization Panel August 12, 2010 Christopher C. Brown, coordinator Mike Culbertson, Colorado State U. James Mauldin, GPO.
Login / Upload / Share Deposit your scholarly research - it’s as easy as 1, 2, 3 MAIN MESSAGE key reasons enumerated ->please read speaker notes id / who.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Extending the Scope of Learning Objects with XML Bill Tait COLMSCT Associate Teaching Fellow The Open University ALT-C Conference Sep 2007.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
The top of the list… More… Still more… More…
Caught in the Web: Web Archiving at U of A Libraries Geoff Harder and Kenton Good Digital Preservation Seminar | March 5, 2010 | University of Alberta.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
The Real At Risk E-Content: University Web Resources EDUCAUSE Joanne Kaczmarek University of Illinois at Urbana-Champaign Taylor Surface OCLC October 12,
Can we be doing more? Beth Tillinghast University of Hawaii at Manoa October 19, 2011 Archive-It Partner Meeting ACCESS TO OUR ARCHIVED WEBSITE COLLECTIONS.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
From here to perpetuity: challenges (and a few confessions) in preserving web-based AV content ASRA Conference 2011 Paul Koerbin Manager Web Archiving.
1 Jo Lambert and Paul Meehan. JUSP aims Supports libraries by providing a single point of access to e-journal usage data Assists management of e- journals.
An Environmental Scan for Data Services Trends that are shaping today’s environment for data services.
October 24, 2015 Research data management – a brief introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
June 3, 2016 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
Developing strong data roots in fertile library soil: e-science in canadian libraries Geoff Harder Canadian eResearch Community, CLA, 2013.
CyberCemetery Preserving At-Risk Government Web Content.
Collection Development in Digital Libraries Matt Woronko.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
From Access to Archive Transforming Scholars Portal into an E-Journal Archive.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Open Access Tools for Scholars Scholarly Communication Retreat Wednesday December 12, 2007 Presented by Marcia Salmon.
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
What happens to Your Thesis after Examination? David Howard: Manager Library Collections and Access. October 2010.
Research Data Management in the Humanities: an Introduction to the Basics Open Exeter Project Team.
Archiving & Preserving Digital Content
Open Exeter Project Team
Promoting and Preserving FIU Research and Scholarship
Joanne Archer University of Maryland Libraries
Digitisation in academic libraries: Experience from Makerere University Library, Kampala Uganda By Patrick Sekikome Presented at the CERN-UNESCO School.
Jon Dunn, Indiana University Marcel LaFlamme, Rice University
Eric Sieverts University Library Utrecht Institute for Media &
This work is licensed under a Creative Commons Attribution 3
The Bentley Digital Media Library
Managing the Institutional Repository for OA Khawulile Radebe: Librarian: Repository Administrator & Metadata.
Presentation transcript:

University of Alberta Libraries: Bringing in the Harvest Archive-It Partners Meeting.: October 2011.: Geoff Harder

University of Alberta Libraries: 2 nd largest academic and research collection in Canada #12 in ARL rankings Strong digital initiatives and digital collections program

Recognized the gap Too much (western) Canadian online content disappearing. Not enough attention being paid by libraries and other memory institutions to the born digital content originating from new publishing streams. Historically important events happening all around us Social media/citizen journalism in particular shines a spotlight on the temporary nature of today’s “news” Strong digital collections and digital preservation program Digitization, repositories, ejournals, research data, DP leader in Canada – why would we ignore web content? Needed a catalyst for action… Why web archiving?

Collection Rescue Non-profit foundation ceases to be Created 80+ websites with research and informational value Rescue operation required! Multitude of technologies and file formats used over the course of a decade of construction

Projected to be an expensive, resource (i.e. time) intensive undertaking to stabilize websites, patch security holes, deal with hardware, etc. Issues around having 80+ new puppy dogs that would need ongoing care and attention Goal was to *preserve the material in the spirit it was created* Decision to take a web archiving approach to the project as opposed to a web hosting approach made the most sense Rescue 911: The Alberta Encyclopedia

Hired recently graduated SLIS student on short term contract to optimize the sites Control over the content meant we could test the crawling and modify the content as required – learned what Archive- It can and can’t do Addressed content hidden behind database searching by creating sitemaps and browsable indexes Converted multimedia files for improved playability Archive-It’s searching made up for loss of native search (in many cases resulting in improved searchability) **Recognized the value of Archive-It as a lightweight, economical tool for saving orphaned and at risk web content where we know a time bomb is ticking e.g. International Polar Year; grant funded research project home pages Rescue 911: The Alberta Encyclopedia

Interesting sidebars: Discoverability through Google and other search engines – how best to redirect traffic and rankings to popular materials? - working with IA to expose HCF archived content to search engines Managing expectations – materials can’t be edited - This is not a “hosting” solution - It is archiving -- extended archiving service to HCF partners looking to take, edit and grow *their* web material with another host

End the printing of hard copies of born digital materials for our collection Education curriculum collection born digital material produced yearly by AB Learning, Provincial Gov’t – limited to no archiving Required someone to monitor sites; print out materials which would then be catalogued for in-person use Recognized opportunity to point and scope crawler to do regular harvesting – materials could still be catalogued (if warranted) Captures curriculum, supplements, related information, and context of the materials and their creation, i.e. can navigate the materials within the historical context of their publication What else could we use Archive-It for?

Preserve University of Alberta’s web presence No brainer – important to the University; many of the websites hadn’t been crawled by IA’s Wayback Machine in many years due to robots.txt restrictions and other issues. University Admin immediately interested in the project University’s web presence rounds out a view of the University as documented in the University Archives (for many units and departments, digital record keeping is still a work in progress) What more can we do?

Begins with a collection policy and an expanded view of what constitutes a research collection Collections of strength Collections of interest Library of Record Format agnostic – base decisions around research value, not just around containers Identify the materials a collection ought to have. Step two is to find the tools and the resources to make it happen. Step three is about priorities. Collections and Liasion/Subject Librarians Growing the collection

Growing interest from units and subject liaison librarians Enthusiasm tempered only by lack of time to do this work Emerging models: -Liaison(s) selector/curator of a collection ·Other staff can assist in identifying materials, setting up the crawls, monitoring changes ·Metadata and cataloguing support where appropriate ·Interdisciplinary approach to current events – all hands on deck to help build time-sensitive collections ·Possibility of partnering with the library school, or creating internship opportunities and so forth The UAlberta Experience

IP issues remain a fuzzy area Public record – e.g. political party websites, vs. clearly copyrighted material Federal and provincial government materials Preserving context – not rebranding and repurposing content – finding that most people/organizations understand why we’re trying to do this work. Approaching content creators to be partners – Recognize the value of their material as important to the historical record. “We’re doing something that helps you.” Permissions and copyright Rights and permissions

Need a discovery strategy that works for web archives Archive gateway? Catalogue? Primo or other discovery layers? Are there other ways to integrate/indicate the archived versions of web content? Better metrics would help us understand how the archived materials are being used and discovered Areas to explore Metrics and discovery

Title image credit: Geoff Harder Digital Initiatives Coordinator, U of A Libraries