Free for all : opening collections & supporting multi-institutional efforts w/ Internet Archive Patrick R. Wallace, digital projects & archives librarian,

Slides:



Advertisements
Similar presentations
The UM Libraries’ Frost Concert Archive Documenting the Performance History of the University of Miami Frost School of Music Amy Strickland University.
Advertisements

WHAT IS DIGITAL COLLECTIONS?. THE UNIT THE PROJECT.
Digitization Projects: Internal Development vs. Outsourcing Production or D.I.Y. vs. The Pros.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Digital Partnerships at San Francisco Public Library: So Many Suitors, So Little Time.
They really can go together!. The tools that I show you today are not geared towards any grade specifically. They are general social studies sites that.
The web has revolutionized our access to information. Documents and publications that were once difficult to fin are now readily available to anyone. Government.
Greeks & Librarians SUNY Delhi’s Digital Archives Project Greek Archives Intern: Jerry Resilien Librarian: Angela Rhodes.
WebInfoMall: the Chinese Web Archive how we got started and how it is now Huang Lianen and Li Xiaoming Peking University, China Digital Archive Workshop.
Trying the Gold Road on a Shoestring Budget: Open Access Publishing with PKP's Open Journal System Nancy R. John, University of Illinois at Chicago Edward.
Was.cdlib.org California Digital Library University of California Rosalie Lack
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
»Works with the university community –Turns scholarly communications into Internet accessible resources and services –Preserves and provides access to.
AHDS Digitisation Workshop University of Edinburgh 3rd April 2003.
What do my photos say about me? Well go to the next slide and ill tell ya, ya dingus!
Presenters:Lea Domingo, Branch Manager, Kahuku Public and School Library Sunny Pai, Digital Initiatives Librarian, Kapiolani Community College If you.
Automating Data Normalization and Clean-up.
The world’s libraries. Connected. CONTENTdm ® Digital Collection Management Solutions Learn what to consider when outsourcing your library’s digitization.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digital Repositories Build It & They Will Come Michael J. Bennett Access Services Supervisor C/WMARS,
Digital Library of the Caribbean (dLOC) & Digital Humanities LEAH R. ROSENBERG LAURIE N.
Internet Basics 10/23/2012. What is the Internet? It’s a world-wide network of computer networks. It grows hourly and involves national governments, communities,
Improving the Discovery and Access of Archival Content Through the Institutional Repository: ScholarWorks at Boise State JULIA STRINGFELLOW CIMA ANNUAL.
Leveraging the Results of NDNP: the Texas Digital Newspaper Program.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
Start-SPPowerShell – Introduction to PowerShell for SharePoint Admins and Developers Paul BAker.
Archiving & Preserving Digital Content
Automation.
Welcome to today’s Infopeople Webinar!
Building Capacity for DH in the Library: A “Learn by Doing” Approach
Jarek Nabrzyski Director, Center for Research Computing
Representing Campus Research Data in a Comprehensive Tool
Gerrianne Schaad, Florida Southern College
Jim Duran and Julia Stringfellow, Special Collections Department
Dark Web – the modern threat for the Internet Security
MISSION POSSIBLE:  Migrating to Oracle’s Planning and Budgeting Cloud Service Bob Usset, EPM Manager © 2016 eCapital Advisors, LLC.
Stones Time for Reflection.
E-readers for Everyone:
Software Documentation
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Competent (English) Language Usage Essentials
APIs and Function Parameters
Challenges and Opportunities of Archiving the UK Web
Create your Benner - intro
László Drótos – Márton Németh National Széchényi Library Department of Electronic Library Services Web archiving Planning a new pilot project.
Federated & Meta Search
SharePoint Saturday Omaha April 2016
Internet Archive & OPENLIBRARY.ORG
Internet Basics.
Decisions, Decisions: How to Determine the Appropriate Method of Cataloging Special Collections in the 21st Century Presented by Patricia Falk, Music Catalog/Metadata.
EScouting Using Your Site.
21st Century Online Exhibits:
Project Planning is a waste of time!!!
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Users and Digital Collections
Caller: “I Have This Stuff…”
Free for all : opening collections & supporting multi-institutional efforts w/ Internet Archive Patrick R. Wallace, digital projects & archives librarian,
Expanding Access, Fair Use, and Creative Commons
Statewide A&OER Efforts of Libraries
Web archive data and researchers’ needs: how might we meet them?
Automation and Scalability in Digital Preservation
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Agile testing for web API with Postman
Tonga Institute of Higher Education IT 141: Information Systems
Experience with the process automation at SORS
Social Media Marketing
Agile Development – a new way of software development?
We Work for the Users! User-centric Digitization for the
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Free for all : opening collections & supporting multi-institutional efforts w/ Internet Archive Patrick R. Wallace, digital projects & archives librarian, Middlebury College Special Collections.

the basics. Archive.org Founded 1996. 501(c)(3) non-profit. San Francisco. 150 billion+ Web pages. Millions of objects. Cool objects. We’re mainly talking about objects today.

the (very) good It’s free. Really free. The stuff in it is free. Long history, no signs of going away. Dedication to providing public access to knowledge. Dedicated to preserving a historical record, especially re: everyday life and digital culture. Transcoding, streaming, OCR, storage -- for free!

the (maybe) bad Tendency to act in an un-librarylike fashion. 1,300 Public Domain dictionaries, still don’t know the word “deaccession”. Bucket system. No quality assurance. No access control. Once free, always free (kind of).

the (really pretty) ugly Messy collections. Arbitrary metadata. Lot and lots of junk. Limited UI, hard to find materials. Serious collection management means Linux, command lines, and scripting. Key management tasks require IA staff/admin intervention.

a serious tool for serious libraries We’re in this together. Shared professional ethic and ethical praxis. “Universal access to all knowledge”. Lots of users. API. Weaknesses are also strengths. Migration is not really that bad.

extra special collections Use case #1: Midd Special Collections & Archives Sharing everything we may, because we can. 5,000+ original items added since Jan 2016. Collaboration to automate DLA uploads. Dramatic increase in item views over CONTENTdm. Loss of some metadata. “Much kludge, very wow.”

no budget? no problem. Use case #2: one big union. Green Mountain Digital Archive (DPLA @ VT). Bringing the smallest institutions on board. Scheduled metadata scraping. Dependant on training, style guide compliance, normalization. Community involvement. When all you have is a hammer, at least you have a hammer.

hack it,work it Web interface. internetarchive Python library. Amazon S3 API. Standalone CLI tool. Lots of custom scripts. Don’t be afraid. Backlogs are good practice.

overloaded? happy to help. [me] pwallace@middlebury.edu sites.middlebury.edu/archivistslab [not_me] internetarchive.readthedocs.io archive.org/help/abouts3.txt

fin.