Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.

Slides:



Advertisements
Similar presentations
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
Advertisements

Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
Can We Talk? MICHAEL Conference London May 23, 2008Joyce Ray.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
Opening Up Worldwide Access to Key BC Historical Documents: BC Historical Newspapers Mike Conroy, Community Digital Projects Analyst UBC Library.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Digital & Preservation Resources Managing the digital collection life cycle.
Digital Preservation and the Open Web: A Curatorial Perspective Terence K. Huwe Institute of Industrial Relations University of California, Berkeley Computers.
UC Shared Images Delivering essential image collections UC-wide Berkeley Davis Irvine Los Angeles Merced Riverside San Diego Santa Barbara Santa Cruz CDL.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
Latin American and Human Rights Web Archiving as part of Research Library Special Collections Kent Norsworthy LLILAS Benson Digital Curation Coordinator,
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
University Archives University Archives & Archive-It WebCom
Newspapers as Historical Sources. Historians value Provide current information Show how events in the past were reported at the time the events occurred.
Southern California Students! This day is dedicated to you!
Web archiving at the NLA ‘ Archiving the music web’ Music Council of Australia Annual Assembly 28 September 2009 Paul Koerbin Manager Digital Archiving.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
UC’s Systemwide Library Planning Some background & current information.
Araba Dawson-Andoh 122 A Alden Library
CRL Global Resources Network News Preservation & Access July 13, 2011 James Simon Director, Global Resources Network.
University of California Applications and Thought-Starters C L A S S R O O M IN-BETWEEN SPACES HOUSING & DINING LIBRARY OFFICES MEDICAL CENTERS CLICK TO.
Affordable Care Act (ACA) Individual Mandate: IRS Filing Requirements July 2015.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
Web Archives, IDEAL, and PBL Overview Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science Virginia Tech Blacksburg, VA, USA 21.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
The Web Archiving Service Tracy Seneca California Digital Library California Digital LibraryNew York UniversityUniversity of North Texas National Digital.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
IIPC GA Curator Tools Fair May 2014 WEB CURATOR TOOL Nicola Bingham Web Archivist.
The web has revolutionized our access to information. Documents and publications that were once difficult to fin are now readily available to anyone. Government.
Web Archiving Challenges: Collaborative Collection Building.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Was.cdlib.org California Digital Library University of California Rosalie Lack
Next Generation Technical Services Rethinking Library Technical Services for the University of California R Bruce Miller.
Phases of Policy Development Joshua Adams, Cornell University Nancy Capell, University of California Patrice DeCoster, SUNY Empire State College ACUPA.
Digital Special Collections Users Council Annual Meeting May 9, 2008.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr,
The University of California Role of the Controller Don Larson, Controller UC San Diego November, 2011.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
Suggested Placement of WCL Boxes/Links The following screens are meant to illustrate where WCL search boxes currently reside on operational library pages,
Laura Alagna University of Chicago Library SAA Metadata and Digital Object Roundtable Annual Meeting Washington, DC 13 August 2014.
The Web-at-Risk NDIIPP Sponsored Project Partners include: California Digital Library – project lead University of North Texas New York University California.
Supporting Scholarship in the Digital Age Carl G. Stahmer Director of Digital Scholarship cstahmer UC Davis Library Town Hall Meeting.
Web Archiving Service Public Access Release Date: July
Encouraging An Informed Citizenry: Locating and Using Congressional Research Service Reports Starr Hoffman Librarian for Digital Collections University.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
Mr. P’s Class Term Paper All the Steps on the Path to an “A” Term Paper in World History.
The Web Archiving Service Spring 2009 Update User’s Council Annual Meeting Tracy Seneca California Digital Library Capture Today’s Web;
George E. Brown, Jr. Network for Earthquake Engineering Simulation 4 th regular meeting of the NEES preservation advisory committee Stanislav Pejša
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
An Introduction to EZID University of California Curation Center Team California Digital Library August, 2011 UC3 Summer Webinar Series.
HathiTrust: A valuable and visionary Partnership.
Libraries Digital Program Updates, 12/8/2006 Stephen P. Davis Director, Libraries Digital Program Division.
OPEN ACCESS INITIATIVE at U.S. Department of Transportation LIBRARY PERSPECTIVE 2014 TRB Annual Meeting January 16, 2014 Lisa D. Zilinski Data Services.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
Archiving & Preserving Digital Content
Digital Collections Update
Research Tools: Primary and Secondary Sources
Wisconsin County and Municipal Government Collections in Archive-It
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Presentation transcript:

Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop

Imagine a world …

This is our world …

WAS … is A service of the UC Curation Center to collect, manage, preserve and publish websites and documents.

WAS Snapshot 53 public archives 120+ archives total 7,500+ sites 50+ TB 23 institutions

WAS Institutions Institute of Governmental Studies Library, UCB UC Berkeley Office of Public Affairs UC Berkeley Libraries UC Davis Libraries UC Irvine Libraries UC Los Angeles Libraries UC Riverside Libraries UC San Diego Libraries UC San Francisco Libraries UC Santa Barbara UC Santa Cruz McHenry Library Emory University Library Institute for Research on Labor and Employment New York University Northwestern University Library Purdue University Stanford University Libraries Temple University University of Arkansas Libraries University of Illinois at Urbana Champaign Libraries University of Michigan, Bentley Historical Library USDA Economic Research Service Water Resources Collections and Archives

WAS Overview A) Curator Tools

Curator Workflow

1. Create Site Enter site name, URL and description Scope Capture frequency Robots.txt

2. Capture Sites

3. View Captures View captures QA Compare

4. Public Access Customize the archive Write description Create custom banner and icon

WAS Overview B) Public Archives

Web Archive ‘home page’

Browse: Site List + Tags

Search: All Sites in an Archive

Integration with your Systems

How are people using WAS?

Institution’s website Preserve intuitional history Capture university news and events

Geographically focused

Topical Support special research collections

Event Sudden action required May need many selectors Start date / end date

Researcher’s Perspective Building collections for research – Study the topic / event – Study site change or web-based communication – Websites are datasets for analysis and data mining Preservation of research – Archive grant-funded websites – Selected sites Create stable citations for publications

Get started! Each library has WAS administrator(s) Unlimited number of curators per account What’s the cost? – UC does not pay a service fee – Storage only: $1040/per TB (average site is $1.46/annually); storage costs to go down

Challenges Shared collection development Metadata issues Workflow and cost models for faculty projects Time! Limitations of web crawlers Websites are messy

Contact me! Rosalie Lack WAS Service Manager

Imagine a world … “Imagine a world in which libraries and archives had never existed. No institutions had ever systematically collected or preserved our collective cultural past: every book, letter, or document was created, read and then immediately thrown away. What would we know about our past?’’

This is our world … “Yet, that is precisely what is happening with the web: more and more of our daily lives occur within the digital world, yet more than two decades after the birth of the modern web, the “libraries” and “archives” of this world are still just being formed.” A Vision Of The Role And Future Of Web Archives Kalev H. Leetaru, Graduate School of Library and Information Science, University of Illinois. Presented as the keynote address at the 2012 IIPC General Assembly in Washington, DC.