The ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual.

Slides:



Advertisements
Similar presentations
Current State of Play in Digital Preservation Peter B. Hirtle Cornell University Library Society of American Archivists.
Advertisements

Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
Recent developments in digital archiving and preservation Jan Fullerton Director General National Library of Australia.
OCLC Digital Archive Overview Judith Cobb LIPA Meeting July 2006.
An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,
A atuação do arquivista e o mercado de trabalho nos Estados Unidos XV Congresso Brasileiro do Arquivologia.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers 1 DIGITAL PRESERVATION: MAINTAINING THE.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Kevin L. Glick Electronic Records Archivist Manuscripts and Archives Yale University ECURE Arizona State University March 2, 2005 Fedora and the Preservation.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Developing PANDORA Mark Corbould Director, IT Business Systems.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
OCLC Online Computer Library Center Registry of Digital Masters A joint project of the Digital Library Federation and OCLC Taylor Surface, OCLC ALA Annual.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Digital Library Architecture and Technology
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
WebArchiv Czech Web Archive IIPC 2007, Paris.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Ymchwil Research Ymchwil Research RESAW Ioan Isaac-Richards Ingest Processes Manager Head of Web Archiving
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
Open Access Symposium 2015 Open Access, the Law, and Public Information Mary Alice Baish UNT Dallas College of Law May 19, 2015 National Plan for Access.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
The Real At Risk E-Content: University Web Resources EDUCAUSE Joanne Kaczmarek University of Illinois at Urbana-Champaign Taylor Surface OCLC October 12,
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Brigid Burke Technical Services & Digital Projects Librarian Fairleigh Dickinson University.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Digitization An Introduction to Digitization Projects and to Using the Montana Memory Project.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
ACRL New England Chapter Information Technology Interest Group Preservation and Conservation Interest Group March 18, 2005 Here Today, Here Tomorrow Issues.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
GeoMAPP: Using Metadata to Help Preserve Geospatial Content Matt Peters, Utah’s Automated Geographic Reference Center Glen McAninch, Kentucky Department.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
North Carolina Geospatial Data Archiving Project : Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Partners: NCSU.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
DRS 2 Project (2008 – Present!) Andrea Goethals, Harvard Library Digital Preservation Management Workshop, MIT June 13, 2013.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation.
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Institutional Repositories: Models & Approaches A NELINET Seminar.
Jaime Stoltenberg Map and Geospatial Data Librarian Arthur H. Robinson Map Library University of Wisconsin-Madison Wisconsin Land Information Association.
Digital Preservation Initiatives in the United States A Summary Deanna B. Marcum.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Repositories: How are They Evolving? A NERCOMP Workshop September.
7th Annual Hong Kong Innovative Users Group Meeting
Building A Repository for Digital Objects
DataNet Collaboration
Personal Archives Accessible in Digital Media
Integrating PREMIS and METS
Fedora and the Preservation of University Records ECURE
Presentation transcript:

the ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual Chicago June 2005 Taylor Surface, OCLC

ECHO DEPository UIUC / OCLC The digital preservation problem Information is being produced in greater quantities and with greater frequency than at any time in history. –How will society preserve this information and make it available to future generations? –How will libraries and other repositories classify this information so that their patrons can find it with the same ease that they can locate a book on a shelf? The ease with which electronic information can be created and "published" makes much of what is available today, gone tomorrow. Thus there is an urgent need to preserve this information before it is forever lost. [Library of Congress (

ECHO DEPository UIUC / OCLC About NDIIPP The National Digital Information Infrastructure Preservation Project is a $99.8M national digital strategy effort led by the Library of Congress. Its mission: Develop a national strategy to collect, archive and preserve the burgeoning amounts of digital content, especially materials that are created only in digital formats, for current and future generations.

ECHO DEPository UIUC / OCLC Library of Congress NDIIPP Program Building Digital Preservation Infrastructure Partnerships Policy Standards Technical components

ECHO DEPository UIUC / OCLC NDIIPP key areas of interest Digital Preservation … Practical applications and models National technical architecture Basic research

ECHO DEPository UIUC / OCLC ECHO DEPository – Overview Design selection methodology Develop software implementing theory –Machine-assisted –Open source Evaluate various repositories –Using content gathered from tools –Other content providers Study semantic preservation techniques

ECHO DEPository UIUC / OCLC Three objectives Comparative test of repositories with various digital collections Development of Web Archives Workbench Investigations of semantic digital preservation and alternate applications of workbench tools

ECHO DEPository UIUC / OCLC Project Partners University of Illinois, Urbana-Champaign –Libraries, GSLIS, NCSA, WILL, DMI OCLC State Libraries of Arizona, Connecticut, Illinois, North Carolina and Wisconsin Tufts – Perseus Project Michigan State – Sounds Archives Library of Congress, NDIIPP Program –$3 million funding over 3 years

ECHO DEPository UIUC / OCLC ECHO DEPository Project Universe of Content Tools from this project Service Provider Repository SRB Greenstone Fedora DSpace Digital Archive NCSA UIUC OCLC Digitize d Texts G.I.S. Photos Vide o Audio Admin Data Comparative repository testing

ECHO DEPository UIUC / OCLC ECHO DEPository Project Universe of Content State Pubs Tools from this project Service Provider Repository SRB Greenstone Fedora DSpace Digital Archive NCSA UIUC OCLC Digitize d Texts G.I.S. Photos Vide o Audio Admin Data Web Archives Workbench “ Arizona model” W.A.W. development

ECHO DEPository UIUC / OCLC The Arizona Model Web domains as “archival collections” Creates efficiencies for … –Selection of “documents” –Name authority & other metadata –Browseable access

ECHO DEPository UIUC / OCLC Arizona Model: a new approach Assumptions –Content creators won’t help –Item by item selection is unsatisfactory –Bulk harvesting is unsatisfactory An archival approach –Identifying groups of similar material (series) –Automatic identification of new series items –Series description Item level description is possible if warranted –Ingest of documents into an archive

ECHO DEPository UIUC / OCLC Web Archives Workbench Apache Linux Packager Tool Packager Tool Heritrix Harvester Heritrix Harvester Cloudscape DB Cloudscape DB TomCat Discovery Tool Discovery Tool Analysis Tool Analysis Tool Properties Tool Properties Tool

ECHO DEPository UIUC / OCLC Web Archives Workbench (WAW) Tools for curators … Discovery – identify & manage domains Properties – associate metadata, content, and providers Analysis – select content from structure Packager – package content & metadata

ECHO DEPository UIUC / OCLC WAW - Discovery Tool Currently available (May 2005) Helps curators identify domains that are within their collecting scope Crawls web sites and extracts domains of possible interest from content Maintains lists of domains Monitors selected domains for changes

ECHO DEPository UIUC / OCLC WAW - Properties Tool Currently available (May 2005) Relates content providers to web sites Organizes a ‘group’ of web sites hierarchically Associates metadata to content providers and, later, to selected content Metadata can be subject headings, preferred names, aliases, etc.

ECHO DEPository UIUC / OCLC WAW - Analysis Tool Available January 2006 Content selection at varying levels of granularity –Harvests an entire site or one document Scheduled harvesting of content Shows site structure Understands serials Content is automatically associated to content provider’s metadata

ECHO DEPository UIUC / OCLC WAW - Packager Tool Available January 2006 Combines descriptive metadata about content creator, series, and object Creates administrative and preservation metadata Packages web content and metadata into an XML standard package (METS) Neutral format for ingest into OCLC archive and other repositories

ECHO DEPository UIUC / OCLC ECHO DEPository Project Universe of Content State Pubs Tools from this project Service Provider Repository SRB DSpace Digital Archive NCSA UIUC OCLC Digitize d Texts G.I.S. Photos Vide o Audio Admin Data Web Archives Workbench Digital preservation investigation Fedora Greenstone

the ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ECHO DEPository project web site: NDIIPP web site: Me: Taylor Surface, OCLC