NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library IFLA conference 27/02/10.
Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Institutional Repositories It’s not Just the Technology New England Archivists Boston College March 11, 2006 Eliot Wilczek University Records Manager Tufts.
National Geospatial Digital Archive Greg Janée. Greg Janée May 31, Outline Two preservation misadventures Digital preservation problems Genesis.
Long-term Preservation as a Relay Greg Janée University of California at Santa Barbara.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
COMPANY CONFIDENTIAL COPYRIGHT 2004 MASSTECH GROUP INC. KCTS Case Study,
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
NDIIPP and NGDA National Preservation Network For Digital Content.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Digital Library Architecture and Technology
Electronic Mail List Preservation Takes Off: The H-Net Archive Lisa M. Schmidt MATRIX: The Center.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Why do we need a FR? We are designing with long-term storage in.
National Partnership for Advanced Computational Infrastructure Digital Library Architecture Reagan Moore Chaitan Baru Amarnath Gupta George Kremenek Bertram.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Cloud Task Replica Repository Preservation Tools Open Repositories Atlanta Richard Rodgers MIT Libraries.
The ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual.
UPDATE ON PARTNER VISITS January 18, /18/2013APTrust Update1.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
SDMX Web Services the JSON version Sami Airo & Gerard Salou.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
AIP Backup & Restore Sunita Barve NCRA, Pune. AIP The latest version of DSpace 1.7.0, supports backup and restore of all its contents as a set of AIP.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
Greg Janée topics Fedora NGDA project activities Two study ideas MODIS Preservation as series-of-handoffs.
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Institute Repositories and Digital Preservation : Assessing Current Practices at Research Library Rathachai Chawuthai Information.
What is NDIIPP doing?. July 7 th, Web-At-Risk is opening its archives for public access, having captured nearly 6 TB of data—the entire CA State Government.
DRS 2 Project (2008 – Present!) Andrea Goethals, Harvard Library Digital Preservation Management Workshop, MIT June 13, 2013.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
National Geospatial Digital Archive Greg Janée UC Santa Barbara.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
Al Cornish, Systems Librarian Washington State University Libraries Preserving Access to Multimedia Collections.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
OAIS (archive) Producer Management Consumer. Representation Information Data Object Information Object Interpreted using its Yields.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
OAIS (archive) OAIS (archive) Producer Management Consumer.
Joint Meeting of CSUL Committees,
Metadata Issues in Long-term Management of Data and Metadata
OAIS Producer (archive) Consumer Management
DAITSS and the Florida Digital Archive
An Overview of Data-PASS Shared Catalog
Digital Project Lifecycle Curating Across the Curriculum
Implementing an Institutional Repository: Part II
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

NGDA Architecture Update Greg Janée

Greg Janée May 16, Three motivations Archival has to be cheap & easy –little incentive –no funding Need to archive data semantics –key differentiator from text, audio, video Focus on long-term preservation –need to migrate whole systems

Greg Janée May 16, system databasestorage handle resolver database Typical repository architecture database handle resolver database fragile

Greg Janée May 16, NGDA architecture storage subsystem standard, public data model archival system ADLOAI bulk loader databases, caches, etc. Web access ingest

Greg Janée May 16, Post-NGDA architecture storage subsystem standard, public data model Web

Greg Janée May 16, Storage system requirements Req’s: –associate UUIDs/RIDs with bitstreams –retrieve global/local bitstream by UUID/RID –determine (parent) UUID of any bitstream –list all UUIDs Satisfied by: –any filesystem –tag URIs for UUIDs tag:library.ucsb.edu,2005:identifier

Greg Janée May 16, Archival objects directory UUID component RID UUID

Greg Janée May 16, Archival objects Directory info per component –named relationship/position –format & semantics by UUID references to definitions –fixity: checksum –provenance: isDerivative –policy: mutability –rights Components may be provided by archive itself

Greg Janée May 16, Example USGS DOQQ GeoTIFFFGDC Object x x.tiffx.fgdcx.gif metadata data derived TIFF subtypeOf

Greg Janée May 16, Archives Archive = set of archival objects –no structure –no free-floating bitstreams In anticipation of federation: –associations may cross archive boundaries –archival objects may not

Greg Janée May 16, Object types Content Format definition Semantic definition Provider Organizational structures –collection –series –ingest session

Greg Janée May 16, Archive-provider agreement Defines –common structure of objects to be ingested –necessary validations –associations to other objects –policies, rights, etc. Represents choke point –requires human evaluation

Greg Janée May 16, Deferred functionality Incremental ingest Object revisions Rights 3rd-party access Federation

Greg Janée May 16, Status Starting development now Approach: iterative refinement