Preservation by Migration to XML Dirk Roorda. work on a preservation strategy positioning of the XML preservation strategy implementing the strategy in.

Slides:



Advertisements
Similar presentations
The REPOX system Nuno Freire -
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
ECMA Open XML File Formats and the Evolution of Open File Formats Mark Lange Senior Policy Counsel Microsoft EMEA.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Metadata Management at GESIS-ZA Reiner Mauer GESIS – Data Archive and Data Analysis CESSDA-Expert Seminar Odense, September 11th 2008.
Oracle Rally Applications Modernization. 4 June About the Company Founded in 2002 Unites high-level information technology and organization architecture.
METS at UC Berkeley Part I: Generating METS Objects.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Looking into the future… DDI workshop IASSIST 2006 Jim Jacobs.
1 Institutional Repository (IR) Models Rutgers University Community Repository (RUcore) A digital library perspective (objects and collections) Flexible.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
WMS: Democratizing Data
© 2010 Microsoft Corporation. All rights reserved. Quality Assurance: Towards Tools for Characterizing and Comparing Digital Documents Natasa Milic-Frayling.
John OckerbloomDec. 6, 2002 Supporting learning at the library Towards integrating LMS and digital library technology at Penn John Mark Ockerbloom CNI.
The eXtensible Past XML As a Means for Easy Access to Historical Research Data and a Strategy for Digital Preservation.
MTEI Methods & Tools for Enterprise Integration
Professional Informatics & Quality Assurance Software Lifecycle Manager „Tools that are more a help than a hindrance”
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Digital preservation Hydra Europe, LSE 24 April 2015 Anders Conrad.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
2005 Adobe Systems Incorporated. All Rights Reserved. 1 Ontolog Forum Gunar Penikis Sr. Product Manager Adobe Systems.
9 Feb 2004Mikko Mäkinen & Saija Ylönen Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Geneva, 9-11 February 2004, Topic (ii): Metadata.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Open Access & Institutional Repositories, Accra June 2007 Metadata and e-preservation Dr D Peters DISA: Digital Innovation South Africa.
Package! Publish! Print! Brian Adelberg Digital Document Solutions Software Development Lead Microsoft Corporation.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
By Jeremy Burdette & Daniel Gottlieb. It is an architecture It is not a technology May not fit all businesses “Service” doesn’t mean Web Service It is.
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Learn more about office users -- Feature usage study by document element statistics Rui SuYing IBM Lotus Symphony.
Preservation Planning Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
Lotus Symphony Extension Model ● Jin Hua, Chen ● IBM.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
Java Web Services Orca Knowledge Center – Web Service key concepts.
7th Annual Hong Kong Innovative Users Group Meeting
Building A Repository for Digital Objects
DAITSS: Dark Archive in the Sunshine State
? What is Institutional Repository for Rutgers University
An Overview of Data-PASS Shared Catalog
Unit 5 Systems Integration and Interoperability
Building Search Systems for Digital Library Collections
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Lisa Ruff Business Productivity/Accessibility TS Microsoft Federal
DataForge: A DDI-Enabled Toolkit for Researchers and Data Managers
Malte Dreyer – Matthias Razum
RODA.
Digitization Standards: Issues & Updates
The Next Generation of the Microdata Information System MISSY: An Integrated Solution for the Documentation of European Microdata European DDI User Conference,
Presentation transcript:

Preservation by Migration to XML Dirk Roorda

work on a preservation strategy positioning of the XML preservation strategy implementing the strategy in software pursuing a standard with international partners welcome to MIXED

Dynamics of preservation when the digital context changes emulate: re-implement data and tools migrate: re-represent data, use new tools whatever you do, do it smart

Data and tools

Smart strategies multiple related tasks? seek normalization

Diachronic and synchronic migration across time (diachronic) original is nearly obsolete original intention might be unclear migration within time (synchronic) many different formats vendor specific peculiarities

Smart migration

MIXED explained Migration to Intermediate XML for Electronic Data

MIXED scenario

Kinds of data documents spreadsheets databases statistical data images word, open(red)office excel, openoffice access, mysql spss, sas photoshop, irfanview

Umbrella format

Making it work software = framework + modules standard = wrapper + metadata + for each kind of data: selected XML standard for that kind

Making software make a good product as initial effort generic framework substantial number of conversion plugins for spreadsheets and databases for statistical data connect to repositories, Fedora ready integrate efforts of all interested parties by using an open architecture webservices for framework and plugins use third party plugins for SPSS and DDI using an open source paradigm

SPSS reader as plugin

Helping a standard emerge finding a name: Preferred Data Formats for Archives (PDFA) suggestions welcome using existing auxiliary standards connecting to open source software seeking a user base in the archiving world

Using MIXED repositories: preservation planning interested in file conversion: web services individual users: stand alone

Using standards XML (Schema, UNICODE) OAIS (interface to repositories) ODF (spreadsheets) SOAP, ESB Java, SPRING, OSGI

Co-operation DExT (Data Exchange Tools) (UKDA) ODaF (Open Data Foundation) PLANETS (Preservation and Long-term Access through Networked Services) you?

Trends end of vendor-specific binary formats in sight interchange formats more concerned with preservation

The future... the major data kinds have a preferred preservation format the set of preservation formats is standardized easy-to-use software converts between preservation formats and custom formats

Discussion One preservation standard per data kind? ODF OOXML ! The role of DDI in metadata and data is there an interchange format for statistical data between SPSS, SAS, STATA etc. ? Is formatting and action relevant for preservation? fonts, colors, formulas Usage of MIXED it is a collection of quality convertors it will belong to a collection of preservation tools Plugin interoperability how can we reuse other conversion plugins