1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.

Slides:



Advertisements
Similar presentations
Lecture-7/ T. Nouf Almujally
Advertisements

Managing Data Resources
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
1 Basic DB Terms Data: Meaningful facts, text, graphics, images, sound, video segments –A collection of individual responses from a marketing research.
3-1 Chapter 3 Data and Knowledge Management
Introduction to Databases Transparencies
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
Dr. Mohamed Osman Hegazi
BUSINESS DRIVEN TECHNOLOGY
Attribute databases. GIS Definition Diagram Output Query Results.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
LECTURE 2 DATABASE SYSTEM CONCEPTS AND ARCHITECTURE.
Database Management System Lecture 2 Introduction to Database management.
Class 6 Data and Business MIS 2000 Updated: September 2012.
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
1 Introduction to databases concepts CCIS – IS department Level 4.
Systems analysis and design, 6th edition Dennis, wixom, and roth
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
By: M.Nadeem Akhtar1 The Data Base Management System (DBMS) Ch # 09.
Web-Enabled Decision Support Systems
Introduction to SQL Steve Perry
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
Database Design - Lecture 2
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
CS 474 Database Design and Application Terminology Jan 11, 2000.
6 Chapter Databases and Information Management. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits.
Data Management David Nathan & Peter Austin & Robert Munro.
Dr. Mohamed Osman Hegazi 1 Database Systems Concepts Database Systems Concepts Course Outlines: Introduction to Databases and DBMS. Database System Concepts.
Storing Organizational Information - Databases
2005 Epocrates, Inc. All rights reserved. Integrating XML with legacy relational data for publishing on handheld devices David A. Lee Senior member of.
MIS 327 Database Management system 1 MIS 327: DBMS Dr. Monther Tarawneh Dr. Monther Tarawneh Week 2: Basic Concepts.
Lecture2: Database Environment Prepared by L. Nouf Almujally 1 Ref. Chapter2 Lecture2.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
CHAPTER 3 DATABASES AND DATA WAREHOUSES. 2 OPENING CASE STUDY Chrysler Spins a Competitive Advantage with Supply Chain Management Software Chapter 2 –
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
ITGS Databases.
1 Database Management Systems (DBMS). 2 Database Management Systems (DBMS) n Overview of: ä Database Management Components ä Database Systems Architecture.
Dr. Mohamed Osman Hegazi
Database Concepts Track 3: Managing Information using Database.
Assoc. Prof. Dr. Ahmet Turan ÖZCERİT.  The concept of Data, Information and Knowledge  The fundamental terms:  Database and database system  Database.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
Managing Data Resources File Organization and databases for business information systems.
© 2017 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Databases and Database User ch1 Define Database? A database is a collection of related data.1 By data, we mean known facts that can be recorded and that.
An Introduction to database system
Database System Concepts and Architecture
Building A Repository for Digital Objects
XML and Databases.
Databases.
Database Management System (DBMS)
Chapter 2 Database Environment.
MANAGING DATA RESOURCES
Chapter 1 Database Systems
Chapter 1: The Database Environment
Data Model.
Database Systems Instructor Name: Lecture-3.
Database Design Hacettepe University
Medusa at the University of Illinois
Chapter 1 Database Systems
DATABASES WHAT IS A DATABASE?
Chapter 2 Database Environment Pearson Education © 2014.
The Database Environment
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Presentation transcript:

1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003

2 Testbed Background Established October 2000 by –Ministry of the Interior –Ministry of Education, Culture and Sciences Dutch National Archives Will finish 1 st October 2003 Objective: –To secure sustained accessibility to reliable government information in the digital era

3 Research Questions Advantages of different preservation approaches? Factors and effectiveness of each approach? Basic Requirements for preservation? Which metadata are essential for preservation? Options for Attribute preservation?

4 Scope 4 Archival Record Types: –Text documents –Spreadsheets – s –Databases 3 Preservation Approaches: –Migration –Emulation –XML

5 Database Three components: –Contents –Database Management System (DBMS) –Application The database system comprises all three components. The term database includes at least the contents of a database.

6 Different types of databases Relational - Oracle, Microsoft Access Hierarchical Native XML - Tamino Object oriented Network

7 Compared to other recordtypes Each database system is unique The native environment (application) is not widely available and is generally database specific The technical challenge for converting databases into a preservation friendly format is high Operational database contents are subject to frequent changes The relationship between a record and a database is unclear and is also context dependent.

8 Relationship database & record Records are contained, as whole objects, in the database. The contents of the database contain records. Each record is spread over tables. The contents of the database is the record. Database data (as whole objects or spread across tables) accessed or presented in a precise manner in the application form a record. The whole database system is the record A database is not a record at all

9 What are we trying to preserve? The record/s as discrete, easy to access entities. The ability to reconstruct the contents of the database. The data and input/output screens to form/reproduce records. The whole database system.

10 Dutch archival regulation article 6e “for databases: the original storage format or ASCII (flat file, with separator tokens), accompanied by documentation, preferably as an XML-DTD, about the structure of the database (at least encompassing the complete logical data model with a description of the entities); queries should be stored in the query language SQL (SQL2)”.

11 Our work so far Focus on relational databases. Conversion of databases to XML, concentrating on content and structure. Review of commercial tools for converting databases to XML. Design and development of conversion tool.

12 XML; pros Open standard, widely accepted and applied, well structured. Platform and program independent. Practical approach to the concept of separation of content, structure and apperance. Extensible and controllable; readable by both humans and machines. Free - i.e no royalties payable. Widely used, so lots of software tools already available.

13 XML; cons Verbose; indeed human readable, but too much to read! XML will be superseded in 5 or 10 years. Complex material -> much pioneering work is still to be done. What to do with the XML? Alternatives:  Original file format  ASCII

14 XML vs migration Migration needs to have both systems running. Migration needs to happen every few years. Lot of work involved in migration; requires specialist knowledge. Conversion to XML can be seen as an intermediate step in a migration between a present day and a future database. By converting to XML, you remove the dependency on the present day database system.

15 Tool; how it works Concentrates on the data in the database. Designed to closely follow the structure of the relational database. One XML file for each table. Separate overview XML file. Constraint information in the overview file describes the structure of the database. (Optionally) store database views.

16 Problems encountered JDBC or a ODBC/JDBC bridge Details are different for different databases Not all of the methods in java.sql work in all cases Oracle data tables for one application are typically associated with a particular table owner Resources Images

17 Demonstration ………

18 Conclusions Preservation of databases is still “uncharted territory”. Lot of questions, just a few answers. XML is able to preserve content and structure of databases. More information: