An Approach to Software Preservation

Slides:



Advertisements
Similar presentations
Publishing Data Catherine Jones Library Systems Development Manager, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton, UK.
Advertisements

A centre of expertise in digital information managementwww.ukoln.ac.uk QA For Web Sites: QA Focus Resources Brian Kelly UKOLN University of Bath Bath .
Configuration management
Database System Concepts and Architecture
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
ITIL: Service Transition
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Human Computer Interaction G52HCI
July 11 th, 2005 Software Engineering with Reusable Components RiSE’s Seminars Sametinger’s book :: Chapters 16, 17 and 18 Fred Durão.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Introduction to z/OS Basics © 2006 IBM Corporation Chapter 8: Designing and developing applications for z/OS.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
Course Instructor: Aisha Azeem
Catherine Jones Science and Technology Facilities Council SCAPE Training Statsbiblioteket, Aarhus, November 2013 Control Policy formulation The why.
UFCFX5-15-3Mobile Device Development UFCFX Mobile Device Development An Introduction to the Module.
PRESENTATION START.
UML - Development Process 1 Software Development Process Using UML (2)
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
ITEC224 Database Programming
An Introduction to Software Architecture
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Why persistent identifiers are crucial in digital preservation.
SENG521 (Fall SENG 521 Software Reliability & Testing Software Product & process Improvement using ISO (Part 3d) Department.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
29/11/20071 Brian Matthews Brian McIlwrath A Framework for the Significant Properties of Software.
BE-SECBS FISA 2003 November 13th 2003 page 1 DSR/SAMS/BASP IRSN BE SECBS – IRSN assessment Context application of IRSN methodology to the reference case.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Enhancing social tagging with a knowledge organization system Brian Matthews STFC.
FDT Foil no 1 On Methodology from Domain to System Descriptions by Rolv Bræk NTNU Workshop on Philosophy and Applicablitiy of Formal Languages Geneve 15.
A centre of expertise in data curation and preservation Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Integration integration of all the information flowing through a company – financial and accounting, human resource information, supply chain information,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
OOAD UNIT V B RAVINDER REDDY PROFESSOR DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
BNSC Agency Report David Giaretta Colorado Springs 16 Jan 2007.
LECTURE 5 Nangwonvuma M/ Byansi D. Components, interfaces and integration Infrastructure, Middleware and Platforms Techniques – Data warehouses, extending.
Preserving containers EUAN COCHRANE DIGITAL PRESERVATION MANAGER YALE UNIVERSITY LIBRARY.
 System Requirement Specification and System Planning.
1 CASE Computer Aided Software Engineering. 2 What is CASE ? A good workshop for any craftsperson has three primary characteristics 1.A collection of.
Advanced Software Engineering Dr. Cheng
Advanced Higher Computing Science
CESSDA SaW Training on Trust, Identifying Demand & Networking
ITIL: Service Transition
Software Project Configuration Management
Active Data Management in Space 20m DG
Design and Implementation
Registry Repository of Representation Information
VI-SEEM Data Repository
Chapter 16 – Software Reuse
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Introduction to Software Testing
Chapter 2: The Linux System Part 1
Patterns.
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
Chapter 11: Software Configuration Management
An Introduction to Software Architecture
Chapter 7 –Implementation Issues
Open Archival Information System
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Executable Specifications
Presentation transcript:

An Approach to Software Preservation PV 2009, Madrid Arif Shaon, Brian Matthews, Juan Bicarregui, Catherine Jones (STFC), Jim Woodcock (Univ of York) 1 December, 2009 1

Science and Technology Facilities Council Provide large-scale scientific facilities for UK Science particularly in physics and astronomy E-Science Centre – at RAL and DL Provides advanced IT development and services to the STFC Science Programme Strong interest in Digital Curation of our science data Keep the results alive and available R&D Programme: DCC, CASPAR

Long-term Preservation of Software JISC funded work: Tools & Guidelines for the preservation of software as a research output Used the JISC funded: Significant Properties of Software Report Software very large topic Diversity in: application of software and software architecture and scale of software and provenance and user interaction Project needed to limit scope Scientific and mathematical software Limited commercial consideration Limit consideration of user interaction Finding information Literature, Standards (e.g. the OAIS Reference Model) Case Studies: Talking to developers of products and software repositories Developing a framework for software preservation.

Software Preservation What is software preservation? Storing a copy of a software product Enabling its retrieval in the future Enabling its reconstruction in the future Enabling its execution in the future Not what most software developers and maintainers do.

Why Preserve Software ? Preserving the Data Preserving the work Preserving the software is necessary to preserve other data Keep the data live and reusable Prime motivation for STFC Preserving the work E.g. research work in Computing Science Reproducible Handling Legacy Specialised code from the past which still needs to be used Usually seen as a problem! Museums and archives: Either supporting Hardware E.g. Bletchley Park, Science Museum, Or in its own right Chilton Computing, Multics History Project

A Conceptual Framework for Long-term Software Preservation Three aspects to the framework: A Performance Model for software Determine what it means to preserve s/w Adequacy of performance of s/w Based on the NAA performance model for digital preservation Model for describing s/w artefacts As complex digital objects. Versions and variants Properties for preservation For retrieval, reconstruction, replay

Performance Model for Software Compiler Compiler Processing Compiler Performance / SW Processing Software Source Data Source Software Performance / Data Processing User Data Performance Testing data performance to judge adequacy of the software performance. Important to maintain software test suite to assess preservation of significant properties of the software.

Adequacy of Software Preservation A software package can be said to perform adequately relative to a particular set of “significant properties”, if in a particular performance it preserves those significant properties to an acceptable tolerance. Significant properties are evaluable features of the performance After the recall and reconstruction phase Assesses the value of the replay Can be generalised to any digital object

A Conceptual Model for Software Product The whole software object under consideration Could be single library module, or very large system (e.g. Linux) Comes under one “authority” (legal control) Defines “gross functionality” Version Releases of the system Characterised by changes in detailed functionality Variant Versions for a particular platform Characterised by operating system and environment Instance A particular instance of a particular variant at a particular location Ownership An individual licence Fixed to particular MAC or IP address, URLs etc. *

Preservation Properties of Software What attributes of software do we need to take into account for long-term preservation? Software is a Composite digital object Collection of modules Specifications, Configuration scripts, test suites, documentation Architecture Client/server, storage system, input / output User interaction Command line, User Interface User model Functionality what it does and what data it depends on Environment platform, operating system, programming language versions Dependencies Compilation dependency graph Standard libraries Other software products Specialised hardware

Relationship to the OAIS model Software Compiler Compiler Processing Rep. Descriptive Compiler Performance / SW Processing PDI & SP Software Source Info Info Software Performance / Data Processing User Data Source Data Performance Retrieval Reconstruct Replay Open Archival Information System (OAIS) – ISO standard for the preservation of digital object. Software preservation properties are related to concepts in OAIS.

The BADC Case Study (1) The British Atmospheric Data Centre (BADC) A NERC Designated Data Centre Hosts over 250TB of atmospheric data for UK scientists and researchers Also develops, supports, and provides access to a variety of software to facilitate accessibility and usability of data Examples of BADC software: Trajectory Service, Weather Generator The BADC approach to Software Preservation Long-term preservation is out of the current operational remit Considers the high recurring costs of preservation as a prohibitive factor Requires preservation solutions that could be integrated into the existing software management infrastructure

The BADC Case Study (2) Evaluating the preservation framework against some BADC software artefacts Involved recording values for different preservation properties defined in the framework The final result validates the relevance and adequacy of the framework However, highlights that clear understanding of both the framework and different aspects of the software is also needed Underlines the need for suitable tools with sufficient guidelines

Significant Properties Editing and Querying for Software (SPEQS) Java-based Eclipse plug-in; enables capturing software preservation properties during its development Demonstrates the concept of preservation tools that could be integrated within existing software development systems Used in CASPAR project

Summary Exploration of the s/w preservation space Defined reasons, audience, some basic concepts Defined a framework which enables s/w to be included in OAIS preservation framework Fits in a OAIS compatible preservation methodology Validated in some practical scenarios

Questions? http://sigsoft.dcc.rl.ac.uk/twiki/bin/view http://www.e-science.stfc.ac.uk/projects/software-preservation/softpres8985.html