HAN Conference 2000 - © History Data Service The History Data Service : Promoting Good Practice and Standards of Scholarship Cressida Chappell Head of.

Slides:



Advertisements
Similar presentations
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
Advertisements

Research Data Access and Preservation Summit Panel 2 - Promoting Re-Use of Scientific Collections Some responses to the questions posed... John Harrison.
28 March 2003e-MapScholar: content management system The e-MapScholar Content Management System (CMS) David Medyckyj-Scott Project Director.
Part of the UK Data Archive and the Arts and Humanities Data Service. Funded by the Joint Information Systems Committee and the Arts and Humanities Research.
History Data Service1 Good Design for Historical source based Databases History Data Service Hamish James.
Strategic issues for digital projects... …or, what are we doing here?
Strategic issues for digital projects... …or, what are we doing here?
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
QA Focus Digital Preservation End of Programme Meeting: 5/99, 7/99, DiVLE and JISC/NSF International Digital Libraries.
Data Management: Metadata, Repositories and Curation Tony Mathys, Anne Robertson Eddie Boyle, Guy McGarva GeoForum, 4 th November, York.
Music Encoding Initiative (MEI) DTD and the OCVE
ISO 9001:2000 Documentation Requirements
P4 – Features and Functions of Information Systems
History of English Language Assessment Archives in context and as context Database structure ISAAR (CPF) Online Archival Sustainability.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
EAD in A2A Bill Stockting, Senior Editor A2A and EAD Working Group: Central Archives of Historical Records, Warsaw, 26 April 2003.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
Medieval Sources, Digital Resources Mark Merry History Data Service
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
Part of the Arts and Humanities Data Service and the UK Data Archive. Funded by the Joint Information Systems Committee and the Arts and Humanities Research.
AHDS History Zoe Bliss Acquisitions and Information Officer.
Prepared by Long Island Quality Associates, Inc. ISO 9001:2000 Documentation Requirements Based on ISO/TC 176/SC 2 March 2001.
Object Orientated Data Topic 5: Multimedia Technology.
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Digitisation Mick Eadie Visual Arts Data Service.
Software Development Unit 2 Databases What is a database? A collection of data organised in a manner that allows access, retrieval and use of that data.
Introduction to Database Systems 1.  Assignments – 3 – 9%  Marked Lab – 5 – 10% + 2% (Bonus)  Marked Quiz – 3 – 6%  Mid term exams – 2 – (30%) 15%
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Profile and a quick introduction Software Engineering: ) هندسة البرمجيات (in Arabic: is the branch of computer science Designed to develop a set rules.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
SEMINAR ON :. ORGANISATION Organizations are formal social units devoted to attainment of specific goals. Organizations use certain resources to produce.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
Archival information system ARHiNET Croatian national archival information system Vlatka Lemić Croatian State Archives, Croatia.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
A CIDOC CRM – compatible metadata model for digital preservation
Object Orientated Data Topic 5: Multimedia Technology.
1 The Technical Standards and Your Bid Sarah Ormes UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives.
Section 8.1 Create a custom theme Design a color scheme Use shared borders Section 8.2 Identify types of graphics Identify and compare graphic formats.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
GRASP: Designing Objects with Responsibilities
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Introduction to metadata
Storage of digital objects Adolf Knoll National Library of the Czech Republic
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Topic 4 - Database Design Unit 1 – Database Analysis and Design Advanced Higher Information Systems St Kentigern’s Academy.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
HDF and HDF-EOS: Implications for Long-Term Archiving and Data Access.
Calum Dow Thurs 12 th November Our Partners…
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
@ulccwww.ulcc.ac.uk IRMS Cymru October 2015 From EDRMS to digital archive: a wish-list for ways to preserve digital records.
CLASS Metadata and Remote Sensing Extensions CLASS Data Provider’s Conference September 2005 Anna Milan, Ted.Habermann,
DOCUMENTATION REF: Essentials of IT (Hamilton et al) Chapter 1.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
Practical Aspects of Preservation Peter Simpson Development Officer Arts and Humanities Data Service.
Chapter 1 Introduction to HTML
LO2 - Be Able to Design IT Systems to Meet Business Needs
Intro Project Introduction to HTML.
Palestinian Central Bureau of Statistics
Presentation transcript:

HAN Conference © History Data Service The History Data Service : Promoting Good Practice and Standards of Scholarship Cressida Chappell Head of Service Hamish James Collections Manager

HAN Conference © History Data Service Who Are We? Founded in 1993 National service funded by –Joint Information Systems Committee –Arts and Humanities Research Board Based in the UK Data Archive at the University of Essex Part of the Arts and Humanities Data Service Team of historians and IT specialists

HAN Conference © History Data Service What Do We Do? (1) Mission Statement The History Data Service collects, preserves, and promotes the use of digital resources, which result from or support historical research, learning and teaching.

HAN Conference © History Data Service What Do We Do? (2) Provide advice and training about creating, describing, using and preserving historical digital resources Collect and preserve historical digital resources Provide access to a wide-ranging collection of historical digital resources including resources held by other organisations Develop online data and metadata delivery systems to enhance access to this collection Establish thematic special collections, and enrich and enhance selected data collections Promote standards and best practice in the creation, description, use and preservation of historical digital resources

HAN Conference © History Data Service Digitisation: Good Practice A ‘good’ digital resource is one that is flexible and that can support many different uses –Preservation - an accurate representation of the source material. –Research - codes, indexes, categorisation. –Access - many different users in different settings. Who will use the resource? How will they use it? –Are there discipline specific methods of describing, categorising, and coding information that should be used? –Can the resource be searched at appropriate levels of detail? The digital resource should be well documented The digital resource should adhere to standards and avoid reliance on unusual features of software or hardware

HAN Conference © History Data Service Ensuring Preservation The time and resources invested in the creation of digital resources can easily be placed in jeopardy because hardware and software become obsolete, and magnetic media degrade Long-term preservation is essential if this investment is to be safeguarded Digital resources need to be preserved and migrated through changing technologies in order that they will continue to be accessible in the future However, the extent to which a digital resource can be preserved without significant information loss is largely dependent on decisions taken during the data creation process

HAN Conference © History Data Service Supporting Re-use Many historical digital resources potentially have significant and long-term value to the research and teaching community The time and resources invested in their creation can only be fully realised if they are suitable for re-use both by the data creator and by others Such suitability, however, is again largely dependent on decisions taken during the data creation process

HAN Conference © History Data Service Ensuring Long Term Viability To remain usable digital resources must have suitable hardware and software and be well documented Digital technology develops so quickly that resources can become unusable within a few years if they are not actively preserved The digital resource should be hardware and software independent to ensure that it remains usable. Use neutral data formats. These are formats that are widely accepted, are not controlled by a single organisation, and have a publicly available definition Use commonly accepted formats in preference to specialist formats. Avoid relying on special features of particular software or hardware that cannot be adequately replicated in other settings

HAN Conference © History Data Service Why Is Good Documentation Important? The maintenance of comprehensive documentation detailing the data creation process and the steps taken involves a significant but profitable investment of time and resources It is more effective if documentation is generated during rather than after a data creation project Such an approach will result in a better-quality data collection, as well as better-quality documentation, because the maintenance of proper documentation demands consistency and attention to detail The process of documenting a data creation project can also have the benefit of helping to refine research questions and it can be a vital aid to communication in larger projects Good documentation is crucial to a data collection’s long-term vitality Without it the resource will not be suitable for future use and its provenance will be lost Proper documentation contributes substantially to a data collection's scholarly value

HAN Conference © History Data Service What Is Good Documentation? At a minimum, documentation should provide information about a digital resource’s –Contents –Provenance Who created the digital resource and why? How was the digital resource created? Which sources were used to create the data collection? –Structure It needs to be sufficiently detailed to allow the data creator to use the resource in the future, when the data creation process has started to fade from memory It also needs to be comprehensive enough to enable others to explore the resource fully, and detailed enough to allow someone who has not been involved in the data creation process to understand the data collection and the process by which it was created.

HAN Conference © History Data Service Elements Of A Digital Resource The environment of a digital resource often receives the most attention, but it is the users and digital objects that are most important –Hardware and software selection should be based on the needs of the users and the types of digital objects to be use Users Knowledge Experience Culture Environment Hardware Software (OS) (Network) Digital Objects Binary Data Relationships

HAN Conference © History Data Service Digitisation Process Digitisation: Any means of capturing the information content of a non-digital source in binary coded form The digitisation process involves separating the information content of the source from the medium which carries that information The process of digitisation creates a representation of the original source, it does not create a duplicate of the original source Information may be enhanced or damaged, discarded or added during the digitisation process Digitisation forces choices about which aspects of the source will be captured in the digital representation of the source Information content can be anything about a source. Consider a page in a book; the information content includes: the text on the page the size and shape of the characters on the page the layout of text on the page the chemical composition of the paper the number of the page

HAN Conference © History Data Service Source - Digitisation - Resource The ‘input channels’ of digitisation (keyboard, scanner etc.) are narrow and can only capture a small proportion of the source’s information content identify aspects of source to digitise chose digitisation method chose digital format

HAN Conference © History Data Service Source Analysis Simplify the source –ignore unwanted information –exclude certain types of information –Define a sub-set of the remaining information content –select information directly from the source or define a set of summarised information based on the source Model the information content sub-set –break information content into discrete elements of information –describe the characteristics of each information element –describe how information elements relate to each other Successful source analysis requires a good understanding of the source and of the purpose of the digital resource

HAN Conference © History Data Service Data Models Data models are abstract ways of structuring information File formats are specific ways of implementing a data format –A Word97 document and an WordPerfect 8.0 document are different file formats that both implement similar data models of ‘a document’ The information content of a source can usually be represented by a number of different data models –the source and the intended purpose of the digital resource should determine the most appropriate data model to use –once a data model has been selected, it should be possible to store data in a number of file formats as required To be useful, digital data must be: –organised according to an appropriate data model –stored in a file format that can represent the data model –used in an application that understands the data model and file format in the desired way (try opening an HTML file in a web browser and an ordinary text editor, notice the difference)

HAN Conference © History Data Service Overlapping Formats Often one data model can be represented using another data model –the elements of a mark-up document can be stored as fields in a database –a database can be stored using a mark-up DTD –SVG (Scalable Vector Graphics) is an XML DTD for storing vector based images There are usually many file formats that can be used to represent a single data model, selecting the right data model is much more important then selecting a particular file format Choice of file format will follow from choice of software that suits your requirements

HAN Conference © History Data Service Digitisation: A Balancing Act Successful digitisation involves several trade-offs: –Amount and detail vs. time and cost of digitisation –Complexity of the digital resource vs. ease of use and understanding –Flexibility of the digital resource vs. suitability for a specific use –Feasibility of digitisation with current technology vs. future possibilities for digitisation Choices of what to digitise and how to digitise a source should be guided by a firm understanding of the source and the intended purpose of the digital resource –Do not exceed the limits of available support (financial, technical, equipment, labour) –Always try to preserve the information content of the source –Keep information that tracks the origin and history of the digital resource with the digital resource