Oral history as research data CLARIN workshop: Exploring Spoken Word Data in Oral History Archives Oxford 18-19 April 2016 Louise Corti Director, Collections.

Slides:



Advertisements
Similar presentations
UK DATA ARCHIVE Louise Corti, ODAF April UK Data Archive an internationally-renowned centre of expertise in data acquisition, preservation, dissemination.
Advertisements

DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
ESDS Qualidata: Qualitative Data Preparation and Use John Southall ESDS 26 November 2003.
New Services for Users Enhanced User Support and Enhanced Access to Data Angela Dale, Head ESDS Government Melanie Wright, Head ESDS Access & Preservation.
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Online Qualitative Data Resources: Best Practice in Metadata Creation.
The Economic and Social Data Service (ESDS) Karen Dennison, Support Services Manager, UK Data Archive April 2008.
Access to Economic and Social Data via the UK Data Archive Jack Kneeshaw UKDA.
Using secondary qualitative data in interdisciplinary contexts Libby Bishop ESDS Qualidata, University of Essex Working Across Boundaries: 2 nd NCRM Summer.
ESDS Qualidata and QUADS Coordination Louise Corti Online Resources Day 15 November 2005, London.
ESDS Qualidata. Qualitative Data Collections Data from National Research Council (ESRC) individual research grant awards Data from ESRC Programme research.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
Economic and Social Data Service a distributed data service for the social sciences.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
Economic and Social Data Service June What is the ESDS? national service supporting the archiving, dissemination and use of social and economic.
Nesstar, ESDS International and ESDS Qualidata online demonstrations ASLIB visit to the UK Data Archive Wednesday 24 November 2004 Louise Corti, Associate.
Secondary analysis of qualitative data: what is it and can it help your research? Libby Bishop ESDS Qualidata, University of Essex Department of Sociology.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Metadata and the UK Data Archive CESSDA Expert Seminar Odense September 2008 Margaret Ward Lenin Ageer.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Qualitative Data Preparation and Use Jack Kneeshaw ESDS Psychology Department-U of Essex 4 December 2003.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
Learning and Teaching with the UK Census Developing the Collection of Historical and Contemporary Census Data and Materials into a Major Learning and Teaching.
DATA IN Qualitative Data Acquisitions Process Louise Corti ESDS Qualidata, UKDA IASSIST WORKSHOP 27 May 2003.
Part of the Arts and Humanities Data Service and the UK Data Archive. Funded by the Joint Information Systems Committee and the Arts and Humanities Research.
DATA LIFECYCLE & DATA MANAGEMENT PLANNING ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA.
Managing sensitive data and authorship in Humanities and Social Sciences ODIN conference, Cologne October 2013 Louise Corti Collections Development and.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
MANAGING YOUR RESEARCH DATA: PLANNING TO SHARE ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
DExT PROJECT Louise Corti UK Data Archive University of Essex Colchester, Essex CO4 3SQ Tel: +44 (0) URL:
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
R utgers C ommunity R epository RU CORE 1 Research Data and Context  Presentation Goals  The challenge of context  Metadata design to support context.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The repositories Landscape: where are Repositories now and what’s around the corner? UKDA-store Louise Corti UKDA, University of Essex MIMAS OPEN FORUM.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Introduction to ESDS International Celia Russell Economic and Social Data Service MIMAS April 14 th 2004 University of Manchester Delivering the World:
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Documenting and disseminating census and survey data sets Ilpo Survo, United Nations ESCAP, Bangkok, for UNECE.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
ESDS resources for managing and analysing data Beate Lichtwardt Economic and Social Data Service UK Data Archive Research Method Festival, Oxford 1 July.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ESDS - Support and resources Beate Lichtwardt, ESDS/UKDA British Library Conference Centre, London 9 March 2009.
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
Introduction to metadata
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
REPORT BACK FROM THE DDI QUALITATIVE WORKING GROUP ……………………………………………………….………………………………
Quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical.
DDI AND EXPERIENCES AT ICPSR Prepared for Expert Seminar Finnish Social Science Data Archive Tampere, Finland September 1-2, 2000.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
David Robb 10/14/08 Discovery Streaming. From the Home Page, you can search for digital media by keyword, subject, grade level, or curriculum standards.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
METADATA ORGANISATION ESDS APPROACHES AND RESOURCES …………………………………………
Karen Dennison Collections Development Manager
Experiences of the Digital Repository of Ireland
Powerful access to qualitative data: What’s behind the UK QualiBank
ESDS resources for managing and analysing data
Enhancing ICPSR metadata with DDI-Lifecycle
DATA LIFECYCLE & DATA MANAGEMENT PLANNING
Exchanging Data Management Plans with DDI
Metadata supported full-text search in a web archive
Presentation transcript:

Oral history as research data CLARIN workshop: Exploring Spoken Word Data in Oral History Archives Oxford April 2016 Louise Corti Director, Collections Development and Producer Relations

Covering ‘Research ‘data use and by whom Data discovery and access Representing content and context when publishing data

My organisation - UK Data Archive Department of the University of Essex. Established in 1967 as a ‘Data Bank’ 48 years of curating and providing access to social science data Data and support services for research, teaching and learning Runs the UK Data Service: national service providing access to social science research data Speciality in social survey data, qualitative data and now …’big data’ Registered to ISO (information security standard)

UK Data Service ukdataservice.ac.uk

Sister data archives Source:

Some statistics about our Service Data for research and teaching purposes, used in all sectors and by many different disciplines 6,000 datasets in the collection 400 new datasets /new editions added within last 12 months 25,000 registered users c.75,000 “downloads” from Core service c.40,000 page views on UKDS.stat c. 82,000 census downloads 2000,000 downloads worldwide per annum user support queries per annum c.3,000,000 web page views

Qualitative data services 1994 Qualidata ESRC 6 years for funding. Piloting national approach to qualitative data sharing and archiving Fully incorporated into UK Data Service in 2000 Archiving, data sharing, secondary analysis training and inter(national) advice Fully integrated with 4 specialists in house (plus other portfolios) 70 staff

Qualitative data

Key data

Examples: oral history interviews 957 qualitative collections, mostly text-based Family Life and Work Experience before 1918, Middle and Upper Class Families in the Early 20th Century, (SN 5404)SN 5404 British Oral Archive of Political and Administrative History, (SN 5252)SN 5252 Oral History of Cultural Consumption in Italy, (SN 6479)SN 6479

Re-use purposes of qualitative data downloaded from UK Data Service, Source: Bishop & Kuula-LuUmi, Sage Open 2016

What do users do with the data ? Comparative research, restudy or follow-up study Re-analysis/secondary analysis Research design and methodological advancement Replication of published statistics Teaching and learning

Publications reusing qualitative data, Web of Knowledge, Source: Bishop & Kuula-LuUmi, Sage Open 2016

Citations of publications reusing qualitative data, Web of Knowledge, Source: Bishop & Kuula-LuUmi, Sage Open 2016

The national data (survey) archives – qualitative data volume UK Data Service Finnish Data service Gesis QualiServ ice, Bremen Slovenia Data Archive Swiss Data Service

Access conditions available for download/online access under open licence without any registration Open available for download/online access to logged-in users who have registered and agreed to an End User Licence Safeguarded available for remote or safe room access registered users whose research proposal has been approved by an access committee and who have received specialist training Controlled Depositor selects, with guidance, the access category most appropriate for the data collection

Common user scenario Role:Active Research Professor Discipline:Sociology of health Need data: Interviews /testimonies on health behavior What data: In depth interviews, with socio-demographic attributes Expectation:Search and browse catalogue collection-level records; Search and browse text; live links to available data Retrieve: Relevant hits; go to extract. View attributes, metadata, study context. Link to other related items and collection level data: read – listen - look Download full textual data in CAQDAS-friendly format Use for: Content analysis/coding in Nvivo software Publish:Journal article citing data extracts (with PI)

Discovering content and context Search across collections to find: collection characteristics, e.g Date Investigator Substantive topics Method – autobiograhy, life story, ethnography etc item characteristics Socio-economic attributes of speakers – critical for social science Spoken or written words Challenge when large no of collections

Study level metadata

Top level catalogue record / keyword index DDI-2 XML catalogue record; international archival standards (based on ISAD(G)) Citation : standard format with DOI National Centre for Social Research and University College London. Department of Epidemiology and Public Health (2001) Health Survey for England, 2009 [computer file]. 2nd Edition. Colchester, Essex: UK Data Archive [distributor], SN: 6732, Keyword index using social science and humanities HASSET thesaurus (ELSST European language)

Documentation : Being a Doctor Standard documentation for qualitative collection (46 indepth interviews) User guide - research report, interview schedule, information for participants and consent form Data list (Excel and PDF) Citation file Read file information on data preparation)

Data listing

Depositor stories

Self deposit system for smaller datasets

Typical rft transcript template Header in from data list Speaker tags

Audio Very little in collection Converted to mp3 for dissemination (download, stream) Archived in original lossless formats, and converted to open storage formats e.g..flac File names follow a clear logic; to relate to text etc. 2000Int01.mp Int01.rtf – 2000Essay01.pdf Image01.jpg

Challenge Discovery challenge when breadth of data collections Hard to search both across and within content Challenge of what and how much context? UKDA has 2 pathways Discover collections and download Search and browse content

Beyond the catalogue: user journey 1.Enter a search term or browse 2.Display the text of an interview transcript on a web page 3.Link to related data such as audio or video 4.Examine the metadata about the interview, e.g. the speaker and various attributes 5.Visit related information, such as external websites holding contextual documentation about the study or topic, e.g. maps 6.Cite an extract for referencing in publication 7.User or new user can annotate an extract

Faceted browsing – common facets Refine search or browse: Collection title Access (showing access conditions) – open or closed Resource type (type of object, such as interview transcript or image) Date (of coverage of the materials, not fieldwork dates) Sex Age group Socioeconomic status Region (to which the data refer)

Search interface - hits

Target page for an interview

Target page for an audio file

Target references

Fine-grained citation Paragraph level citation in QualiBank (APA citation style). Citation URI resolves back to the paragraph in context

Which and how much metadata? Primary metadata (core search facets) – available for majority of collections – collection & object level Additional secondary metadata – depends on the collection type and topics Metadata population - a lot of manual work!

Metadata demands for UK Quali Bank Discover, retrieve, examine in context, cite Use known XML schemas to capture context and relationships Rich descriptive metadata for files Ability to add new metadata descriptors and be extensible Pre-defined GUIDs used for citation

Description below the collection DDI Codebook 2.5 for basic study-level catalogue metadata Limited use of TEI schema (Text Encoding Initiative) for mark-up of textual data items (e.g layout and edits) QuDEx schema (Qualitative Data Exchange) for rich file- level description, document coding and annotation and intra-collection relationships Schema at: projects/qudex /

Use of Text Encoding Initiative (TEI) Limited use of TEI elements To denote structural mark-up TEI header: 3 mandatory elements Body elements: Turn takers, paragraphs, headers Inline tags: Corrections, errors Use of randomly generated GUIDs to uniquely identify TEI and QuDEx objects: Collection, Files, Paragraphs (any other part of data)

Essay with School Leavers on the Isle of Sheppey Retain typos

Home Intelligence & Morale Reports nt/?id=q-631d115b-79c7-45ce-9f34-09eda6c2f848

User case study

Events and gatherings User meet depositors Teaching with data –qualitative methods teachers How to prepare data for sharing Lots of tried and tested fun exercises Consent and ethics always popular Evangelise - everywhere!

QuDEx Overview XML schema for documenting metadata for qualitative data collections (DDI committee). W3C compliant Standard way of encoding metadata for exchange between CAQDAS packages for use within data archives and libraries for dissemination systems Enables description of complex collections detailed description at the object level, e.g. interview characteristics, interview setting, type of object etc. capture relationships between resources (files) preserve references to annotations performed on data

QuDEx Collection level metadata

Within-collection object metadata

QuDEx Category schema

XML: School Leavers on the Isle of Sheppey

QualiBank system tools BaseX for metadata and textual data storage and retrieval DDI 2.5/ DDI Codebook: collection level QuDEx: Limited collection and object level TEI: object level - text documents File server for non-XML docs Simple QuDEx metadata data entry tool: Sharepoint C# scripts process and validate against XML schemas; Oxygen for manual mark-up Solr indexes used for faceted browsing and TEI text highlighting Xquery on BaseX for object metadata, text utterances and related materials GUID generator SQL database for QuDEx and TEI elements UI in-house technologies.NET and RESTful web services/APIs

Imrpovd streamlined workflow!

QuDex principles

QualiBank guides and references QuDex schema dex13.pdf QualiBank User Guide uide.pdf Showcasingthe QualiBank

For ESRC award holders Upload data to our ReShare data repository, following guidance….ReShare Harvest project information from ESRC Gateway to Research DataCite DOI assigned Discover service harvests catalogue information

Idea of volume in ReShare 850 data collections published so far in ReShare 500 were migrated from previous Fedora system 100+ pending in review in the pipeline – being deposited or being sent back after review for actioning

Research Data Discovery System