Powerful access to qualitative data: What’s behind the UK QualiBank

Slides:



Advertisements
Similar presentations
UK DATA ARCHIVE Louise Corti, ODAF April UK Data Archive an internationally-renowned centre of expertise in data acquisition, preservation, dissemination.
Advertisements

Metadata for Digital Content at the Library of Congress Jane Mandelbaum Information Technology Services Library of Congress May 2009.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
ESDS Qualidata: Qualitative Data Preparation and Use John Southall ESDS 26 November 2003.
Setting the scene: the ESRC and JISC vision for access to qualitative data Louise Corti, ESDS Qualidata Economic and Social Data Service, UK Data Archive.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Online Qualitative Data Resources: Best Practice in Metadata Creation.
Using secondary qualitative data in interdisciplinary contexts Libby Bishop ESDS Qualidata, University of Essex Working Across Boundaries: 2 nd NCRM Summer.
ESDS Qualidata and QUADS Coordination Louise Corti Online Resources Day 15 November 2005, London.
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
Nesstar, ESDS International and ESDS Qualidata online demonstrations ASLIB visit to the UK Data Archive Wednesday 24 November 2004 Louise Corti, Associate.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
UKOLN is supported by: Using the RSLP schema Ann Chapman Collection Description Focus A centre of expertise in digital information management
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
BCAD Architecture 2009 British Cartoon Archive. Projects A project to digitise and catalogue the Carl Giles Archive to current international standards.
Qualitative Data Preparation and Use Jack Kneeshaw ESDS Psychology Department-U of Essex 4 December 2003.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
Metadata for Digital Content Jane Mandelbaum, Ann Della Porta, Rebecca Guenther.
NESSTAR - the data archive perspective by Margaret Ward UK Data Archive.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Louise Corti IASSIST, Edinburgh May 2005.
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878 – 1879) Maria Nisheva-Pavlova, Pavel Pavlov Faculty.
DATA IN Qualitative Data Acquisitions Process Louise Corti ESDS Qualidata, UKDA IASSIST WORKSHOP 27 May 2003.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
Open access & visibility Management Digital Preservation ORA: Purposes.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
REPORT BACK FROM THE DDI QUALITATIVE WORKING GROUP ……………………………………………………….………………………………
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Oral history as research data CLARIN workshop: Exploring Spoken Word Data in Oral History Archives Oxford April 2016 Louise Corti Director, Collections.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
Louise Corti UK Data Archive IASSIST 2007
AEM Digital Asset Management - DAM Author : Nagavardhan
7th Annual Hong Kong Innovative Users Group Meeting
Using JSTOR May 2016.
Science Reference Center
The IPT user interface and data quality tools
Content-level intellectual control for digital archives
Summon discovers contents from one search box!
Markup of Educational Content
Science Reference Center
Building Search Systems for Digital Library Collections
Alison Valk Georgia Tech
Data Management: Documentation & Metadata

DIGITAL LIBRARY.
Enhancing ICPSR metadata with DDI-Lifecycle
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Introduction into Knowledge and information
Updates on the XSLT stylesheets for DDI
Database Design Hacettepe University
Márton Németh – László Drótos How to catalogue a web archive?
Metadata supported full-text search in a web archive
Presentation transcript:

Powerful access to qualitative data: What’s behind the UK QualiBank Darren Bell Data & Services Developer UK Data Archive IASSIST, Toronto June 2014

QualiBank project: rationale & aims Provide enhanced access to key qualitative data via online data browsing and exploration: UK QualiBank Based on existing metadata schemas and known technologies Offer a mechanism for reliably citing data located in the system Project includes large-scale digitisation of precious and undigitized materials Maximise the impact from existing research and resource investments – demonstrate re-use

UK Data Service and its own needs We have one of the largest qualitative data collections– over 350 data collections A proportion of these have been digitised from older paper sources Currently users find and download these from our website Not so easy to find, but study documentation good No searching within collections No file manifest shown until download It can be a bit of guess work! Have Datacite DOIs; cannot reliably cite parts of data

Finding & accessing qualitative data Search for “health” in our data catalogue, Discover Retrieve catalogue record, e.g. SN 6124: Being a Doctor: a Sociological Analysis, 2005-2006 DDI 2.5 very limited for describing file content View limited user guide Web download as RTF bundle (46 transcripts)

Data listing

Download Zip of data and doc

Complex data collections SN 5801: Concepts of Healthy Eating Food Research: Phases I and II, 1992-1996 293 interview transcripts; 73 diaries; 6 observation field notes Not represented well at all in a DDI 2.X catalogue

Metadata demands for UK QualiBank Explore data through a data journey Find relevant extract, examine in context, cite Link data to still and moving images, and other related research outputs Some collections completely open Demands highly structured and consistently marked-up data Qualitative data requires object (file-level) descriptive metadata, e.g. interviews, audio-visual files, images Use of common metadata elements enable federated catalogues across providers and borders

Description below the collection DDI 2.5 for catalogue metadata QuDEx schema for file level description: allows detailed identification of data objects: Interview transcript or audio recording etc. Descriptive categories at the object level, e.g. mime type, interview characteristics, interview setting Relationship to another data object or part of data Capacity to capture rich annotation of parts of data (e.g an extract) Based on published QuDEx model in use (Schema at: www.data-archive.ac.uk/create-manage/projects/qudex/) Object-level description = a lot of manual work! Limited use of TEI schema for mark-up of textual data items

User expectations Search/browse for data Browse Search: Search /faceted browse of data - text; image/PDF, audio Browse Faceted browse by categories: Collection level, title, date and openess Collection object: data type, interview characteristics, location Search: Display no. hits and minimal item metadata Word in paragraph; thumbnail image/pdf; AV link Context: other related objects,within system or external Access full object View data, key metadata and all related files and links Get citation for part of data

System assumptions BaseX for metadata storage; Java loading; Solr search Data must be fully prepared on loading/publishing to the system. Data not ‘managed’ within the system Mark-up, metadata, relationships all pre-defined Pre-defined GUIDs to be used for citation (DOI + drilldown) Cannot search audio-visual data content Simple QuDEx metadata data entry tool created using SharePoint Technologies for user interface use existing in-house systems, .NET No download of data collection/subset - route to the UK Data Service Citation of selected extract of text; user-annotation possible

UK QualiBank Dataflow

Digitisation of key data sources Selectively digitize paper-based materials: Original survey questionnaires Open ended questions Transcribed interviews Handwritten field notes, essays Diagrams Photographs Destination formats: All text files treated as XML Image files (photos and text) as PDF Audio as mp3

QuDEx collection level metadata

Objects in collection metadata

Object relationships Rich set of verbs available to define relationships between all objects Converse verbs generated automatically:

QuDEx Category Schemes

Use of Text Encoding Initiative (TEI) Minimal use of TEI tags, of massive profile To denote structural mark-up Headers, turn takers, paragraphs Corrections, errors Use of unique GUIDs to identify all QuDEx IDs: Collection, Files, Paragraphs

School Leavers on the Isle of Sheppey

TEI XML: School Leavers on the Isle of Sheppey

Search interface - hits

Target page for an interview

Target references

Audio file target page

Citation mechanism System allows extract/quotation level citation; 1 or more consecutive paragraphs Citation object and citation format created on the fly – using GUIDS and system URI URI resolves directly to the data extract Some more sensitive collections are closed, so cannot resolve to data without login Is related to our collection-level DOIs e.g. 10.5255/UKDA-SN-6124-1

Contact details Darren Bell dbell@essex.ac.uk Louise Corti corti@essex.ac.uk Agustina Martinez a.martinez-garcia@ljmu.ac.uk