Quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical.

Slides:



Advertisements
Similar presentations
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
Advertisements

ESDS Qualidata: Qualitative Data Preparation and Use John Southall ESDS 26 November 2003.
New Services for Users Enhanced User Support and Enhanced Access to Data Angela Dale, Head ESDS Government Melanie Wright, Head ESDS Access & Preservation.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up (SQUAD) Louise Corti UK Data Archive, University of Essex QUADS Demonstrator Workshop.
Using Atlas-ti to explore qualitative data Libby Bishop and Louise Corti, UK Data Archive, ESDS, University of Essex IASSIST 2004 workshop.
Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Online Qualitative Data Resources: Best Practice in Metadata Creation.
Using secondary qualitative data in interdisciplinary contexts Libby Bishop ESDS Qualidata, University of Essex Working Across Boundaries: 2 nd NCRM Summer.
Issues in methods and reuse for hypermedia ethnography Presented at QUADS Showcase day September 28, 2006 Louise Corti.
ESDS Qualidata and QUADS Coordination Louise Corti Online Resources Day 15 November 2005, London.
QUALITATIVE ARCHIVING AND DATA SHARING SCHEME WHO WE ARE QUADS is the ESRC Qualitative Archiving and Data Sharing Scheme, running from April 2005 until.
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
HAND OUTS DExT Project UK Data Archive September 2007.
A DTD for Qualitative Data: Extending the DDI to Mark-up the Content of Non-numeric Data Libby Bishop and Louise Corti, UK Data Archive, ESDS, University.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
New features for ESDS Qualidata Online Libby Bishop UK Data Archive, University of Essex QUADS Demonstrator Workshop 28 September 2006.
Nesstar, ESDS International and ESDS Qualidata online demonstrations ASLIB visit to the UK Data Archive Wednesday 24 November 2004 Louise Corti, Associate.
Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve LHours UK Data Archive CAQDAS Conference, April 2007.
QUADS Co-ordination Louise Corti QUADS Director, UKDA 28 September 2006.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
New Directions for ESDS Qualidata: 2003 and beyond Louise Corti, Head ESDS Qualidata Economic and Social Data Service UK Data Archive IASSIST 2003.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Documenting the Resource Malcolm Polfreman
A Common Standard for Data and Metadata: The ESDS Qualidata Document Type Definition (DTD) Libby Bishop Online Qualitative Data Resources: Best Practice.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Louise Corti IASSIST, Edinburgh May 2005.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up (SQUAD) Louise Corti and Libby Bishop UK Data Archive, University of Essex IASSIST.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
M.Sc. of Advanced Software Engineering CO7206 System Reengineering XML & AST Many Slides are by Georgios Koutsoukos.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
DExT PROJECT Louise Corti UK Data Archive University of Essex Colchester, Essex CO4 3SQ Tel: +44 (0) URL:
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability: Where the irresistible force of flexibility meets the immovable.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up (SQUAD) Louise Corti UK Data Archive, University of Essex ASC Conference 29 September.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
UK DATA ARCHIVE-NLP COLLABORATION Louise Corti and Claire Grover UK Data Archive University of Essex Colchester, Essex CO4 3SQ
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
REPORT BACK FROM THE DDI QUALITATIVE WORKING GROUP ……………………………………………………….………………………………
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
An exercise in preservation and applied technology Making an Electronic Text.
Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.
WP 3: Standardisation of shared metadata Mode of operation –All partners are involved –Building on practice outside the project Achievements of Year 1.
REPRESENTING CONTEXT IN AN ARCHIVE OF EDUCATIONAL EVALUATIONS PROJECT ACTIVITIES The project team canvassed opinion across the.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
REPRESENTING CONTEXT IN AN ARCHIVE OF EDUCATIONAL EVALUATIONS The project has constructed a permanent archive of significant.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Managing Semi-Structured Data. Is the web a database?
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Chapter 29. Copyright 2003, Paradigm Publishing Inc. CHAPTER 29 BACKNEXTEND 29-2 LINKS TO OBJECTIVES Attach an XML Schema Attach an XML Schema Load XML.
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
METHODOLOGICAL ISSUES IN QUALITATIVE DATA SHARING AND ARCHIVING THE PROJECT TEAM CONTACT Dr Bella Dicks Cardiff School.
METADATA ORGANISATION ESDS APPROACHES AND RESOURCES …………………………………………
1 Annotation Framework March Terminology CV - abbreviation for controlled vocabulary CRS - Community Review System (a collection within DLESE)
Oral history as research data CLARIN workshop: Exploring Spoken Word Data in Oral History Archives Oxford April 2016 Louise Corti Director, Collections.
Louise Corti UK Data Archive IASSIST 2007
Powerful access to qualitative data: What’s behind the UK QualiBank
Metadata for research outputs management
Metadata in Digital Preservation: Setting the Scene
Business Intelligence
Presentation transcript:

quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical solutions for ‘exposing’ digital qualitative data to make them fully shareable and exploitable. The main objectives are to: specify, test and propose an eXtended Markup Language (XML) schema for storing and marking up qualitative data investigate requirements for contextualising qualitative data and developing standards for data documentation develop semi-automated using natural language processing (NLP) tools for preparing marked up qualitative data for sharing research tools for publishing and interrogating data via the web – Qualitative Data Mark-Up Tools (QDMT) WHAT FEATURES OF TEXT CAN BE MARKED UP? Spoken interview texts provide the clearest and most common example of the types of encoding features that can be marked up. There are three basic groups of structural features: utterance, specific turn taker, defining idiosyncrasies in transcription links to analytic annotation and other data types (e.g. thematic codes,concepts,audio or video links, researcher annotations) identifying information such as real names, company names, place names, occupations, temporal information USING NLP TOOLS Information Extraction (IE) is a sub-field of NLP which aims to identify key pieces of information in texts using 'shallow' analysis techniques. A typical IE system will perform Named Entity Recognition where particular kinds of proper names and terms are identified, classified and marked up. This is a means of annotating documents with semantic metadata – enabling resource discovery and data exploration. The Edinburgh LT-XML and CME tools have been used to process the data. Example: Italy's business world was rocked by the announcement last Thursday that Mr. Verdi would leave his job as vice- president of Music Masters of Milan, Inc to become operations director of Arthur Anderson. DEFINING CONTEXT Rich context enables informed re-use of data. But defining how to provide context for raw data to make it more ‘usable’ is complex. ESDS Qualidata has done much to establish informal ways of documenting raw data. Micro and macro level features should be considered including: Fieldwork observations are useful as are timelines and political chronologies. Equally when undertaking a replication or restudy, detailed information on sampling procedures, field work approaches and question guides will be essential. SQUAD has identified a minimal generic set of elements that represent a baseline for contextualising data. how the research question was framed the research application process project progress fieldwork situations analyses processes

THE PROJECT TEAMCONTACT quads.esds.ac.uk/squad METADATA STANDARDS SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP Louise Corti and Claire Grover UK Data Archive University of Essex Colchester, Essex CO4 3SQ Tel: +44 (0) URL: quads.esds.ac.uk/squad Libby Bishop Louise Corti Claire Grover Maria Milosavljevic core tag set for transcription names, numbers, dates links and cross references notes and annotations text structure unique to spoken texts linking, segmentation and alignment advanced pointing - XPointer framework text and AV synchronisation contextual information (participants, setting, text) The XML schema will specify a ‘reduced’ set of Text Encoding Initiative (TEI) elements: ANONYMISING DATA TOOL This tool imports marked up data from from the Edinburgh pipeline system. Named entities are highlighted and co-reference chains – e.g numerous references to a single person - are identified. Annotations are explored in an XML format in the NITE NXT model. NXT uses ‘stand off’ annotation – where annotation is linked to or referenced by words. Names can be anonymised with chosen pseudonyms. The references of names to pseudonyms is saved. DATA EXCHANGE STANDARDS A uniform format for richly encoding qualitative research is necessary as it: enables preservation and re- use of metadata, data and annotation; ensures consistency of presentation and description of data; supports the development of common web-based publishing and search tools; and facilitates data interchange and comparison among datasets. SQUAD has produced a limited formal definition of a common XML vocabulary and DTD based on the TEI and tested a new Qualitative Data Interchange Format (QDIF). defined header metadata for a standardised transcript defined and tested generic XML models for qualitative data tested and refined NLP tools for qualitative data built front end to NLP named entity tools chosen software to enable annotation of data explored export formats for longer-term archiving investigated powerful XML based indexing tools for searching and retrieving data investigated web display of multimedia data and pointers to other resources using XML – extending the functionality of ESDS Qualidata TOOLS PROGRESS Mijail Alexandrov Kabadjov interview text with XML tags embedded There's just one or two factual things first of all do you mind my asking how old you are? 49. And what schools did you go to? - King Street