DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

Slides:



Advertisements
Similar presentations
1 Integrating user environments and data liquidity to improve the research experience.
Advertisements

EPortfolio research ePortfolio research: Purpose and audience What is the purpose? Who must the research inform (audience)? What specific research questions.
An Introduction to the UK Data Archive and the Economic and Social Data Service November 2007 Jack Kneeshaw, UKDA.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
ICT in Arts and Humanities Research e-Science in the Arts and Humanities 7 July 2006.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Data Management: Metadata, Repositories and Curation Tony Mathys, Anne Robertson Eddie Boyle, Guy McGarva GeoForum, 4 th November, York.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
SHERPA: institutional repositories Bill Hubbard SHERPA Project Manager University of Nottingham.
The Digital Workplace: Organisational Metadata and Recordkeeping 6 June 2007 Noni Oldfield.
Multilingual Information Access in a Digital Library Vamshi Ambati, Rohini U, Pramod, N Balakrishnan and Raj Reddy International Institute of Information.
School of Environmental Sciences University of East Anglia
Old Bailey Proceedings Online Mi Michael Pidd, Humanities Research Institute.
Costs of Digital Archiving the case of DANS Anna Palaiologk | Heiko Tjalsma | Laurents Sesink |
 Manmatha MetaSearch R. Manmatha, Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts, Amherst.
The Text Analysis Portal for Research (TAPoR) and Research Needs in the Digital Humanities Ray Siemens CRC Humanities Computing 2005 UVic.
Tags, Networks, Narrative Explorations in Folksonomy Sue Thomas and Bruce Mason IOCT, De Montfort University 30 th January 2007.
Three Years Later: Lessons Learned from Establishing a Metadata Service Marty Kurth PCC Policy Committee Meeting November 5, 2004.
Research data spring Enabling Complex Analysis of Large Scale Digital Collections 14/7/2015 Lots of money has been spent digitising heritage collections.
Science as an Open Enterprise: Open Data for Open Science Professor Brian Collins CB, FREng UCL, June 2012 Emerging conclusions from a Royal Society Policy.
Digital Media Technology Week 8: XSLT 3. Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about.
Representation of Data in Computer Systems
DIGITIZATION OF RARE LIBRARY MATERIALS Metadata Format Access to Digital Documents © Adolf Knoll, National Library of the Czech Republic.
LIBER Digitisation Conference, Copenhagen The cost of digitisation and preservation: The LIFE Project October 2007 Richard Davies LIFE 2 Project.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Final Search Terms: Archiving (digital or data) Authentication (data) Conservation (digital or data) Curation (digital or data) Cyberinfrastructure Data.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Challenges & opportunities in the preservation of (digital) information: the case of European research libraries Museo de las Ciencias Teatro de UNIVERSUM.
Seminar on New Frontiers for Statistical Data Collection WP 30 Moving to common survey tools and processes – the ABS experience Jenine Borowik, Adrian.
ERPANET pre-conference workshop, Glasgow 30 August 2004 Hans Hofman Nationaal Archief Netherlands Co-Director ERPANET ERPANET seminar Glasgow, 30 August.
ResearchData.arts.ac.uk The Rococo Project – A case study.
Planning Digitisation Projects Aly Conteh The British Library 30/11/2012 CERL Annual Seminar.
Amos Kujenga ADLSN Training Coordinator Addis Ababa, Ethiopia 5 – 7 November 2014 Introduction To Digital Libraries and Repositories.
DIGITAL FORENSICS Forensic Toolkit: a tool to process born digital records Emma Jolley Curator of Digital Archives.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Introduction to the sessions & structure of the Hackathon Paul Wheatley British Library / OPF / DPC.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
MODELLING THE DIGITAL PRESERVATION COSTS Paul Wheatley Digital Preservation Manager British Library.
VIEWNG SYSTEMS There are two ‘viewing systems’ that we use to visually interpret the world around us. What do you think the two ‘viewing systems’ are?
A Semantic Knowledge Base for the UK Government Web Archive Tom Storrar & Claire Newing Applying records management processes principles to the open government.
Open Data for Open Science: implications for European universities Geoffrey Boulton EUA, Brussels 2012 Some emerging conclusions from a Royal Society Policy.
Information Design Trends Unit 4 : Sources and Standards Lecture 1: Content Management Part 1.
Sharing Research Data with: OC Data Portal: ocdp.lib.uci.edu UC Irvine Dash: dash.lib.uci.edu Dan Tsang, Data Librarian Julia Gelfand, Applied Sciences.
Digital Preservation: The State of the Art November 1999 Update Marc Fresko CONSOLIDATING THE EUROPEAN LIBRARY SPACE - 18 NOVEMBER 1999 © Copyright Applerace.
Visualising the Old Bailey Proceedings as a Digital Panopticon Dataset RICHARD WARD RESEARCH ASSOCIATE, DIGITAL PANOPTICON PROJECT 14 APRIL 2014
UNEP Live. What is UNEP Live? - An on-line knowledge management platform - Focuses on open access to global, regional and national data and knowledge.
Archiving CAD in Archaeology: Ingest to Dissemination (or The ADS experience to date) Kieron Niven Archaeology Data Service, University of York, UK.
Kathleen Shearer Data management: The new frontier for libraries.
Science Fair Examples. Sketching Research: Where do I begin? Before you begin your research, you should ask yourself some questions. These will help.
Image and Sound Representation
The most commonly used Boolean Search terms are AND, OR, and NOT.
Lesson Objectives Aims From the spec:
HEADING OF PRESENTATION. HEADING OF PRESENTATION.
Evaluation of Information Literacy Education
Campus Cyberinfrastructure
Linking persistent identifiers at the British Library
MINED Approach Current Practice Applications Material specimen library
Generic Statistical Business Process Model (GSBPM)
Multilingual Information Access in a Digital Library
Introduction to Digital Libraries Assignment #3
ICT Programming Lesson 1:
Research Data Management
Introduction to Digital Libraries Assignment #3
Introduction to Digital Libraries Assignment #3
Introduction to Digital Libraries Assignment #4
Jez Cope, Data Services Lead, The British Library
Presentation transcript:

DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

The data lifecycle in a typical digital humanities project: 1.Acquisition (e.g. digitisation) 2.Processing (adding value) 3.Analysis (and dissemination) Data in the humanities is usually: 1.Small (discrete sources created by individuals). 2.Broad (many different types of sources have to be assembled). 3.Complex (because humans are not spreadsheets). Rarely ‘Big’.

Data acquisition: 1.Most of the evidence base is pre- digital. Very little is ‘born digital’. 2.Data acquisition is a question of translation, representation and interpretation. 3.The methods we use either enable or inhibit research. 4.But, the process also develops intimate knowledge of the evidence.

British Library Newspapers Keyword search for “pidd” gives 2,730 results…

Data processing: 1.Metadata can be complex, reflecting the complexity of the data. 2.Metadata can be very specialised, limiting re-use. 3.When processed at scale, computational methods are a trade-off between through-put and accuracy.

Nominal record linkage using computational means to trace the lives of 90,000 people. Record linkage across 45 separate datasets (some public, some commercial, all in different formats and with different data models). And most people have common names.

Analysing data: Do data visualisations tell us anything that we do not already know? Data visualisation is only as good as the data. Data visualisation should reveal trends and anomalies, directing us to deeper readings of the evidence.