Download presentation
Presentation is loading. Please wait.
Published byAlyson Marsh Modified over 9 years ago
1
Digital Media Technology Week 8: XSLT 3
2
Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about dates of publication, languages, subjects of books □ Groups firstly work separately on assignments; results discussed during presentations
3
□ Natural language text is rife with ambiguities and irregularities □ XML bridges the gap in between linear texts and discrete data
4
Examples: □ Production of 20,000 exabytes on a yearly basis20,000 exabytes □ 50 million tweets sent daily 50 million tweets □ 84,000 hours of video uploaded daily on YouTube 84,000 hours of video □ David Leahy: Three V’s of big data: Volume, Velocity and VarietyThree V’s of big data Big data
5
The End of Theory?
6
□ E-science: a confluence of three developments: big data, collaboration and grid-computing (Wouters and Beaulieu, 2009)Wouters and Beaulieu □ E-research: a more inclusive term: various ways in which computer-based methodologies can transform scholarly and scientific practices Escience and e-research
8
Digital Humanities □ Focus on the various ways in which the computer can be used to investigate traditional questions in the humanities. □ Investigation of the phenomenon of computation from a humanities perspective
9
Terminology □ Digital Humanities, Humanities Computing, e-Humanities, Humanistic Informatics □ The term does not cover pure digitisation, use of word processors, weblogs, e-mail applications
10
Blackwell Companion to Digital Humanities
11
□ Alliance of DH Organisations, EADH, CenterNet Alliance of DH Organisations EADH CenterNet □ DH conferences, THATCamp □ Digital Humanities Quarterly, Journal of Digital Humanities, Literary and Linguistic Computing
12
Father Busa’s Index Thomasticus □ PhD research on the notion of ‘presence’ in the work of Thomas Aquinas □ Restructured and transformed version of the text □ Rationale: dealing with large quantities of text
13
Text Collections □ Mass digitisation projects: □ Million Book Project at Carnegie Mellon: ca. 1,500,000 titles □ Project Gutenberg: ca. 70,000 titles; □ Delpher: 90.000 books; 1 million newspapers □ Google Books: ca. 15 million
14
Digital Scholarly Editions William Blake Archive Rosetti Archive
15
26.250 days: “If we could read a book on each of those days, it would take almost forty lifetimes to work through every volume in a single million book library”
16
Distant reading vs. Close reading
17
Google n-gram viewer & culturonomics
18
Optical Character recognition
20
Repetitions in individual works
21
Questions □ What sort of knowledge is produced? □ Is this objective knowledge? A positivist approach within the humanities? □ Can the application of an algorithm be considered a form of reading?
22
XML and XSLT □ XML divides a linear text into discrete units □ XSLT and Xpath can be used to analyse these units in quantitative way: e.g. counts of elements, string lengths, number of words □ Example during seminar: play by Oscar Wilde, encoded in TEI
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.