Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Media Technology Week 8: XSLT 3. Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about.

Similar presentations


Presentation on theme: "Digital Media Technology Week 8: XSLT 3. Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about."— Presentation transcript:

1 Digital Media Technology Week 8: XSLT 3

2 Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about dates of publication, languages, subjects of books □ Groups firstly work separately on assignments; results discussed during presentations

3 □ Natural language text is rife with ambiguities and irregularities □ XML bridges the gap in between linear texts and discrete data

4 Examples: □ Production of 20,000 exabytes on a yearly basis20,000 exabytes □ 50 million tweets sent daily 50 million tweets □ 84,000 hours of video uploaded daily on YouTube 84,000 hours of video □ David Leahy: Three V’s of big data: Volume, Velocity and VarietyThree V’s of big data Big data

5 The End of Theory?

6 □ E-science: a confluence of three developments: big data, collaboration and grid-computing (Wouters and Beaulieu, 2009)Wouters and Beaulieu □ E-research: a more inclusive term: various ways in which computer-based methodologies can transform scholarly and scientific practices Escience and e-research

7

8 Digital Humanities □ Focus on the various ways in which the computer can be used to investigate traditional questions in the humanities. □ Investigation of the phenomenon of computation from a humanities perspective

9 Terminology □ Digital Humanities, Humanities Computing, e-Humanities, Humanistic Informatics □ The term does not cover pure digitisation, use of word processors, weblogs, e-mail applications

10 Blackwell Companion to Digital Humanities

11 □ Alliance of DH Organisations, EADH, CenterNet Alliance of DH Organisations EADH CenterNet □ DH conferences, THATCamp □ Digital Humanities Quarterly, Journal of Digital Humanities, Literary and Linguistic Computing

12 Father Busa’s Index Thomasticus □ PhD research on the notion of ‘presence’ in the work of Thomas Aquinas □ Restructured and transformed version of the text □ Rationale: dealing with large quantities of text

13 Text Collections □ Mass digitisation projects: □ Million Book Project at Carnegie Mellon: ca. 1,500,000 titles □ Project Gutenberg: ca. 70,000 titles; □ Delpher: 90.000 books; 1 million newspapers □ Google Books: ca. 15 million

14 Digital Scholarly Editions William Blake Archive Rosetti Archive

15 26.250 days: “If we could read a book on each of those days, it would take almost forty lifetimes to work through every volume in a single million book library”

16 Distant reading vs. Close reading

17 Google n-gram viewer & culturonomics

18 Optical Character recognition

19

20 Repetitions in individual works

21 Questions □ What sort of knowledge is produced? □ Is this objective knowledge? A positivist approach within the humanities? □ Can the application of an algorithm be considered a form of reading?

22 XML and XSLT □ XML divides a linear text into discrete units □ XSLT and Xpath can be used to analyse these units in quantitative way: e.g. counts of elements, string lengths, number of words □ Example during seminar: play by Oscar Wilde, encoded in TEI


Download ppt "Digital Media Technology Week 8: XSLT 3. Seminar 11 November □ One long seminar (four hours) □ Exports from UBL catalogue □ Records contain data about."

Similar presentations


Ads by Google