Presentation is loading. Please wait.

Presentation is loading. Please wait.

Peter Grzybek & Ernst Stadlober  Austrian Research Fund  Project #15485 Quantitative Text.

Similar presentations


Presentation on theme: "Peter Grzybek & Ernst Stadlober  Austrian Research Fund  Project #15485 Quantitative Text."— Presentation transcript:

1 Peter Grzybek & Ernst Stadlober http://www-gewi.uni-graz.at/quanta  http://quanta.uni-graz.at Austrian Research Fund  Project #15485 Quantitative Text Typology

2 … A Universe of Texts Let‘s suppose there is …

3 Is the Universe Structured ? Or Can We Structure it ? How Can the Text Universe Be Structured?

4 Corpus Analysis vs. Text Analysis „Text Mixture“ (Re-)Construction of a norm of a standard of „language“Text As a Homogeneous Entity Complete Text („Quasi Text“) Self-regulating System

5 What is a Text ? Complete novel, composed of books ? Complete book of a novel, consisting of several chapters ? Individual chapters ? Dialogical vs. narrative sequences within a text ? Two Major Problems: 1.Data Homogeneity 2.Definition of Basic Analytical Units

6 Both problems relevant for quantitative approaches WHY QUANTITATIVE APPROACHES ? ASSUMPTION: If a ‚text‘ is governed by synergetic processes, these processes can and must be quantitatively described. The descriptive models obtained for each ‚text‘, can be compaired to each other, possibly resulting in one or more general model(s). Thus, a quantitative typology of texts can be obtained.

7 Synergetics In a Nutshell – Frequencies and Dependencies WHY WORD LENGTH ? Word Length: Graphemes, Phonemes, Syllables, Morphemes,…

8 TYPES OF TEXT TYPOLOGIES I. Qualitative II. Quantitative-Qualitative a.Tabula Rasa Principle (Clustering Methods) b.A-priori  A-posteriori Principle (Discrimination Methods)

9 Structuring the Text Universe (Ia): Text Sorts

10 Structuring the Text Universe (Ib): Functional Styles

11 In a qualitative approach, the text universe is structured with regard to external (pragmatic) factors („with reference to the world“) general communicative functions of language (functional styles) specific situational functions (text sorts)

12 Top-Down Bottom-Up

13 Top-Down Bottom-Up First and Second Order Cross Comparisons

14 Intended Emphasis on Letters ‚Letter‘ as a Prototype of Language 1.Located between Oral and Written Communication 2.Result of One Homogeneous Process of Text Generation

15

16 FUNCTIONAL STYLE AUTHOR(S)TEXT TYPE(S)NUMBER EVERYDAY LANGUAGE Cankar, JurčičPrivate Letters61 PUBLIC STYLEdiv. anon.Open Letters29 JOURNALISMdiv. anon.Readers‘ Letters, Comments 65 ARTISTIC STYLE Prose Cankar Švigelj-Mérat / Kolšek Individual Chapters from Short Novels („povest“) Letters from an Epistolary Novel 68 93 PoetryGregorčičVersified Poems40 DramaJančarIndividual Acts from Dramas42 Textbasis (398 Slovenian Texts)

17 A Small World of Texts Word Length Frequencies (in %) of Four Texts Literary Prose Text (#256)Versified Poetic Text (#359) Journalistic Comment (#324)Private Letter (#1)

18 Post-Hoc-Tests (Text Sorts) Groups without significant differences form „homogeneous subgroups“ a.Homogeneous subgroups do exist b.All four letter types in different subgroups !

19 Post-Hoc-Analyses  Homogeneous Subgroups Discriminant analyses  Cases are attributed to groups, on the basis of specific predictor variables The variables are submitted to linear transformations in order to arrive at an optimal discrimination of the individual cases

20 Discriminant Analysis: Eight Text Sorts Discrimination variables: m 1, m 2, v, p 1 (56.30%)

21 Discriminant Analysis: Four Letter Types (n=213) {Private L.} {Ep. Novel} {Readers‘ L.} {Open L.} Discrimination variables: m 1, v 70.40 %

22 Discriminant Analysis: Three Letters Types (n=213) {Private L., Ep. Novel} {Readers‘ L.} {Open L.} Discrimination variables: m 1, p 2 86.90 %  Distinction of Literary Letters Irrelevant ?

23 Discriminant Analysis: Private vs. Public Letters (n=213) {Private L., Ep. Novel}, {Readers‘ & Open L.} Discrimination variables: m 1, p 2 92.00 %  Distinction of Private vs. Public Styles ?

24 Discriminant Analysis: Private vs. Public Texts (n=248) {Private L., Ep. Novel}, {Readers‘ & Open L., Comments} Discrimination variables: m 1, p 2 91.10 %  Public vs. Private Styles ?

25 Discriminant Analysis: Private/Oral vs. Public/Written Texts (n=290) {Private L., Ep. Novel, Drama}, {Readers‘ & Open L., Comments} Discrimination variables: m 1, p 2 92.40 %  Oral vs. Written Styles ?

26  Towards a New Typology ? Discriminant Analysis: Three Text Types (n=330) {Private / Oral} {Public / Written} {Verse} Discrimination variables: m 1, p 2, v 91.20 %

27 Discriminant Analysis: Four Text Types (n=398) {Private / Oral} {Public / Written} {Prose} {Verse} Discrimination variables: m 1, p 2, v 79.90 %

28 Discriminant Analysis: Three Text Types (n=398) {Private / Oral} {Public / Written / Prose} {Verse} Discrimination variables: m 1, p 2, v 92.70 %

29 This is the End …


Download ppt "Peter Grzybek & Ernst Stadlober  Austrian Research Fund  Project #15485 Quantitative Text."

Similar presentations


Ads by Google