Presentation is loading. Please wait.

Presentation is loading. Please wait.

Andrej A. Kibrik (Iling RAS and Lomonosov MSU)

Similar presentations


Presentation on theme: "Andrej A. Kibrik (Iling RAS and Lomonosov MSU)"— Presentation transcript:

1 Andrej A. Kibrik (Iling RAS and Lomonosov MSU)
ИНТЕГРИРОВАННОЕ ИССЛЕДОВАНИЕ МУЛЬТИКАНАЛЬНОГО ДИСКУРСА AN INTEGRATED STUDY OF MULTICHANNEL DISCOURSE Andrej A. Kibrik (Iling RAS and Lomonosov MSU) Vera I. Podlesskaya (RSUH) Olga V. Fedorova (Lomonosov MSU) Integrated Approach to the Problem of Knowledge Representation RSUH, Проект # 1

2 The dominant practice The actual practice of language studies is largely oriented to writing Linell 1982, “The written language bias in linguistics” The same is true of language studies in psychology Some important exceptions E.A.Zemskaja, O.A.Lapteva, O.B.Sirotinina Discourse psychology Writing is a late, derived, and “unusual” way of language use The dominant practice could be compared to describing trees on the basis of furniture Indeed, some properties of wood are preserved even after the carpenter’s work But if we want real botany, we have to go to the forest and see how trees grow from the ground

3 Not just speech When we communicate naturally, we not only produce chains of words, but also intonate gesticulate interact with eye gaze etc. These processes are traditionally studied by different academic disciplines But the actual communication process is whole and undivided Hence multimodal approach Gibbon et al. eds. 2000, Kress 2002, Granström et al. eds. 2002, Scollon 2006, Kibrik 2010, Knight 2011, Adolphs & Carter 2013, Müller et al. eds …

4 Spoken multichannel discourse
Language? Discourse Vocal / auditory modality Kinetic / visual modality Verbal channel Prosodic channel Gaze channel Facial expressions channel Gesture channel Other modalities Proxemics channel Other channels Intonation Numerous other components Manual gestures Cephalic gestures Corporal gestures Other gestures по-моему движения головы важнее. Во всяком случае мы их обязались размечать на втором этапе, когда еще без мимики и положения тела

5 LANGUAGE AS IS: A multichannel initiative
Goals: Create a resource approaching the actual richness of human discourse Explore natural communication in an integrated way Registered phenomena verbal structure prosody gesture (and other aspects of “body language”) eye gaze

6 Outline 1. Resource 2. Some preliminary findings
Character of interaction: structured vs. unstructured Character of environment: prepared vs. unprepared Pear chats and stories corpus Design Technical solutions Annotation Underlying idea: sharpen tools for the subsequent stage of free conversation 2. Some preliminary findings

7 Pear chats and stories The Pear Film (Chafe 1980)

8 Design Listener Narrator Reteller Commentator
The Narrator and the Commentator watch the film The Narrator tells the Reteller about the film The Commentator adds details, all three discuss the film The Reteller tells about the film to the Listener, who has just joined the group The Listener writes down the contents of the film telling chat retelling 8 2nd retelling (written) 8

9 Audio recording Six channels ZOOM H6 Handy Recorder 96 kHz / 24 bit
Each participant recorded with a lapel SONY ECM-88B mic, mono Inbuilt mic records all vocal events, stereo Automatic synchronization of all audio files

10 Video recording: Cover shot
GoPro Hero 4 (wide angle) Frame rate: 50 FPS Resolution 2700х1500

11 Video recording: Individual frontal cameras
Industrial high-speed cameras JAI GO-5000M-USB Frame rate: 100 FPS Crucial for analysis of kinetic behavior File format: mjpeg No interframe compression Resolution 1392х1000 No audio

12 Eye trackers Tobii Glasses II Eye Tracker Sampling rate: 50 Hz
Resolution: 1920х1080 Video recording of the scene 25 FPS overimposed eye movements Software that produces temporal coordinates of fixations

13 The scene Tobii glasses Listener Narrator Reteller Commentator 13 13

14

15 Pear chats and stories: Quantitative parameters
24 sessions recorded in the summer of 2015 96 participants in all 18 to 36 yrs old Gender 36 men and 60 women Education 42 persons with higher education 54 students 9 hours About 100 K words

16 http://multidiscourse.ru http://multidiscourse.ru/annotation/
Examples of mediafiles Annotation Vocal Gestural Manual Cephalic Eye gaze

17 Vocal transcription Verbal structure
Division into elementary discourse units (EDUs) Quanta of talk (Ščerba 1955, Cruttenden 1986, Chafe 1994) Elementary behavioral acts of discourse production Identified on the basis of prosodic criteria: tempo, pausing, etc. Temporal dynamics Pauses Accents Tone in accents Illocutionary characteristics Phase Tempo Emphasis Reduction Tonal register Disfluencies Comments on specific EDUs General characterization Etc., etc.

18 Scores vocal transcript

19 Manual gesture transcript (ELAN)
Annotation components Gesture chains Gesture boundaries Handedness Gesture phases Stroke boundaries ………

20 Eye tracking annotation
Annotation components Fixations on: Interlocutor face hands body other Environment Durations

21 Multilayer annotation

22 Problems we are struggling with
Synchronization of all recordings including the problem of various frame rates Gesture annotation: Degree of detail? Gesture vs. posture Gesture vs. adaptor Automatic detection of motion

23 Some research findings
Traditional notions of communication theory must be restated pausing turn-taking distinction between production and comprehension In the kinetic modality postures are not a separate channel but a part in each particular gestural component High degree of coordination between units belonging to different channels: manual gestures and EDUs (Fedorova et al. 2016)

24 Repairs and gesture (Vera Podlesskaya)
In highly interactional context, additional types of repairs compared to strict monologue In particular, other-initiated Long-range repairs: the speaker realizes that an above stated information is inaccurate Marker of discovery Accompanied by an emphatic gesture slap on the knee

25 Eyetracking in natural communication (Olga Fedorova)
Most eyetracking studies accomplished in experimental settings Eye gaze strategies in natural communication fall into several types: General Longer fixations on face (1 to 2 s) compared to fixations on hands (100 to 250 ms) Context-dependent In interactional context, speaker’s fixations on the environment are rarer than in monologue Individual: total duration of fixations on interlocutors’ hands Recording 23: 1.9% Recording 04: 5.2% Recording 06: 11.1% This may be related to individual differences in peripheral vision Or other individual properties

26 Conclusion Clearly, the way we gesticulate and operate our vision affects how we talk vocally In a better world, the reasonable sequence in the scientific study of language should have been: (1) basic, original use of language: multichannel spoken face-to-face communication (2) derived, secondary uses of language: monomodal, written, grammar But if we cannot reverse the history of linguistics, we attempt to explore the fundamental form of language now – better late than never


Download ppt "Andrej A. Kibrik (Iling RAS and Lomonosov MSU)"

Similar presentations


Ads by Google