Prosody-driven Sentence Processing: An Event-related Brain Potential Study Ann Pannekamp, Ulrike Toepel, Kai Alter, Anja Hahne and Angela D. Friederici.

Slides:



Advertisements
Similar presentations
Information structuring in English dialogue class 4
Advertisements

Accessing spoken words: the importance of word onsets
Background In Swedish, there is a difference between main clauses and subordinate clauses regarding the linear order of sentence adverbs and inflected.
Punctuation Generation Inspired Linguistic Features For Mandarin Prosodic Boundary Prediction CHEN-YU CHIANG, YIH-RU WANG AND SIN-HORNG CHEN 2012 ICASSP.
Timing of the brain events underlying access to consciousness during the attentional blink Claire Sergent, Sylvain Baillet, & Stanislas Dehaene.
The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Eye Movements and Spoken Language Comprehension: effects of visual context on syntactic ambiguity resolution Spivey et al. (2002) Psych 526 Eun-Kyung Lee.
Language Use and Understanding BCS 261 LIN 241 PSY 261 CLASS 12: BRANIGAN ET AL.: PRIMING.
Are the anterior negativities to grammatical violations indexing working memory? Manuel Martin-Loeches, Francisco munoz, Pilar Casado, A. Melcon, C. Fernandez-frias,
Foundations of psycholinguistics Week 3 The beginnings of language acquisition Vasiliki (Celia) Antoniou.
Prosodic facilitation and interference in the resolution of temporary syntactic closure ambiguity Kjelgaard & Speer 1999 Kent Lee Ψ 526b 16 March 2006.
SPEECH PERCEPTION 2 DAY 17 – OCT 4, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
The Neuroscience of Language. What is language? What is it for? Rapid efficient communication – (as such, other kinds of communication might be called.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
ERP Investigation of Prosodic and Semantic Focus Shawn Johnson Charles Clifton, Jr. Mara Breen Andrea Eileen Martin Joanna Morris Florack.
All slides © S. J. Luck, except as indicated in the notes sections of individual slides Slides may be used for nonprofit educational purposes if this copyright.
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Connecting Acoustics to Linguistics in Chinese Intonation Greg Kochanski (Oxford Phonetics) Chilin Shih (University of Illinois) Tan Lee (CUHK) with Hongyan.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
Semantic and syntactic processing in Chinese sentence comprehension: Evidence from event-related potentials Zheng Ye, Yue-kia Luo, Angela D. Friederici,
An fMRI investigation of covertly and overtly produced mono- and multisyllabic words. Shuster LI, Lemieux SK. Brain and Language 93 (2005):20-31.
Agustín Gravano 1 · Stefan Benus 2 · Julia Hirschberg 1 Elisa Sneed German 3 · Gregory Ward 3 1 Columbia University 2 Univerzity Konštantína Filozofa.
SYNTAX 1 DAY 30 – NOV 6, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Multiple constraints in action
Electroencephalography Electrical potential is usually measured at many sites on the head surface.
Detecting Prosody Improvement in Oral Rereading Minh Duong and Jack Mostow Project LISTEN Carnegie Mellon University The research.
Kai Alter Newcastle Auditory Group Segmentation in speech: On the processing of boundaries and accents.
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
Sound and Speech. The vocal tract Figures from Graddol et al.
Word category and verb-argument structure information in the dynamics of parsing Frisch, Hahne, and Friedericie (2004) Cognition.
Perceived prominence and nuclear accent shape Rachael-Anne Knight LAGB 5 th September 2003.
WORD SEMANTICS 4 DAY 29 – NOV 4, 2011 Brain & Language LING NSCI Harry Howard Tulane University.
Introduction How do people recognize objects presented in pictorial form? The ERP technique has been shown to be extremely useful in studies where the.
Suprasegmentals Segmental Segmental refers to phonemes and allophones and their attributes refers to phonemes and allophones and their attributes Supra-
Comprehension of Grammatical and Emotional Prosody is Impaired in Alzheimer’s Disease Vanessa Taler, Shari Baum, Howard Chertkow, Daniel Saumier and Reported.
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
10/13/10Psyc / Ling / Comm 525 Fall 10 Kim & Osterhout (2005) JML The independence of combinatory semantic processing: Evidence from event-related potentials.
Right hemisphere sensitivity to word & sentence level context: Evidence From Event-Related Brain Potentials. Coulson, S. Federmeier, K.D., Van Petten,
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
As expected, a large N400 effect was observed for all 3 word types in both experiments, |ts|≥7.69, ps
Background: Speakers use prosody to distinguish between the meanings of ambiguous syntactic structures (Snedeker & Trueswell, 2004). Discourse also has.
Ethogram Notes Charting Animal Behavior. Ethogram = behavior inventory used for quick recording of behaviors takes 1000’s of hours for a complete catalog.
N400-like semantic incongruity effect in 19-month-olds: Processing known words in picture contexts Manuela Friedrich and Angela D. Friederici J. of cognitive.
The effects of working memory load on negative priming in an N-back task Ewald Neumann Brain-Inspired Cognitive Systems (BICS) July, 2010.
1.) *Experiment* 2.) Quasi-Experiment 3.) Correlation 4.) Naturalistic Observation 5.) Case Study 6.) Survey Research.
3 Phonology: Speech Sounds as a System No language has all the speech sounds possible in human languages; each language contains a selection of the possible.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Control of prosodic features under perturbation in collaboration with Frank Guenther Dept. of Cognitive and Neural Systems, BU Carrie Niziolek [carrien]
ANT Z=52 R ACUE - PASSIVE VCUE - PASSIVE 1300 msVoltageCSD.31uV.03uV/cm 2 AIM We investigate the mechanisms of this hypothesized switch-ERP.
ERPs in language acquisition
Chapter 11 Language. Some Questions to Consider How do we understand individual words, and how are words combined to create sentences? How can we understand.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
Cortical evoked potentials to an auditory illusion: Binaural beats
© 2015 Psychology Press / Taylor & Francis
S. Kramer1, K. Tucker1, A.L. Moro1, E. Service1, J.F. Connolly1
SUPRASEGMENTAL PHONEME
RESULTS AND DISCUSSION Fall Level High-rising Fall Level High-rising
Kuiper and Allan Chapter 6.2
Studying Intonation Julia Hirschberg CS /21/2018.
Detecting Prosody Improvement in Oral Rereading
Kuiper and Allan Chapter 6.2
Turn-taking and Disfluencies
Comparative Studies Avesani et al 1995; Hirschberg&Avesani 1997
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1
Towards a neural basis of auditory sentence processing
Volume 59, Issue 5, Pages (September 2008)
Presentation transcript:

Prosody-driven Sentence Processing: An Event-related Brain Potential Study Ann Pannekamp, Ulrike Toepel, Kai Alter, Anja Hahne and Angela D. Friederici Presented by Laura Matzen, 9/1/2005

Pannekamp et al. (2005) Goal of study: –Determine what causes the closure positive shift (CPS) –Is this effect driven by prosody alone or by other factors?

Pannekamp et al. (2005) Basic Design: –Systematically reduce linguistic content of sentences –Record ERPs in each condition to see if CPS is present in all cases or to see how it changes

Background on Event-related Potentials (ERPs)

Background Closure Positive Shift (CPS) –seen in response to normal spoken sentences –somehow related to intonation contour Intonation contour –“sentence melody” –provides information about syntactic structure, sentence mode –fundamental frequency – F0

“Actually, we were the ones who said we didn’t want a regular cake, so you can blame it on us.” Frequency

“Actually, we were the ones who said we didn’t want a regular cake, so you can blame it on us.” Frequency

Intonational Phrases (IPh) One or more in each sentence Group and organize words into phrases Structure is usually determined by syntax

Intonational Phrases (IPh) “...defined as containing at least one nuclear accent and a boundary tone at their right edges” Pitch contour drops at the end, resets at start of the next IPh Last syllable usually lengthened Often followed by a pause

Examples She went to the store, then she picked up the dry cleaning on the way home. She went to the store, then she dropped the kids off at school, then she went home.

Intonational Phrases (IPh) Seem to be secondary to syntax in some ways –supporting role BUT, listeners can identify prosodic boundaries in the absence of semantic and syntactic information (de Rooij, 1975)

Intonational Phrases (IPh) Beckman (1996) “the prosodic structure of the utterance has to be seen as a full grammatical property also requiring its own parsing” However, it might be harder to process prosody in absence of other linguistic information –Off-line behavioral studies can’t address this issue

Steinhauer et al. (1999) Studied prosody with ERPs –Two sentence types: 1 or 2 IPh boundaries –Saw positive-going waveform in response to all boundaries – CPS –Possible Confound: Close relationship between prosody and syntax CPS could still be related to processing syntactic boundaries, not just prosodic boundaries

Steinhauer & Friederici (2001) ERP study with delexicalized speech –Filtered to removed phonemic, semantic and syntactic information –Only prosodic info left (pitch, amplitude, rhythm)  CPS in this case could only be caused by prosody

Steinhauer & Friederici (2001) Results –Strange-looking CPS at first boundary –No CPS at second boundary –Contingent negative variation (CNV) across whole sentence  Is this because the sentences are so unnatural? Is this task completely different from language processing?

Meyer et al. (2002) fMRI study –Very different responses to natural speech and delexicalized speech –Response to prosody stronger in right hemisphere “strong evidence that pitch processing in the absence of additional linguistic information such as syntax and/or semantics takes place in right hemisphere regions”

Current Experiments Motivations: –Investigate partial replication of CPS findings by Steinhauer & Friederici (2001) Why didn’t they get the expected results? Try using more natural stimuli –See if CPS shifts to the right hemisphere as segmental information is removed

Experiments Experiment 1: Normal sentences –Normal semantic, syntactic and phonemic info Experiment 2: Jabberwocky sentences –Remove semantic information Experiment 3: Pseudo sentences –Remove semantic and syntactic information Experiment 4: Hummed sentences –Remove semantic, syntactic, and phonemic info (only prosody left)

Expt 1: Normal Sentences A1: [Kevin promises mom to sleep] IPh1 [and to be a good boy for a while.] B1: [Kevin promises] IPh1 [mom to kiss] IPh2 [and to be a good boy for a while.]

Expt 1: Normal Sentences A1: one IPh boundary at 1950 msec B1: two IPh boundaries at 950 and 2700 msec “First part” (subject + verb) of sentence longer than A1 High boundary tone at end of first part Longer pause after first part Both conditions have IPh boundary after second verb (marked by high boundary tone)

Expt 2: Jabberwocky Sentences All content words replaced with pseudo words A2: [The bater rabels Onna to lubol] IPh1 [and the rado to nupe.] B2: [The bater rabels ] IPh1 [Onna to lubol] IPh2 [and the rado to nupe.]

Expt 2: Jabberwocky Sentences A2: one IPh boundary at 2100 msec B2: two IPh boundaries at 1100 and 2600 msec “First part” (subject + verb) of sentence longer than A2 High boundary tone at end of first part Longer pause after first part Both conditions have IPh boundary after second verb (marked by high boundary tone)

Expt 3: Pseudo Sentences All function and content words replaced with pseudo words A3: [Bater saklimm Onna ko labei keg ] IPh1 [nug som Rado lie nupes.] B3: [Bater saklimm] IPh1 [Onna ko labei keg] IPh2 [nug som Rado lie nupes.]

Expt 3: Pseudo Sentences A3: one IPh boundary at 2000 msec B3: two IPh boundaries at 920 and 2400 msec “First part” (subject + verb) of sentence longer than A3 High boundary tone at end of first part Longer pause after first part Both conditions have IPh boundary after second verb (marked by high boundary tone)

Expt 4: Hummed Sentences A4: [mm mmm mmmm mm mmmm] IPh1 [mmmm mmm mmm mmm mmmmm.] B4: [mm mmm] IPh1 [mmmm mm mmm] IPh2 [mmmm mmm mmm mmm mmmmm.]

Expt 4: Hummed Sentences A4: one IPh boundary at 1850 msec B4: two IPh boundaries at 850 and 2150 msec Longer pause after first part High boundary tone at end of first part (First part itself is NOT longer in this case) Both conditions have IPh boundary after second verb (marked by high boundary tone)

Procedure Subjects only saw one type of sentence Auditory presentation for all sentences Trial followed by a probe word –Subjects had to say whether word was in sentence or not (Words were placed randomly into hummed filler sentences)

Data Collection 23 Ag/AgCl electrodes 200 msec prestimulus baseline Averages computed over whole sentences – 4500 msec segments

Results

A1: IPh at 1950 ms Response at 2000 ms B1: IPh1 at 950 ms Response at 1500 ms IPh2 at 2700 ms Response at 2700 ms

A2: IPh at 2100 ms Response at 2200 ms B2: IPh1 at 1100 ms Response at 1500 ms IPh2 at 2600 ms Response at 2800 ms

A3: IPh at 2000 ms Response at ms B3: IPh1 at 920 ms Response at 1500 ms IPh2 at 2400 ms Response at 2500 ms

A4: IPh at 1850 ms Response at 2200 ms B4: IPh1 at 850 ms Response at 1000 ms IPh2 at 2150 ms Response at ms Negative peak from ms

Waveforms measured from offset of 1 st sentence fragment

Their interpretation... Positivity resembles CPS Observed in all experimental conditions –Related to processing of prosodic boundaries Scalp distribution changes across conditions

Their interpretation... CPS associated with first IPh seems related to the amount of segmental content in sentences –Moved forward and rightward as info decreased CPS associated with second IPh seems UNrelated to amount of segmental content

Scalp Distributions – IPh1 Experiment 1: CPS over whole head Experiment 2: CPS at midline and lateral sites, also moves to anterior sites Experiment 3: Moves to right anterior sites Experiment 4: CPS broadly distributed

Scalp Distributions First IPh: –CPS moved anterior sites as linguistic information decreased –CPS for psuedo sentences moved rightward, but not for hummed sentences (possible explanation – left hemisphere processing timing of hums?)

Scalp Distributions Second IPh: –CPS distributed broadly over midline sites also over lateral sites for normal and pseudo sentence (why not for other types of sentences?) –CPS moved rightward for hummed sentences ONLY (CPS not seen here at all in previous study- conclude that naturalness is important)

Negativity Early negativity in hummed sentences –Different processing mechanism for pure prosody? –Search for early accent (that isn’t there)? –Different task demands?

Their Conclusions CPS is independent of expt manipulations –Seems to be related to perception of prosodic boundaries –Dependent exclusively on prosody “The observed differences in the scalp distribution of the CPS as a function of the segmental content of the acoustic speech stream suggest that prosodic processing interacts with other information types involving different systems.”

Questions Why did scalp distribution change? (It doesn’t seem to change in a systematic way) Why are there different results for the 1 st and 2 nd IPh boundaries? Are the differences in timing meaningful? Why not compare sentences with no IPh boundaries? (one phrase) Is it reasonable to say that these are “natural” manipulations?