Stress, Focus, and the Roots of Segmental Variation Kenneth de Jong Indiana University Work reported here in collaboration with Eric Oglesbee, Kyoko Okamura,

Slides:



Advertisements
Similar presentations
An Adaptive, Dynamical Model of Linguistic Rhythm Sean McLennan Proposal Defense
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Teaching Pronunciation
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Acoustic Characteristics of Consonants
Phonetic variability of the Greek rhotic sound Mary Baltazani University of Ioannina, Greece  Rhotics exhibit considerable phonetic variety cross-linguistically.
Perceptual Organization in Intonational Phonology: A Test of Parallelism J. Devin McAuley 1 & Laura C. Dilley 2 Department of Psychology Bowling Green.
SPEECH PERCEPTION 2 DAY 17 – OCT 4, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Speech perception 2 Perceptual organization of speech.
Speech Science XII Speech Perception (acoustic cues) Version
Splice: From vowel offset to vowel onset FIG 3. Example of stimulus spliced from the repetitive syllables. EXPERIMENT 2 (Voicing ID) METHOD Speech materials:
Syllables and Stress, part II October 22, 2012 Potentialities There are homeworks to hand back! Production Exercise #2 is due at 5 pm today! First off:
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Speech and speaker normalization (in vowel normalization)
Evidence of a Production Basis for Front/Back Vowel Harmony Jennifer Cole, Gary Dell, Alina Khasanova University of Illinois at Urbana-Champaign Is there.
A cross-linguistic comparison of the coordination between hand gestures and phonological prominence Giorgos Tserdanelis.
Linguistic Stress in Language and Speech Kenneth de Jong Indiana University.
Digital Systems: Hardware Organization and Design
PHONETICS AND PHONOLOGY
Phonetics (Part 1) Dr. Ansa Hameed.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Syllables and Stress October 21, 2009 Syllables “defined” “Syllables are necessary units in the organization and production of utterances.” (Ladefoged,
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Sound and Speech. The vocal tract Figures from Graddol et al.
Chapter three Phonology
The Description of Speech
Phonology, phonotactics, and suprasegmentals
Phonetics HSSP Week 5.
Phonetics and Phonology
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,
Whither Linguistic Interpretation of Acoustic Pronunciation Variation Annika Hämäläinen, Yan Han, Lou Boves & Louis ten Bosch.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Speech Science Fall 2009 Nov 2, Outline Suprasegmental features of speech Stress Intonation Duration and Juncture Role of feedback in speech production.
Stress as Attention Modulation and the Roots of its Formal Properties Kenneth de Jong Indiana University Jones.ling.indiana.edu/~kdejong &
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
Is phonetic variation represented in memory for pitch accents ? Amelia E. Kimball Jennifer Cole Gary Dell Stefanie Shattuck-Hufnagel ETAP 3 May 28, 2015.
Results 1.Boundary shift Japanese vs. English perceptions Korean vs. English perceptions 1.Category boundary was shifted toward boundaries in listeners’
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Connected speech processes Coarticulation Suprasegmentals.
Speech Perception 4/4/00.
Results Tone study: Accuracy and error rates (percentage lower than 10% is omitted) Consonant study: Accuracy and error rates 3aSCb5. The categorical nature.
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
5aSC5. The Correlation between Perceiving and Producing English Obstruents across Korean Learners Kenneth de Jong & Yen-chen Hao Department of Linguistics.
Ch 3 Slide 1 Is there a connection between phonemes and speakers’ perception of phonetic differences? (audibility of fine distinctions) Due to phonology,
SPEECH PERCEPTION DAY 16 – OCT 2, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Speech Science IX How is articulation organized? Version WS
A problem with linguistic explanations  A problem with linguistic explanations  Controlling articulatory movements  Memory for speech  The balance.
Automatic Identification and Classification of Words using Phonetic and Prosodic Features Vidya Mohan Center for Speech and Language Engineering The Johns.
Assessment of Phonology
Sensation & Perception
Phonetic Context Effects Major Theories of Speech Perception Motor Theory: Specialized module (later version) represents speech sounds in terms of intended.
Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Chapter II phonology II. Classification of English speech sounds Vowels and Consonants The basic difference between these two classes is that in the production.
Syllables and Stress October 21, 2015.
Speech Perception.
Stop + Approximant Acoustics
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Control of prosodic features under perturbation in collaboration with Frank Guenther Dept. of Cognitive and Neural Systems, BU Carrie Niziolek [carrien]
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
11 How we organize the sounds of speech 12 How we use tone of voice 2009 년 1 학기 담당교수 : 홍우평 언어커뮤니케이션의 기 초.
PHONETICS AND PHONOLOGY
The 157th Meeting of Acoustical Society of America in Portland, Oregon, May 21, pSW35. Confusion Direction Differences in Second Language Production.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Speech Perception (acoustic cues)
Presentation transcript:

Stress, Focus, and the Roots of Segmental Variation Kenneth de Jong Indiana University Work reported here in collaboration with Eric Oglesbee, Kyoko Okamura, Noah Silbert and Bushra Zawaydeh, and supported by NSF: BCS Slides at:

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Segmental Variation: Problem or Solution? Everything varies, and the variation is the most interesting part EXAMPLE FROM MOTOR LITERATURE: Variation in production tells us about linguistic goals: –Abbs et al (1984, among others) -> trading relations in production tell us about motor synergies organized around consonantal closure –Perkell et al (1993); de Jong (1997) -> trading between lip rounding and tongue-body retraction suggests a common goal of acoustic timbre depression Current research: stress variation similarly may be informative about why and how stress and syllables go together

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress as Attentional Focus Stress is conventionalized Attentional Focus (de Jong, 2000) STRESS –Syllabic domain –Grammatical property of a language –Inheres in lexical items FOCUS –Can operate in various domains –Property of production –Inheres in particular utterances BOTH –Variation modeled in terms of local modulation along Lindblom’s H&H dimension

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Lindblom’s H r + H o Model (Lindblom 1990) Hyperarticulation (Hr) - Hypoarticulation (Ho) Hr = output oriented constraints Ho = production-oriented factors Not OR …; but AND

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress extension of H r +H o (de Jong, 1991; 1995; 1998) Stress effects specified as hyperarticulation At that time: This was Uhglee - super complicated Essentially stress is open-ended adaptation, how do we predict the effect of stress? Hyper-end: Requires a model of the phonological contrast space Hypo-end: Requires a model of motor constraints and optimization

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress extension of H r +H o (de Jong, 1991; 1995; 1998) TODAY the reverse: If H+H characterization is right; Stress & Focus variation can tell us about phonological contrast space and motor constraints Specifically: Variation suggests how phonological contrast expression and motor constraints might contribute to creating a relationship between stress and syllables

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -Connect contrastive dimensions to focus modulations: onset voicing and VOT & F0 in voicing contrasts (Oglesbee, 2008) -VOWELS: vowel quality (de Jong & Zawaydeh, 2002; de Jong, 2004) -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -Connect contrastive dimensions to focus modulations: onset voicing and VOT & F0 in voicing contrasts (Oglesbee 2008) -VOWELS: vowel quality (de Jong & Zawaydeh, 2002; de Jong, 2004) -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Oglesbee (2008) Measured perceptual sensitivities in a goodness estimation task to VOT, F0 contour, and intensity dynamics for /p/ and /b/ Compared productions of /p/ and /b/ in focus and non-focus positions Subjects created larger differences with focus in dimensions with perceptual sensitivity Dimensions: –All: VOT –S1: vowel amplitude ramp (/b/ sharper) –S2: F0 contour (/p/ higher)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality (de Jong & Zawaydeh, 2002; de Jong, 2004) -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality (de Jong & Zawaydeh, 2002; de Jong, 2004) -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Non-high front vowels -> Vowel quality Stress by voicing Dashed arrows indicate voicing effects Solid arrows indicate effects going from: Unstressed -> secondary stressed Secondary -> primary stressed - F2 (Hz) - F1 (Hz)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality becomes more extreme with focus and stress -> vowel space is a gradient contrast space with weak motor constraints -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality becomes more extreme with focus and stress -> vowel space is a gradient contrast space with weak motor constraints -CONSONANTS: fricative timbre (Silbert & de Jong, 2008) - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Silbert & deJong (JASA, 2008) Examined English /f/ /v/ /s/ & /z/ Prevocalic, post-vocalic & focus, non-focus Measured intensity, duration, spectral quality and dynamics of quality Focus affects only duration and intensity -> NO spectral effects

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality becomes more extreme with focus and stress -> vowel space is a gradient contrast space with weak motor constraints -CONSONANTS: fricative timbre is not affected by focus -> rigid motor constraints - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality becomes more extreme with focus and stress -> vowel space is a gradient contrast space with weak motor constraints -CONSONANTS: fricative timbre is not affected by focus -> rigid motor constraints - More CONSONANTS: coda voicing and vowel duration (de Jong & Zawaydeh, 2002; de Jong, 2004)

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Voicing -> Vowel duration Stress by voicing Focus by voicing

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Focus & Stress Effects -VOT, F0, and onset amplitude cues to voicing become more extreme with focus, in parallel with perceptual sensitivity in goodness tasks -VOWELS: vowel quality becomes more extreme with focus and stress -> vowel space is a gradient contrast space with weak motor constraints -CONSONANTS: fricative timbre is not affected by focus -> rigid motor constraints - More CONSONANTS: other consonantal cues are heavily affected

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Generalizations so far Stress effects strongly localized in vocalic nucleii Consonant effects similarly most readily apparent in nucleii Consonant acoustics affected in terms of durational and intensity properties All of which suggests a model of syllable nucleii as motorically fluid repositories of variation Consonantal margins are motorically constrained

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Types of Explanations (deJong, 2007) Hardware Physiological facts and constraints on control Activeware Facts about action and coordination. Motoric propensities which encourage certain articulatory coordinations OR … Shareware Facts about how talkers and listeners couple to share information. Requirements of communication with a listener OR … Ancestorware It’s the way mom & dad did it. Languages are shared repositories for an enormous amount of information.

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress variation - Attentional dynamics Many previous models of variation at the syllable level have Hardware or Activeware explanations (e.g. de Jong, 2003, where syllabic units are coodinative gestural constraints) The attentional focus model is a Shareware explanation for stress-related phonetic variability.

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress system A stress system = a convention in which both speakers and listeners pay more attentional focus to certain syllables than to others Productions of stressed items are given more attentional focus (de Jong, 1991; 1995; 1998) Perceptions of stressed items are more important for intelligibility (Cole, et al, 1978, Cole and Jakimik, 1980; Bond and Garnes, 1980; Terken and Nooteboom, 1987) Producers and perceivers are coupled during spoken communication. This is a shareware system.

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt But … not the whole story Okamura (ms); de Jong (2004); de Jong & Zawaydeh (2002) Stress differs across languages: Arabic and English focus effects concentrated in stressed syllables. Not true of (say) Japanese. Though stress and focus effects are similar; stress effects more consistent across subjects than focus effects

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt General Model Summary Suggests a model with (at least) two components 1)Shareware: Attentional variation happens according to speakers’ perceptions of listeners’ ability to perceive -This hyperarticulation in general is unevenly distributed in the signal 2)Ancestorware: Attentional variation gets contributed to the pool of experiences that the listener has with particular lexical items -Shareware factors gets conventionalized in Ancestorware

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Why syllables? Syllables might be implicated in both parts Shareware: syllables may act as integral units in the apportionment of attention. Syllabic nucleii are ‘acoustic objects’ which exhibit attentional integration. (See online slides for more on this part of model.) Ancestorware: production variation is more readily and systematically encoded in vocalic nucleii, punctuated by less gradiently variable consonantal margins. Focus variation along H+H lines is localized in syllable nucleii, so over time, nucleii get stress

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt End

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt ADDENDUM: Attentional Dynamics in Audition Auditory attention work by Mari Jones & others (Jones, 1976; Jones & Boltz, 1989; Jones & Yee, 1993; Large & Jones, 1997) –attentional selectivity -> focus –attentional capture -> prominence –attentional integration -> syllabic units –temporal expectancy -> rhythmic alignment

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress variation - Attentional dynamics Part 1. attentional selectivity: some parts of a stimulus are more readily acted upon than others Stress => some syllables are attentionally selected

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress variation - Attentional dynamics Part 2. attentional selectivity arises from ---> attentional capture: parts which suddenly change in salient dimensions tend to garner such selective advantages Attention is attracted to acoustic events where sudden changes take place F0: Pitch accents dock to stressed locations Amplitude: stress is a property of syllables, which are characterized by sonority rises

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress variation - Attentional dynamics Part 3. attentional selectivity also ---> may exhibit attentional integration: aspects which get attended to as a unit are those which work together to define an object or event works on portions of speech which are auditorially cohere Syllables might be such auditorily coherent units

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt Stress variation - Attentional dynamics Part 4. Attentional modulation is generally governed by temporal expectancy. High attention areas may, under appropriate conditions, come at temporally predictable intervals. Hence stress is sometimes governed by rhythmic, metrical patterning.

.//jones.ling.indiana.edu/~kdejong/ ASA_Paris3.ppt References Abbs et al (1984). Control of multimovement coordination: Sensorimotor mechanisms in speech motor programming. Journal of Motor Behavior, 16(2): Bond & Garnes (1980). Misperceptions of fluent speech. In R. Cole (ed.), Perception and Production of Fluent Speech. Hillsdale, NJ: Erlbaum, pp Cole et al (1978). Perceptibility of phoneic features in fluent speech. JASA, 64: Cole & Jakimik (1980). How are syllables used to recognize words? JASA, 67: De Jong (1991). The Oral Articulation of English Stress Accent. Ohio State University Ph.D. thesis. De Jong (1995) The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation. JASA, 97: De Jong (1997). Labio-velar compensation and acoustically-based motor equivalence. JASA, 101: De Jong (1998). Stress-related variation in the articulation of coda alveolar stops: Flapping revisited. Journal of Phonetics, 26: De Jong (2000). Attention modulation and the formal properties of stress systems. In J. Boyle, J-H. Lee, and A. Okrent (eds.), Chicago Linguistic Society 36, Vol. 1, pp. 71 – 91. Chicago: Chicago Linguistics Society. De Jong (2004). Stress, lexical focus, and segmental focus in English: Patterns of variation in vowel duration. Journal of Phonetics, 32: 493 – 516. De Jong & Zawaydeh (2002). Comparing stress, lexical focus, and segmental focus: Patterns of variation in Arabic vowel duration. Journal of Phonetics, 30: Jones (1976). Time, our lost dimension: Toward a new theory of perception, attention, and memory. Psychological Review, 83: Jones & Boltz (1989). Dynamic attending and responses to time. Psychological Review, 96: Jones & Yee (1993). Attending to auditory events: the role of temporal organization. In S. McAdams and E. Bigand (eds.), Thinking in Sound: the Cognitive Psychology of Human Audition. Oxford: Clarendon Press, pp Large & Jones (1997). The dynamics of attending: How we track time varying events. Psychological Review, 106: Lindblom (1990). Explaining phonetic variation: a sketch of the H&H Theory. In H.J. Hardcastle, and A. Marchal (eds.), Speech Production and Speech Modeling, NATO ASI Series D: Behavioural and Social Sciences, Vol. 55 (Kluwer A.P., Dordrecht). Oglesbee (2008). Multidimensional Stop Categorization in English, Spanish, Korean, Japanese, and Candian French. Indiana University Ph.D. thesis. Okamura (ms). The effects of lexical focus and phonological focus on pitch accent in Japanese. Information available from kokamura at Indiana dawt edu. Perkell et al (1993). Trading relations between tongue-body raising and lip rounding production in the vowel /u/: a pilot "motor equivalence" study. JASA, 93, Silbert & de Jong (2008). Focus, prosodic context, and phonological feature specification: Patterns of variation in fricative production. JASA, 123: 2769 – Terken & Nooteboom (1987). Opposite effects of accentuation and deaccentuation on verification latencies for given and new information. Language and Cognitive Processes, 2: