PHREND at UCSC Sepember 24, 2016 Sarah Bakst, UC Berkeley

Slides:



Advertisements
Similar presentations
Tom Lentz (slides Ivana Brasileiro)
Advertisements

Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
Acoustic and Physiological Phonetics
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
From Resonance to Vowels March 8, 2013 Friday Frivolity Some project reports to hand back… Mystery spectrogram reading exercise: solved! We need to plan.
“Connecting the dots” How do articulatory processes “map” onto acoustic processes?
Basic Spectrogram & Clinical Application Lab 9. Spectrographic Features of Vowels n 1st formant carries much information about manner of articulation.
Perturbation Theory March 11, 2013 Just So You Know The Fourier Analysis/Vocal Tract exercise is due on Wednesday. Please note: don’t make too much out.
A two dimensional kinematic mapping between speech acoustics and vocal tract configurations : WISP A.Hatzis, P.D.Green1 History of Vowel.
Acoustic Characteristics of Vowels
Hello, Everyone! Review questions  Give examples to show the following features that make human language different from animal communication system:
Speech Science XII Speech Perception (acoustic cues) Version
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
Digital Systems: Hardware Organization and Design
Vocal Tract Physiology December 2, 2014 Almost There… The final interim course project report is due today! I’ll get your last graded homeworks back.
The Human Voice. I. Speech production 1. The vocal organs
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
PHYSICAL PROPERTIES OF SPEECH SOUNDS
Speech Group INRIA Lorraine
Speech Anatomy and Articulation
Development of coarticulatory patterns in spontaneous speech Melinda Fricke Keith Johnson University of California, Berkeley.
Chapter 6 Features PHONOLOGY (Lane 335).
Introduction to Speech Production Lecture 1. Phonetics and Phonology Phonetics: The physical manifestation of language in sound waves. –How sounds are.
Auditory-acoustic relations and effects on language inventory Carrie Niziolek [carrien] may 2004.
Voice Transformations Challenges: Signal processing techniques have advanced faster than our understanding of the physics Examples: – Rate of articulation.
Linguistics I Chapter 4 The Sounds of Language.
Conclusions  Constriction Type does influence AV speech perception when it is visibly distinct Constriction is more effective than Articulator in this.
PHONETICS & PHONOLOGY COURSE WINTER TERM 2014/2015.
Present Experiment Introduction Coarticulatory Timing and Lexical Effects on Vowel Nasalization in English: an Aerodynamic Study Jason Bishop University.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
Fricatives, part II November 21, 2012 Announcements For Friday: spectrogram matching exercise! Fricatives and possibly glides, too. Final exam has been.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Phonological Theory.
Phonetics Class # 2 Chapter 6. Homework (Ex. 1 – page 268)  Judge [d ] or [ ǰ ]  Thomas [t]  Though [ ð ]  Easy [i]  Pneumonia [n]  Thought [ θ.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Sarah Bakst, UC Berkeley Phonetics and Phonology in Europe University of Cambridge June 29, 2015.
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
Speech Science IX How is articulation organized? Version WS
Fricatives, part 2 November 14, 2008 Who’s Next Today: some leftover notes on vowels Then: more fricatives Monday: fricative spectrogram matching.
From Resonance to Vowels March 13, 2012 Fun Stuff (= tracheotomy) Peter Ladefoged: “To record the pressure of the air associated with stressed as opposed.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
LIN 3201 Sounds of Human Language Sayers -- Week 1 – August 29 & 31.
PHONETIC 1 MGSTER. RAMON GUERRA by: Mgster. Ramon Guerra.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
Phonetics: A lecture Raung-fu Chung Southern Taiwan University
Introduction to English Pronunciation
GEPPETO 1 : A modeling approach to study the production of speech gestures Pascal Perrier (ICP – Grenoble) with Stéphanie Buchaillard (PhD) Matthieu Chabanas.
AN ANALOG INTEGRATED- CIRCUIT VOCAL TRACT PRESENTED BY: NIEL V JOSEPH S7 AEI ROLL NO-46 GUIDED BY: MR.SANTHOSHKUMAR.S ASST.PROFESSOR E&C DEPARTMENT.
PHONETICS AND PHONOLOGY
1 Probing the Big Bang with ultrasound: Retraction of /s/ in English Adam Baker, Jeff Mielke, Diana Archangeli University of Arizona Supported by James.
Phonetics : The Sounds of Language The science of phonetics attempts to describe all of the sounds used in all languages of the world. Knowing a language.
The effect of speech timing on velopharyngeal function
The Human Voice. 1. The vocal organs
Copyright © American Speech-Language-Hearing Association
Structure of Spoken Language
The Human Voice. 1. The vocal organs
English Phonetics and Phonology
Spoken language phonetics: Transcription, articulation, consonants
Phonetics.
Chapter 2 Phonology.
Speech Perception (acoustic cues)
Motor theory.
/r/ Place: palatal Articulatory phonetics Acoustics
A Japanese trilogy: Segment duration, articulatory kinematics, and interarticulator programming Anders Löfqvist Haskins Laboratories New Haven, CT.
Presentation transcript:

PHREND at UCSC Sepember 24, 2016 Sarah Bakst, UC Berkeley Modeling the effect of palate shape on the articulation and acoustics of American English /r/ PHREND at UCSC Sepember 24, 2016 Sarah Bakst, UC Berkeley

Overview What is the role of individual anatomy in speech production? Here: domedness of palate Mapping between articulation and acoustics Test case: American English /r/ This work models an answer to the question, “what is the role of individual anatomy in speech production?”

Role of palate shape Brunner et al. (2009): flatness correlated with reduced articulatory variability but not acoustic variability for front vowels Bakst (2016): extension to /r/

Articulatory variability Articulatory and acoustic variability compared by palate shape in /r/ production Articulatory variability (z-scored) pdf(‘rtic.pdf’, height = 6, width = 10) plot(dd$rscale~dd$alpha, main = 'Variability in production correlated with palate shape for /r/ ', ylim = c(-2,2), xlab = 'Flatness', ylab = 'Variability', type = 'n') points(dd$rscale~dd$alpha, pch = 15) abline(3.552,-1.728) dev.off() plot(dd$f3scale~dd$alpha, col = 'red') abline(1.4910,-0.7256, col = 'red', lty = 2) Acoustic variability (z-scored F3 s.d.) 1.8 2.0 2.2 2.4 2.6 Palate flatness

Quantal mapping arbitrary constriction along vocal tract (figure from Stevens & Keyser 2010)

Domedness as a factor Palate shape affects mapping between articulation and acoustics (Stevens 1972) Differences in width of zone II Differences in location and frequency of discontinuities

Motivation Observation: flat palates -> reduced articulatory variability What is the nature of this relationship between palate shape and articulatory/acoustic variability? Is the articulatory-acoustic mapping actually different for different palate shapes? Or is there just less room to move? Test with F3 in /r/ as acoustic parameter

Hypothesis Main hypothesis: Less linear mapping = Wider ‘zone II’ for flatter palates, hence constrained articulatory variability Less linear mapping = larger areas of local acoustic stability (domed) more discontinuities in F3 values for domed palates More linear mapping = more acoustic flexibility (flat) Greater range in F3 value for flatter palates

Modeling in Maeda Maeda synthesizer Default: one relatively domed-average palate four articulatory principal components based on real human: jaw/tube dorsum position dorsum shape apex position

Maeda revised New parameter: apex orientation ant post ant bunched /r/ retroflex /r/ bunched /r/

Tongue tip lips glottis

Palates Maeda comes with one default palate I additionally created a flat palate and a super-domed palate

New palates

Methods Settings for each parameter (below: dorsum shape) on a scale from -4 to 4 Component multiplied by setting

Method Four manipulated parameters: Dorsum position (anteriority) Dorsum shape (bunchiness) Apex shape (tip-up/-down) Lip protrusion Python script looped over values for each articulatory parameter Model plays through vocal tract Formant tracking in ifc formant Rejection of bad tokens: low rms amplitude

Area function Area taken at 32 places along vocal tract Synthesizer uses special formula taking into account domedness of palate: A (x) = αx^β α is a metric of domedness; x = width in sagittal plane α empirically-derived

Results: Zone width

Correlations with F3 Flat Default Domed Dorsum position -0.52 -0.64 -0.63 Dorsum shape -0.47 -0.32 -0.19 Lip protrusion -0.05 -0.14 -0.24 Tip curl -0.09

Correlations with F3 Flat Default Domed Dorsum position -0.52 -0.64 -0.63 Dorsum shape -0.47 -0.32 -0.19 Lip protrusion -0.05 -0.14 -0.24 Tip curl -0.09

Correlations with F3 Flat Default Domed Dorsum position -0.52 -0.64 -0.63 Dorsum shape -0.47 -0.32 -0.19 Lip protrusion -0.05 -0.14 -0.24 Tip curl -0.09

Correlations with F3 Flat Default Domed Dorsum position -0.52 -0.64 -0.63 Dorsum shape -0.47 -0.32 -0.19 Lip protrusion -0.05 -0.14 -0.24 Tip curl -0.09

Correlations with F3 Flat Default Domed Dorsum position -0.52 -0.64 -0.63 Dorsum shape -0.47 -0.32 -0.19 Lip protrusion -0.05 -0.14 -0.24 Tip curl -0.09

Lowest F3 configurations

Discussion Smallest “zone I” for flattest palate–theoretically a wider range of attainable acoustic output Effect of each articulator on F3 actually varies by palate shape Supports result linking reduced articulatory variability with flatter palate shape

Conclusions Palate shape does seem to have an influence over the articulatory-acoustic mapping Implications for sound change? Next step: add sublingual cavity expectations: sublingual cavity significant lowering effect (usually on F3) better articulatory predictions

Acknowledgements Ronald Sprouse Keith Johnson Susan Lin John Houde Rich Ivry Members of the UC Berkeley Phon Lab

Selected references B. S. Atal, J. J. Chang, M. V. Matthews, and J. W. Tukey, “Inver- sion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique,” Journal of the Acoustical Society of America, vol. 63, no. 5, pp. 1535–1555, 1978. Bakst, S. (2016). The role of palate shape in individual articulatory and acoustic variability. Laboratory Phonology 15. Brunner, J., S. Fuchs, and P. Perrier (2009). On the relationship between palate shape and articulatory behavior. Journal of the Acoustical Society of America 125(6), 3936–3949. Carre, R. and M. Mrayati (1988). Articulatory-acoustic-phonetic relations and modeling, regions, and modes. In A. Marchal and W. Hardcastle (Eds.), Speech production and speech modeling, NATO ASI Series. Dordrecht: Kluwer Academic Publishers. Espy-Wilson, C. Y., S. Boyce, M. Jackson, S. Narayanan, and A. Alwan (2000). Acoustic modeling of American English /r/. Journal of the Acoustical Society of America 108(1), 343–356. J. Mielke, A. Baker, and D. Archangeli, “Variability and homo- geneity in American English /r/ allophony and /s/ retraction,” Lab- oratory Phonology, vol. 10, pp. 699–729, 2010. Mrayati, M., R. Carre, and B. Guerin (1988). Distinctive regions and modes: a new theory of speech production. Speech Communication 7, 257–286. Stevens, K. N. and S. E. Blumstein (1975). Quantal aspects of consonant production and perception: a study of retroflex stop consonants. Journal of Phonetics 3, 215–233. Stevens, K. N. and S. J. Keyser (2010). Quantal theory, enhancement and overlap. Journal of Phonetics 38, 10-19. A.Watanabe,“Formantestimationmethodusinginversefiltercon- trol,” IEEE Transactions on Speech and Audio Processing, vol. 9, no. 4, pp. 317–26, 2001.