Laboratory for Experimental ORL KULeuven Perception of major acoustic cues Astrid van Wieringen 5th European Master school on Language and Speech Bonn,

Slides:



Advertisements
Similar presentations
Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
Advertisements

Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Speech Perception Dynamics of Speech
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2008.
Phonetics.
Hello, Everyone! Review questions  Give examples to show the following features that make human language different from animal communication system:
Nasal Stops.
Speech Science XII Speech Perception (acoustic cues) Version
Digital Systems: Hardware Organization and Design
The Human Voice. I. Speech production 1. The vocal organs
Introduction to linguistics – The sounds of German R21118 Dr Nicola McLelland.
Phonetics (Part 1) Dr. Ansa Hameed.
Speech Anatomy and Articulation
Speech sounds Articulation.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
English Phonetics and Phonology Lesson 3B
Chapter 6 Features PHONOLOGY (Lane 335).
Nasal Stops. Nasals Distinct vocal tract configuration Pharyngeal cavity Oral cavity (closed) Nasal cavity (open)
Chapter 2 Introduction to articulatory phonetics
Phonetics III: Dimensions of Articulation October 15, 2012.
Linguistics I Chapter 4 The Sounds of Language.
Natural classes and distinctive features
Speech Sounds of American English and Some Iranian Languages
The sounds of language Phonetics Chapter 4.
English Pronunciation Practice A Practical Course for Students of English By Wang Guizhen Faculty of English Language & Culture Guangdong University of.
Phonetics HSSP Week 5.
Phonetics Phonetics: It is the science of speech sounds. It is the study of the production and reception of speech sounds. It is concerned with the sounds.
1 4. Consonants  Consonants are produced ‘ by a closure in the vocal tract, or by a narrowing which is so marked that air cannot escape without producing.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
1 L103: Introduction to Linguistics Phonetics (consonants)
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
Phonological Theory.
Phonetics Class # 2 Chapter 6. Homework (Ex. 1 – page 268)  Judge [d ] or [ ǰ ]  Thomas [t]  Though [ ð ]  Easy [i]  Pneumonia [n]  Thought [ θ.
SPEECH ORGANS & ARTICULATION
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Phonetics: Dimensions of Articulation October 13, 2010.
Phonetics 2. Phonology 2.1 The phonic medium of language Sounds which are meaningful in human communication constitute the phonic medium of language.
Linguistics The fourth week. Chapter 2 The Sounds of Language 2.1 Introduction 2.1 Introduction 2.2 Phonetics 2.2 Phonetics.
Phonetics Definition Speech Organs Consonants vs. Vowels
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. How each of the phonemes in English is articulated 2. The differences.
LIN 3201 Sounds of Human Language Sayers -- Week 1 – August 29 & 31.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
Ch4 – Features Features are partly acoustic partly articulatory aspects of sounds but they are used for phonology so sometimes they are created to distinguish.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Stop/Plosives.
Welcome to all.
PHONETICS AND PHONOLOGY
ARTICULATORY PHONETICS
ARTICULATORY PHONETICS
Phonetics Dimensions of Articulation
Linguistics: Phonetics
The Human Voice. 1. The vocal organs
Essentials of English Phonetics
Structure of Spoken Language
The Human Voice. 1. The vocal organs
Phonetics: The Sounds of Language
Speech is made up of sounds.
Spoken language phonetics: Transcription, articulation, consonants
Chapter 2 Phonology.
Phonetics: The Sounds of Language
Speech Perception (acoustic cues)
CONSONANTS ARTICULATORY PHONETICS. Consonants When we pronounce consonants, the airflow out of the mouth is completely blocked, greatly restricted, or.
PHONETICS AND PHONOLOGY INTRODUCTION TO LINGUISTICS Lourna J. Baldera BSED- ENGLISH 1.
Presentation transcript:

Laboratory for Experimental ORL KULeuven Perception of major acoustic cues Astrid van Wieringen 5th European Master school on Language and Speech Bonn, July 2004

Lab Exp ORL KULeuven Astrid van Wieringen July Content of Tutorial In order to understand why certain speech sounds are not perceived/recognized by hearing-impaired or automatic speech recogniser, one should understand: –major categories + acoustic properties of speech sounds –different types of tests & speech materials –how to assess transmission of robust spectral and temporal cues by means of analytical (phoneme) tests –data collection and analyses hearing loss (with focus on cochlear implants) Practical part: –Test perception of filtered speech sounds

Lab Exp ORL KULeuven Astrid van Wieringen July Speech sounds - major categories Vowel and consonant phonemes are classified in terms of Manner of articulation –concerns how the vocal tract restricts airflow completely stopping of airflow by an occlusion creates a plosive (stop consonant) vocal tract constrictions of varying degree occur in liquids, fricatives, glides and vowels lowering the velum causes nasal sounds Place of articulation refers to the location in the vocal tract Voicing presence/absence of vocal fold vibration

Lab Exp ORL KULeuven Astrid van Wieringen July Manner of articulation of most consonants Stop consonants (plosives): complete closure and subsequent release of a vocal tract obstruction. Pressure build-up followed by burst. Liquids: like vowels, but tongue is used for some degree of obstruction. For /l/ air escapes around the tip of tongue or dorsum. The /r/ has more variable articulation Nasals: a lowering of the velum. Airflow out of the nostrils. In English only nasalized consonants (oral tract completely closed), in French also nasalized vowels (air escapes through oral tract and nasal cavities). Vowels may be nasalized in English, but the distinction is not phonemic (= vowel identity does not change). In French there are pairs of vowels that differ only in the presence or absence of vowel nasalization. Fricatives: narrow constriction in the oral tract (for some language in the pharynx and in the glottis). If the pressure behind the constriction is high enough and the passage sufficiently narrow, airflow becomes fast enough to generate turbulence at the end of the constriction Strident fricatives : noise amplitude is enhanced by airflow striking a surface:  (shy) Affricate= stop + fricative: d  (gin)

Lab Exp ORL KULeuven Astrid van Wieringen July Place of articulation (varies per language)

Lab Exp ORL KULeuven Astrid van Wieringen July Place of articulation Labials –bilabial: if both lips constrict –labiodental: if the lower lip contacts the upper teeth Dental: the tongue tip or blade touches the edge or back of upper teeth –interdental: if the tip protrudes between the upper and lower teeth (‘the’) Alveolar: the tongue tip or blade touches the alveolar ridge Palatals: the tongue blade or dorsum constricts with the hard palate –retroflex: if the tongue tip curls up Velar: the dorsum approaches the soft palate Uvular: the dorsum approaches the uvula Pharyngeal: constriction in the pharynx Glottal: vocal folds close or constrict

Lab Exp ORL KULeuven Astrid van Wieringen July Dutch vowel triangle

Lab Exp ORL KULeuven Astrid van Wieringen July Major acoustic cues of stop consonants /p, t, k, b, d, g/ Phonetic features –Manner: stop (plosive) –Place (bilabial, alveolar, velar) –Acoustic cues Silence (corresponds to the period of oral constriction = stop gap) »Voiced stops: low energy, also called voice bar Burst: corresponds to the articulatory release of the oral constriciton and to aerodynamic release (due to build-up of pressure). Bursts occur in initial and medial position, rarely found in final position. Place of articulation may be signaled by spectrum of burst, but –Transition is also very important. Transition corresponds to the articulatory movement from oral constriction for the stop to the more open tract for a following sound (usually vowel). Easy to identify for voiced than for voiceless sounds. Most important features: stop gap release burst presence/absence of voice onset time transition voicing features

Lab Exp ORL KULeuven Astrid van Wieringen July Duration of stop consonants Stop gap: ms Burst: 5-40 ms (a ‘transient’ = disappears immediately, shortest event in speech!) CV (consonant - vowel) and VC (vowel consonant) transitions: ms. Reflects changes in the vocal tract. Very difficult to measure/analyze such a short event. However, perceptually very important!

Lab Exp ORL KULeuven Astrid van Wieringen July Time-signals of Dutch plosives

Lab Exp ORL KULeuven Astrid van Wieringen July Time (s) Onset of / b / in / aba / Time (s) Onset of / d / in / ada / Time (s) Onset of / t / in / ata / Time (s) Onset of / p / in / apa / Initial part of Dutch plosives

Lab Exp ORL KULeuven Astrid van Wieringen July Time (s) / aba / Time (s) / apa / Time (s) / ada / Time (s) / ata / Time (s) / aka / Spectrogram of Dutch plosives

Lab Exp ORL KULeuven Astrid van Wieringen July Fricatives Phonemes: –voiced / , , ,  / –voiceless: / , , , , ,  / Phonetic features: –manner: frication –place: labiodental, linguadental, alveolar, palatal, glottal Acoustic cues: –voicing –frication noise: noise generated as air is forced through a narrow constriction. Then filtered by the vocal tract. – transitions to and from the vowels due to changes in the vocal tract –sibilants/ stridents have intense noise energy –non sibilants: weak noise energy

Lab Exp ORL KULeuven Astrid van Wieringen July Spectrograms of a few Dutch fricatives Time (s) / afa / Time (s) / ava / Time (s) / asa / Time (s) / aza /

Lab Exp ORL KULeuven Astrid van Wieringen July Nasals Phonemes: /m, n,  / Phonetic features: –manner: nasal –place: bilabial, alveolar, velar Acoustic features: –murmur: as a result of nasal radiation of acoustical energy. The spectrum is dominated by low-freq. energy (< 500 Hz). Murmur cues of three different nasals are not exactly alike, but difficult as a distinctive cue –transitions: preceding and following vowels will be nasalized. Cues to place of articulation –voicing is always present (except during whispering) Spectrum of nasals reflects a combination of formants and antiformants

Lab Exp ORL KULeuven Astrid van Wieringen July Spectrograms of a few Dutch nasals

Lab Exp ORL KULeuven Astrid van Wieringen July Glides also called ‘ approximants ’ and semivowels: –gradual articulatory movement –vocal tract narrowed, not closed Phonemes: /j/ & /w/ Phonetic features –Manner: glide or semivowel –Place: palata l or labiovelar Acoustic cues –A relatively slow transition ( ms) –F1 of both sounds starts at very low value (a little higher than for stops) –F2 of /w/: 800 Hz Compare with /b/!!, F3 of /w/: 2200 Hz –F2 of /j/: 2200 Hz (compare wih /d/!!), F3 is 3000 Hz longer glides: vowel-vowel sequences!: –[bi] - [wi]- [ui] and –[du] - [ju] - [iu]

Lab Exp ORL KULeuven Astrid van Wieringen July Spectrograms of 2 Dutch glides

Lab Exp ORL KULeuven Astrid van Wieringen July Liquids Phonemes: /l/ & /r/ Phonetic features: –Manner: lateral or rhotic –Place: alveolar for /l/, palatal for /r/ Acoustic cues: rather complex: –both relatively fast formant transitions –similarity with glides: well-defined formant structure (less constriction than stops, fricatives, and affricates) –/l/: energy mainly in the low frequencies. Resonances and antiresonances due to divided vocal tract. Resembles /n/. F1: 360 Hz, F2: 1300 Hz, F3: 2700 Hz –/r/: similar for F1 F2 somewhat lower than for /l/ F3 especially lower (1650 Hz). Durations of formant transitions somewhat longer for /r/ than for /l/ –temporal cues: /r/: F1 has a short steady-state + relatively long transition /l/: F1 has a long steady-state + relatively short transition

Lab Exp ORL KULeuven Astrid van Wieringen July Spectrograms of 2 Dutch liquids no clear distinction between vowel and consonant F3 of /r/ lower

Lab Exp ORL KULeuven Astrid van Wieringen July Speech perception assessment for the hearing-impaired

Lab Exp ORL KULeuven Astrid van Wieringen July Speech perception assessment Required for diagnostic purposes monitoring progress in a rehabilitation programme comparison of different speech processing strategies (hearing aids and ochlear implants) understand “limited” technology/number of channels available for hearing impaired or implantees –hearing aid: speech divided into frequency bands. Acoustically enhanced –cochlear implant: acoustical sound is picked up by microphone, analyzed into frequency bands, coded and sent to limited number of electrode pairs in the inner ear (electrical stimulation)

Lab Exp ORL KULeuven Astrid van Wieringen July How a cochlear implant works... (MedEL) (1)Sounds are picked up by a microphone and turned into an electrical signal. (2) This signal goes to the speech processor where it is "coded" (turned into a special pattern of electrical pulses). (3) These pulses are sent to the coil and are then transmitted across the intact skin (by radio waves) to the implant. (4) The implant sends a pattern of electrical pulses to the electrodes in the cochlea.(5) The auditory nerve picks up these tiny electrical pulses and sends them to the brain. (6) The brain recognizes these signals as sound.

Lab Exp ORL KULeuven Astrid van Wieringen July Tutorial article on cochlear implants that appeared in the IEEE Signal Processing Magazine, pages , September Introduction to cochlear implants Philipos C. Loizou

Lab Exp ORL KULeuven Astrid van Wieringen July Figure of electrode array in the cochlea... Necessary to ‘map’ (fit) acoustical information to electrical information....

Lab Exp ORL KULeuven Astrid van Wieringen July Top: Output of the CIS algorithm for the word ‘som’. Pulse channels reflect the envelopes of the bandpass filter output time (ms) amplitude per channel A

Lab Exp ORL KULeuven Astrid van Wieringen July Transmission of AMA & ASA by a CI device

Lab Exp ORL KULeuven Astrid van Wieringen July Many types of speech tests to evaluate CI performance –detection of environmental sounds –identification of male/female voice –identification of vowels and consonants (V & C) in nonsense cont. –words –sentences Each type of test triggers a different level of performance. Why is a carefully balanced V & C test important? – /paat/, /pit/, /poot/, etc., or /apa/, /ara/, /ana/, –it gives important information on the transmission of speech features via the implant and hearing aid (e.g. voiced- voicelessness, nasality of /m/ or high frequency frication/turbulence of /s/) analytical: no contextual information therefore, information can guide the fitting of an implant Analytical tests: purpose and performance

Lab Exp ORL KULeuven Astrid van Wieringen July speech stimuli should be –carefully pronounced and, if possible, adjusted to the same RMS level (so that other cues are kept in hand) –presented via hard disc of PC, CD or tape (recorded at highest level of quality) –administered to the subject in a quiet room (if presented acoustically) –presented a sufficient number of times to obtain a reliable score –Note: an analytical test does not replace other tests, but it measures speech perception based on auditory information alone. Can be used for several languages. At the Lab. Exp. ORL recordings were made of Dutch vowels and consonants in different contexts. These were carefully selected from different tokens, segmented (with an additional hamming window to avoid on- and offset clicks), equalized in RMS (root mean square) and partly analyzed (with regard to their main spectral and temporal properties). –/aCa/: /p, t, k, b, d, r, l, m, n, s, f, z, v, w, j/. –/pVt/:/oe, ie, i, oo, o, ee, e, u, aa, a/ All speech sounds were analyzed (frequency, duration, energy, …) Choice of test depends on objectives, BUT

Lab Exp ORL KULeuven Astrid van Wieringen July Confusion matrix Distribution of errors even more interesting –not random –can be quantified by means of an information transmission algorithm (Miller and Nicely, 1955)

Lab Exp ORL KULeuven Astrid van Wieringen July Effect of filtering Loss of auditory information can be examined in normal-hearing persons by filtering away acoustical information: to allow certain frequencies to be transmitted while attenuating others. –a high-pass filter allow all components above a cutoff frequency to be transmitted –a band-pass filter allows frequencies within a certain band to pass –low-pass filter allow all components below a cutoff frequency to be transmitted Demonstration of loss of acoustical cues!