Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010.

Similar presentations


Presentation on theme: "1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010."— Presentation transcript:

1 1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010

2 2 Self-Study If you want to look at spectrograms of your own voice, there are several programs available: 1.Matlab Use the “specgram” command; color map can be changed using “colormap gray” or similar commands 2.CSLU Toolkit Download from http://www.cslu.ogi.edu/toolkit Free for educational use, Windows only Plot spectrograms with “SpeechView” tool. 3.Praat Download from http://www.fon.hum.uva.nl/praat/ Free and available for windows, linux, Apple, etc. 4.Wavesurfer Download from http://www.speech.kth.se/wavesurfer/ Free and available for windows, linux, Apple, etc.

3 3 Self-Study There’s a tutorial on the web that allows you to hear the effect of different formant values: http://www.asel.udel.edu/speech/tutorials/synthesis/ceevees.html You can enter start time, end time, amplitude, and formant values for beginning, middle and end of a “syllable”, then generate a waveform and hear the result. A great website on spectrogram reading: http://home.cc.umanitoba.ca/~robh/ includes “how to” tips on spectrogram reading, a monthly “mystery spectrogram”, and archives of past months’ spectrograms.

4 4 Two Vowels: “preempt”

5 5 Two Vowels: “heavy oak”

6 6 Two Vowels: “reapply”

7 7 Acoustic-Phonetic Features: Manner of Articulation Approximately 8 manners of articulation: NameSub-Types Examples. Vowelvowel,diphthongaa, iy, uw, eh, ow, … Approximantliquid, glidel, r, w, y Nasalm, n, ng Plosiveunvoiced, voicedp, t, k, b, d, g Fricativeunvoiced, voicedf, th, s, sh, v, dh, z, zh Affricateunvoiced, voicedch, jh Aspirationh Flapdx, nx Change in manner of articulation usually abrupt and visible; manner provides much information about location of phonemes.

8 8 Acoustic-Phonetic Features: Manner of Articulation Approximants (/l/, /r/, /w/, /y/): vowel-like properties, but more constriction /l/ has tongue-tip touching alveolar ridge, /r/ has tongue tip curled up/back (retroflex), raised and “bunched” dorsum, sides of tongue touching molars, /w/ has tongue back and lips rounded, /y/ has tongue toward front and very high glides (/w/, /y/) can be viewed as “extreme” production of a vowel (sometimes called semivowels): /w/  /uw/ /y/  /iy/

9 9 Acoustic-Phonetic Features: Manner of Articulation Approximants (/l/, /r/, /w/, /y/): movement of tongue slower than other vowel-to-vowel or consonant-to-vowel transitions, but not as slow as diphthong movement sometimes voiceless when following a voiceless plosive (“play”) /l/ may have slight discontinuity when tongue makes/breaks contact with alveolar ridge; other approximants have no discontinuity

10 10 Acoustic-Phonetic Features: Manner of Articulation Nasal (/m/, /n/, /ng/): produced with velic port open and obstruction in vocal tract sound travels through nasal cavities these cavities filter speech with both poles (resonances) and zeros (anti-resonances) longer pathway causes primary resonance to be low (220-300 Hz) anti-resonances cause higher frequencies to have lower power /m/ F1 P1 F2 F3 P2 F4 F5 F6 Z1 Z2

11 11 Acoustic-Phonetic Features: Manner of Articulation Nasal (/m/, /n/, /ng/): formant structure obscured by pole-zero pairs all three English nasals look and sound similar (place of articulation has little effect on spectrum); can be distinguished primarily by coarticulatory effects on adjacent vowel(s). sometimes very brief duration (“camp”, “winner”) occasional confusion with /w/, /l/ (if F3 not visible), and closure portion of voiced plosives often sharp discontinuity with adjacent vowel adjacent vowel may be nasalized

12 12 Acoustic-Phonetic Features: Manner of Articulation Plosive (Oral Stop) (/p/, /t/, /k/, /b/, /d/, /g/): 1.closure along vocal tract (lips, alveolar ridge, velum) 2.buildup of air pressure behind closure 3.release of closure 4.burst of air 5.possible aspiration following burst complex process, several changes over brief time span some context-dependent attributes, some semi-invariant ones voiced bursts sometimes have “voice bar” in low- frequency region, caused by vocal fold vibration with complete oral and velic closure. sometimes voice bar is excellent cue; sometimes can be confused with a nasal

13 13 Acoustic-Phonetic Features: Manner of Articulation /p ah p/ /t ah t/ /k ah k/

14 14 Acoustic-Phonetic Features: Manner of Articulation Plosive (Oral Stop) (/p/, /t/, /k/, /b/, /d/, /g/): closure and time required to build pressure results in “silence” region of spectrum prior to burst burst airflow is a step function, which becomes similar to an impulse, which has equal energy at all frequencies identity of a plosive contained in (at least) three areas: (1) voice-onset-time (VOT) / duration of aspiration (2) formant transitions in neighboring vowels/approximants (3) spectral shape of burst “voiced” plosives may not show any real voicing (!)

15 15 Acoustic-Phonetic Features: Manner of Articulation Fricative (/f/, /th/, /s/, /sh/, /v/, /dh/, /z/, /zh/): fricatives produced by forcing air through a constriction in the mouth constriction located anywhere from the labiodental region (/f/, /v/) to palato-alveolar region (/sh/, /zh/) all English fricatives come in voiced and unvoiced varieties voicing may not be present in voiced fricatives (!), making duration an important distinguishing cue (voiced  shorter) the location and type of the constriction create spectral anti-resonances as well as resonances the main difference between /s/ and /f/ is in frequencies above 4000 Hz; telephone-band speech has limit of 4KHz.

16 16 Acoustic-Phonetic Features: Manner of Articulation Fricative (/f/, /th/, /s/, /sh/, /v/, /dh/, /z/, /zh/): Rules for distinguishing between /dh/ and /v/: /dh/ - formant structure is clearly visible OR frication is stronger at 5000 Hz and not so strong at low frequencies /v/ -formants not visible at location of maximum frication OR low-frequency energy is as strong as the energy at 5000 Hz However, due to the difficulty of distinguishing /dh/ from /v/ and distinguishing /th/ from /f/, in the spectrogram reading exercises we will treat them as the same.

17 17 Acoustic-Phonetic Features: Manner of Articulation Affricate (/ch/, /jh/): Affricates are conceptually like diphthongs: two separate phonemes considered as one English has two affricates: /ch/  /t sh/ /jh/  /d zh/ Sometimes cue to affricate is in burst preceding fricative; in closure between vowel and fricative. Sometimes cue to affricate is in voicing or duration. Affricates phonemically distinct from stop-fricative sequence in some cases, e.g. “light ship” vs. “lye chip”. Duration of the /t/ closure, and rate of increase of frication energy, may distinguish the two cases acoustically.

18 18 Acoustic-Phonetic Features: Manner of Articulation Aspiration (/h/): like vowels, except usually no voicing can usually see formant structure formant patterns similar to surrounding vowel(s) /ah h aw s/ = “a house”

19 19 Acoustic-Phonetic Features: Manner of Articulation Flaps (/dx/, /nx/): allophone of /t/, /d/, or /n/ very brief duration; no closure for /dx/ indicated by dip in energy and F2 near 1800 Hz “write another”


Download ppt "1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010."

Similar presentations


Ads by Google