Download presentation
Presentation is loading. Please wait.
Published byRudolph Fleming Modified over 9 years ago
1
Digital Systems: Hardware Organization and Design
4/15/2017 Speech Recognition Speech Sounds of American English Architecture of a Respresentative 32 Bit Processor
2
Speech Sounds of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Speech Sounds of American English There are over 40 speech sounds in American English which can be organized by their basic manner of production Vowels, glides, and consonants differ in degree of constriction Sonorant consonants have no pressure build up at constriction Nasal consonants have no pressure build up at constriction Continuant consonant do not block airflow in oral cavity Manner Class Number Vowel 18 Fricatives 8 Stops 6 Nasals 3 Semivowels 4 Affricates 2 Aspirant 1 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
3
Phonemes of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Phonemes of American English 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
4
Phonetic Alphabets Reference
15 April 2017 Veton Këpuska
5
Digital Systems: Hardware Organization and Design
4/15/2017 Vowel Production No significant constriction in the vocal tract Usually produced with periodic excitation Acoustic characteristics depend on the position of the jaw, tongue, and lips 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
6
Vowels of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Vowels of American English There are approximately 18 vowels in American English made up of monothongs, diphthongs, and reduced vowels (schwa’s) They are often described by the articulatory features: High/Low, Front/Back, Retroflexed, Rounded, and Tense/Lax 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
7
Spectrograms of the Cardinal Vowels
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of the Cardinal Vowels 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
8
Vowel Formant Averages
Digital Systems: Hardware Organization and Design 4/15/2017 Vowel Formant Averages Vowels are often characterized by the lower three formants: High/Low is correlated with the first formant, F1 Front/Back is correlated with the second formant, F2 Retroflexion is marked by a low third formant, F3 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
9
Digital Systems: Hardware Organization and Design
4/15/2017 Vowel Durations Each vowel has a different intrinsic duration Schwa’s have distinctly shorter durations (50ms) /I, ℇ, Λ, Ʊ/ are the shortest monothongs Context can greatly influence vowel duration 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
10
Happy Little Vowel Chart
Digital Systems: Hardware Organization and Design 4/15/2017 Happy Little Vowel Chart Robb's "So inaccurate, yet so useful." 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
11
Digital Systems: Hardware Organization and Design
4/15/2017 Fricative Production Turbulence produced at narrow constriction Constriction position determines acoustic characteristics Can be produced with periodic excitation 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
12
Fricatives of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Fricatives of American English There are 8 fricatives in American English Four places of articulation: Labio-Dental (Labial), Interdental (Dental), Alveolar, and Palato-Alveolar (Palatal) They are often described by the features Voiced/Unvoiced, or Strident/Non-Strident (constriction behind alveolar ridge) 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
13
Spectrograms of Unvoiced Fricatives
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of Unvoiced Fricatives 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
14
Digital Systems: Hardware Organization and Design
4/15/2017 Fricative Energy Strident fricatives tend to be stronger than non-strident fricatives. 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
15
Digital Systems: Hardware Organization and Design
4/15/2017 Fricative Durations Voiced fricatives tend to be shorter than unvoiced fricatives. 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
16
Examples of Fricative Voicing Contrast
Digital Systems: Hardware Organization and Design 4/15/2017 Examples of Fricative Voicing Contrast 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
17
Friendly Little Consonant Chart
Digital Systems: Hardware Organization and Design 4/15/2017 Robb's Friendly Little Consonant Chart "Somewhat more accurate, yet somewhat less useful." 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
18
Digital Systems: Hardware Organization and Design
4/15/2017 What is this word? 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
19
Digital Systems: Hardware Organization and Design
4/15/2017 Stop Production Complete closure in the vocal tract, pressure build up Sudden release of the constriction, turbulence noise Can have periodic excitation during closure 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
20
Stops of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Stops of American English There are 6 stop consonants in American English Three places of articulation: Labial, Alveolar, and Velar Each place of articulation has a voiced and unvoiced stop Unvoiced stops are typically aspirated Voiced stops usually exhibit a “voice-bar’’ during closure Information about formant transitions and release useful for classification 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
21
Spectrograms of Unvoiced Stops
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of Unvoiced Stops 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
22
Examples of Stop Voicing Contrast
Digital Systems: Hardware Organization and Design 4/15/2017 Examples of Stop Voicing Contrast 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
23
Singleton Stop Durations
Digital Systems: Hardware Organization and Design 4/15/2017 Singleton Stop Durations 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
24
Digital Systems: Hardware Organization and Design
4/15/2017 Voicing Cues for Stops 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
25
Digital Systems: Hardware Organization and Design
4/15/2017 /s/-Stop Durations 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
26
Examples of Front and Back Velars
Digital Systems: Hardware Organization and Design 4/15/2017 Examples of Front and Back Velars 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
27
Digital Systems: Hardware Organization and Design
4/15/2017 What is this word? 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
28
Digital Systems: Hardware Organization and Design
4/15/2017 Nasal Production Velum lowering results in airflow through nasal cavity Consonants produced with closure in oral cavity Nasal murmurs have similar spectral characteristics 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
29
Nasal of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Nasal of American English Three places of articulation: Labial, Alveolar, and Velar Nasal consonants are always attached to a vowel, though can form an entire syllable in unstressed environments ([n], [m], [ŋ]) /ŋ/ is always post-vocalic in English Place identified by neighboring formant transitions 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
30
Spectrograms of Nasals
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of Nasals 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
31
Digital Systems: Hardware Organization and Design
4/15/2017 What is this word? 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
32
Digital Systems: Hardware Organization and Design
4/15/2017 Semivowel Production Constriction in vocal tract, no turbulence Slower articulatory motion than other consonants Laterals form complete closure with tongue tip, airflow via sides of constriction 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
33
Semivowels of American English
Digital Systems: Hardware Organization and Design 4/15/2017 Semivowels of American English There are 4 semivowels in American English Sometimes referred to as Liquids or Glides Glides are a more extreme articulation of a corresponding vowel Similar, though more extreme, formant positions Generally weaker due to narrower constriction Semivowels are always attached to a vowel, though /l/ can form an entire syllable in unstressed environments ([l]) 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
34
Spectrograms of Semivowels
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of Semivowels 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
35
Acoustic Properties of Semivowels
Digital Systems: Hardware Organization and Design 4/15/2017 Acoustic Properties of Semivowels /w/ and /l/ are the most confusable semivowels /w/ is characterized by a very low F1, F2 Typically a rapid spectral fallo above F2 /l/ is characterized by a low F1 and F2 Often presence of high frequency energy Postvocalic /l/ characterized by minimal spectral discontinuity, gradual motion of formants /y/ is characterized by very low F1, very high F2 /y/ only occurs in a syllable onset position (i.e., pre-vocalic) /r/ is characterized by a very low F3 Prevocalic F3 < medial F3 < postvocalic F3 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
36
Digital Systems: Hardware Organization and Design
4/15/2017 What is this word? 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
37
Digital Systems: Hardware Organization and Design
4/15/2017 Affricate Production There are two affricates in American English: Alveolar-stop palatal-fricative pairs Sudden release of the constriction, turbulence noise Can have periodic excitation during closure 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
38
Digital Systems: Hardware Organization and Design
4/15/2017 Aspirant Production There is one aspirant in American English: /h/ (e.g., “hat’’) Produced by generating turbulence excitation at glottis No constriction in the vocal tract, normal formant excitation Sub-glottal coupling results in little energy in F1 region Periodic excitation can be present in medial position 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
39
Spectrograms of Affricates and Aspirant
Digital Systems: Hardware Organization and Design 4/15/2017 Spectrograms of Affricates and Aspirant 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
40
Digital Systems: Hardware Organization and Design
4/15/2017 What is this word? 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
41
Phonotactic Constraints
Digital Systems: Hardware Organization and Design 4/15/2017 Phonotactic Constraints Phonotactics is the study of allowable sound sequences Analyses of word-initial and -final clusters reveal: 73 distinct initial clusters (about 10 “foreign’’ clusters) 208 distinct final clusters Can be used to eliminate impossible phoneme sequences: /tk/ can’t end a word, and /kt/ can’t begin a word, Therefore, */… t k t …/ is an impossible sequence 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
42
Word-Initial Consonants from MWP Dictionary
Digital Systems: Hardware Organization and Design 4/15/2017 Word-Initial Consonants from MWP Dictionary 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
43
Digital Systems: Hardware Organization and Design
4/15/2017 The Syllable Syllable structure captures many useful generalizations Phoneme realization often depends on syllabification Many phonological rules depend on syllable structure Syllable structure is predicated on the notion of ranking the speech sounds in terms of their sonority values 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
44
Syllables and Sonority
Digital Systems: Hardware Organization and Design 4/15/2017 Syllables and Sonority Utterances can be divided into syllables The number of syllables equals the number of sonority peaks Within any syllable, there is a segment constituting a sonority peak that is preceded and/or followed by a sequence of segments with progressively decreasing sonority values 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
45
Digital Systems: Hardware Organization and Design
4/15/2017 The Syllable Template Branches marked by “o” are optional Nucleus must contain a non-obstruent Sonority decreases away from nucleus Affix contains only coronals: Only the last syllable in a word can have an affix /sp/, /st/, and /sk/ are treated as single obstruents 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
46
Digital Systems: Hardware Organization and Design
4/15/2017 Some Examples 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
47
Words Containing /r/ and /l/
Digital Systems: Hardware Organization and Design 4/15/2017 Words Containing /r/ and /l/ 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
48
Acoustic Realizations of /r/
Digital Systems: Hardware Organization and Design 4/15/2017 Acoustic Realizations of /r/ 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
49
Acoustic Realizations of /l/
Digital Systems: Hardware Organization and Design 4/15/2017 Acoustic Realizations of /l/ 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
50
Allophonic Variations at Syllable Boundaries
Digital Systems: Hardware Organization and Design 4/15/2017 Allophonic Variations at Syllable Boundaries 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
51
Digital Systems: Hardware Organization and Design
4/15/2017 References “Acoustics of American English Speech – A Dynamic Approach”, J. Olive, A. Greenwood, and J. Coleman, Springer-Verlag 1993. “Articulatory-Acoustic-Auditory Relationships”, Kenneth Stevens, in The Handbook of Phonetic Sciences, Ed. William Hardcastle and John Laver, Blackwell Publishers, 1997. 15 April 2017 Veton Këpuska Architecture of a Respresentative 32 Bit Processor
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.