Effectiveness of Visual Biofeedback in Speech Training of Children with Hearing Impairment Elizabeth Reid, BSLT and Emily Lin, PhD Introduction It is well.

Slides:

Advertisements

Similar presentations

Tom Lentz (slides Ivana Brasileiro)

Advertisements

Sounds that “move” Diphthongs, glides and liquids.

Basic Spectrogram & Clinical Application: Consonants

Acoustic Characteristics of Consonants

Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.

Speech Perception Dynamics of Speech

Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.

Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)

1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et.

Effects of Competence, Exposure, and Linguistic Backgrounds on Accurate Production of English Pure Vowels by Native Japanese and Mandarin Speakers Malcolm.

1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.

Phonological Intervention Options: Variations of Minimal Pair Contrasts Minimal Pairs Maximal Oppositions Empty Set Multiple Oppositions.

Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.

Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.

PHONETICS AND PHONOLOGY

The nature of sound Types of losses Possible causes of hearing loss Educational implications Preparing students for hearing assessment.

Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.

Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.

Voice Onset Time as a Parameter for Identification of Bilinguals Claire Gurski University of Western Ontario London, ON Canada.

Profile of Phoneme Auditory Perception Ability in Children with Hearing Impairment and Phonological Disorders By Manal Mohamed El-Banna (MD) Unit of Phoniatrics,

SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?

PHONOLOGICAL ANALYSIS ABSTRACT Substitution is a common phenomenon when a non-English speaker speaks English with foreign accent. By using spectrographic.

Report Writing Tips for Speech Language Pathologists

CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING

CSD 2230 HUMAN COMMUNICATION DISORDERS

Lingual-Alveolar Plosives

Abstract Research Questions The present study compared articulatory patterns in production of dental stop [t] with conventional dentures to productions.

Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.

Independent + Relational Analyses Systemic Phonological Analysis of Child Speech (SPACS)

Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,

1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.

Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.

Acoustic Aspects of Place Contrasts in Children with Cochlear Implants Kelly Wagner, M.S., & Peter Flipsen Jr., Ph.D. Idaho State University INTRODUCTION.

Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.

Voice Quality + Stop Acoustics

Multidisciplinary Diagnosis of (C)APD: Panel Discussion Teri James Bellis, Ph.D. The University of South Dakota Vermillion, SD USA.

Clinical Assessment of Articulation and Phonology

Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.

Neural mechanisms for timing visual events are spatially selective in real-world coordinates. David Burr, Arianna Tozzi, & Concetta Morrone.

Assessment of Phonology

Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.

Does Phonological Awareness Intervention Impact Speech Production in a 3-year-old? Kayla Knueppel, Department of Communication Sciences and Disorders Vicki.

Voice Onset Time + Voice Quality

Phonological development in lexically precocious 2-year-olds by Smith, McGregor & Demille Presented by: Marrian B. Bufete.

Overview ► Recall ► What are sound features? ► Feature detection and extraction ► Features in Sphinx III.

1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.

CSD 2230 INTRODUCTION TO HUMAN COMMUNICATION DISORDERS Normal Sound Perception, Speech Perception, and Auditory Characteristics at the Boundaries of the.

Katherine Morrow, Sarah Williams, and Chang Liu Department of Communication Sciences and Disorders The University of Texas at Austin, Austin, TX

Performance Comparison of Speaker and Emotion Recognition

2.3 Markedness Differential Hypothesis (MDH)

0 / 27 John-Paul Hosom 1 Alexander Kain Brian O. Bush Towards the Recovery of Targets from Coarticulated Speech for Automatic Speech Recognition Center.

Current Approaches to Management of DAS Michelle D. White.

Speech Perception.

Stop + Approximant Acoustics

Lecture 1 Phonetics – the study of speech sounds

IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.

A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.

1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.

Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

Acoustic Phonetics 3/14/00.

Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.

Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)

Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech.

APPLICATION OF MOTOR LEARNING TO DEVELOPMENTAL APRAXIA OF SPEECH Melissa M. Mueller, B.A. Carlin F. Hageman, Ph.D. Angela N. Burda, Ph.D. Ken M. Bleile,

Speechreading Based on Tye-Murray (1998) pp

The effect of speech timing on velopharyngeal function

Speech Perception.

Speech Perception (acoustic cues)

A Japanese trilogy: Segment duration, articulatory kinematics, and interarticulator programming Anders Löfqvist Haskins Laboratories New Haven, CT.

Presenter: Shih-Hsiang(士翔)

Presentation transcript:

Effectiveness of Visual Biofeedback in Speech Training of Children with Hearing Impairment Elizabeth Reid, BSLT and Emily Lin, PhD Introduction It is well known that the limitations of a hearing-impaired child’s perceptual system can prevent them from perceiving differences in sounds, resulting in speech production that is delayed or disordered (Ruffin-Simon, 1983). To compensate for this lack of access to auditory cues, there has been a substantial increase in the development of real-time visual feedback displays such as spectrograms. Sspectrograms provide a visual representation of the frequency, intensity, and time domains of an acoustic signal (Ertmer & Maki, 2000). Unlike many other visual feedback devices that provide feedback on a single dimension of speech, spectrographic displays can provide many segmental and suprasegmental speech features simultaneously. Spectrographic displays (SDs) provide immediate and objective feedback, allowing a child to compare his/her own speech production with a correct visual template from the clinician (Dagenais, Critz-Crosby, Fletcher & McCutheon, 1994). Despite the growing interest in visual feedback tools, there have been few studies that have objectively examined the effectiveness of such devices. More research on their effectiveness is needed before they are accepted by clinicians as an effective treatment tool. Therefore the main objective of this study was to evaluate, using objective measures, the effectiveness of spectrograms compared to traditional speech training approaches for hearing- impaired children. The second objective of the study was to describe the temporal behaviour and formant characteristics of speech produced by hearing-impaired children and examine how the acoustic properties are related to the perceived accuracy of their speech production. The majority of studies describing the speech production of hearing-impaired children has been confined to perceptual analysis of phonetic and phonologic errors and acoustic analyses of temporal aspects of the speech signal. A recent study by Uchanski & Geers (2003) used spectral moment analysis to examine the acoustic energy characteristics of fricatives spoken by hearing-impaired children. Their study provided an interesting basis for further exploration of hearing impaired children's’ consonant production. Department of Communication Disorders, University of Canterbury, Christchurch, New Zealand Method Subjects: 3 subjects (S1=12y; S2=9y; S3=7y) with bilateral moderate-severe sensorineural hearing losses. Instrumentation: Hheadset microphone (AKG C420, Austria), mixer (Eurorack MX602A, Behringer), 12- bit A/D converter (National Instrument DAQCard-AI-16E-4, USA), SCB pin shielded connector box, with a low-passed filter (cutoff frequency = 20 KHz), laptop installed with TF32 (Paul Milenkovic, 2000) & PRATT (Boersma & Weenink, 2005). Procedure: Recordings were done in a quiet room with the microphone 5 cm from the mouth. Initial recordings of the Goldman Fristoe Test of Articulation were obtained. Commonly occurring error processes were identified for each subject. Training targets were chosen (S1=Deletion of Final Consonant (DFC) and Consonant Cluster Reduction (CCR); S2=DFC; S3=DFC). Probe lists were developed for each target and were recorded throughout the training period. 30mins treatment sessions were carried out over 12 weeks for subject 1, 4 weeks for subject 2, and 2 weeks for subject 3. Subject 1 received traditional therapy followed by visual therapy; subjects 2 and 3 received visual therapy only. Traditional therapy involved verbal instruction with visual & tactile cues. Visual therapy used spectrogram displays of a correct production which the subjects were required to match using real-time pitch and intensity displays, and then judge their accuracy. (picture ***) Subjective analysis: phonemic transcriptions of each recording. Acoustic analysis: vowel and consonant lengths, F1 and F2, and spectral moments 1(mean, indicating …), 2 (standard deviation, indicating …), 3 (skewness, indicating …) and 4 (kurtosis indicating…). Statistical Analysis: Abstract The effectiveness of spectrograms in speech training of hearing-impaired children was examined and compared to traditional therapy approaches. Subjective and objective analyses suggested that spectrograms were effective in improving particular speech targets. The temporal and spectral properties of speech produced by the subjects were also examined and acoustic cues were identified which were related to the perceived accuracy of their speech productions. These results have the potential to provide clues to the type of compensatory feedback needed in therapy. Discussion Individually, all three subjects showed positive but different effects of training with spectrograms. The acoustic measures were more sensitive than subjective measures in identifying changes and highlighting differences in training approaches. VOT for all three subjects reduced over the training period. VOT length provides an important cue for the phonemic contrast between voiced plosives and their voiceless counterparts. The distinction requires fast movements of the articulators and good coordination of motor control between the larynx and upper articulators. Therefore the reduction in VOT indicates that visual training has improved all subjects’ coordination of phonation and articulation, which is likely to result in improved intelligibility. Temporal measures showed an increase in consonant cluster length for the trained target /fl/ for subject 1, but no improvement for the untrained target /pr/. This suggests that subject 1’s awareness and production of the two components of the consonant cluster has improved, however further treatment is necessary to facilitate generalisation to other consonant clusters. Subject 2, showed an increase in final consonant length over the training period suggesting an improved awareness and production of final consonants. Conversely, These results suggest that visual training is effective in improving subjects’ awareness of the targets and their production accuracy. Subject 3 showed a negative decrease in final consonant length. This may be due to the small number of measures taken or the fact that he only received one session of training. Although vowels were not targeted, subjects 1 & 2 showed an increase in vowel space following visual training. This appeared to be largely due to the improved production range of Formant 2 for S1 and Formant 1 for S2. A reduced vowel space area represents a restriction of tongue elevation and front-back tongue movement (Liu Tso & Kuhl, 2005). Therefore the improved vowel space following training suggests that subjects 1 & 2 were producing a greater range of formant frequencies, resulting in greater distinction between vowels. Subject 3 showed a decrease in vowel space following the training period, which may be due to the shorter training period he experienced compared to the other two subjects. A number of acoustic properties were found which differentiated the correct and incorrect speech productions. Perception of vowel accuracy was found to be related to an increased vowel space as well as shorter vowel durations. Researchers (Monsen, 1974; Gulian et al., 1983) have identified vowel prolongation as one of the speech characteristics of the hearing-impaired. In this study, vowel durations for incorrect productions were prolonged compared to those for correct productions. A smaller vowel space was seen for incorrect productions, indicating a more restricted articulation range than for correct productions. This result is similar to Angelocci et al.’s (1964) comparison between hearing-impaired and normal-hearing speakers in that the vowel space derived from normal data was larger than that from the abnormal comparison groups. This result suggests that training aimed at the expansion of vowel space could be potentially beneficial to improve the speech intelligibility of hearing-impaired children. Perception of consonant accuracy was most closely related to VOT for plosives, and moment 1 (mean) and moment 3 (skewness) for fricatives, affricates and plosives. Correct plosive consonant productions contained a normal range of VOT measures, however incorrect productions were more variable and many were prolonged outside these ranges. As discussed previously, VOT is an important cue for the voiced-voiceless distinction. These results show that a reduced VOT improves perceptual intelligibility of speech production. M1 values for incorrect consonant productions tended to be much lower than those for correct productions, suggesting that tongue placement was more posterior in incorrect productions. Since the M1 measure appeared to be sensitive in differentiating correct and incorrect consonant productions, it could be used in clinical application to provide feedback in speech training and monitor progress. *** other moments?? Acknowledgements: This research is part of a Masters thesis which is currently being completed by the first author and directed by the second author at the University of Canterbury. Support for this research was provided by the Oticon Foundation New Zealand. Moment 1 (mean) and Moment 2 (standard deviation) for consonant productions perceived as correct vs incorrect. Vowel space for correct and incorrect vowel productions. Results A total of 180 values (3 pitch levels X 2 vowels X 3 groups X 10 subjects) for each measure were submitted to a one-way Analysis of Variances (ANOVA) to determine whether the three subject groups differed on each measure. An increase in vowel space was seen for subjects 1 & 2 following the training period, while subject 3 showed a slight decrease in vowel space. The increase for subject 1 was attributed most to an increase in the range of F2 productions, while the increase for subject 2 was due to the increase in the range of F1 productions. Calculation of the vowel working space area encompassing /i/, /a/, and /u/ showed a smaller working space area for incorrect productions than for correct productions. There was a reduction in vowel space for subject 3, which may have been due to the small number of recordings taken. Minimal change was seen with the Goldman Fristoe recordings, which was likely due to the small number of tokens for each target in the recording. For the probe list, Subject 1 showed no improvement in target processes with traditional training, however a clear trend of improvement was seen with visual training. The Goldman Fristoe recordings for Subjects 2 and 3 showed minimal change in target accuracy scores over the training period, which was likely due to the small number of tokens as well as their high accuracy scores pre-training. Percentage of deletion of final consonant targets correct for subjects 2 & 3 Percentage of targets correct for subject 1 Measures of VOT displayed a downward trend for all three subjects, indicating reduced VOT over the training period. For subject 1, a reduction in VOT was seen immediately with traditional training, however the trend was variable making comparisons between training approaches difficult. VOT for subjects 2 & 3 Voice Onset Time for subject 1 Subject 1 showed an increase in consonant cluster length for the trained /fl/ target with traditional training. During the visual training period, the length was maintained at a similar level with a slight drop in length over the period. Measures for the untreated control were variable over the training period suggesting no treatment effect. Final consonant length for subjects 2 showed a positive upward trend over the training period, however improvement were not maintained in the follow-up recording, indicating lack of maintenance. For subject 3, only three measures were taken of final consonant length, which showed a reduction in length, however the small number of recordings is likely to affect reliability. Consonant cluster length for subject 1 Final consonant length for subjects 2 & 3 Vowel space pre & post-treatment for each subject Those vowel productions perceived to be correct (ABS = ) had larger vowel spaces compared to those perceived as incorrect (ABS = ). Most incorrect consonant productions consistently exhibited lower M1 values than correct consonant productions, which covered a greater frequency range. All fricatives had M1 values lower than those reported for normal hearing speakers Fry (2001). Iincorrectly produced fricatives exhibited lower M2 values and incorrectly produced plosives higher M2 values than those of their correct counterparts, indicating that incorrectly produced fricatives and plosives tended to deviate from a normal pattern, Conclusion Investigation of the effectiveness of spectrographic displays suggested that spectrograms can enhance the awareness and improve the production of particular speech targets that children with hearing impairment would otherwise miss with traditional training. Results of the acoustic-perceptual investigation highlighted the usefulness of acoustic analysis in establishing a link between the hearing-impaired children’s production and perceptual deficits and thus providing clues to the type of compensatory feedback needed for aural rehabilitation. Results also emphasize the importance of using acoustic measures in research, as they are able to provide more detailed information and more sensitive to changes compared to subjective measures. Vowel Space Pre- Training Post- Training Demaris Blake Jack Final consonant length for subjects 2 & 3

Final consonant length Consonant cluster length % targets correct for subject 1 % DFC targets correct for subjects 2 &3 VOT for subjects 2 & 3 VOT Subject 1