Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure(2) Department of Electrical Engineering, The University.

Slides:



Advertisements
Similar presentations
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
Advertisements

Frequency-response-based Wavelet Decomposition for Extracting Children’s Mismatch Negativity Elicited by Uninterrupted Sound Department of Mathematical.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Simple Linear Regression and Correlation (Part II) By Asst. Prof. Dr. Min Aung.
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Analysis and Synthesis of Shouted Speech Tuomo Raitio Jouni Pohjalainen Manu Airaksinen Paavo Alku Antti Suni Martti Vainio.
Communications & Multimedia Signal Processing Meeting 6 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 6 July,
Communications & Multimedia Signal Processing Report of Work on Formant Tracking LP Models and Plans on Integration with Harmonic Plus Noise Model Qin.
Statistical Relationship Between Quantitative Variables
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Math 227 Elementary Statistics Math 227 Elementary Statistics Sullivan, 4 th ed.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Correlation and Regression Analysis
Relationships Among Variables
Computational Analysis of USA Swimming Data Junfu Xu School of Computer Engineering and Science, Shanghai University.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Correlation and Linear Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,
Correlation.
Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.
Chapter 3: Introductory Linear Regression
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
NORTEL NETWORKS CONFIDENTIAL 2 Evaluaton of Objective Quality Estimators: Methods used with Voice Models & Implications for Video Testing Leigh Thorpe.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Department of Communication and Electronic Engineering University of Plymouth, U.K. Lingfen Sun Emmanuel Ifeachor New Methods for Voice Quality Evaluation.
Frequency-response-based Wavelet Decomposition for Extracting Children’s Mismatch Negativity Elicited by Uninterrupted Sound Department of Mathematical.
Study of Neural Correlates of Mandarin Tonal Production with Neural Network Model Department of Electrical Engineering, National Central University, Jhongli.
APA Results Section Results.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure(12) Department of Electrical Engineering, The University.
Weakly Supervised Training For Parsing Mandarin Broadcast Transcripts Wen Wang ICASSP 2008 Min-Hsuan Lai Department of Computer Science & Information Engineering.
Katherine Morrow, Sarah Williams, and Chang Liu Department of Communication Sciences and Disorders The University of Texas at Austin, Austin, TX
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Frequency-response-based Wavelet Decomposition for Extracting Children’s Mismatch Negativity Elicited by Uninterrupted Sound Department of Mathematical.
Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure(1) Department of Electrical Engineering, The University.
Frequency-response-based Wavelet Decomposition for Extracting Children’s Mismatch Negativity Elicited by Uninterrupted Sound Department of Mathematical.
Scatter Plots. Scatter plots are used when data from an experiment or test have a wide range of values. You do not connect the points in a scatter plot,
Chapter 7: The Distribution of Sample Means
Free Powerpoint Templates ROHANA BINTI ABDUL HAMID INSTITUT E FOR ENGINEERING MATHEMATICS (IMK) UNIVERSITI MALAYSIA PERLIS.
Project VIABLE - Direct Behavior Rating: Evaluating Behaviors with Positive and Negative Definitions Rose Jaffery 1, Albee T. Ongusco 3, Amy M. Briesch.
Chapter 11 Simple Linear Regression and Correlation.
Objectives Fit scatter plot data using linear models.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Basic Estimation Techniques
Chindamanee School English Program
Correlational Studies
Regression Chapter 6 I Introduction to Regression
Speech Enhancement with Binaural Cues Derived from a Priori Codebook
Chapter 11: Simple Linear Regression
Chapter 5 STATISTICS (PART 4).
SIMPLE LINEAR REGRESSION MODEL
Understanding Standards Event Higher Statistics Award
Basic Estimation Techniques
2.6 Draw Scatter Plots and Best-Fitting Lines
Satoru Suzuki, Marcia Grabowecky  Neuron 
Introduction to Regression
Satoru Suzuki, Marcia Grabowecky  Neuron 
An Introduction to Correlational Research
Inferential Statistics
15.1 The Role of Statistics in the Research Process
Ch 4.1 & 4.2 Two dimensions concept
BEC 30325: MANAGERIAL ECONOMICS
Presentation transcript:

Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure(2) Department of Electrical Engineering, The University of Texas at Dallas, Richardson, Texas75083, USA Received 8 Jan 2011; Accepted 27 May 2011; doi: /jmbe.885 Chairman:Hung-Chi Yang Presenter: Yu-Kai Wang Advisor: Dr. Yeou-Jiunn Chen Date:

Outline  Material and Methods  Results  Conclusions

Material and Methods  2.4 Procedure  Environment  In a sound-proof room , using a PC connected to a Tucker- Davis System 3 workstation.  Stimuli were played to listeners monaurally through  A Sennheiser HD 250 Linear II circumaural headphone at a comfortable listening level.

Material and Methods  Conditions  Participated in a 10-minute training and session(Prior to the test).  Listened to a set of tone-vocoded and EAS-vocoded stimuli to familiarize.  Subjects were asked to write down all thewords they had heard.

Material and Methods  Experiments  Experiments 1 and 2 listened to vocoded English stimuli  Total of 30 and 24 test conditions,respectively  Experiment 3 listened to vocoded Mandarin Chinese stimuli  total of 20 test conditions,respectively  Twenty sentences, and none of the sentences were repeated.  The order of the test conditions was randomized.  Subjects were given a 5-minute break every 30 minutes during the testing session..

Material and Methods  Used objective speech quality measures for predicting the intelligibility of vocoded speech in noisy conditions.  Vocoded English  Vocoded Mandarin Chinesein  This study investigated the performance.  The linear predictive coding (LPC)  The log-likelihood ratio (LLR)  The Itakura-Saito (IS)  The cepstrum (CEP) distance measures

Material and Methods  Evaluated the performance of the time-domain segmental.  SNR measure (SNRseg)  The frequency-weighted SNR measure (fwSNRseg)  The weighted spectral slope (WSS) measure  The above measures are primarily based on.  The premise that speech quality can be modeled in terms of differences in loudness between the original and processed signals.  In terms of differences in the spectral envelopes.  (e.g., as computed using an LPC model) between the original and processed signals.

Material and Methods  The PESQ measure.  Assesses speech quality by estimating the overall loudness difference between the noise-free and processed signals.

Results  Two statistical measures were used to assess the performance of the tested speech quality measures.  Pearson’s correlation coefficient (r)  An estimate of the standard deviation of the prediction error.  The average intelligibility scores obtained.  NH listeners were subjected to correlation analysis with the corresponding values.  The objective measures  Scores obtained for a total of 54 and 20 test conditions were included in the correlation analysis. (Shown in Table 1)Table 1  Vocoded English.  Vocoded Mandarin Chinese.

Results  Figure 1 shows sentence intelligibility scores  54 vocoded test conditions of English sentences.  20 vocoded test conditions of Mandarin Chinese sentences.

Results  Table 3 shows. Table 3  The resulting correlation coefficients and predictions errors  Vocoded English  The PESQ measure had the highest correlation (r = 0.91) (see scatter plot in Fig. 1(a)). Fig. 1(a)  vocoded Mandarin Chinese  Confirms the results of Chen and Loizou  The other conventional objective measures (IS, SNRseg, and fwSNRseg) was good (r = 0.63~0.69).  The LLR, CEP, and WSS measures was quite poor (r = -0.48~- 0.57).

Results  See scatter plot in Fig. 1(b)  The PESQ measure  A very low correlation (r = 0.48)  The WSS measure  A high intelligibility prediction with a correlation of (r = -0.87)  The other conventional objective measures  Poorly predicted the intelligibility of vocoded Mandarin Chinese (r = -0.40~0.42).

Results  Table 3 also shows the correlation coefficients  EAS-vocoder (EAS in the table)).  The PESQ measure consistently well predicts the intelligibility of both  tone-vocoded (r = 0.92)  EAS-vocoded (r = 0.90) English sentences.  The WSS measure yielded a high correlation with the intelligibility of tone-vocoded English sentences (r = -0.85).  This might be due to the relatively small number of tone-vocoder conditions

Results  The WSS measure was highly correlated with the intelligibility of Mandarin Chinese sentences  When processed by tone-vocoder(r = -0.90) and EAS-vocoder (r = -0.92).

Results

Conclusion  Assessed the performance of several objective measures of predicting the intelligibility of vocoded speech.  PESQ, highly correlated with the intelligibility score.  Most objective measures performed modestly well or poorly in predicting the intelligibility of vocoded speech.  The language effect was found to have an impact on the performance of predicting the intelligibility of vocoded speech.