Download presentation
Presentation is loading. Please wait.
Published byAubrey Roberts Modified over 9 years ago
1
CEPSTRAL ANALYSIS Cepstral analysis synthesis on the mel frequency scale, and an adaptative algorithm for it. Cecilia Caruncho Llaguno
2
Sources Cepstral analysis on the mel frequency scale – Satoshi Imai - Tokio Institute of Technology, 1983 An adaptative algorithm for mel-cepstral analysis of speech – Toshiako Fukada - Canon Inc. Kawasaki, 1992 – Keeichi Tokuda, Takao Kobayasi, and Satoshi Imai - Tokio Institute of Technology, 1992
3
Basic Concepts Cepstral Analysis – Definition Definition – Features Features Mel frequency scale
4
Cepstral analysis Main features – Good characteristics for representation – Log spectral envelope → accurate & efficient – Small sensitivity & quantization noise – Small spectral distortion – LMA filter → high quality speech synthesis
5
Cepstral analysis Complex logarithm Inverse Z transform In unit circle |z|<1
6
Mel frequency scale Human hearing sense → non-linear frequency scale Linear up to 1000 Hz, logarithmic above.
7
Mel cepstral analysis system
8
Spectral envelope extraction by the improved cepstral method Approximation of the mel scale
9
Spectral envelope extraction by the improved cepstral method Former method: – Fine structure → The spectral envelope is not suficiently separated from the pitch parameter Present method: – Can extract the envelope without being affected by the fine structure.
10
Mel Log Spectrum Approximation filter Why do we use it? – High quality – Simple – Coefficient sensitivities – Quantization characteristics Transfer function Quantization of the filter parameterfilter parameter
11
MLSA transfer function Ideal Basic filter:
12
MLSA transfer function Ideal MLSA filterNot realizablePadé approximation:
13
Filter parameters
14
Data rate Filter coefficients → bounded Digitalization → quantizer q → data amount b s (bits/frame)
15
Data rate Spectral envelope: b s bits/frame Pitch parameter: b p bits/frame Period of transmission: T seconds Averall bit rate of this system: B (bits/second)
16
Data rate Speech quality T (ms)MqBp (bit)B (kbits/s)Speech quality 15110.2574Very high 2080.572Fairly good 2550.561.2Still good
17
Spectral distortion Distortion caused by the interpolation Distortion caused by the quantization
18
Spectral estimation based on mel-cepstral representation Model spectrum
19
Spectral estimation based on mel-cepstral representation Unbiased Estimator of Log Spectrum by S. Imai and C. Furuichi → minimization of ε
20
Spectral estimation based on mel-cepstral representation Newton-Raphson method:
21
Adaptative mel-cepstral analysis algorithm H → Unit matrix → μ... adaptation step size ε (n)... estimate of ε at time n e(n) → output of the inverse filter 1/D(z) at time n →
22
Adaptative mel-cepstral analysis algorithm
23
Conclusions MLSA – Simple – Good stathistical features – Small spectral distortions Adaptative algorithm – Computationally efficient – Fast convergence properties
24
Questions? Thank you for your attention Muchas gracias por su atención
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.