Complete Discrete Time Model Complete model covers periodic, noise and impulsive inputs. For periodic input 1) R(z): Radiation impedance. It has been shown.

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Response to a Sinusoidal Input Frequency Analysis of an RC Circuit.
Advertisements

Acoustic Characteristics of Consonants
Digital Signal Processing IIR Filter IIR Filter Design by Approximation of Derivatives Analogue filters having rational transfer function H(s) can be.
Nonrecursive Digital Filters
Chapter 2 Data and Signals
Filtering Filtering is one of the most widely used complex signal processing operations The system implementing this operation is called a filter A filter.
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
Time-domain Crosstalk The equations that describe crosstalk in time-domain are derived from those obtained in the frequency-domain under the following.
It was assumed that the pressureat the lips is zero and the volume velocity source is ideal  no energy loss at the input and output. For radiation impedance:
Properties of continuous Fourier Transforms
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
Analysis & Synthesis The Vocoder and its related technology.
Review last lectures.
F a c u l t é P o l y t e c h n i q u e d e M o n s Mixed-Phase Speech Modeling and Formant Estimation Using Differential Phase Spectrums Baris Bozkurt,
Chapter 15 Fourier Series and Fourier Transform
Power Amplifiers Power Amplifiers are used in the transmitter
Frequency Characteristics of AC Circuits
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Module 2: Representing Process and Disturbance Dynamics Using Discrete Time Transfer Functions.
Alternating Current Circuits
Lecture 1 Signals in the Time and Frequency Domains
Fundamentals of Electric Circuits Chapter 11
Kent Bertilsson Muhammad Amir Yousaf. DC and AC Circuit analysis  Circuit analysis is the process of finding the voltages across, and the currents through,
Sensitivity System sensitivity is defined as the available input signal level Si for a given (SNR)O Si is called the minimum detectable signal An expression.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
Module 2 SPECTRAL ANALYSIS OF COMMUNICATION SIGNAL.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
1 Linear Prediction. Outline Windowing LPC Introduction to Vocoders Excitation modeling  Pitch Detection.
1 Z-Transform. CHAPTER 5 School of Electrical System Engineering, UniMAP School of Electrical System Engineering, UniMAP NORSHAFINASH BT SAUDIN
Fundamentals of Electric Circuits Chapter 18 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Structure of Spoken Language
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Eeng Chapter 5 AM, FM, and Digital Modulated Systems  Phase Modulation (PM)  Frequency Modulation (FM)  Generation of PM and FM  Spectrum of.
Chapter 31 Lecture 33: Alternating Current Circuits: II HW 11 (problems): 30.58, 30.65, 30.76, 31.12, 31.26, 31.46, 31.56, Due Friday, Dec 11. Final.
1 TOPIC 4: FREQUENCY SELECTIVE CIRCUITS. 2 INTRODUCTION Transfer Function Frequency Selective Circuits.
More On Linear Predictive Analysis
Z Transform The z-transform of a digital signal x[n] is defined as:
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
Topics 1 Specific topics to be covered are: Discrete-time signals Z-transforms Sampling and reconstruction Aliasing and anti-aliasing filters Sampled-data.
Linear Prediction.
A sinusoidal current source (independent or dependent) produces a current That varies sinusoidally with time.
Digital Signal Processing Lecture 6 Frequency Selective Filters
Eeng Chapter4 Bandpass Signalling  Bandpass Filtering and Linear Distortion  Bandpass Sampling Theorem  Bandpass Dimensionality Theorem  Amplifiers.
Small-Scale Fading Prof. Michael Tsai 2016/04/15.
Chapter 2 Ideal Sampling and Nyquist Theorem
Eeng Chapter 5 AM, FM, and Digital Modulated Systems  Phase Modulation (PM)  Frequency Modulation (FM)  Generation of PM and FM  Spectrum of.
Eeng Chapter4 Bandpass Signalling  Bandpass Filtering and Linear Distortion  Bandpass Sampling Theorem  Bandpass Dimensionality Theorem  Amplifiers.
Youngjune, Han Chapter 4 Time Response Youngjune, Han
Digital Control Systems (DCS)
TOPIC 3: FREQUENCY SELECTIVE CIRCUITS
EE Audio Signals and Systems
Fourier Transform Analysis of Signals and Systems
Chapter4 Bandpass Signalling Bandpass Filtering and Linear Distortion
Fundamentals of Electric Circuits Chapter 14
Chapter 2 Ideal Sampling and Nyquist Theorem
Fundamentals of Electric Circuits Chapter 18
Physical Layer Part 1 Lecture -3.
Finite Wordlength Effects
Linear Prediction.
Chapter 8 The Discrete Fourier Transform
Chapter4 Bandpass Signalling Bandpass Filtering and Linear Distortion
Chapter 4. Time Response I may not have gone where I intended to go, but I think I have ended up where I needed to be. Pusan National University Intelligent.
Chapter 8 The Discrete Fourier Transform
Speech Processing Final Project
Chapter 3 Modeling in the Time Domain
DIGITAL CONTROL SYSTEM WEEK 3 NUMERICAL APPROXIMATION
Presentation transcript:

Complete Discrete Time Model Complete model covers periodic, noise and impulsive inputs. For periodic input 1) R(z): Radiation impedance. It has been shown that R(z) can be approximated as R(z) = 1 - z -1 or R(z) = 1 -  z -1  differentiator.

Therefore in continuous time it can be written that  R(z) can be moved to the glottis in the previous figure 2) G(z): z-transform of glottal flow input, g[n] over one cycle. It can be approximated by If  <1  two identical poles outside the unit circle, two zeroes at infinity (maximum phase) 3) V(z): all pole vocal-tract function. Complete Discrete Time Model

Therefore V(z) and R(z) are minimum phase. G(z) is maximum phase. Some related work Zeros of Z-Transform (ZZT) Decomposition of Speech For Source-Tract Separation Baris Bozkurt, Boris Doval, Christophe D’Alessandro, Thierry Dutoit This study proposes a new spectral decomposition method for source-tract separation. It is based on a new spectral representation called the Zeros of Z-Transfor m (ZZT), which is an all-zero representation of the z-transform of the signal. We show that separate patterns exist in ZZT representations of speech signals for the glottal flow and the vocal tract contributions. The ZZT-decomposition is simply composed of grouping the zeros into two sets, according to their location in the z-plane. This type of decomposition leads to separating glottal flow contribution (without a return phase) from vocal tract contributions in z domain. Complete Discrete Time Model

A Method For Glottal Formant Frequency Estimation Baris Bozkurt, Boris Doval, Christophe D’Alessandro, Thierry Dutoit This study presents a method for estimation of glottal formant frequency (Fg) from speech signals. Our method is based on zeros of z-transform decomposition of speech spectra into two spectra : glottal flow dominated spectrum and vocal tract dominated spectrum. Peak picking is performed on the amplitude spectrum of the glottal flow dominated part. The algorithm is tested on synthetic speech. It is shown to be effective especially when glottal formantand first formant of vocal tract are not too close. In addition, tests on a real speech example are also presented where open quotient estimates from EGG signals are used as reference and correlated with the glottal formant frequency estimates. Improved Differential Phase Spectrum Processing For Formant Tracking Baris Bozkurt, Boris Doval, Christophe D’Alessandro, Thierry Dutoit This study presents an improved version of our previously introduced formant tracking algorithm. The algorithm is based on processing the negative derivative of the argument of the chirp-z transform (termed as the differential phase spectrum) of a given speech signal. No modeling is included in the procedure but only peak picking on differential phase spectrum. We discuss the effects of roots of z- transform to differential phase spectrum and the need to ensure that all Zeros are at some distance from the circle where chirp-z transform is computed. For that, we include an additional zero- decomposition step in our previously presented algorithm to improve its robustness. The final version of the algorithm is tested for analysis of synthetic speech and real speech signals and compared to two other formant tracking systems. Complete Discrete Time Model

If the differentiation at the otput (radiation impedance) is applied to the glottal flow. Derivative of glottal flow is more like pulse ! Complete Discrete Time Model Glottal flow Glottal flow derivative

NOISE INPUT IMPULSE INPUT The combination of three inputs may be linear or nanlinear ! Complete Discrete Time Model

OTHER ZEROS OF THE VOCAL-TRACT In the noise and impulse source states oral tract constrictions may give zeros as well as poles (absorption of energy by cavity anti-resonances)  V(z) may have zeros   Vocal tract function is generally mixed phase.  Maximum phase elements of the vocal tract can also contribute to a more gradual attack of the speech waveform. The modeling described here is called the SOURCE-FILTER MODEL of speech production. Complete Discrete Time Model

In the source filter model, it is assumed that glottal input is infinite and glottal airflow is not influenced by the vocal tract. However the pressure in the vocal tract cavity above glottis backs up (resists) against the glottal flow. Vocal Fold and Vocal Tract Interaction

Electrical analog is shown below P sg : subglottal (lung) pressure p(t): sound pressure corresponding to a single first formant in front of glottis.(because it has been found that other formants have negligible effect on glottal flow.) Z g (t): time varying impedance of the glottis. R,L,C: these parameters model first formant with formant frequency (center frequency) bandwidth (3dB) Vocal Fold and Vocal Tract Interaction

Z g (t) accounts for the interaction between the glottal flow and vocal tract. If Z g (t) is comparable to the impedance of 1 st formant then there will be considerable interaction and Ω 0, B 0 will be affected. Also Z g (t) has been found to be nonlinear: k=1.1 A(t) smallest time-varying area of glottal slit. Equations are nonlinear and time-varying. Vocal Fold and Vocal Tract Interaction

Numericval solution of the above equations reveals that the skewness of glottal flow is due to in part A(t) and in part to the loading effect of the first formant. Numerical solution also yielded a ripple component. Vocal Fold and Vocal Tract Interaction Glottal flow derivative

The problem can approximately be analyzed by linearizing the differential equation. Taylor series of if x<<1. Vocal Fold and Vocal Tract Interaction

where By differentiation Vocal Fold and Vocal Tract Interaction

Corresponding Norton equivalent circuit is Where U sc (t) is now time-varying source. Vocal Fold and Vocal Tract Interaction

Formant Frequency and Bandwidth Modulation Because we have a linear but time-varying equation, formant frequency and bandwidth are time-varying i.e. they are modulated  Laplace transform does not apply. But the equaton can be solved for each time instantas a constant coefficient equation. g 0 (t) is proportional to A(t) (glottal area)  bandwidth is proportional to glottal area ( B 1 (t)  B 0 since A(t)  0 )  formant is proportional to the derivative of glottal area ( Ω 1 (t) may be aboveor below Ω 0 ) Vocal Fold and Vocal Tract Interaction formant bandwidth

Glottal area Bandwidth Formant factor

In the minimum bandwidth modulation cases ( /i/, /u/ ) B 1 (t) increases by a factor of 3 to 4. Multiplier of Ω 1 (t)  0.8 ~ 1.2 Conclusions The increase of B 1 (t) within a glottal cycle yields the truncation of glottal flow (sharp closing of folds. It is due to a decrease in the impedance at the glottis as glottis opens. Reduced glottal impedance Z g (t) yields pressure drop accros glottis. Vocal Fold and Vocal Tract Interaction

Truncation Effect (Using Klatt Synthesiser)

Truncation Effect