Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.

Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion University of the Negev Workshop on: Speech Enhancement and Multichannel Audio Processing Technion 22.2.2007 BGU

Outline  Motivation  Single source pitch estimation and tracking  Multiple source pitch estimation and tracking  Experiments  Conclusion BGU

Motivation  Speech enhancement  Sensitivity of many audio processing algorithms to interference. For example: Automatic speech/speaker recognition Speech/music compression  Single microphone blind source separation (BSS)  Karaoke BGU

Single Source - Modeling  Voice frames - harmonic model: additive Gaussian noise  In matrix notation: BGU

Single Source – Pitch Tracking  Maximum Likelihood (ML) estimator:  Pitch tracking: The data vector at the m th frame: - first-order Markov process:  Maximum A-posteriori Probability (MAP) pitch tracking via the Viterbi algorithm. (Tabrikian-Dubnov-Dickalov 2004) BGU

Single Source - Voicing Decision  Unvoiced model Colored Gaussian noise model:  Voiced/unvoiced decision by the Generalized Likelihood Ratio Test (GLRT): BGU (Fisher-Tabrikian-Dubnov 2006)

Multiple Sources  ML estimator of from under the model: with unknown signal and unknown (Gaussian) noise covariance: BGU (Harmanci-Tabrikian-Krolik 2000)

Multiple Sources  Voiced model: v includes other interferences. is unknown.  Using J overlapping subframes of size L s (2K+1<J< L s ): jth column of : BGU

Multiple Sources  Pitch tracking: The data vector at the m th frame: - first-order Markov process  Maximum A-posteriori Probability (MAP) pitch tracking via the Viterbi algorithm BGU

Multiple Sources - Voicing Decision  Unvoiced model Colored Gaussian noise model:  Voiced/unvoiced decision by the GLRT: BGU (Fisher-Tabrikian-Dubnov 2007)

Multiple Source Models  Exact ML for the strongest voiced signal, and “ locally ML ” for other voiced signals BGU Likelihood function

Experiments – Single Source BGU

Experiments - Two Sources BGU

Experiments – Voicing Decision BGU

Experiments - – Voicing Decision BGU

Conclusions  ML pitch estimation for single and multiple sources have been developed under the harmonic model for voiced frames.  The derived likelihood functions under the two models allow implementation of the Viterbi algorithm for MAP pitch tracking.  The GLRT for voicing decision is derived under the two models.  Future work: development of multiple hypothesis tracking methods for single microphone BSS. Adaptive estimation of the number of harmonics BGU

Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.

Similar presentations

Presentation on theme: "Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.

Similar presentations

Presentation on theme: "Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion."— Presentation transcript:

Similar presentations

About project

Feedback