Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin,

Slides:



Advertisements
Similar presentations
Wideband Speech Coding for CDMA2000® Systems
Advertisements

Lecture 4 Operational Amplifiers—Non-ideal behavior
Chapter 5 Analog Transmission.
Learning Introductory Signal Processing Using Multimedia 1 Outline Overview of Information and Communications Some signal processing concepts Tools available.
C.L.I.L. MODULE: ANALYSIS AND DESIGN OF FIRST ORDER RC FILTERS
E E 2415 Lecture 16 2 nd Order High Pass, Low Pass and Band Reject Filters.
Chapter 3: PCM Noise and Companding
Speech Coding Techniques
ON THE REPRESENTATION OF VOICE SOURCE APERIODICITIES IN THE MBE SPEECH CODING MODEL Preeti Rao and Pushkar Patwardhan Department of Electrical Engineering,
DAM-Île de France Département Analyse, Surveillance, Environnement 06/06/2014 DIF/DASE/B.ALCOVERRO 1 INFRASOUND SENSOR CALIBRATION DEVICE 13/11/2001 Infrasound.
SPEECH PROCESSING IN GSM SYSTEMS
UWB Channels – Capacity and Signaling Department 1, Cluster 4 Meeting Vienna, 1 April 2005 Erdal Arıkan Bilkent University.
VMR-WB – Operation of the 3GPP2 Wideband Speech Coding Standard M. Jelinek†, R. Salami‡ and S. Ahmadi * †University of Sherbrooke, Canada ‡VoiceAge Corporation,
Pulsed-RF S-Parameter Measurements Using a VNA. 2 Agenda Pulsed-RF Overview Pulsed-RF measurement techniques Wideband/synchronous Narrowband/asynchronous.
Doc.: IEEE /404r0 Submission July 2001 Georg Dickmann, BridgeCo AG.Slide 1 AV Timing Limits BridgeCo AG Georg Dickmann
Addition 1’s to 20.
SUPRESSION OF LORAN-C NAVIGATION SIGNAL IN DIGITAL CAVE RADIOS (AN EXPERIMENTAL APPROACH) Mr. Antonio Muñoz Group of Technologies in hostile Environments.
11,602,207,002,40 11,60 5,60 1,00 1,20 7,80 Practical Limitations of Wideband Terminals Workshop on Wideband Speech Quality in Terminals and Networks:
Part II (MPEG-4) Audio TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Pensinyalan (1) Sinyal Analog dan Sinyal Digital.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 181 Lecture 18 DSP-Based Analog Circuit Testing  Definitions  Unit Test Period (UTP)  Correlation.
Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Philippe Gournay, Bruno Bessette, Roch Lefebvre
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Sampling and quantization Seminary 2. Problem 2.1 Typical errors in reconstruction: Leaking and aliasing We have a transmission system with f s =8 kHz.
A 12-WEEK PROJECT IN Speech Coding and Recognition by Fu-Tien Hsiao and Vedrana Andersen.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,
Communications & Multimedia Signal Processing Formant Tracking LP with Harmonic Plus Noise Model of Excitation for Speech Enhancement Qin Yan Communication.
Communications & Multimedia Signal Processing Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing.
Analysis & Synthesis The Vocoder and its related technology.
Angle Modulation Objectives
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Representing Acoustic Information
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
1 PATTERN COMPARISON TECHNIQUES Test Pattern:Reference Pattern:
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Systems (filters) Non-periodic signal has continuous spectrum Sampling in one domain implies periodicity in another domain time frequency Periodic sampled.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2004 Physical Layer PART II.
1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.
(Extremely) Simplified Model of Speech Production
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Sung-Won Yoon David Choi February 8 th, 2001.
A Very Low Bit Rate Protection Layer to Increase the Robustness of the AMR- WB+ Codec against Bit Errors Philippe Gournay Université de Sherbrooke Département.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
IEEE GlobalSIP, Orlando, FL, USA, December 14-16, 2015 Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec Magdalena Kaniewska, Stéphane Ragot Orange.
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open.
Motivation ● The (Ham) world needs an open source, patent free speech codec at bit rates of less than 5000 bit/s ● I know how to build one!
PATTERN COMPARISON TECHNIQUES
Scalable Speech Coding for IP Networks
Digital Communications Chapter 13. Source Coding
Vocoders.
Spread Spectrum Audio Steganography using Sub-band Phase Shifting
Cepstrum and MFCC Cepstrum MFCC Speech processing.
Audio Henning Schulzrinne Dept. of Computer Science
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Speech and Audio Processing
1-channel 2-channel 4-channel 8-channel 16-channel Original
Chapter 3: PCM Noise and Companding
Vocoders.
Presentation transcript:

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin, Roch Lefebvre University of Sherbrooke Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 2 Outline Problem statement Proposed solution System performance Discussion

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 3 Problem Statement Telephone Band: Hz AM Band: Hz How to make sound like with 500 bits/sec? We need to recover information from both low and high-frequency bands (G.729)

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 4 Proposed Solution 1) Do our best to recover the wideband information from narrowband speech 2) Use coding for the information that cannot be recovered –Recovered information : Low-frequency band High-frequency excitation –Coded information : High-frequency spectral envelope

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 5 System Overview Inverse IRM filter is optional –produces a flat response from Hz Inverse IRM Filter Low-frequency regeneration 22 High-frequency regeneration 8 kHz narrowband 16 kHz wideband Hz band Hz band Hz band Side information

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 6 Low-Frequency Regeneration (1/2) Assumptions : –Only pitch harmonics need to be recovered In general, no more than two pitch harmonics below 200 Hz –Absolute phase is not perceptually relevant Frequency of harmonics determined from pitch analysis Amplitudes determined from feed-forward multi- layer perceptron (output in log domain)

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 7 Low-Frequency Regeneration (2/2) Low-frequency harmonic synthesis Scale and sum 22 1 st harmonic 2 nd harmonic Pitch analysis MFCC calculation Multi-layer Perceptron Pitch delay Cepstral coefficients Scale factors Narrowband speech Low frequencies Pitch gain LP filter (1) (16)

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 8 High-Frequency Extension Excitation-filter model (16 ms frames) Problem is separated in two parts –Excitation extension (no side information) –Spectral envelope coding (side information) A(z)A(z) Narrowband speech LPC analysis Excitation extension B(z)B(z) 1 Spectral envelope Extension Side information (High-frequency spectral envelope) High- frequency band High pass

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 9 Excitation Extension Narrowband excitation Absolute value Whitening filter wideband excitation 22 High pass

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 10 Spectral Envelope Coding Spectral envelope calculated from the wideband LPC coefficients Quantization of the Hz range (40 points) –Log domain –8-bit Vector Quantization (500 bits/s side information, using 16 ms frames) Concatenation with envelope obtained from LPC analysis on narrowband speech

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 11 Objective results Low-frequency band –3 dB RMS error on harmonic amplitude High-frequency band –3.6 dB RMS error on envelope –No objective measure for excitation extension (perceptually very close to original)

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 12 Subjective Results femalemale Original wideband Recovered from original IRM-filtered speech Recovered from G.729 coded speech

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 13 Discussion Highlights –Expand IRM-filtered telephone-band speech to AM band –Very low side information rate (500 bits/s) Areas of improvement –Use high-band spectral estimation before coding –Use residual low-frequency information (below 300 Hz) –Noise robustness –Post-filtering