© 2006 AudioCodes Ltd. All rights reserved. AudioCodes Confidential Proprietary Signal Processing Technologies in Voice over IP Eli Shoval Audiocodes.

Slides:



Advertisements
Similar presentations
Echo cancellation Ian Hung 2B Computer Engineering University of Waterloo August 17th, 2001.
Advertisements

Wideband Speech Coding for CDMA2000® Systems
International Telecommunication Union The Fully Networked Car Geneva, 4-5 March 2009 Wrap-up Session Conclusions for Session 5 Session 5: Voice and audiovisual.
The Fully Networked Car Geneva, 4-5 March Automotive Speech Enhancement of Today: Applications, Challenges and Solutions Tim Haulick Harman/Becker.
IP Cablecom and MEDIACOM 2004 Prediction and Monitoring of Quality for VoIP services Quality for VoIP services Vincent Barriac – France Télécom R&D SG12.
Acoustic Echo Cancellation for Low Cost Applications
HD Voice Unprecedented mobile voice quality
UBIFone & The Technology Ahead 25 th June 2006 This presentation is the property of UbiFone. Distributors or any other individuals or entities are not.
Speech Processing for NSR Vs DSR Veeru Ramaswamy PhD CTO, Vianix LLC
STQ Workshop, Sophia-Antipolis, February 11 th, 2003 Packet loss concealment using audio morphing Franck Bouteille¹ Pascal Scalart² Balazs Kövesi² ¹ PRESCOM.
N Team 15: Final Presentation Peter Nyberg Azadeh Bararsani Adie Tong N N multicodec minisip.
Copyright © by Elliot Eichen. All rights reserved. RTP – Real Time Protocol (and RTCP)
SG2001_VIP.ppt Page 1 PLANET Technology Corp. Product Guide 2001 VoIP Products Your Voice over Internet By Product Department.
High Performance 32 Channel ADPCM Codec File Number Here ® LogiCORE Products.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
1 © NOKIA GPP2 Wideband Codec Presentation Interoperable Wideband Speech Coder for CDMA2000 and WCDMA Systems W-VRM: Wideband Variable-Rate Multi-Mode.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Voice over the Internet (the basics) CS 7270 Networked Applications & Services Lecture-2.
Abdellatif O. Abdellatif Sudatel Telecom Group Voice Over IP.
1 TAC2000/ IP Telephony Lab Perceptual Evaluation of Speech Quality (PESQ) Speaker: Wen-Jen Lin Date: Dec
1 Voice Quality Enhancements 2 Outline Acoustic and network echo Noise Reduction (NR) Mobile Cross-talk Control (MCC) Noise Level Compensation (NLC)
1 © 2005 Cisco Systems, Inc. All rights reserved. Cisco Public IP Telephony Introduction to Packet Voice Technologies Cisco Networking Academy Program.
VoIP on the iPhone: Imagine the Possibilities Jan Linden, VP of Engineering.
© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.
Leveraging Existing Application Processors in Mobile Devices to Implement VoIP Client.
Objective and Subjective Degradations of Transcoded Voice for Heterogeneous Radio Networks Interoperability Ľubica Blašková 1, Jan Holub 1, Michael Street.
The Importance of Quality VoIP for Web Conferencing and Collaboration Jan Linden, Vice president of Engineering Global IP Sound, Inc.
1 Visit us at: & Your Way. Communicate.
Slide title In CAPITALS 50 pt Slide subtitle 32 pt Frame Header Based Speech Quality Analysis Method in a Circuit-Switched Media Gateway Master’s Thesis.
Mobile HD Voice January 6 th, 2010 Mahesh Makhijani.
The Fully Networked Car Geneva, 4-5 March Wideband Speech Communications: the Good, the Bad, and the Ugly Scott Pennock Sr. Hands-Free Standards.
Voice Over IP Developing IPHONE Jeremy Stanley CS 460 section 1.
© 2006 Cisco Systems, Inc. All rights reserved. QOS Lecture 2 - Introducing VoIP Networks.
How to construct world-class VoIP applications on next generation hardware David Duffett, Aculab.
Cisco Unified Communications Manager (CUCM)
Secure Steganography in Audio using Inactive Frames of VoIP Streams
Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6.
Microphone Integration – Can Improve ARS Accuracy? Tom Houy
Scheme for Improved Residual Echo Cancellation in Packetized Audio Transmission Jivesh Govil Digital Signal Processing Laboratory Department of Electronics.
Technical Seminar Presented by :- Debabandana Apta (EC ) National Institute of Science and Technology [1] “ECHO CANCELLATION” Presented.
Audio Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Colombia, September 2013 The importance of models and procedures for planning, monitoring and control in the provision of communications services.
Introduction to Packet Voice Technologies Cisco Networking Academy Program.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
What’s new in Wideband Audio?
Definition and Coordination of Signal Processing Functions for telephone connections involving automotive speakerphones Scott Pennock Senior Hands-Free.
Code : STM#220 Samsung Electronics Co., Ltd. IP Telephony System Error Handling & Management IP Telephony System Error Handling & Management Distribution.
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
D1 - 27/10/2015 The present document contains information that remains the property of France Telecom. The recipient’s acceptance of this document implies.
Voice Quality (Special Topic) ISSUE1.0 Fixed Network Technical Support Dept.
Aug 25, 2005 page1 Aug 25, 2005 Integration of Advanced Video/Speech Codecs into AccessGrid National Center for High Performance Computing Speaker: Barz.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
LOG Objectives  Describe some of the VoIP implementation challenges such as Delay/Latency, Jitter, Echo, and Packet Loss  Describe the voice encoding.
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
Voice Quality and Testing EIT-03 Alan Percy Director of Business Development AudioCodes.
Creating Resource-Efficient V2oIP Applications for Low-MHz Mobile Processors Fred Wydler VP VoIP Products SPIRIT DSP.
Troubleshooting Echo in VoIP Network Deployments
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Voice Coding in 3G Networks
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
A Brief Look at VoIP QoS. Why is VoIP QoS important? PSTNs have played the first role in voice communications for a long time. But future is going to.
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
Opus SW codec RTLAB Ki Eun Seong. What is the Opus Codec? Real-time interactive audio codec Targets interactive audio over the internet Aims to be royalty-free,
Scalable Speech Coding for IP Networks
Audio Henning Schulzrinne Dept. of Computer Science
– Workshop on Wideband Speech Quality in Terminals and Networks
Understanding the Internet Low Bit Rate Coder
Scalable Speech Coding for IP Networks: Beyond iLBC
Presentation transcript:

© 2006 AudioCodes Ltd. All rights reserved. AudioCodes Confidential Proprietary Signal Processing Technologies in Voice over IP Eli Shoval Audiocodes

Scope The purpose of this presentation is to provide an overview of speech processing technologies that are used in Audiocodes VoIP products

Outline Signal processing technologies in VoIP –Line Echo Cancellation –Acoustic Echo Cancellation –Speech Compression –Wideband Speech Compression –Background Noise Reduction –Voice Quality Monitoring

Main challenges in VoIP system design Bandwidth Efficiency –Need a vocoder with the lowest possible bit rate with voice quality suitable for the application –Different vocoders will suite different networks (LAN, WAN, Wireless) IP Network Inherent problems –Jitter –Delay –Packet Loss Voice Quality issues and enhancements –Echo –Background Noise –Gain adjustment

Main challenges in VoIP system design Cont’d Handling of Non Speech Signals - –Fax –Data Modems –Caller ID –DTMF Interoperability – VoIP equipment must be able to communicate smoothly with equipment of other vendors Implementation efficiency

Basic DSP processing in VoIP

Line/Electrical Echo phenomena Line echo exists in both networks due to leakage in 2/4 wire hybrid transformers In PSTN network: the echo exist but is not perceptible (it is masked) In IP network: echo is perceptible due to the added IP delay

TDM Networks vs. IP Networks Regular TDM network IP network – the IP delay is added to the PSTN side delay

Basic structure of Echo Canceler

Example of sparse FIR with 3 active windows to handle 3 hybrids

G.168 Test 2B - convergence

Challenges in Echo Cancellation Double talk can cause the adaptive filter to diverge – adaptation in AC49x/AC50x EC is robust to double talk Non linearity in the echo path can not be modeled by the linear FIR – AC49x/AC50x EC has a proprietary NLP to reduce the residual echo NLP attenuation can cause modulation of background noise level – AC49x/AC50x EC support the injection of comfort noise in order to overcome this issue Echo path can change during the call – AC49x/AC50x EC adaptation is fast after such changes are detected

Acoustic Echo Cancellation A similar problem to line echo cancellation with some additional complications: –Longer echo paths, less sparse compared to line echo path –Worse ERL, can also be negative, howling must be handled –Worse SNR due to the hands free interface –Rapid echo path changes –Higher nonlinearity in the echo path due to loudspeaker characteristics –Need both in 8KHz and 16KHz sampling rates The AC494/AC495 has a an acoustic EC that can handle hands free communication in IP phones.

Speech Compression Interoperability is a key issue in VoIP communication systems, therefore Vocoders are usually standardized Audiocodes products support a wide range of vocoders Support transcoding between different vocoders

Narrowband Vocoders G.711 – The most basic vocoder, 64 kbps G.726 – ADPCM 32 kbps G.729A – The most popular LBR, 8 kbps G – Developped by Audiocodes, 6.3 kbps, Same quality less bitrate than G729 iLBC – very robust to packet loss, royalties free, 13 kbps AMR – used in UMTS, kbps EVRC – used in CDMA, 8.55 kbps

Next generation speech compression - Wideband Vocoders Bandwidth: 50H – 7Khz, 16 bit, 16Khz sampling rate (vs. 300H to 3.4Khz, PCM, 8 Khz rate in NB speech) Substantially higher MOS quality Superior clarity Better Intelligibility (esp. in noise) Richer sound Similar bit rates (& cost) as NB Better speaker recognition (important in conferencing) Better quality with music signals

Comparison of NB and WB codecs Source: ITU G.7291 performance tests

Comparison of NB and WB codecs Humans perceives a wideband speech signal as a much higher in quality than narrow band, the difference is big - more than 1 in Wide Band MOS score (4.5 compared to 3.5) The MOS difference is even more dramatic when comparing current narrowband codec like 8 kbps to a modern wide band codec like 32 kbps

Wide Band Vocoders in AC49x/AC50xx G.722 Sub Band Coding 48,56,64 kbps – used in some high end conferencing systems G AMR-WB – kbps used in UMTS networks G – 8-32 kbps used for VoIP 96 kbps used for VoIP (*) RTA – Microsoft proprietary Vocoder (*) Speex – Royalties free vocoder for internet applications SILK – Skype proprietary vocoder (*) (*) – roadmap

Background Noise Reduction A new feature planned for AC release Used for improved Hands free communication in IP phones Optimal Filtering is done in each frequency to suppress the background noise with minimal effect on speech

Noise Reduction Block Diagram

Noise Reduction Demo Male with Car Noise, SNR=12dB –Noisy –NR Male with Car Noise, SNR=6dB –Noisy –NR Female with Office Noise, SNR=18dB –Noisy –NR

Voice Quality Monitoring Telchemy VQ Mon Algorithm – estimate MOS from packet arrival statistics RTCP-XR – a standard packet format that carry the Quality parameters MOS-CQ – Conversational MOS, takes into account also Echo and Delay influence on the quality

Voice Quality Monitoring

Telchemy VQmon accuracy vs. MOS

Summary We described some of the challenges in implementing speech processing algorithms in practical VoIP products We described the solutions as implemented in Audiocodes AC490x/AC50x VoIP processors products

Thank you for your time