What’s new in Wideband Audio?

Slides:



Advertisements
Similar presentations
© 2008 AudioCodes Ltd. All rights reserved. AudioCodes Confidential Proprietary Alan D. Percy Director or Market Development HD Communications Reaching.
Advertisements

Wideband Speech Coding for CDMA2000® Systems
International Telecommunication Union The Fully Networked Car Geneva, 4-5 March 2009 Wrap-up Session Conclusions for Session 5 Session 5: Voice and audiovisual.
VMR-WB – Operation of the 3GPP2 Wideband Speech Coding Standard M. Jelinek†, R. Salami‡ and S. Ahmadi * †University of Sherbrooke, Canada ‡VoiceAge Corporation,
HD Voice Unprecedented mobile voice quality
UBIFone & The Technology Ahead 25 th June 2006 This presentation is the property of UbiFone. Distributors or any other individuals or entities are not.
N Team 15: Final Presentation Peter Nyberg Azadeh Bararsani Adie Tong N N multicodec minisip.
Copyright © by Elliot Eichen. All rights reserved. RTP – Real Time Protocol (and RTCP)
VistaPlus TM AP15 Audio Processor Enhanced Contact Center Productivity October 2008.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Codec requirements update Michael Knappe Co-chair, codec WG 1Michael Knappe IETF 77.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
Live Music Mode: Case Study and Development Performing Arts Production Workshop Trieste, Italy 14 July 2009 Stefan Karapetkov Emerging Technologies Director.
© 2006 AudioCodes Ltd. All rights reserved. AudioCodes Confidential Proprietary Signal Processing Technologies in Voice over IP Eli Shoval Audiocodes.
1 © NOKIA GPP2 Wideband Codec Presentation Interoperable Wideband Speech Coder for CDMA2000 and WCDMA Systems W-VRM: Wideband Variable-Rate Multi-Mode.
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
1 Video for Live Music Performance and Education Stefan Karapetkov Emerging Technologies Director.
Voice over the Internet (the basics) CS 7270 Networked Applications & Services Lecture-2.
1 TAC2000/ IP Telephony Lab Perceptual Evaluation of Speech Quality (PESQ) Speaker: Wen-Jen Lin Date: Dec
VoIP on the iPhone: Imagine the Possibilities Jan Linden, VP of Engineering.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
CHAPTER 15 & 16 Service Provider VoIP Applications and Services Advanced Enterprise Applications.
Existing PBX Existing Phone Handsets Numbering Plan to digit Internal extensions 9 for an outside line 3 digits.
Leveraging Existing Application Processors in Mobile Devices to Implement VoIP Client.
The Importance of Quality VoIP for Web Conferencing and Collaboration Jan Linden, Vice president of Engineering Global IP Sound, Inc.
Colombo, Sri Lanka, 7-10 April 2009 Multimedia Service Delivery on Next Generation Networks Pradeep De Almeida, Group Chief Technology Officer Dialog Telekom.
Presence Applications in the Real World Patrick Ferriter VP of Product Marketing.
1 Visit us at: & Your Way. Communicate.
Mobile HD Voice January 6 th, 2010 Mahesh Makhijani.
The Fully Networked Car Geneva, 4-5 March Wideband Speech Communications: the Good, the Bad, and the Ugly Scott Pennock Sr. Hands-Free Standards.
Cisco Unified Communications Manager (CUCM)
Improving Voice Quality in International Mobile-to-Mobile Calls Aram Falsafi, Seattle, WA PIMRC September 2008.
Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6.
Mobility. Creating Mobile Voice & Video Applications.
Sergei Hyppenen Supervisor: Professor Sven-Gustav Häggman
Leveraging Wideband Codecs for VoIP Development Laurent Amar President, VoiceAge Corporation.
January 23-26, 2007 Ft. Lauderdale, Florida Host media processing – revisited Faye McClenahan – Aculab.
Unified. Simplified. Unified Communications Launch 2007.
A path towards common quality assessment of narrowband and wideband voice Trond Ulseth, Telenor R&D Workshop on Wideband Speech.
Benefits of VoIP Peering in a Challenging Economy (SP-10) Tuesday - 02/03/09 4:00-4:45pm Mark Benisz, VP Americas, XConnect Global Networks.
Highlights of the Revised VMR-WB RTP Payload and Storage File Formats Sassan Ahmadi, Ph.D. Nokia Inc. USA May 1, 2004 For more information please refer.
Audio Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Colombia, September 2013 The importance of models and procedures for planning, monitoring and control in the provision of communications services.
A Speech Processing Solution in a 3G Media Server Miikka Rautapää Nokia Networks Supervisor: Professor Raimo Kantola
Definition and Coordination of Signal Processing Functions for telephone connections involving automotive speakerphones Scott Pennock Senior Hands-Free.
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Pengantar Multimedia. Sound  Physical phenomenon – vibration.  Source = electrical – acoustic  Vibration – oscillation – wave  Wave periodical – song,
What is H.323? H.323 is standard providing a foundation for audio, video, and data communications across IP-based networks, including the Internet.
Université du Québec École de technologie supérieure Department of software and IT engineering Real-time multi-user transcoding for push to talk over cellular.
Slide title In CAPITALS 50 pt Slide subtitle 32 pt Static Call Admission Control and Dimensioning of Media Gateways in IP based Mobile Core Networks Mika.
Aug 25, 2005 page1 Aug 25, 2005 Integration of Advanced Video/Speech Codecs into AccessGrid National Center for High Performance Computing Speaker: Barz.
Adoption of IP in the Next Generation Contact Center Rupesh ChokshiGautham NatarajanDirector, AT&T.
January 23-26, 2007 Ft. Lauderdale, Florida Challenges in Deploying VoWLAN.
Minjie Xie, Dave Lindbergh, and Peter Chu
Voice Coding in 3G Networks
A Very Low Bit Rate Protection Layer to Increase the Robustness of the AMR- WB+ Codec against Bit Errors Philippe Gournay Université de Sherbrooke Département.
HD aka Wideband Audio G.722 codec/Wideband Audio:  Expands spectrum for voice comms to approx Hz  64kbps bandwidth (same as G.711)  G.722.
A UDIO B ANDWIDTH D ETECTION IN THE EVS C ODEC University of Sherbrooke, Canada VoiceAge Corporation, Montréal, Canada Fraunhofer IIS, Erlagen, Germany.
Improvements in speech services of GERAN Master’s Thesis presentation Author: Tommi Jokela Supervisor: Prof. Sven-Gustav Häggman Instructor: M.Sc. Benoist.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
August 3-4, 2004 San Jose, CA Successfully Offering VoIP- Enabled Applications Services Jan Linden Vice President of Engineering.
HD Voice and Asterisk: Hearing the sirens song October 14, 2009.
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
Opus SW codec RTLAB Ki Eun Seong. What is the Opus Codec? Real-time interactive audio codec Targets interactive audio over the internet Aims to be royalty-free,
Scalable Speech Coding for IP Networks
Audio Henning Schulzrinne Dept. of Computer Science
– Workshop on Wideband Speech Quality in Terminals and Networks
The Changing Role of DSPs in Media Gateway Design
Scott Pennock Senior Hands-Free Standards Specialist
Presentation transcript:

What’s new in Wideband Audio?

Wideband Audio VoIP is indeed a disruptive technology, but has it changed the life of the average consumer? Cost? Quality? Features? Wideband Audio codecs and improved handling of music could soon change this dynamic Let’s discuss Technology behind the codecs Real-world implementations

Telecom Audio Spectrum Human voice: 80 Hz to 14,00Hz Narrowband: 8 kHz sampling (300-3400 Hz bandwidth) Used in PSTN, mostly intelligible Wideband: 16 kHz sampling (50-7000 Hz bandwidth) Used in VoIP

Wideband Audio? Captures significantly more speech information Significant improvement in speech quality over traditional PSTN Improved naturalness & presence below 200Hz Increased intelligibility above 3,400Hz Improves user experience and satisfaction New applications – voice recognition Customer retention Fewer misunderstandings

Wideband Enablers Telecom was about minimizing transport cost Now about differentiation and enhancing the user experience Access bandwidth was limited Broadband access now a reality: high bandwidth delivered at low cost 1 - 10 Mbits/s Cost of WB is similar to NB @ 64kbps Endpoints and Network were not wideband capable Now: VoIP, Wideband DECT, Skype, Microsoft OCS Wireless deployments: wideband, music codecs Private / corporate networks, Tandem Free Operation (TFO), Wideband extension, Wideband SLICS

The Technology

Lossy Codec Classes Speech communication codecs (G.72X, AMR et.al) Designed for “real-time” speech, music handled poorly Low sampling rate (8-16KHz), low fidelity Low-medium delay (10-30 ms) Mostly time-domain (CELP is the most popular) Music codecs (MP3, AAC, Vorbis) Can encode any signal (not optimal for speech) – designed for entertainment Up to 48 kHz sampling rate (full bandwidth), high fidelity (“CD-quality” High delay (>100 ms) Mostly frequency domain (MDCT-based)

Speech Codec Spectrum Applications Deployed Bandwidth Example Codec More than 15Khz Full Band (20Khz) AAC-LD Presence (Video Conf) 14Khz Super Wideband G.722.1C (Siren14), SILK VoIP, Audio Conf 7Khz Wideband G.722.2 (AMR-WB), SVOPC BB VoIP & Audio Chat 3.5Khz Narrowband G.729, G.723.1 G.711, iSAC PSTN &VoIP

ITU and 3GPP codec roadmap Super -wideband EV-VBR 2008 G.722.2 AMR-WB 2002 G.729.1 2007 G.722 1988 G.722.1 1999 wideband AMR-NB 1999 GSM-FR 1987 G.728 1992 GSM-HR 1994 GSM-EFR 1995 G.726 1984 narrowband G.729 1995 Years ITU 3GPP 3GPP & ITU Legend:

Embedded Speech Codecs ITU-Super WB Provides extended bandwidth and stereo capabilities 16 KHz audible bandwidth Stereo extension Generic extension applicable to wideband codecs e.g.. ITU G.729.1 & EV-VBR 3GPP-EPS (evolved packet system) (aka LTE) ITU EV-VBR is well positioned to meet future EPS requirements Interoperable with 3GPP AMR-WB. Open Codecs Speex (4 to 42Kbps) Royalty free but limited to non patented techniques (ACELP for example)

Music Codecs MPEG-1 Layer III (aka MP3) AAC Vorbis Built on top of Layers I and II First-generation, very inefficient AAC Second generation, much better than MP3 Flexible, kitchen-sink type of approach Tons of tools and partially incompatible profiles Variants: AAC-LC, AAC-LD, AAC-HE, ... Vorbis Second-generation, similar quality to AAC Open-source, royalty-free (Xiph.Org Foundation)

Future of codecs Improving quality Reducing delay Super-wideband, coding of music The gap between speech and music codecs is closing AMR-WB+, G.722.1x moving to music, higher quality AAC-LD moving to lower delay Reducing delay Increasing robustness Shift from bit-error robustness to packet loss robustness

Improved Music Handling Background music is poorly handled Most speech codecs (AMR-NB, G.729, AMR-WB, Speex etc) are derivatives based on CELP CELP makes assumptions that are only valid for speech (and single-note music) CELP does not perform well on music – especially at low bit-rate Music codecs are not suitable for speech

Improved Music Handling How do you improve the handling of background music? Three strategies: Increase the bit-rate Dual-mode codecs (e.g. AMR-WB+) Use non-CELP codecs (AAC-LD, G.722.1x, G.711.1, CELT, …)

Wideband Extension (WEx) as an interim solution How do you provide a wideband experience when linking a wideband-capable client to the PSTN? Current solution: up-sample the narrowband speech to 16 kHz Better solution: Create wideband “artificially” from the narrowband speech Support becoming available WEx capable handsets (Philips for example) WEx enabled Media Gateway (Vocallo for example)

a.k.a The Role of the Media Gateway The Implementations a.k.a The Role of the Media Gateway

Wideband VoIP DECT - France Telecom Mobile Platform IAD Access Platform IP Network TDM Network IMS GW DLC Access Platform IAD

Wide Band Extension (WBE) Mobile Platform Wide Band Extension Expand the signal to create impression of wideband. AEC ANR NLE IP Network WBE IMS GW LEC IP/DLC TDM Network Access Platform DLC IAD

Improving the User Experience AEC ANR NLE Wideband Lite Acoustic Echo Canceller acts as a complement to badly designed handset Wideband Adaptive Noise Reduction reduces noise of mobile handset environment. Wideband Natural Level Enhancement, uses info from intensity of the voice and SNR to compensate for loud environment of the talker Mobile Platform IP Network IMS GW IP/DLC TDM Network DLC IAD Access Platform

The role of the MGW When selecting MGW solutions: Don’t just look for checklist of codecs! Look for solutions that provide wideband extension, wideband ECAN, ANR, etc. Select solutions that incur low latency when transcoding IP-to-IP communications

Summary Clear benefit to the users Skype changed expectation levels Technology enablers already in place VoIP deployment CODECS WB-enabled end-points and MGWs available