Sergei Hyppenen Supervisor: Professor Sven-Gustav Häggman

Slides:



Advertisements
Similar presentations
Wideband Speech Coding for CDMA2000® Systems
Advertisements

Chapter Thirteen: Multiplexing and Multiple- Access Techniques.
Speech Coding Techniques
VMR-WB – Operation of the 3GPP2 Wideband Speech Coding Standard M. Jelinek†, R. Salami‡ and S. Ahmadi * †University of Sherbrooke, Canada ‡VoiceAge Corporation,
A TDM realization TDM systems are critical in timing
High Performance 32 Channel ADPCM Codec File Number Here ® LogiCORE Products.
Multiple Access Techniques for wireless communication
4.2 Digital Transmission Pulse Modulation (Part 2.1)
IP Multimedia Subsystem (IMS) 江培文. Agenda Background IMS Definition IMS Architecture IMS Entities IMS-CS Interworking.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
1 TAC2000/ IP Telephony Lab Perceptual Evaluation of Speech Quality (PESQ) Speaker: Wen-Jen Lin Date: Dec
20101 The Physical Layer Chapter Bandwidth-Limited Signals.
Overview.  UMTS (Universal Mobile Telecommunication System) the third generation mobile communication systems.
Data Communications Circuit Switching. Switching Networks Long distance transmission is typically done over a network of switched nodes Nodes not concerned.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
A Software Defined Radio Implementation for Voice Transmission over Wireless Ad-hoc Networks Jason Tran SURF-IT 2009 Fellow Mentors: Dr. Homayoun Yousefi’zadeh.
Voice Quality Evaluation for Wireless Transmission with ROHC S. Rein and F.H.P. Fitzek and M. Reisslein Voice Quality Evaluation for Wireless Transmission.
© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.
Slide title In CAPITALS 50 pt Slide subtitle 32 pt Frame Header Based Speech Quality Analysis Method in a Circuit-Switched Media Gateway Master’s Thesis.
Chapter Seven: Digital Communication
DIGITAL VOICE NETWORKS ECE 421E Tuesday, October 02, 2012.
Computer Networks Digitization. Spring 2006Computer Networks2 Transfer of an Analog Signal  When analog data (voice, pictures, video) are transformed.
© 2006 Cisco Systems, Inc. All rights reserved. QOS Lecture 2 - Introducing VoIP Networks.
Introduction.
Improving Voice Quality in International Mobile-to-Mobile Calls Aram Falsafi, Seattle, WA PIMRC September 2008.
Multiplexing.
1 9. Circuit Switching. Prof. Sang-Jo Yoo 2 Contents  Switching Networks  Circuit-Switching Networks  Switching Concepts  Routing in Circuit-Switching.
Data Comm. & Networks Instructor: Ibrahim Tariq Lecture 3.
William Stallings Data and Computer Communications 7 th Edition Chapter 8 Multiplexing.
CHAPTER #6  Introducti on to ATM. Contents  Introduction  ATM Cells  ATM Architecture  ATM Connections  Addressing and Signaling  IP over ATM.
A Speech Processing Solution in a 3G Media Server Miikka Rautapää Nokia Networks Supervisor: Professor Raimo Kantola
02 – Performance Basics 1CS Computer Networks.
Digital Transmission Outlines:- Multiplexing FDM TDM WDM
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
D1 - 27/10/2015 The present document contains information that remains the property of France Telecom. The recipient’s acceptance of this document implies.
1 © 2006 Nokia pullola_ ppt / Extending Base Station Active Radio Link Set for Improved Uplink Scheduling Esa-Pekka Pullola Supervisor:
Rev A Mikko Suominen Enhancing System Capacity and Robustness by Optimizing Software Architecture in a Real-time Multiprocessor Environment.
Slide title In CAPITALS 50 pt Slide subtitle 32 pt Dynamic and Persistent Scheduling for Voice over IP Traffic in the Long-Term Evolution Uplink Master’s.
William Stallings Data and Computer Communications
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
LOG Objectives  Describe some of the VoIP implementation challenges such as Delay/Latency, Jitter, Echo, and Packet Loss  Describe the voice encoding.
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
Data Comm. & Networks Instructor: Ibrahim Tariq Lecture 3.
3.2 Software Fundamentals. A protocol is a formal description of digital message formats and the rules for exchanging those messages in or between computing.
Chapter 6 Multiplexers and Switching Concepts
GPRS 1. Before GPRS: HSCSD  HSCSD or High Speed Circuit Switched Data was the first upgrade to be standardized by ETSI to bring high speed data to GSM.
Voice Coding in 3G Networks
A Very Low Bit Rate Protection Layer to Increase the Robustness of the AMR- WB+ Codec against Bit Errors Philippe Gournay Université de Sherbrooke Département.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
CDMA X EV-DO by S.Vidhya. CDMA 2000 CDMA2000 (also known as C2K or IMT Multi ‑ Carrier (IMT ‑ MC)) is a family of 3G[1] mobile technology standards,
Improvements in speech services of GERAN Master’s Thesis presentation Author: Tommi Jokela Supervisor: Prof. Sven-Gustav Häggman Instructor: M.Sc. Benoist.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Objective This presentation covers the Generation of Telecom Network Evolution. Basically the presentation aims on the evolution from 1G to 4G and some.
Time Division Multiplexing
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
1 Wireless Networks Lecture 21 WCDMA (Part I) Dr. Ghalib A. Shah.
Overview Of 3G Mobile. 3G AWARENESS 3G is the next generation mobile communications systems. 3G is basically an ITU defined set of standards, which along.
Outline  Introduction (Resource Management and Utilization).  Compression and Multiplexing (Other related definitions).  The Most Important Access Utilization.
INTRODUCTION. Electrical and Computer Engineering  Concerned with solving problems of two types:  Production or transmission of power.  Transmission.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Multiplexing.
Point-to-Point Network Switching
5G MOBILE TECHNOLOGY By J.YOGESH 08M31A0425.
Cellular Wireless Networks
Name:Shivalila A H,Shima
Integrated Services Digital Network Prepared By: Muhammad Basheir Omer Tyseer Khalil ISDN.
EEL 4598/5718: Computer Communications
Which of the following is a digital communications mode?
Cellular Telephone Networks
Presentation transcript:

Seminar Presentation: Adaptive Multi-Rate Wideband Speech Codec deployment in 3G Core Network Sergei Hyppenen Supervisor: Professor Sven-Gustav Häggman HELSINKI UNIVERSITY OF TECHNOLOGY 11.04.2006 1 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Contents of the presentation Abbreviations Introduction AMR-WB speech codec Network architectures: GSM and 3G (Release 4) Speech transmission TrFO and TFO Out-of-Band Transcoder Control in TrFO TFO frames Lawful interception Signal interception simulation Test results: Noise floor values Test results: MOS quality values Conclusions 2 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Abbreviations 3G: 3rd Generation HR: Half Rate speech codec ACELP: Algebraic Code-Excited Linear Prediction AMR-WB: Adaptive Multi-Rate Wideband speech codec ATM: Asynchronous Transfer Mode BSS: Base Station Subsystem CN: Core network dB: decibel dBov: dB relative to the overload point of the digital system DTX: Discontinuous Transmission EDGE: Enhanced Data rates for Global Evolution G.711: PCM-based coding method with 8 kHz sampling frequency and 8-bit A- or µ-law weighting GSM: Global System for Mobile Communications HR: Half Rate speech codec IP: Internet Protocol LSB: Least Significant Bit MOS: Mean Opinion Score rated 1-5 NSS: Network Sub-System OoBTC: Out-of-Band Transcoder Control TC: Transcoder TDM: Time Division Multiplexing TFO: Tandem Free Operation TrFO: Transcoder Free Operation UMTS: Universal Mobile Telecommunications System VAD: Voice Activity Detection WB-PESQ: a tool for quality evaluation [ITU-T: P.862] 3 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Introduction Speech contains frequencies up to the 10 kHz Current fixed and mobile telecommunication systems operate with a narrow audio bandwidth: 300-3400 Hz (ITU-T G.711) 500-3000 Hz is sufficient for understanding The sampling frequency used in digital core networks is 8000 Hz → in theory enables transmitting signals up to 4000 Hz Codecs utilized in mobile systems lower the quality of narrowband speech even more than the G.711 AMR-WB speech codec improves the quality and especially the naturalness of speech In EDGE and UMTS all coding modes of the AMR-WB will be used, in GSM only coding modes till 12.65 kb/s 4 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

AMR-WB speech codec Process 50-7000 Hz Sampling: 16 kHz Precision: 14-bit Coding model: ACELP VAD and DTX Bad frame handler Bit rates: 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kb/s Coding mode 12.65 kb/s produces better quality than G.711 (64 kb/s) 5 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Network architectures: GSM and 3G (Release 4) GSM: Transcoder (TC) is a part of Base Station Subsystem (BSS) In core Network Sub-Systems (NSS) speech signals are transferred in G.711 form 3G, Release 4: Core Network (CN) is divided to Packet Switched (PS) and Circuit Switched (CS) domains CS domain is separated to Control Plane (Signaling) and User Plane (Data) TC moved to core network, but still, the most common scheme to transfer speech in CN is G.711 6 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Speech transmission GSM In current telecommunication systems transcoding is performed at least twice In core networks speech signals are transferred in narrowband G.711 form and one one-way connection requires a 64 kb/s channel GSM Wideband speech cannot be transferred using the same technique Requires 16 kHz * 14 bit connection speeds, which are UNAXEPTABLY HIGH! → wideband speech should be transferred only in CODED FORM! 7 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

TrFO and TFO Transcoder Free Operation (TrFO) transfers coded speech frames in ATM- and IP-based networks as such Transcoder-free means that the same codec is used on the both sides of a connection → Out-of-Band Transcoder Control (OoBTC) is needed OoBTC requires the late assignment of a radio traffic channel with forward bearer establishment in CN (see the next slide for details) In Tandem Free Operation (TFO) coded frames are merged into least significant bits (LSB) of PCM-based signals The TFO is utilized in TDM networks TFO protocol negotiates with the distant partner a common codec to be used by sending messages in-band Message bits replace every 16th LSB When both mobile terminals switch to a compatible codec, coded speech frames can be merged into PCM-based stream that was decoded from those coded frames 8 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Out-of-Band Transcoder Control in TrFO In TrFO negotiation of the codec to be used during the call has to be performed before the bearer establishment procedures 9 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

TFO frames 1 When TFO is operational 1, 2 or 4 LSBs of every 8-bit PCM sample are replaced by TFO frames TFO frames requiring replacement of 4 LSBs consist of the main frame part (1st and 2nd LSBs) and the extension frame part (3rd and 4th LSBs). During the transmission through the core network TFO frames should not be modified by noise suppression, level control or other enhancement algorithms 10 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

TFO frames 2 TFO frames are different for each codec and each coding mode, if a multi-rate codec is in question TFO frames contain synchronization bits, control and error correction bits, time alignment bits, spare bits and actual data bits Synchronization and control bits are used only in the main part On the right is an example of the TFO frames specified for the AMR-WB, the coding mode is 23.85 kb/s 11 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Lawful interception But how bad the noise really is? Before an operator may launch a commercial telecommunication network, it has to provide the lawful interception service. The quality provided for the authorities has to be the same or better than the quality provided for the monitored target PCM-based intercepted signals are directed to the authorities as such Coded signals are converted into PCM form What to do if the intercepted signal contains TFO frames? After all, the signal is noisy The solution is utilization of the passive TFO protocol But how bad the noise really is? 12 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Signal interception simulation Theoretical noise floor values were calculated with the assumption that every bit in signal representation raises the dynamics of the signal 6 dB The results were verified by sending silence through the testing system Also the MOS quality values of the speech signals were evaluated using the WB-PESQ tool In tests the scheme presented on the right was simulated 13 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Test results: Noise floor values Linear notation of the A-law is 13 bits and the µ-law is 14 bits. The first bit is the sign bit and it is not one of the effective bits in representation In theory only half of the bits are really replaced → measured noise floor values are lower than the calculated ones 14 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Test results: MOS quality values The level of the original signals was -26 dBov and SNR 45 dB Decoded from TFO frames signals (2b) are slightly different than the originally decoded ones (2a), as TFO protocol needs approx 1 second time to establish a connection. During that time no coded speech frames are sent 15 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy

Conclusions → the passive TFO protocol is needed indeed! SNR values of the intercepted signals with AMR-WB-specific TFO frames were 15-25 dB (original signals -26 dBov) and MOS grades below two. If the original signals would have contained noise from the beginning, as it is usually in real phone-calls, the quality would have been lower Using in the tests signals with lower levels, -30 and -36 dBov, which corresponds to intensive whispering in real-world calls, the results would have been even worse → authorities will not be satisfied with the quality of the intercepted signal → the passive TFO protocol is needed indeed! 16 © 2006 Nokia AMRWB_depl.ppt / 2006-04-11 / SHy