1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.

Slides:



Advertisements
Similar presentations
Speech Coding Techniques
Advertisements

Time-Frequency Analysis Analyzing sounds as a sequence of frames
Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.
Analogue to Digital Conversion (PCM and DM)
Speech Compression. Introduction Use of multimedia in personal computers Requirement of more disk space Also telephone system requires compression Topics.
4.2 Digital Transmission Pulse Modulation (Part 2.1)
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Multimedia communications EG-371Dr Matt Roach Multimedia Communications EG 371 and EG 348 Dr Matthew Roach Lecture 2 Digital.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Audio and Video Compression
CSc 461/561 CSc 461/561 Multimedia Systems Part A: 1. Audio.
Fundamental of Wireless Communications ELCT 332Fall C H A P T E R 6 SAMPLING AND ANALOG-TO-DIGITAL CONVERSION.
COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Chapter 4 Digital Transmission
Waveform SpeechCoding Algorithms: An Overview
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.
CS :: Fall 2003 Audio Coding Ketan Mayer-Patel.
1/21 Chapter 5 – Signal Encoding and Modulation Techniques.
Fundamentals of Digital Communication
Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.
Chapter Seven: Digital Communication
DIGITAL VOICE NETWORKS ECE 421E Tuesday, October 02, 2012.
GODIAN MABINDAH RUTHERFORD UNUSI RICHARD MWANGI.  Differential coding operates by making numbers small. This is a major goal in compression technology:
Chapter 6 Basics of Digital Audio
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Pulse Code Modulation (PCM)
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
COMMUNICATION SYSTEM EEEB453 Chapter 5 (Part IV) DIGITAL TRANSMISSION.
10/6/2015 3:12 AM1 Data Encoding ─ Analog Data, Digital Signals (5.3) CSE 3213 Fall 2011.
Speech and Audio Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2013 Last updated
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
Sound Sound is a continuous wave that travels through the air
Chapter 4 Audio and video compression
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
1 Speech Synthesis User friendly machine must have complete voice communication abilities Voice communication involves Speech synthesis Speech recognition.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Digital Multiplexing 1- Pulse Code Modulation 2- Plesiochronous Digital Hierarchy 3- Synchronous Digital Hierarchy.
4.2 Digital Transmission Pulse Modulation Pulse Code Modulation
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
Pulse Code Modulation (PCM) Analog voice data must be translated into a series of binary digits before they can be transmitted. With Pulse Code Modulation.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Digital Audio (2/2) S.P.Vimal CSIS Group BITS-Pilani
Digital Communications Chapter 13. Source Coding
Vocoders.
UNIT – III I: Digital Transmission.
UNIT II.
4.1 Chapter 4 Digital Transmission Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
CS 4594 Data Communications
Vocoders.
PCM & DPCM & DM.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

1 Audio Coding

2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data

3 Overview of Today PCM –Linear –  -LaW DPCM ADPCM MPEG-1 Vocoding Sampling Techniques Generic Coding Techniques Psychoacoutic Coding Speech Specific Techniques

4 Encode Design Bandlimiting filter –Smooth analog signals Analog to digital converter (ADC) –Sample and Quantize analog signals.

5 Bandlimiting filter Pass only frequency components up to half of Nyquist rate.

6 Analog to digital converter

7 Sampling Pulse Amplitude Modulation (PAM) –Each sample ’ s amplitude is represented by 1 ________ value Sampling theory (_________) –If input signal has ________ frequency (bandwidth) f, sampling frequency must be at least ____ –With a _____-pass filter to interpolate between samples, the input signal can be fully reconstructed

8 PCM Pulse Code Modulation (PCM) –Each sample ’ s amplitude represented by an ________ code-word –Each bit of resolution adds __ dB of dynamic range –Number of bits required depends on the amount of noise that is tolerated Quantization error (“noise”) n = SNR –

9 Linear PCM Quantization levels are _________ spaced. ___ bit samples provide plenty of dynamic range. Compact Disks do this.

10 Under Sampling Sample rate under Nyquist rate LF also called antialiasing filter Added to original signal and cause distortion.

11 Quantization intervals

12 Associated waveform set

13   -Law companding (ITU Rec. G.711) Non-linear quantization of the signal ’ s amplitude –Quantization step-size decreases logarithmically with signal ______ –Low-amplitude samples represented with ______ accuracy than high-amplitude samples –Humans are less sensitive to changes in “ ____ ” sounds than “ _____ ” sounds

14 f(x) = 127 x sign(x) x ln(1 +  |x|) ln(1 +  ) (x normalized to [-1, 1])   -Law companding Provides __-bit quality (dynamic range) with an _-bit encoding Used in North American & Japanese ISDN voice service Simple to compute encoding

Difference Encoding Differential-PCM (DPCM) –Exploit _________ redundancy in samples –___________ between 2 x-bit samples can be represented with significantly fewer than x-bits –Transmit the difference (rather than the ________)

16 DPCM Working Principle Previous sampling value

“Slope Overload” Slope Overload Problem Differences in high frequency signals near the ___________ frequency cannot be represented with a smaller number of bits! –Error introduced leads to severe distortion in the ______ frequencies

18 Adaptive DPCM (ADPCM) Use a larger step-size to encode differences between ______-frequency samples & a smaller step-size for differences between ____- frequency samples Use ________ sample values to estimate changes in the signal in the near future

Predictor + – + y-bit PCM sample x-bit ADPCM “difference” Difference Quantizer Step-Size Adjuster Dequantizer + Predicted PCM Sample n+1 ADPCM To ensure differences are always small... –Adaptively change the ____-size (quanta) –(Adaptively) attempt to _____ next sample value

20 Psychoacoustic Fundamentals Absolute threshold of hearing Critical band frequency analysis Frequency masking Temporal masking

21 Absolute Threshold of Hearing Human perception of sound is a function of ________ and signal __________ –(MPEG exploits this relationship.) Sampled segments of the source audio waveform are analyzed but only those features _____________ to the ear are transmitted. Psychoacoustic model is used to identify _________ masking and ________ masking and eliminate them from the transmitted signal. Sound Level (dB) Frequency (kHz) Inaudible Audible Maximum allowable Energy level for Coding distortion

Sound Level (dB) Frequency (kHz) Inaudible Audible Masking tone Masked tone Auditory Masking The presence of tones at certain frequencies makes us unable to perceive tones at other “ _________ ” frequencies –Humans cannot distinguish between tones within _____ Hz at low frequencies and _____kHz at high frequencies

23 MPEG Encoder Block Diagram MappingQuantizerCoding Frame Packing Psycho- acoutstic Model PCM Audio Samples (32, 44.1, 48 kHz) Encoded Bitstream Ancillary Data

24 Vo-coding Concept: Develop a __________ model of the vocal cords & throat –Derive/compute _____ parameters for a short interval and transmit to the decoder –Use the parameters to _______ speech at the decoder So what is a good model? –A “ buzzer ” in a “ tube ” ! –The buzzer is characterized by its _________ & _______ –The tube is characterized by its ___________s

Amplitude Frequency (kHz) Vocoding - Basic Concepts Formant — frequency maxima & minima in the spectrum of the speech signal Vocoders code –_____ –Period –_________, and –signaling vocal tract _________ parameters Voiced sounds, m,v,and l. Unvoiced sounds, f and s.

26 “yadda yadda yadda” y(n) = a k y(n – k) + G x x(n)  k=1 p Linear Predictive Coding (LPC) –A sample is represented as a linear combination of ___ previous ________s “ Buzzer ” and “ Tube ” Model Vocoding principles: –voice = _________s + buzz ______ & intensity –voice – estimated ________s = “ residue ”

27 LPC Decoder artificially generates speech via _________ synthesis –A mathematical simulation of the _______ as a series of bandpass filters –Encoder codes & transmit filter _______, pitch period, gain factor, & nature of excitation

28 LPC Schematic

29 LPC Related Standards Standards: –Regular Pulse Excited Linear Predictive Coder (RPE-LPC) Digital cellular standard GSM 6.1 (___ kbps) –Code Excited Linear Predictive Coder (CELP) US Federal Standard 1016 (_____ kbps) Waveform template based to improve sound quality. –Linear Predictive Coder (LPC) US Federal Standard 1015 (______ kbps) Very synthetic and used primarily in military applications with very limited bandwidth.

30 Networking Concerns Audio bandwidth is actually quite small. But human sensitivity to loss and noise is quite ________. Networking concerns: –_______ concealment –________ control Especially for telephony applications.