LECTURE 5 5-1 Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.

Slides:



Advertisements
Similar presentations
Signal Encoding Techniques
Advertisements

T.Sharon-A.Frank 1 Multimedia Compression Basics.
Analogue to Digital Conversion (PCM and DM)
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
CHAPTER 4 DIGITAL MODULATION Part 1.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
SWE 423: Multimedia Systems
Chapter 7 End-to-End Data
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Spatial and Temporal Data Mining
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
CMPT 365 Multimedia Systems
T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.
Fundamental of Wireless Communications ELCT 332Fall C H A P T E R 6 SAMPLING AND ANALOG-TO-DIGITAL CONVERSION.
Department of Computer Engineering University of California at Santa Cruz Data Compression (2) Hai Tao.
Chapter 4 Digital Transmission
331: STUDY DATA COMMUNICATIONS AND NETWORKS.  1. Discuss computer networks (5 hrs)  2. Discuss data communications (15 hrs)
CSE679: MPEG r MPEG-1 r MPEG-2. MPEG r MPEG: Motion Pictures Experts Group r Standard for encoding videos/movies/motion pictures r Evolving set of standards.
4.2 Digital Transmission Pulse Modulation (Part 2.1)
Digital Communication Techniques
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Fundamentals of Digital Communication
Chapter Seven: Digital Communication
ECE472/572 - Lecture 12 Image Compression – Lossy Compression Techniques 11/10/11.
Multimedia Data Video Compression The MPEG-1 Standard
Video Basics. Agenda Digital Video Compressing Video Audio Video Encoding in tools.
MPEG Motion Picture Expert Group Moving Picture Encoded Group Prateek raj gautam(725/09)
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
10/10/04 L5/1/28 COM342 Networks and Data Communications Ian McCrumRoom 5D03B Tel: voice.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
Final Review by Amy Zhang Digital Media Computing.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
CIS679: Multimedia Basics r Multimedia data type r Basic compression techniques.
Chapter 4 Audio and video compression
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
1 Speech Synthesis User friendly machine must have complete voice communication abilities Voice communication involves Speech synthesis Speech recognition.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
Digital Video Digital video is basically a sequence of digital images  Processing of digital video has much in common with digital image processing First.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento Supervisor : Sylwester Szczpaniak.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 11 COMPRESSION.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
JPEG Compression What is JPEG? Motivation
CSI-447: Multimedia Systems
Video Basics.
Digital Communications Chapter 13. Source Coding
Vocoders.
Digital Communication
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
CMPT 365 Multimedia Systems
Standards Presentation ECE 8873 – Data Compression and Modeling
Judith Molka-Danielsen, Oct. 02, 2000
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information Data Speech Image Encoding of Speech Signals – Vocoders Makes use of special properties of speech Periodicity Distinction between voiced and unvoiced sounds Image Encoding Makes use of suitable transforms Uses special techniques Transmits only the difference between image frames Combines speech and image coding for video

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Analog Waveform Encoding x(t) t t t t PAM PWM PPM Observe Original Signal Amplitude of a train of pulses is modulated: Pulse Amplitude  Signal Amplitude Width of a train of pulses is modulated: Pulse Width  Signal Amplitude Position of a train of pulses is modulated: Pulse Position  Signal Amplitude

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Pulse Coded Modulation (PCM) x(t) t t PCM y(t) Pulse Coded Modulation Samples are digitized to n bits (this example uses 3 bits) Using more bits increases accuracy PCM has a significant DC component Modulating onto higher frequency carrier reduces DC component Other PCM Schemes Delta Modulation (DM) Differential PCM (DPCM) Adaptive DPCM (ADPCM) Digital Waveform Coding DSPs are ideal for implementing most PCM schemes PCM = Any Analog to Digital conversion where the result is a serial bit stream. Several methods of converting and transmitting PCM exist.

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Speech Coding – Vocoders Speech vocoders exploit special properties of speech Vocal Tract = Acoustic Tube Voiced sounds are periodic in nature, e.g., “A”, “E” sounds Unvoiced sounds are like random noise, e.g., “S”, “F” sounds Aim for maximum possible compression Understandable but not 100% faithful reproduction A Typical Vocoder – Synthesis RANDOM NOISE VOCAL- TRACT MODELER x PERIODIC EXCITATION GAIN SPEECH TIME - VARYING FILTER PITCH

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Channel Vocoder – Coder Speech is split into subbands for spectral envelope detection Envelope detection aids vocal tract modeling Pitch detector estimates the frequency and aids in distinguishing voiced and unvoiced segments Outputs are multiplexed to produce coded speech signal BANDPASS FILTER RECTIFIER LOWPASS FILTER BANDPASS FILTER RECTIFIER LOWPASS FILTER PITCH DETECTOR SPEECH IN CODED OUTPUT MUX 1 16 ADC

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Channel Vocoder - Synthesis Pitch information switches between “Voiced - Pulse Source” and “Unvoiced - Random Noise” sounds Pitch produces correct frequency for voiced sounds DSP is the ideal medium for implementing vocoders Filters may be implemented efficiently Speech spectrum can be analyzed easily Vocal tract can be modeled easily + SPEECH BANDPASS FILTER X BANDPASS FILTER X RANDOM NOISE PULSE SOURCE PITCH DE- MUX CODED INPUT DAC

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Image Coding Bandwidth required for current TV Image Resolution –NTSC: 484 x 427 pixels, Hz frame rate –PAL: 580 x 425 pixels, 25 Hz frame rate Screen has 4:3 aspect ratio Frames are interlaced to reduce flicker Black and white bandwidth –NTSC: 0.5 x 484 x 427 x = 3.1 M Hz –PAL: 0.5 x 580 x 425 x 25 = 3.1 M Hz

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Bandwidth for TV For black and white picture, bandwidth required is approximately 3 MHz Each pixel represents one sample so the required bandwidth is 6 MHz for a horizontal resolution of 3 MHz For color pictures, basic rate is about 150 MBits per second White Pixel Black Pixel 3 MHz 36 MHz A

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Transform Coding Transform coding of images reduces bandwidth requirements Most of the information in a picture is at low frequencies Transform coders preserve information at low frequencies Ignoring transformed signals with small coefficients Reduces bandwidth required Does not significantly degrade picture quality FFT is not very useful because it produces imaginary components Discrete cosine transform (DCT) is very popular in image processing Image is divided into 8x8 element blocks and each block is individually transformed A full-screen color image requires 200 Mbit/s channel By using transforms and SPCM, the same image can be transmitted over a 34 Mbit/s channel The resulting reduction is approximately 6 times Huffman coding may be used on transformed signals to further reduce the bandwidth requirements

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Video Compression H Series standards are most popular for video compression H.261 and H.320 standards describe compression algorithms H Series Coding: The difference between present and previous frames is transformed with DCT, Huffman coded and transmitted Motion detector produces displacement vectors indicating direction and displacement of movement between previous and present frame VIDEO IN DCT HUFFMAN CODER  MOTION DETECTOR PREVIOUS FRAME STORE COEFFICIENT VALUES DISPLACEMENT VECTORS + – IMAGE REGENERATION PRESENT FRAME Simplified Diagram of H.261 Coder

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Video Decompression DECODED PICTURE COEFFICIENT VALUES DISPLACEMENT VECTORS + + IDCT FRAME STORE  Simplified Block Diagram of H.261 Decoder H Series standards allow manufacturers to design for different applications with different performance levels Videoconferencing systems Videophones H.261 and more recent H.320 standards are computationally intensive DSPs provide the best implementation platform

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Joint Photographic Expert Group - JPEG COEFFICIENT CODER DCT HUFFMAN CODER QUANTIZER PICTURE ENCODED DATA IDCT INVERSE QUANTIZER COEFFICIENT DECODER HUFFMAN DECODER DECODED PICTURE ENCODED DATA Picture is transform-coded by DCT in 8x8 blocks Coefficients are quantized More bits are used for lower frequencies ensuring greater accuracy for higher information content Next stage codes and orders coefficients Finally, coefficients are Huffman encoded to reduce amount of data JPEG decoder reverses the coding process to produce a still picture

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Moving Pictures Expert Group - MPEG Each frame is split into small blocks Blocks are transform-coded by DCT Coefficients are coded with one of the following: Forward or Backward predictive coding or a combination of both This scheme makes use of the similarity between the present frame and either the previous or the next frame Finally, blocks are quantized for transmission QUANTIZE DCT HUFFMAN CODER FORWARD/ BACKWARD PREDICTIVE CODING MOVING PICTURE ENCODED DATA MPEG coding is similar to H Series (H.320) and JPEG standards It is primarily aimed at digital storage media such as CD-ROM

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Summary Variants of pulse coded modulation (PCM) are widely used in waveform encoding Speech coding makes use of its special properties such as: Periodicity of voiced sounds Exclusion of areas not detectable by human ear Digital images require an enormous amount of storage A single black and white TV frame needs approximately a quarter of a million bits Color frames need even more Image coders use transform coding FFT is not a suitable coder for images Discrete cosine transform (DCT) is used widely For moving images, coding systems exploit the similarity between frames Only changes to the previous frame are transmitted MPEG uses similarity to next as well as previous frame DSPs are ideal for medium implementation of most coding schemes