MPEG1 Coding Standard By: Richard M Tarbell. MPEG: Motion Picture Expert Group First devised in 1988 by a group of almost 1000 experts Primary motivations:

Slides:



Advertisements
Similar presentations
Multimedia System Video
Advertisements

MPEG Moving Picture Experts Group. What defines good video quality? Size of pictures Bitrate of channel medium (especially in real-time applications)
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
15 Data Compression Foundations of Computer Science ã Cengage Learning.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
MPEG: A Video Compression Standard for Multimedia Applications Václav Hlaváč CTU Prague, Initial material were slides of Didier.
Data Compression CS 147 Minh Nguyen.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Image Compression. Data and information Data is not the same thing as information. Data is the means with which information is expressed. The amount of.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Chapter 7 End-to-End Data
School of Computing Science Simon Fraser University
Spring 2003CS 4611 Multimedia Outline Compression RTP Scheduling.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG: A Video Compression Standard for Multimedia Applications Didier Le Gall Communications of the ACM Volume 34, Number 4 Pages 46-58, 1991.
SWE 423: Multimedia Systems Chapter 7: Data Compression (1)
Spatial and Temporal Data Mining
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Multimedia Retrieval Architecture Anandi Giridharan Electrical Communication Engineering, Indian Institute of Science, Bangalore – , India Multimedia.
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
1 JPEG Compression CSC361/661 Burg/Wong. 2 Fact about JPEG Compression JPEG stands for Joint Photographic Experts Group JPEG compression is used with.jpg.
Image Compression JPEG. Fact about JPEG Compression JPEG stands for Joint Photographic Experts Group JPEG compression is used with.jpg and can be embedded.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image and Video Compression
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Trevor McCasland Arch Kelley.  Goal: reduce the size of stored files and data while retaining all necessary perceptual information  Used to create an.
{ Lossy Compression William Dayton Nick Trojanowski.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
ECE472/572 - Lecture 12 Image Compression – Lossy Compression Techniques 11/10/11.
MPEG: A Video Compression Standard for Multimedia Applications Didier Le Gall Communications of the ACM Volume 34, Number 4 Pages 46-58, 1991.
1 Image Compression. 2 GIF: Graphics Interchange Format Basic mode Dynamic mode A LZW method.
Multimedia Data Video Compression The MPEG-1 Standard
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Video Basics. Agenda Digital Video Compressing Video Audio Video Encoding in tools.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Video Compression: Performance evaluation of available codec software Sridhar Godavarthy.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
CIS679: Multimedia Basics r Multimedia data type r Basic compression techniques.
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Digital Image Processing Image Compression
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
JPEG Image Compression Standard Introduction Lossless and Lossy Coding Schemes JPEG Standard Details Summary.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 11 COMPRESSION.
MPEG CODING PROCESS. Contents  What is MPEG Encoding?  Why MPEG Encoding?  Types of frames in MPEG 1  Layer of MPEG1 Video  MPEG 1 Intra frame Encoding.
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
RICO HARTONO JAHJA SOURCE CODING: PART IV.
JPEG Compression What is JPEG? Motivation
Data Compression.
Algorithms in the Real World
Data Compression.
Video Compression - MPEG
Chapter 7.2: Layer 5: Compression
Data Compression CS 147 Minh Nguyen.
Standards Presentation ECE 8873 – Data Compression and Modeling
Judith Molka-Danielsen, Oct. 02, 2000
MPEG-1 Overview of MPEG-1 Standard
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Govt. Polytechnic Dhangar(Fatehabad)
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Presentation transcript:

MPEG1 Coding Standard By: Richard M Tarbell

MPEG: Motion Picture Expert Group First devised in 1988 by a group of almost 1000 experts Primary motivations: –High compression rate for video storage comparable to VHS quality –Random access capability Overall MPEG standard combines video and audio signal into one large compression algorithm

Standard Specs of MPEG1 Works at 1.5 megabits per second bitrate –Bitrate = (length * width * depth * fps) / (compression ratio) Makes use of the 8x8 discrete cosine transform (DCT) for intraframe compression Uses an algorithm to reduce both temporal redundancy and spatial redundancy The code can use several different variable length codebooks to achieve a higher compression ratio –the more codebooks used, the higher the potential compression ratio

Was specifically designed for digital storage media Did not initially lend itself to: Real time applications such as videophone and video-over-IP Applications that involve no long-term storage medium Uses a complex compression algorithm to allow for a simple decompression algorithm Simple decompression algorithm allows for real-time decompression MPEG1

MPEG1 Compression Aspects Interframe compression –Correlation/compression between like frames –Based on H.261 compression standard Intraframe compression –Correlation/compression within a frame –Based on “baseline” JPEG compression standard Audio compression –Three different layers (MP3) Lossless and Lossy compression are both used for a high compression rate

What defines good video quality? Size of pictures Bitrate of channel medium (especially in real-time applications) Resolution of the original images and/or frames Frame rate of source AND frame rate of reproduction medium (24 frames per second is standard movie quality) If any one of these factors is inferior, it can bottleneck the overall system and cause reduction in video quality: GIGO  garbage in = garbage out !!!

Intraframe Coding: Compression within each individual frame Intraframe Compression: Reduces spatial redundancy to reduce necessary transmission rate Encoding I blocks are practically identical to JPEG standard Makes use of the DCT transform along with zigzag ordering

Video Coloring Scheme Translate the RGB system into a YUV system Human perception is less sensitive to chrominance than to brightness –Translate brightness into chrominance and then the resolution does not have to be as good  lower necessary bitrate Coloring Scheme: Normal JPEG Coloring Blocks Red Blue Green Yellow Cr Cb Cg Translation formulas: Y = Wr*R + Wb*B + Wg*G Cr = Wr* (R - Y) Cb = Wb* (B - Y) Cg =Wg* (G - Y)

chro·mi·nance P Pronunciation Key (kr m -n ns) n. Pronunciation Key The difference between one color and a reference color of the same brightness and chromaticity. According to dictionary.com: Macroblock: composed of six blocks (4:2:0 or 4:1:1 format) Four blocks of yellow (luminance) One block of Cb (chrominance) One block of Cr (chrominance) GOB: Group of Blocks, composed of 33 macroblocks in an 11x3 arrangement

Encoding an Image Into a JPEG Block 1.Divide the image into blocks, the size of each block is 8x8 2.Level shift and use the 8x8 DCT transform If there exists a block or several blocks that are not of size 8x8, then force them to be by replicating the last column or row until proper dimension is achieved DC and low frequencies High frequencies

Once the 8x8 matrix has been formed, it is quantized…..

After quantization, the 8x8 matrix is transformed into a single vector using zigzag extraction: DataVector = Cat(1,2,3,4,5,…….63,64)

Why a zig-zag pattern? The zig-zag pattern starts with DC and low frequency values first and proceeds to high frequency values last Most individual frames and blocks have much more energy in the low frequency spectrum than in the high frequency spectrum Allows for better prediction to be made once the data vector has been assembled: …………………….. Data vector is generally monotonically decreasing in energy, most energy is in first few blocks

Data Compression of the New Vector Run-length code for grey: long “runs” of grey can be stored like so: (Grey color) (* 3) (regular color) (regular color) (Grey color) (* 4) Huffman coding: Lossless coding Use tree diagram to decide how to encode all values Assign the shortest codewords to the most frequent values Assign longest codewords to least frequent values

Random Access and Interframe Compression Prediction that does not depend upon the user accessing the first frame (skipping through movie scenes, arbitrary point pick-up Temporal Redundancy Only perform repeated encoding of the parts of a picture frame that are rapidly changing Do not repeatedly encode background elements and still elements  Random access capability

Decoding with non-random access Decoding and playing sub-frames located in section G, All frames before section G must be decoded as well Synchronization algorithm issues. If section G is far along in the movie, this could take a considerable amount of time.

Decoding with random access When decoding any frame after an I frame (frame G in this example) we only have to decode past frames until we reach an I frame saves time when skipping from frame to frame Introduce “I” frames, frames that are NOT predictively encoded by design Frames that are still encoded using a prediction algorithm are called “P” frames I frames are not predictively encoded reduction in compression ratio Depending on the concentration of I frames, there is a tradeoff: More I frames  faster random access time Less I frames  better compression ratio

Most MPEG1 implementations use a large number of I frames to ensure fast access -Somewhat low compression ratio by itself For predictive coding, P frames depend on only a small number of past frames -Using less past frames reduces the propagation error To further enhance compression in an MPEG1 file, introduce a third frame called the “B” frame  bi-directional frame B frames are encoded using predictive coding of only two other frames: a past frame and a future frame By looking at both the past and the future, this helps reduce prediction error due to rapid changes from frame to frame (i.e. a fight scene or fast-action scene)  IPP

Predictive coding hierarchy: I, P, and B frames I frames (black) do not depend on any other frame and are encoded separately Called “Anchor frame” P frames (red) depend on the last P frame or I frame (whichever is closer) Also called “Anchor frame” B frames (blue) depend on two frames: the closest past P or I frame, and the closest future P or I frame B frames are NOT used to predict other B frames, only P frames and I frames are used for predicting other frames

MPEG1 Temporal Order of Compression I frames are generated and compressed first Have no frame dependence P frames are generated and compressed second Only depend upon the past I frame values B frames are generated and compressed last Depend on surrounding frames Forward prediction needed

Differences Between MPEG1 and H.261 MPEG1 uses I, P, and B frames, H.261 uses I and P frames –Large gaps between I frames and P frames –Predicted frame and reference frame are not necessarily adjacent –Higher coding rate –More prediction error due to larger distance between predicted frame and reference frames Prediction of B frame uses two references –One past frame and one future frame  forward prediction MPEG1 was intended for motion pictures  not real time - H.261 was intended for video conferencing Computational cost - MPEG1 is more advanced  higher computation

Rate control and MPEG1 Although MPEG1 is for storage, rate control is possible in certain software packages/modifications Techniques to increase code rate Increase quantizer step size to reduce picture quality B frame quality is reduced first to reduce net error –No other frame depends on a B frame

MPEG Audio Compression First developed to compress 1.5 Mbits/sec (normal uncompressed audio stream) into 56 kbits/sec (the rate of a basic dialup modem) Can encode mono, stereo, and joint-stereo audio Designed for generic waveforms (not partial to speech) Many different standard sampling rates: 16kHz, 22.05kHz, 24kHz, 32 kHz, 44.1 kHz, 48 kHz Uses sub-band filtering 32 filter bands for layers I and II 32 filter bands and 18 sub-bands for layer III (576 bands total) Each is equidistant in bandwidth (width = Fs / 64)

Layers of MPEG Audio Layer I 12 samples per sub-band (384 total samples) Compression ratio: approx 4:1 Around 384kbps (depends on chosen sampling rate) Layer II 36 samples per sub-band (1152 total samples) Compression ratio: approx 6:1 to 8:1 Around 256kbps to 192kbps Layer III 12 samples per sub-band (384 total samples) Compression ratio: approx 10:1 to 12:1

Perceptual Coding and Psychoacoustics Similar concept to MPEG interframe coding: irrelevancy is identified and then removed –This is called auditory masking Input signal is quantized according to a level that meets both bitrate and masking requirements DFT of the input is taken, masking takes place on several levels

Two major types of masking take place for encoding: Temporal Masking: loud noises that occur close to each other in time (about 3 to 5 milliseconds) can be approximated by a single loud noise Frequency Masking: areas of the spectrum that have high energy and are close together can be filtered to eliminate “spikes” in the spectrum

Human Perception Smallest signal perception is 0dB (JBN) Largest signal perception is about 135dB (threshold of pain)  Dynamic range of about 5,000,000 to 1 (ratio of largest signal to smallest signal)

Normal Spectrum

Auditory Masking Spectrum

Encoding model for Layer I =========  ========  ========= 

MPEG1 Audio Encoding Process 1.Break the signal into frames (a frame is usually several milliseconds in length) 2. Determine spectral energy distribution by taking the DFT, then break this distribubtion into sub-bands via filtering 3. Consider the specified bit rate (this will determine the number of bits per frame that can be used)

4. Compare the DFT signal to the human psychoacoutic model, perform masking 5. Use a modified Huffman code to achieve compression 6. After each frame has been compressed, place a header on each frame 7. Reassemble the new frames with headers back into a continuous bit stream

MPEG1 Audio is Used: In CD format (first intentions were for the CD) VCD and DVD formats (even though MPEG1 picture format is obsolete for these kinds of media) As a basis for AAC audio (AAC was modeled to rival MPEG2 audio, AAC is considered to be superior to MP3 audio) MPEG1 layers I and II were based off of Sony MUSICAM technology

Primary References: Sayood, Khalid. Introduction to Data Compression, 2 nd Edition. 2000, Morgan Kaufmann Publishers: San Diego. 3.Berkely Multimedia Research Center