Download presentation
Presentation is loading. Please wait.
Published byMichael Smith Modified over 8 years ago
2
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak March 13, 2003
3
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 2 Reference: MPEG Handbook: MPEG-1, MPEG-2, MPEG-4 by John Watkinson, available through Drexel Library, Books 24/7
4
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 3 Acoustics and Audition Acoustic fields are mechanical vibration in air Common measure: sound pressure of sinusoidal temporal variation Sound decibels are relative to 2x10 -5 Pascals rms, db(SPL) Weighted response attempts to mimic human hearing
5
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 4 Subjective Weighting
6
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 5 Why Compress? “With the rising density and falling cost of digital recording media, the use of compression in professional audio storage applications is hard to justify. However, where there is a practical or economic restriction on channel bandwidth compression becomes essential. Now that the technology exists to broadcast television pictures using compressed digital coding, it is obvious that the associated audio should be conveyed in the same way.”
7
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 6 How Big is Big Telephone type audio 8 Ksamples per second, 8 bit/sample 64 k bit/sec CD audio 40 k samples, 2 channels, 16 bits per sample 1.28 M bit/sec
8
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 7 Why 16 bits? In short term, 8 bits gives good audio quality Music intensity can vary over a large range Range from threshold to very loud is 120 dB = 10 6 !
9
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 8 Compression Technologies Companding DPCM Subband Coding
10
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 9 Companding Quantization Noise Q
11
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 10 DPCM (Delta Modulation)
12
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 11 Transform Coding Split waveform into segment Take transform Code transform coefficients To avoid artifacts, transforms of overlapping segments are used
13
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 12 Transform and Subbands Fourier transform coefficients correspond to frequency bands Only limited dynamic range is needed for each band
14
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 13 Sound Band Phenomena Two tones at adjacent frequencies can combine to make a beat pattern When the tones are separated in frequency, they may interfere or may be independent. If one of the tones has a small amplitude, it is masked
15
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 14 Examples of Tone Pairs 310320350400 -6 db -12 db -18 db -24 db -30 db -36 db Two second of 300 Hz sinewave plus second sinewave at adjacent frequency and at lower amplitude. Total power is constant.
16
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 15 MP3: MPEG Layer III 32 subband filters working over 1152 sample windows (~ 26 msec) Subband outputs processed with MDCT (modified DCT), 18 frequencies A total of 576 frequencies Four window types, more than one window size
17
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 16 Window Types The window functions of Layer III coding. At (a) is the normal long window, whereas (b) shows the short window used to handle transients. Switching between window sizes requires transition windows (c) and (d). An example of switching using transition windows is shown in (e).
18
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 17 MPEG Layer III coder The Layer III coder. Note the connection between the buffer and the quantizer which allows different frames to contain different amounts of data.
19
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 18 Examples of MP 3 Compression Audio (13 seconds) Music (13 seconds)
20
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 19 MP3 Compression Samples SpeechMusic wav 8 kbit/sec 16 kbit/sec 32 kbit/sec 64 kbit
21
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 20 Course Overview Lossless compression Huffman coding Lossy compression Image Predictive (DPCM) Block transform (DCT) Video Motion estimation Standards JPEG MPEG/ITU Performance evaluation Special Hardware
22
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 21 JPEG Subsampling component images MCU Zig-zag coding Quantization JPEG run-length coding Color conversion JPEG bit stream Not just still images!
23
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 22 MPEG/ITU Family of Standards I, P, B pictures Different amounts of compression Types of coding in pictures I, P, B, skipped blocks Bit stream structure GOP ≠ giga operations Standards and subsampling Display and coding resolutions are different Color conversion MPEG-2 profiles
24
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 23 Performance Evaluation Basic evaluation: instruction counts Where’s the beef? JPEG MPEG Implementation Computer RISC Cores DSP Special Hardware
25
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 24 Special Hardware Hardware for DCT RAC Hardware for Motion Estimation Linear arrays Issues Best algorithm for hardware ≠ best algorithm for software Hardware removes program overhead parallel ALU and address calculation Multiple processing units
26
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 25 Big Picture Imaging/Video is key technology Computer Communications Entertainment television, movies Games Photography Instrumentation Web Database System perspective hardware/software system
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.