1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,

1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur, 721302, India jay@cse.iitkgp.ernet.in

2 Transcoding DWT to DCT

3 Analysis Synthesis Forward DWT and Inverse DWT Discrete Wavelet Transform

4 2-D DWT  1D DWT applied to vertical and horizontal direction.  For Multilevel DWT:-  The LL band is recursively decomposed, vertically and horizontally.  Filtering is performed in time domain. Image in spatial domain LH HL HH 2LL2HL 2LH2HH LL LH HL HH Decomposition 1 st Level 2 nd Level

5 DCT domain Upsampling with Zero Insertion Type-II DCT of upsampled signal as obtained through zero insertion of signal x(n) is computed by: Note:- DCT obtained is referred as upsampled DCT.

6 A typical conversion matrix 4x4 block to 8x8 upsampled type-II DCT For even sample xoxoxoxo… For odd sample oxoxoxox…

7 Computation of upsampled DCT

8 Wavelet synthesis in the DCT domain  Result is transcoded type-II DCT coefficients.  Operation is equivalent to IDWT + DCT.

9 Transcoding in the DCT domain Composite Operation

10 Wavelet to DCT Transcoding (WDT) Viswanath, Mukherejee and Biswas (2009), Springer Journal on SIVP,

11  WDT computes DCT blocks of larger size.  By decomposing, 8x8 blocks are obtained.  Cost of transcoding increases.  To reduce the cost, smaller blocks are to be considered. WBDT: * Three blocks of NxN size are used in a wavelet subband of size WxW. * Blocks are Upsampled, composed, filtered and decomposed. * The transcoded blocks of size 2Nx2N. Note:- N=4 for JPEG based applications. Wavelet to Block DCT Transcoding (WBDT) This work is published in Springer Journal on SIVP, Jan 2009

12 T L, T H are pre-computed Let and Wavelet to Block DCT Transcoding (WBDT)

13  WBDT computes DCT blocks of size 8x8,  Uses composing and decomposing of DCT blocks.  Requires Type-I DCT of filter and Type-II DCT of data.  Linear Filtering in the DCT domain:  Requires only Type-II DCT.  Blocks are processed separately and added together. The WBDTL, * Three adjacent NxN size subband blocks are used. * Upsampled blocks filtered separately and the results are added. * Computes the transcoded blocks of size 2Nx2N. Note:- For JPEG based applications, N=4. Wavelet to Block DCT Transcoding by Linear Filtering (WBDTL)

14 Where and are the even upsampled DCT blocks of wavelet approximation subband blocks and DCT matrices of the lowpass synthesis filter. are the odd upsampled DCT blocks of wavelet detail subband blocks and DCT matrices of the highpass synthesis filter. Where and Wavelet to Block DCT Transcoding by Linear Filtering (WBDTL)

15 Wavelet to Block DCT Transcoding by Linear Filtering (WBDTL)

16 Transcoding with Multilevel DWT LH HL HHLH HL HH 2LL2HL 2LH2HH LL 2LL2HL 2LH2HH 2 nd level Transcoding LH HL HH LL 1 st level Transcoding Reconstructed image DCT Blocks

17 For a tile size of 64 x 64 In JPEG2000 For blocks of size 8 x 8 Cost of Multilevel Transcoding Hybrid Approach

18 JPEG2000 9/7 wavelet filters Analysis Filter Coefficients Synthesis Filter Coefficients i Lowpass H(i) Highpass G(i) Lowpass H ’ (i) Highpass G ’ (i) 0 0.6029 -1.11511.1151-0.6029 0.2669 0.5913 0.2669 -0.0782 0.0575-0.0575 0.0782 -0.0169 -0.0913 -0.0169 0.0267-0.0267

19 Analysis Filters

20 Synthesis Filters

21 Wavelet filters and responses

22 Wavelet filters and responses

23 Other wavelet filters

24 Results PSNR close to ~300 dB

25 Results: Quantization of wavelet subbands For Lena image Large number of coefficients are zeros No. of zero coefficients

26 Observations  The proposed new algorithms for transcoding the DCT coefficients from wavelet coefficients are computationally efficient.  Both the approaches (WBDT and WBTL) are found to be equivalent.  Proposed transcoding achieves PSNR performances as same as the spatial domain technique.

27 Block DCT to wavelet transcoding  The technique is based on the combined step of block filtering and downsampling directly in the DCT domain.  Filtering matrices are computed using wavelet analysis filters.  Three adjacent DCT blocks are used in transcoding.

28 Block DCT to wavelet transcoding Composite operation

29 For LL subband For LH subband Downsampling from block DCT

30 Block DCT to Wavelet Transcoding (BWT) H L and H H are computed from Type-I DCT of the analysis filters. In 1-D: In 2-D:

31 Transcoding with Linear Filtering

32 Block DCT to Wavelet Transcoding with Linear Convolution (BWTL) This work is Accepted in IEEE ICIP 2009

33 T L Matrices using 9/7 analysis lowpass filter

34 DCT to Wavelet: Complexity

35 Results: DCT to Wavelet transcoding PSNR close to 300 dB Ref: ST subbands Almost equal Almost equal

36 Results: DCT Quantization effect on transcoding

37 Results: DCT Quantization effect on transcoding

38 Quantization effect on transcoded subbands More degradation of quality in lower frequency subbands This work is under revision in Springer Journal on SIVP

39 Performance comparison with spatial transcoding Reference: Original Image

40  Spatial transcoding and the BWTL transcoding techniques perform at par.  However, the BWT technique perform the best among them.  This is due to the fact that the rounding errors are accumulating in spatial domain transcoding and the BWTL techniques.  But the BWT technique operates with three blocks simultaneously, and thereby rounding errors are reduced. Observations

41 JPEG2000 to JPEG

42 JPEG2000 Compression Standard  JPEG2000 is an emerging image compression standard  Uses Discrete wavelet transform (DWT) as transform.  Transform coefficients are quantized an encoded.

43 Advantages:  JPEG 2000 permits high compression ratios (CR’s) with little degradation in image quality.  An image can either be compressed without any distortion or data loss, ( “reversible” or “lossless” ), or it can be compressed so that there are slight perceptible differences from the original under normal viewing conditions ( “lossy” ).  An image can contain multiple embedded image quality layers  ROI Certain parts of image can be coded in better quality than others and many other functionalities.  JPEG2000 likely to become the technology of choice in image servers, medical imaging, digital cinema, security systems and digital photography.  JPEG 2000 will be used in other areas like mobile imaging applications. JPEG 2000

44 JPEG Compression Standard  JPEG divides up the image into 8 by 8 pixel blocks, calculates the DCT of each block.  A quantizer rounds off the DCT coefficients according to the quantization matrix. This step produces the "lossy" nature of JPEG, but allows for large compression ratios.  Uses a variable length code on these coefficients, and then writes the compressed data stream to an output file (*.jpg).  For decompression, JPEG recovers the quantized DCT coefficients from the compressed data stream, takes the inverse transforms and displays the image.

45 Transcoding JPEG2000 to JPEG  Encoding of JPEG2000 usually gives 2-4 dB higher PSNR values at the same compression level.  Average decoding time for JPEG2000 is almost five times higher compared to JPEG  Current web browsers do not support JPEG 2000 natively.  Since JPEG is the most common compression standard and has very low complexity, transcoding to JPEG provides a smooth upgrade path.

46 Spatial transcoding

47 Transform Domain Transcoder

48  Since impulse responses of JPEG2000 wavelet filters are symmetric, DCT domain filtering can be used.  JPEG2000 encoded using 2-D wavelet transforms by incorporating 9-7 analysis filters.  The wavelet coefficients are encoded with different quantization levels to achieve different compression rate. Tanscoding…

49 Tanscoding… Quantization in JPEG2000[25,28] Where Quantized coefficients are given by -is the coefficient at subband b -is the quantization step, depends on the number of bits allocated for encoding (15) ‏

50 Tanscoding… Dequantizing in Decoding  Encoder encodes M b bit planes, we can decode N b bit planes.  Remaining bit planes are set to zero Instead of performing IDWT on the denormalized coefficients, TBFCD algorithm is used for transcoding. (16) ‏

51 Transcoding… Inputs  Wavelet coefficients bands LL,LH,HL,HH each of size WxW  L, M are two numbers N=8;  L =2(Width of wavelet band)/N  M=2(Height of wavelet band)/N  Type-I DCT of filter response for the corresponding band Output: NxN block type-II DCT coefficients of the reconstructed image.

52 Tanscoding… Begin{  Get the four decompositions-a low resolution approximation of image in approximation (LL) band, horizontal (HL), vertical (LH) and diagonal (HH) bands.  Compute the upsampled block type-II DCT for each band of size B LNxNM.  Get the filtered block of each band by point-wise multiplication of H and B.  Add the four filtered blocks to get transcoded type-II DCT block.  Decompose the transcoded DCT block to 8x8 DCT blocks. }End  Quantize the DCT blocks by quantization matrix and entropy encode to get JPEG image.  Here scaling the quantization matrix can vary the compression rates and reconstructed image quality.

53 Transform domain Transcoding

54 Computational complexity

55  Spatial Transcoding  IDWT cost  DCTcost  Transform Domain  Upsampled DCT cost  DCT domain Filtering cost  DCT block Decomposing cost Computational complexity

56 IDWT cost for filter length FL1 and FL2 IDWT Complexity depends on the length of the wavelet filter FL. Requires FL multiplications and (FL-1) additions.

57 Spatial domain cost 8x8 block DCT cost : Per pixel complexity: IDWT cost per pixel: Total cost

58 Computational Complexity comparison

59 Computational Complexity comparison

60 JPEG2000 Decoding Original Image PSNR=41.5460 at 1.044 bpp

61 Transcoding Results Spatial Domain bpp=0.720, PSNR=35.77 Transform Domain bpp=0.7208, PSNR=35.77

62 IMAGE to JPEG bpp=0.712 PSNR=35.8082Original Image

63 Transcoder Performance evaluation

64  Guaranteed PSNR(M g ): The minimum PSNR value with respect to the JPEG2000 decoded image to be achieved by the transcoder. Here we used a typical value of 40 dB.  Equivalent rate (ρ eq ): The minimum compression rate of the transcoded JPEG image providing at least the guaranteed PSNR with respect to the JPEG2000 decoded image. The equivalent rate is obtained at 40 dB point with respect to JPEG2000 decoded image.  Equivalent PSNR (M eq ): The transcoder PSNR with respect to the original image at the equivalent rate. We have defined three measures as follows:

65 JPEG2000 to JPEG Transcoder Performance

66 Transcoder Performance

67 Equivalent PSNR comparison

68 Thanks

69 where, Type I 2D-DCT Type-I 2-D DCT of a block

70       1,0,cos,)‏( ( 2, 1 0 1 0 2 12 2 12         Nk nmxk N kX N m N n N n N km II   where, Type II 2D-DCT Type-II 2-D DCT of a block

1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,

Similar presentations

Presentation on theme: "1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,

Similar presentations

Presentation on theme: "1 Image Transcoding in the block DCT Space Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,"— Presentation transcript:

Similar presentations

About project

Feedback