Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Nonlinear Loop Filter for Quantization Noise Removal in Hybrid Video Compression Onur G. Guleryuz DoCoMo USA Labs

Similar presentations


Presentation on theme: "A Nonlinear Loop Filter for Quantization Noise Removal in Hybrid Video Compression Onur G. Guleryuz DoCoMo USA Labs"— Presentation transcript:

1 A Nonlinear Loop Filter for Quantization Noise Removal in Hybrid Video Compression Onur G. Guleryuz DoCoMo USA Labs guleryuz@docomolabs-usa.com

2 2 Overview A loop filter based on denoising with an over-complete bank of transforms. Combats all types of quantization noise (blocking, ringing,...). Applicable to block or lapped compression transforms. Consistently better than h.264 loop filter in rate-distortion. Good visual quality, especially around edges and high frequency regions. Advanced signal processing ingredients in a tight package. Hardware friendly complexity.

3 3 MC Prediction In the loop, nonlinear filtering of video: Encoder Side Encoder + + Current frame + - + + Previously decoded frame Coded differential Nonlinear Denoising Filter

4 4 MC Prediction Decoder side Decoder + + + Previously decoded frame Decoded differential Nonlinear Denoising Filter Display

5 5 Objective of the Loop Filter Nonlinear Denoising Filter : Tries to make the decoded frame as close to original as possible Quantization noise removal. (In a rate-distortion sense)

6 6 Nonlinear Denoising Filter The nonlinear denoising filter adapts to nonstationary image statistics using localized linear transforms and hard-thresholding. (The filter automatically becomes high-pass, or low-pass, or band-pass, etc., depending on the region it is operating on) Signal processing ingredients: Statistically sound (cross-correlation robust) denoising. Overcomplete, translation invariant, transforms. Thresholding strategy based on conditional expectations. Weighted denoising. Compression mode based decisions. Properties of the Proposed Filter

7 7 Transform 1Transform 2 Transform 4 Transform 3 Transform 5 Transform 13 Transform 16 What are translation invariant transforms? E.g., Translation invariant 4x4 DCTs:

8 8 Get, and evaluate translation invariant transforms ( ). Denoise the resulting coefficients using per-coefficient thresholds and a denoising rule. Compute a weighted inverse for a denoised estimate. Use the computed estimate to modify the denoising rule, and re-do everything with the new rule. Put everything in a tight package so that complexity is manageable. Main Idea Suppose all images and transform coefficients are arranged into vectors : Original frame :Decoded frame

9 9 Coefficient denoising Weighted inverse Masking Transform (a) Executed for k=0. (b) Executed for k=1,2. Algorithm Flow

10 10 provide sparse decompositions for. For most pixels we will have at least one of the transforms providing sparsity. Ingredient: Translation Invariant Transforms We assume

11 11 Transform 1 Transform 7 E.g., piecewise smooth image with an edge Sparse DCT block (many small coefficients inside the block). High performance denoising. Non-sparse DCT block (many large coefficients inside the block). Low performance denoising. Legend:

12 12 Ingredient: Cross-Correlation Robust Coefficient Denoising Rule Onur G. Guleryuz, ``Linear, Worst-Case Estimators for Denoising Quantization Noise in Transform Coded Images,'' accepted, IEEE Transactions on Image Processing In removing quantization noise, the additive “noise” is correlated with data. ( Cannot pretend that noise is independent.) These cross correlations are unknown. Must use cross-correlation robust techniques ( We use techniques that are optimal for the worst-case cross correlation. ) Transform

13 13 Coefficient Denoising Rule Estimate using (Please ask the presenter or check the paper for what “condition1” is in general) Thresholding rule: min-max optimal estimate (Per-coefficient thresholds) (Since denoising transforms are block, thresholds are actually per- block. Note that denoising trf. blocks are different from coded- blocks)

14 14 Ingredient: Weighted Inverse for High Performance Around Edges Onur G. Guleryuz, ``Weighted Overcomplete Denoising,'‘ Proc. Asilomar Conference on Signals and Systems, Pacific Grove, CA, Nov. 2003 Transform 1 Transform 7 pixel n The denoised block using transform 7 should contribute more at pixel n, i.e., (Weighted denoising becomes cross-cor. robust when the weights are constrained to sum to one.) (Weight of each denoising block ~ 1/(16-number_of_zero_coeffs).)

15 15 Ingredient: Compression Mode Based Decisions Video frame Intra coded-block P coded-block with quantized data P coded-block with no quantized data, but with a motion difference. P coded-block with no quantized data, and no motion difference.... Similar to h.264 loop filter, different types of coded-block boundaries undergo denoising with different “strengths”. Modes determine per-coefficient (per denoising-block) thresholds. (Rationale: Different types of coded-blocks have different quantization error statistics. We also do not want to filter what we have filtered before....

16 16 Vertical coded- block boundary Example set of pixels influenced by the vertical boundary, forming a one pixel thick shell. Masking (Masking is useful at coded-block boundaries where there are no non-zero coefficients transmitted, i.e., cases of coded-block boundaries between motion-only coded-blocks.) Adjust thickness of shells based on coded-block modes. mask(n)=1 inside the shell, mask(n)=0 outside. Modes determine shell thickness.

17 17 Results We incorporated this work into h.264 reference software (JM9.5) to generate rate-distortion results on video sequences. We also provide INTRA-only results for comparion. h.264* : JM9.5 with loop filtering disabled. h.264 : JM9.5 with h.264 loop filter. Proposed : JM9.5 with denoising loop filter. Default encoder.cfg, except adaptive rounding is on. All sequences QCIF (Adaptive rounding gives the best R-D performance for all three codecs. The proposed filter is consistently better than h.264. Q: Why all QCIF? A: DoCoMo. Results are typical for CIF as well. Proposed provides ~ %10 improvements at typical bit-rates. How the encoder quantizes matters, please ask the presenter why. Please check the paper to see comparison against a windows-media type loop filter.)

18 18 Rate-Distortion Performance Foreman

19 19 Rate-Distortion Performance Silence

20 20 Rate-Distortion Performance Sleepy

21 21 Rate-Distortion Performance Car

22 22 INTRA-only Car - QP2024283236 h264* (dB)44.0840.6437.4034.0030.92 h264-.01+.09+.14+.21+.29 Proposed+.30+.44+.53+.54+.55 Foreman - QP2024283236 h264* (dB)43.7240.2137.1333.9230.98 h264-.06+.04+.07+.16+.28 Proposed+.26+.39+.53+.57 (h264 and Proposed rows are dB improvements over h264*) Sleepy - QP2024283236 h264* (dB)46.6843.5140.6437.5734.69 h264+.02+.16+.21+.32+.40 Proposed+.30+.52+.62+.67+.68 Silence - QP2024283236 h264* (dB)43.5739.8936.7233.5330.77 h264-.02+.05+.01+.11+.27 Proposed+.22+.34+.41+.49+.58

23 23 Visual Quality (Visual quality results around singularities are hard to see on printouts. Please ask the presenter to show visual quality results on notebook screen.) Very good visual quality especially around edges/singularities.

24 24 All integer (32-bit) algorithm with fast transforms (4x4-DCT) and fast overcomplete transforms. Bit-precise results (all devices that implement this algorithm generate exactly the same results). Faster no multiplies algorithm possible (Hadamard + shift-weights). Time complexity about 20% of our proprietary decoder (this decoder similar in complexity to windows media 9). Hardware friendly algorithm. Conditional code executions not required – worst case filtering complexity is for INTRA frames (this is less than 10% of h.264* INTRA encoding). Please ask the presenter for operation count. Complexity and Implementation Issues

25 25 Conclusion Combats all types of quantization noise (blocking, ringing,...). ~ %10 improvements in bit-rate at typical bit-rates/sequences. Applicable to all (I,P,B,*) frame types. ~ 0.5 dB improvements on I Frames (lap or block transform compressed) at typical bit-rates/sequences. Good visual quality, especially around edges and high frequency regions. Significant improvements are when video contains non-smooth regions, rapid scene changes,... ( ~ whenever there are a significant number of non-zero quantized transform coefficients). Hardware friendly.


Download ppt "A Nonlinear Loop Filter for Quantization Noise Removal in Hybrid Video Compression Onur G. Guleryuz DoCoMo USA Labs"

Similar presentations


Ads by Google