Presentation is loading. Please wait.

Presentation is loading. Please wait.

Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang.

Similar presentations


Presentation on theme: "Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang."— Presentation transcript:

1 Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang Institute of Electronics, NCTU

2 Communication & Multimedia C. -Y. Tsai 2005/8/17 2 Outline Overview Overview MCTF in Interframe Wavelet MCTF in Interframe Wavelet MCTF in JSVM MCTF in JSVM Comparison Comparison References References

3 Communication & Multimedia C. -Y. Tsai 2005/8/17 3 Outline Overview Overview Scalable Video Coding Scalable Video Coding MCTF in Interframe Wavelet MCTF in Interframe Wavelet MCTF in JSVM MCTF in JSVM Comparison Comparison References References

4 Communication & Multimedia C. -Y. Tsai 2005/8/17 4 Scalable Video Coding Ability to adjust Ability to adjust Different client requirements Different client requirements Scalabilities Scalabilities Rate/SNR Rate/SNR Spatial Spatial Temporal Temporal

5 Communication & Multimedia C. -Y. Tsai 2005/8/17 5 MCTF MCTF = Motion Compensated Temporal Filtering

6 Communication & Multimedia C. -Y. Tsai 2005/8/17 6 Rate/SNR Scalability Progressive approximation Progressive approximation GOP HeaderMotion Info.Image Data 300kbps PSNR=32.2 dB 500kbps PSNR=34.6 dB 1000kbps PSNR=38.2 dB

7 Communication & Multimedia C. -Y. Tsai 2005/8/17 7 Spatial Scalability Wavelet decomposition provides spatial scalability Wavelet decomposition provides spatial scalability Bit-plane Coder

8 Communication & Multimedia C. -Y. Tsai 2005/8/17 8 HHHH H2H2 H2H2 H3H3 HHH H1H1 H2H2 H2H2 H3H3 L H4H4 15Hz Video Sequence 7.5Hz Video Sequence 30Hz Video Sequence 3.25Hz Video Sequence Temporal Scalability

9 Communication & Multimedia C. -Y. Tsai 2005/8/17 9 Scalable Video Coding History History 2004.3 2005 2004.7 MSRA (wavelet) RPI (wavelet) UNSW (wavelet) HHI (AVC-based)  JSVM

10 Communication & Multimedia C. -Y. Tsai 2005/8/17 10 Approaches An AVC/H.264-based approach (also DCT-based)

11 Communication & Multimedia C. -Y. Tsai 2005/8/17 11 Approaches A wavelet-based approach with t+2D structure.

12 Communication & Multimedia C. -Y. Tsai 2005/8/17 12 Approaches A wavelet-based approach with 2D+t structure

13 Communication & Multimedia C. -Y. Tsai 2005/8/17 13 Lifting Scheme 5/3 lifting scheme 5/3 lifting scheme

14 Communication & Multimedia C. -Y. Tsai 2005/8/17 14 Outline Overview Overview MCTF in Interframe Wavelet MCTF in Interframe Wavelet Barbell lifting Barbell lifting In-band MCTF In-band MCTF Base-layer structure Base-layer structure MCTF in JSVM MCTF in JSVM Comparison Comparison References References

15 Communication & Multimedia C. -Y. Tsai 2005/8/17 15 Barbell Lifting Scheme Purpose: Purpose: Improve the accuracy of motion field. Improve the accuracy of motion field. Methods: Methods: Take (5,3) wavelet kernel. Take (5,3) wavelet kernel. Use “ barbell function ” to generate prediction /update values. Use “ barbell function ” to generate prediction /update values.

16 Communication & Multimedia C. -Y. Tsai 2005/8/17 16 Barbell Lifting Scheme

17 Communication & Multimedia C. -Y. Tsai 2005/8/17 17 Barbell Lifting Scheme Prediction Stage Update Stage

18 Communication & Multimedia C. -Y. Tsai 2005/8/17 18 In-Band MCTF Purpose: Purpose: Improve coding performance with spatial scalability Improve coding performance with spatial scalability Methods: Methods: Leaky motion compensation Leaky motion compensation Mode-based temporal filtering Mode-based temporal filtering

19 Communication & Multimedia C. -Y. Tsai 2005/8/17 19 In-Band MCTF The forming of different quality reference of LL The forming of different quality reference of LL Low quality reference as IP_DIR Low quality reference as IP_DIR High quality reference as IP_LBS High quality reference as IP_LBS

20 Communication & Multimedia C. -Y. Tsai 2005/8/17 20 In-Band MCTF Leaky motion compensation Leaky motion compensation leaky factor leaky factor Attenuate the prediction based on the unknown information at the decoder Attenuate the prediction based on the unknown information at the decoder make a good trade-off between drifting errors and coding efficiency make a good trade-off between drifting errors and coding efficiency

21 Communication & Multimedia C. -Y. Tsai 2005/8/17 21 In-Band MCTF Mode-based temporal filtering Mode-based temporal filtering Mode I: Low quality reference Mode I: Low quality reference Mode 2: High quality reference Mode 2: High quality reference Mode is selected by RD cost Mode is selected by RD cost

22 Communication & Multimedia C. -Y. Tsai 2005/8/17 22 Base-Layer Structure Purpose: Purpose: Coding efficiency improvement in low rates Coding efficiency improvement in low rates AVC compatible AVC compatible Methods: Methods: Insert AVC encoding module into MCTF Insert AVC encoding module into MCTF

23 Communication & Multimedia C. -Y. Tsai 2005/8/17 23 Base-Layer Structure Encoder Decoder

24 Communication & Multimedia C. -Y. Tsai 2005/8/17 24 Outline Overview Overview MCTF in Interframe Wavelet MCTF in Interframe Wavelet MCTF in JSVM MCTF in JSVM Base layer structure Base layer structure Inter-layer prediction Inter-layer prediction Adaptive prediction/update steps Adaptive prediction/update steps Comparison Comparison References References

25 Communication & Multimedia C. -Y. Tsai 2005/8/17 25 Base Layer Structure Purpose Purpose Coding efficiency improvement in low rates Coding efficiency improvement in low rates Compatibility to AVC Compatibility to AVC Methods Methods Unrestricted MCTF (UMCTF) Unrestricted MCTF (UMCTF) Hierarchical B pictures Hierarchical B pictures

26 Communication & Multimedia C. -Y. Tsai 2005/8/17 26 Base Layer Structure UMCTF UMCTF Update step is omitted. Update step is omitted. Hierarchical B pictures Hierarchical B pictures Fully compatible to AVC Main profile Fully compatible to AVC Main profile Non-dyadic decomposition is available Non-dyadic decomposition is available

27 Communication & Multimedia C. -Y. Tsai 2005/8/17 27 Non-Dyadic Decomposition

28 Communication & Multimedia C. -Y. Tsai 2005/8/17 28 Inter-Layer Prediction Purpose Purpose Reduce redundancy between layers Reduce redundancy between layers Methods Methods Inter-layer texture prediction Inter-layer texture prediction Inter-layer motion prediction Inter-layer motion prediction

29 Communication & Multimedia C. -Y. Tsai 2005/8/17 29 Inter-Layer Prediction

30 Communication & Multimedia C. -Y. Tsai 2005/8/17 30 Adaptive Prediction/Update Steps Purpose: Purpose: Delay (Memory)control Delay (Memory)control Method: Method: Sub-partitioning of GOP Sub-partitioning of GOP

31 Communication & Multimedia C. -Y. Tsai 2005/8/17 31 Adaptive Prediction/Update Steps

32 Communication & Multimedia C. -Y. Tsai 2005/8/17 32 Outline Overview Overview MCTF in Interframe Wavelet video MCTF in Interframe Wavelet video MCTF in JSVM MCTF in JSVM Comparison Comparison Cons and pros Cons and pros Experimental results Experimental results References References

33 Communication & Multimedia C. -Y. Tsai 2005/8/17 33 Wavelet Based SVC Key features Key features 3D wavelet decomposition 3D wavelet decomposition Open-loop prediction structure Open-loop prediction structure Spatial-temporal resolution scalability Spatial-temporal resolution scalability SNR scalability SNR scalability

34 Communication & Multimedia C. -Y. Tsai 2005/8/17 34 Wavelet Based SVC Advantages Advantages Nature for multi-resolution scalability Nature for multi-resolution scalability Open-loop prediction structure Open-loop prediction structure Provides elegant SNR scalability without impairing full exploitation of spatial-temporal correlation Provides elegant SNR scalability without impairing full exploitation of spatial-temporal correlation Simplifies the R-D model of the bitstreams. Simplifies the R-D model of the bitstreams. Facilitates the bitstream truncation Facilitates the bitstream truncation each subband is independent with other subbands each subband is independent with other subbands

35 Communication & Multimedia C. -Y. Tsai 2005/8/17 35 Wavelet Based SVC Disadvantages Disadvantages Decomposition modes (coding modes) selection Decomposition modes (coding modes) selection Texture & side information trade off Texture & side information trade off Intra-prediction Intra-prediction Badly-matched blocks Badly-matched blocks Downsampling filter problems Downsampling filter problems

36 Communication & Multimedia C. -Y. Tsai 2005/8/17 36 AVC Based SVC Key features Key features MCTF/Hierarchical B structure for temporal scalability MCTF/Hierarchical B structure for temporal scalability Hierarchical B structure with close-loop structure for base layer Hierarchical B structure with close-loop structure for base layer Multiple spatial layers for spatial scalability Multiple spatial layers for spatial scalability Multiple FGS layers at each spatial resolution for SNR scalability Multiple FGS layers at each spatial resolution for SNR scalability DCT coding of all the frames DCT coding of all the frames

37 Communication & Multimedia C. -Y. Tsai 2005/8/17 37 AVC Based SVC Advantages Advantages All the RDO and intra-prediction can be used. All the RDO and intra-prediction can be used. It guarantees the quality of the first testing point. It guarantees the quality of the first testing point. MPEG filter for low resolution video MPEG filter for low resolution video the target low resolution video is visually good. the target low resolution video is visually good.

38 Communication & Multimedia C. -Y. Tsai 2005/8/17 38 AVC Based SVC Disadvantages Disadvantages Redundancy between spatial layers Redundancy between spatial layers

39 Communication & Multimedia C. -Y. Tsai 2005/8/17 39 Experiments

40 Communication & Multimedia C. -Y. Tsai 2005/8/17 40 Experiments

41 Communication & Multimedia C. -Y. Tsai 2005/8/17 41 References [1] “ Draft of joint scalable video model (JSVM)3.0 reference encoding algorithm description ”, ISO/IEC JTC1/SC29/WG11, N7311, Poznan, July 2005. [2] D. Zhang, J. Xu, H. Xiong, and F. Wu, “ Improvement for in-band video coding with spatial scalability ”, ISO/IEC JTC1/SC29/WG11, M11681, HongKong, Jan. 2005. [3] V. Bottreau, G. Pau, and J. Xu, “ Vidwav evaluation software manual ”, ISO/IEC JTC1/SC29/WG11, M12176, Poznan, July. 2005. [4] X. Ji, J. Xu, D. Zhao, and F. Wu, “ Responses of CE1d: base- layer ”, ISO/IEC JTC1/SC29/WG11, M11127, Redmond, July 2004. [5] R. Xiong, J. Xu, and F. Wu, “ Coding performance comparison between MSRA wavelet video coding and JSVM ”, ISO/IEC JTC1/SC29/WG11, M11975, Busan, April 2005. [6] R. Xiong, J. Xu, and F. Wu, “ Response to VidWav EE1 ”, ISO/IEC JTC1/SC29/WG11, M12286, Poznan, July 2005. [7] J. Reichel, K. Hanke and B. Popescu, “ Scalable Video Model V1.0 ”, ISO/IEC JTC1/SC29/WG11, N6372, Munich, March 2004.


Download ppt "Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang."

Similar presentations


Ads by Google