Download presentation
Presentation is loading. Please wait.
Published byCuthbert Burke Modified over 8 years ago
1
FAST MODE DECISION ALGORITHM FOR INTRA PREDICTION IN HEVC FINAL REPORT Lanka Naga Venkata Sai Surya Teja Student ID 1000916473 Mail ID nvssurya.lanka@mavs.uta.edunvssurya.lanka@mavs.uta.edu Date 05/06/2014 UNDER THE GUIDANCE OF Dr. K. R. Rao
2
PROPOSAL To improve the coding efficiency of intra frame coding, up to 34 intra prediction modes are defined in High Efficiency Video Coding (HEVC) [1] The best mode among these pre-defined intra prediction modes is selected by rate-distortion optimization (RDO) for each block This project describes a new method to reduce the candidates in RDO process, as it will be time-consuming if all directions are tested in the RDO process when compared with the default method of intra prediction in HM 13.0. Also analysis of PSNR, BD- PSNR, BD- Bitrate can be done by comparing with the default encoding scheme in HM 13.0 [5].
3
INTRODUCTION HEVC standard [2] provides a highly flexible hierarchy of unit representation which consists of three block concepts: coding unit (CU), prediction unit (PU), and transform unit (TU). This separation of the block structure is helpful for each unit of optimization. CU is a macroblock-like unit of region splitting which is always square and its size can be from 8x8 luma samples up to the largest coding units (LCUs). The PU is used only for the CU which is the leaf node in the quadtree structure and the size of two PUs are 2Nx2N and NxN. The third block concept transform unit size cannot exceed that of the CU.
4
BLOCK DIAGRAM OF HEVC ENCODER[15]
5
BLOCK DIAGRAM OF H.264 ENCODER[20]
6
BLOCK DIAGRAM OF HEVC DECODER[15]
7
Since encoder needs to exhaust all the combinations of CU, PU and TU to find the optimal solutions, it is very time- consuming The encoder will not tolerate it if all the directions are employed in the rate-distortion optimization process To reduce the computational complexity of the encoder, a fast intra mode decision [7] was adopted in HM13.0 [5]
8
CODING TREE ALGORITHM
9
OVERVIEW OF INTRA PREDICTION In H.264, intra prediction [6][7][8][9] is based on spatial extrapolation of samples from previously decoded image blocks, followed by integer discrete cosine transform (DCT) [10] based coding HEVC utilizes the same principle, but further extends it to efficiently represent wider range of textural and structural information in images HEVC contains several elements improving the efficiency of intra prediction over earlier solutions
10
HEVC INTRA PREDICTION MODES [7]
11
H.264 INTRA PREDICTION MODES[21]
12
METHOD FOR FAST MODE DECISON ALGORITHM FOR INTRA PREDICTION The fast intra prediction consists of three steps: 1.Hadamard Transformed Coefficients Of Residual Signal[13] 2.Progressive Mode Search[13] 3.Early RDOQ Termination[13]
13
TEST SEQUENCES [1] BQSquare_416x240_60
14
[2] BQMall_ 832x480_60
15
[3] KristenAndSara_1280x720_60
16
Unmodified method is nothing but basic HM 13.0 and modified method is fast mode decision algorithm [13].
17
EXPERIMENTAL RESULTS of BQMall_832x480_60 Analysis of results for unmodified method with number of frames 10 QPBITRATE (kbps) PSNR (avg) db ENCODIN G TIME (sec) 2423754.81 60 41.137176.624 2816010.59 20 38.6744161.116 3210612.80 00 36.2830151.866 348447.760 0 34.9856134.619 Analysis of results for modified method with number of frames 10 QPBITRATE (kbps) PSNR (avg) db ENCODI NG TIME (sec) 2424046.1 849 40.9975134.234 2816227.4 525 38.5344120.837 3210718.9 28 36.1320112.380 348532.23 76 34.847398.271
18
EXPERIMENTAL RESULTS of BQMall_832x480_60 Analysis of results for unmodified method with number of frames 30 QPBITRATE (kbps) PSNR (avg) db ENCODIN G TIME (sec) 2423836.52 80 41.1352519.561 2816075.40 80 38.6716515.1675 3210704.52 80 36.2715426.831 348506.496 0 34.9803407.187 Analysis of results for modified method with number of frames 30 QPBITRATE (kbps) PSNR (avg) db ENCODI NG TIME (sec) 2424074.8 932 40.9944396.944 2816236.1 620 38.5300383.799 3210811.5 732 36.1225312.44 348591.56 09 34.8425297.246
19
EXPERIMENTAL RESULTS of KristenAndSara_1280x720_60 Analysis of results for unmodified method with number of frames 10 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2417865.552044.5196301.597 2811800.896042.6232272.229 327907.904040.6527277.846 346449.616039.5625262.815 Analysis of results for modified method with number of frames 10 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2417976.824041.1352232.229 2811939.905638.6716204.993 327984.642536.2715202.827 346498.884834.9803190.540
20
EXPERIMENTAL RESULTS of KristenAndSara_1280x720_60 Analysis of results for unmodified method with number of frames 30 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2417937.072044.5023910.946 2811845.024042.6064 885.948 327935.360040.6329 1090.753 346466.544039.54191148.507 Analysis of results for modified method with number of frames 30 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2417996.821144.4241696.873 2811992.264042.5682668.890 327989.068040.5349812.610 346496.236039.4498844.152
21
EXPERIMENTAL RESULTS of BQSquare_416x240_60 Analysis of results for unmodified method with number of frames 10 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2410885.872040.547159.774 287597.584037.412544.527 325324.400034.871040.159 344432.992033.596538.316 Analysis of results for modified method with number of frames 10 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2411305.324640.387045.428 287814.732937.2947 33.395 325379.611134.7557 29.717 344488.863233.458927.587
22
EXPERIMENTAL RESULTS of BQSquare_416x240_60 Analysis of results for unmodified method with number of frames 30 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2410971.088040.5244152.385 287672.032037.3928 133.030 325376.336034.8391123.324 344475.088033.5497 115.036 Analysis of results for modified method with number of frames 30 QP BITRATE (kbps) PSNR (avg) dB ENCODIN G TIME (sec) 2411387.521440.3846115.8126 287891.257737.327099.7725 325461.536034.682591.259 344529.620033.404782.825
23
Encoding Time Vs QP for BQMall_832x480_60 For number of frames 10
24
Encoding Time Vs QP for BQMall_832x480_60 For number of frames 30
25
Encoding Time Vs QP for KristenAndSara_1280x720_60 For number of frames 10
26
Encoding Time Vs QP for KristenAndSara_1280x720_60 For number of frames 30
27
Encoding Time Vs QP for BQSquare_416x240_60 For number of frames 10
28
Encoding Time Vs QP for BQSquare_416x240_60 For number of frames 30
29
PSNR (avg) vs Bitrate for BQMall_832x480_60 For number of frames 10For number of frames 30
30
PSNR (avg) vs Bitrate for KristenAndSara_1280x720_60 For number of frames 10For number of frames 30
31
PSNR (avg) vs Bitrate for BQSquare_412x240_60 For number of frames 10For number of frames 30
32
BD-PSNR and BD-BITRATE BQMall_832x480_60 for number of frames 10 BQMall_832x480_60 for number of frames 30 QPBD-PSNRBD-BITRATE 24-0.2912.18 28-0.3112.62 32-0.2713.43 34-0.2512.46 QPBD-PSNRBD-BITRATE 24-0.2612.04 28-0.2712.37 32-0.2513.268 34-0.25312.429
33
BD-PSNR and BD-BITRATE KristenAndSara_1280x720_60 for number of frames 10 KristenAndSara_1280x720_60 for number of frames 30 QPBD-PSNRBD-BITRATE 24-0.176.5 28-0.225.45 32-0.4323.08 34-0.207.1404 QPBD-PSNRBD-BITRATE 24-0.125.65 28-0.203.83 32-0.187.45 34-0.156.82
34
BD-PSNR and BD-BITRATE BQSquare_416x240_60 for number of frames 10 BQSquare_416x240_60 for number of frames 30 QPBD-PSNRBD-BITRATE 24-0.6515.99 28-0.4711.95 32-0.2410.03 34-0.2912.29 QPBD-PSNRBD-BITRATE 24-0.6214.32 28-0.427.84 32-0.1614.10 34-0.2912.89
35
BD-PSNR vs QP for BQMall_832x480_60 For number of frames 10For number of frames 30
36
BD-PSNR vs QP for KristenAndSara_1280x720_60 For number of frames 10For number of frames 30
37
BD-PSNR vs QP for BQSquare_416x240_60 For number of frames 10For number of frames 30
38
BD-BITRATE vs QP for BQMall_832x480_60 For number of frames 10For number of frames 30
39
BD-BiTRATE vs QP for KristenAndSara_1280x720_60 For number of frames 10For number of frames 30
40
BD-BITRATE vs QP for BQSquare_416x240_60 For number of frames 10For number of frames 30
41
CONCLUSIONS For the fast mode decision algorithm [13] compared to unmodified HM13.0, 0.1-0.6 dB loss in the PSNR 7 – 23 kbps increase in the bitrate 20-28 % reduction in encoding time WQVGA – SD sequences of 10 frames and 30 frames each were tested for QPs 24, 28,32,34. Also visual quality of the images was maintained.
42
ACRONYMS BD- Bitrate - Bjøntegaard Delta Bitrate BD- PSNR - Bjøntegaard Delta Peak Signal-to-Noise Ratio CU- Coding Unit DCT - Discrete Cosine Transform DST - Discrete Sine Transform HEVC - High Efficiency Video Coding JCT- VC- Joint Collaborative Team on Video Coding LCU - Largest Coding Unit MPM - Most Probable Mode PSNR - Peak Signal-to-Noise Ratio PU - Prediction Unit QP - Quantization Parameter RDOQ - Rate Distortion Optimization Quantization RDO - Rate- Distortion Optimization RMD - Rough Mode Decision SSIM - Structural Similarity Index TU - Transform Unit
43
REFERENCES [1] G.J. Sullivan et al, Overview of the high efficiency video coding (HEVC) standard ‖, IEEE Trans. circuits and systems for video technology, vol. 22, no.12, pp. 1649 – 1668, Dec 2012. [2] JCT-VC, “WD1: Working Draft 1 of High-Efficiency Video Coding”, JCTVC-C403, JCT-VC Meeting, Guangzhou, October 2010. [3] Coding tree structure - https://www.google.com/search?q=coding+tree+structure+in+hevchttps://www.google.com/search?q=coding+tree+structure+in+hevc [4] Y. Piao et al, “Encoder improvement of unified intra prediction,” JCTVC-C207, Guangzhou, October 2010. [5] Software for HEVC - https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware [6] T.L. Silva et al, ”HEVC intra coding acceleration based on tree inter-level mode correlation”, SPA 2013 Sep.2013, Poznan, Poland [7] H. Zhang and Z. Ma, ”Fast intra prediction for high efficiency video coding ”, Pacific Rim Conf. on Multimedia, PCM2012, Singapore, Dec. 2012. [8] M. Zhang et al, ”An adaptive fast intra mode decision in HEVC ”, IEEE ICIP 2012, pp.221-224, Orlando, FL, Sept.- Oct. 2012.
44
[9] Y. Kim et al, “A fast intra-prediction method in HEVC using rate-distortion estimation based on Hadamard transform”, ETRI Journal, vol.35, #2, pp.270-280, Apr. 2013. [10 ]A. Saxena and F. Fernanades, “Mode dependent DCT/DST for intra prediction in block based image/video coding”, IEEE ICIP, pp. 1685-1688, Sept. 2011. [11] M. Khan et al, “An adaptive complexity reduction scheme with fast prediction unit decision for HEVC Intra encoding”, IEEE ICIP, pp. 1578-1582, Sept. 2013. [12] P. Mehta, “Complexity reduction for intra mode selection in HEVC using OpenMP”, course website: http://www-ee.uta.edu/Dip/Courses/EE5359/ Section: previous projects, Sub section: Projects (Spring 2014). [13] S. Vasudevan, “Fast intra prediction and fast residual quadtree encoding implementation in HEVC”, course website: http://www- ee.uta.edu/Dip/Courses/EE5359/ Section: previous projects, Sub section: Projects (Spring 2014). [14] K.R.Rao, D. N. Kim and J.J. Hwang,” Video coding standards: AVS China, H.264/MPEG-4 Part10, HEVC, VP6, DIRAC and VC-1"´, Springer, 2014.
45
[15] G.Sullivan et al, “Standard Extensions of the High Efficiency Video Coding (HEVC) Standard” Journal of Special Topics in Signal Processing, vol.7, No. 6, pp. 1001-1016, Dec 2013. [16] Test Sequences: ftp://ftp.kw.bbc.co.uk/hevc/hm-11.0-anchors/testsequences/ftp://ftp.kw.bbc.co.uk/hevc/hm-11.0-anchors/testsequences/ [17] F. Bossen et al, "HM Software Manual", JCT-VC of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, AHG chairs, January 2014 [18] B. Bross et al, “High Efficiency Video Coding (HEVC) Text Specification Draft 10”, Document JCTVC- L1003, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC), Mar. 2013 available on http://phenix.it- sudparis.eu/jct/doc_end_user/current_document.php?id=7243http://phenix.it- sudparis.eu/jct/doc_end_user/current_document.php?id=7243 [19] JVT Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H.264-ISO/IEC 14496-10 AVC), March 2003, JVT-G050 available on http://ip.hhi.de/imagecom_G1/assets/pdfs/JVT- G050.pdf
46
[22] Special issue on emerging research and standards in next generation video coding, IEEE Transactions on Circuits and Systems for Video Technology (CSVT), vol.22, pp. 1646-1909, Dec. 2012. [23] Special issue on emerging research and standards in next generation video coding, IEEE Transactions on Circuits and Systems for Video Technology (CSVT), vol.23, pp. 2009-2142, Dec. 2013. [24] Introduction to the issue on video coding HEVC and beyond. [25] IEEE Journal of Selected Topics in Signal Processing, Vol. 7, pp. 931 - 1151, Dec. 2013. [26] H. Samet, “The quadtree and related hierarchical data structures,” Comput. Survey, vol. 16, no. 2, pp. 187–260, Jun. 1984. [27] H. Zhang and Z. Ma, "Fast intra node decision for high efficiency video coding (HEVC)", IEEE Trans. on CSVT, vol. 24, pp.660-668, April 2014.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.