BY AMRUTA KULKARNI STUDENT ID : 1000666836 UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

MPEG-2 to H.264/AVC Transcoding Techniques Jun Xin Xilient Inc. Cupertino, CA.
H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Implementation and Study of Unified Loop Filter in H.264 EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
CABAC Based Bit Estimation for Fast H.264 RD Optimization Decision
Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin
Overview of the H.264/AVC Video Coding Standard
Analysis, Fast Algorithm, and VLSI Architecture Design for H
2015/6/271 Intra-Prediction in H.264(JM82) Student : 林鴻志 Advisor : 杭學鳴 教授.
1 An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization IEEE TRANSACTION ON MULTIMEDIA Hanli Wang, Student Member, IEEE, Sam Kwong,
Feature-Based Intra-/InterCoding Mode Selection for H.264/AVC C. Kim and C.-C. Jay Kuo CSVT, April 2007.
BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.
Optimizing Baseline Profile in H
H.264/AVC.
An Introduction to H.264/AVC and 3D Video Coding.
HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.
Conference title 1 A WYNER-ZIV TO H.264 VIDEO TRANSCODER José Luis Martínez, Pedro Cuenca, Gerardo Fernández-Escribano, Francisco José Quiles and Hari.
PROJECT PROPOSAL HEVC DEBLOCKING FILTER AND ITS IMPLIMENTATION RAKESH SAI SRIRAMBHATLA UTA ID: EE 5359 Under the guidance of DR. K. R. RAO.
Fast mode decision for Inter Mode Selection in H.264/AVC Video Coding
By Sudeep Gangavati ID EE5359 Spring 2012, UT Arlington
PROJECT INTERIM REPORT HEVC DEBLOCKING FILTER AND ITS IMPLEMENTATION RAKESH SAI SRIRAMBHATLA UTA ID:
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
Priyadarshini Anjanappa UTA ID:
Vineeth Shetty Kolkeri University of Texas, Arlington
Santosh Kumar Muniyappa
By Abhishek Hassan Thungaraj Supervisor- Dr. K. R. Rao.
Outline JVT/H.26L: History, Goals, Applications, Structure
EE 5359 PROJECT PROPOSAL FAST INTER AND INTRA MODE DECISION ALGORITHM BASED ON THREAD-LEVEL PARALLELISM IN H.264 VIDEO CODING Project Guide – Dr. K. R.
By, ( ) Low Complexity Rate Control for VC-1 to H.264 Transcoding.
Performance Analysis and Comparison of H.264 based on JM and FFMPEG Softwares Guided by Dr K.R.Rao By Kiran Jonnavittula.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.
Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.
EE 5359 TOPICS IN SIGNAL PROCESSING PROJECT ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of.
By: Hitesh Yadav Supervising Professor: Dr. K. R. Rao Department of Electrical Engineering The University of Texas at Arlington Optimization of the Deblocking.
Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.
Implementation and comparison study of H.264 and AVS China EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Figure 1.a AVS China encoder [3] Video Bit stream.
IMPLEMENTATION OF H.264/AVC, AVS China Part 7 and Dirac VIDEO CODING STANDARDS Under the guidance of Dr. K R. Rao Electrical Engineering Department The.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Optimizing Baseline Profile in H.264/AVC Video Coding by Parallel Programming and Fast Intra and Inter Predictions BY Under the Guidance of VINOOTHNA GAJULA.
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
Vineeth Shetty Kolkeri University of Texas, Arlington
Vamsi Krishna Vegunta University of Texas, Arlington
ADAPTIVE INTERPOLATION FILTER FOR H.264/AVC Bhavana Prabhakar Student Id: Department of Electrical Engineering.
IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇.
UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.
EE 5359 PROJECT PRESENTATION FAST INTER AND INTRA MODE DECISION
Transcoding from H.264/AVC to HEVC
Video Compression—From Concepts to the H.264/AVC Standard
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
Time Optimization of HEVC Encoder over X86 Processors using SIMD Kushal Shah Advisor: Dr. K. R. Rao Spring 2013 Multimedia.
By: Santosh Kumar Muniyappa ( ) Guided by: Dr. K. R. Rao Final Report Multimedia Processing (EE 5359)
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
EE 5359 MULTIMEDIA PROCESSING PROJECT PROPOSAL SPRING 2016 STUDY AND PERFORMANCE ANALYSIS OF HEVC, H.264/AVC AND DIRAC By ASHRITA MANDALAPU
Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.
Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.
Complexity varying intra prediction in H.264 Supervisors: Dr. Ofer Hadar, Mr. Evgeny Kaminsky Students: Amit David, Yoav Galon.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Quality Evaluation and Comparison of SVC Encoders
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Study and Optimization of the Deblocking Filter in H
PROJECT PROPOSAL HEVC DEBLOCKING FILTER AND ITS IMPLIMENTATION RAKESH SAI SRIRAMBHATLA UTA ID: EE 5359 Under the guidance of DR. K. R. RAO.
Fast Decision of Block size, Prediction Mode and Intra Block for H
Standards Presentation ECE 8873 – Data Compression and Modeling
Optimizing Baseline Profile in H
Presentation transcript:

BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video Coding

H.264/MPEG-4 Part 10 or AVC Is an advanced video compression standard, developed by ITU-T Video Coding Experts Group(VCEG) together with ISO/IEC Moving Picture Experts Group(MPEG). It is a widely used video codec in mobile applications, internet ( YouTube, flash players), set top box, DTV etc. A H.264 encoder converts the video into a compressed format(.264) and a decoder converts compressed video back into an uncompressed format.

How does H.264 codec work ? An H.264 video encoder carries out prediction, transform and encoding processes to produce a compressed H.264 bit stream. The block diagram of the H.264 video encoder is shown in Fig 1. A decoder carries out a complementary process by decoding, inverse transform and reconstruction to output a decoded video sequence. The block diagram of the H.264 video decoder is shown in Fig 2.

H.264 encoder block diagram Fig. 1 H.264 Encoder block diagram[7]

H.264 decoder block diagram Motion Compensation Entropy Decoding Intra Prediction Intra/Inter Mode Selection Inverse Quantization & Inverse Transform Deblocking Filter + + Bitstream Input Video Output Picture Buffering Fig.2 H.264 decoder block diagram [2]

Profiles in H.264 The H.264 standard defines sets of capabilities, which are also referred to as “Profiles”, targeting specific classes of applications. Fig. 3. Different features are supported in different profiles depending on applications. Table 1. lists some profiles and there applications. ProfileApplications BaselineVideo conferencing, Videophone MainDigital Storage Media, Television Broadcasting HighStreaming Video ExtendedContent distribution Post processing Table 1. List of H.264 Profiles and applications [2]

Profiles in H.264[9] Fig. 3 Profiles in H. 264[9]

Prediction The H.264 encoder forms a prediction of the current macro block – a) Based on the current frame using intra (spatial) prediction. b) Based on the previous frames that have already been coded using inter prediction.

Intra prediction Supports the following macro block sizes : a) 16x16 (for Luma) – [9] Fig. 4 H.264 Intra 16x16 prediction modes.

Intra Prediction H.264 Intra 16x16 prediction modes are shown in Fig 4.  Mode 0 (vertical): extrapolation from upper samples (H).  Mode 1 (horizontal): extrapolation from left samples (V).  Mode 2 (DC): mean of upper and left-hand samples (H+V).  Mode 3 (Plane): a linear “plane” function is fitted to the upper and left-hand side samples H and V. This works well in areas of smoothly-varying luminance. b) 8x8 (for Chroma) – DC, Plane, Horizontal and Vertical

Intra prediction c) 4x4 (for Luma) modes for intra prediction are shown if Fig.5 -[2] Fig 5. H.264 Intra 4x4 prediction modes[2]

Introduction In H.264/AVC intra-prediction is conducted for block sizes such as 4x4 luma blocks, 16x16 luma blocks, and 8x8 chroma blocks for baseline profile 4:2:0 format. The residual between the current block and its prediction is then transformed, quantized, and entropy coded. To obtain the best mode for these blocks, the H.264/AVC encoder performs the rate-distortion optimization (RDO) technique for each macro block.

RDO procedure for one macro block Set macro block parameters : QP(quantization parameter) and Lagrangian multiplier (λ) Calculate : λ Mode = 0.85*2(QP-12)/3 Then calculate cost, which determines the best mode, Cost = D + λ MODE * R, D – Distortion R - bit rate with given QP λ – Lagrange multiplier Distortion (D) is obtained by SATD (Sum of Absolute Transform Differences) between the original macro block and its reconstructed block. Bit rate(R) includes the bits for the mode information and transform coefficients for macro block. Quantization parameter (QP) can vary from (0-51) Lagrange multiplier (λ) a value representing the relationship between bit cost & quality.

RDO Procedure for one macro block 1. For a 4x4 luma block select the best mode, which minimizes the cost among 9 modes. 2. For a 16x16 luma block choose the mode with minimum SATD(Sum of absolute transformed differences) among the 4 modes as the best mode. 3. For an 8x8 chroma block choose the mode with minimum cost among 4 modes. 4. Choose the best one as the prediction mode of one macro block by comparing RD costs for 4x4 mode obtained from step (1) and 16x16 mode from step (2). 5. Choose the best one as the prediction mode of one chroma block.

RDO (Rate distortion optimization) Considering the RDO procedure for intra mode selection in H.264/AVC, the number of mode combinations in one macro block is N8x(16xN4 + N16)  N8 – number of modes of an 8x8 chroma block – 4 modes  N4 – number of modes of an 4x4 luma block - 9 modes  N16 – number of modes of an 16x16 luma block – 4 modes N8x(16xN4 + N16) = 4x(16x9 + 4) = 592 Thus, to select the best mode for one macro block in the intra prediction, the H.264/AVC encoder carries out 592 RDO calculations. As a result, the complexity of the encoder increases extremely.

SATD(sum of absolute transform differences) calculation Following steps are performed in JM 17.2 for calculation of SATD. a) Find the absolute difference (magnitude) between original 16x16 block and predicted 16x16 block. b) Apply Hadamard transform on every 4x4 block. c) Then take sum of every 4x4 block transform coefficients except the DC coefficient. d) Check if the sum_cost > max_cost. if yes return max_cost. e) If no, apply Hadamard transform on every 4x4 block and add sum of all 16 DC coefficients to sum_cost. f) Check if the sum_cost > max_cost if yes return max_cost else return sum_cost. where, sum_cost -> cost calculated for each block partition. max_cost -> maximum cost value allowed for each block size.

Introduction This project uses the baseline profile, as it provides simplicity in implementation. The important features are – a) I and P slice coding b) Enhanced error resilience such as FMO (flexible macro block ordering) and arbitrary slice ordering(ASO) and redundant slices (RS) c) Context adaptive variable length coding (CAVLC) Baseline profile is primarily used for low-cost applications, for data loss robustness. The joint model (JM 17.2) [6] implementation of the H.264 encoder is used in this project.

Complexity reduction algorithm for intra prediction This project will implement the complexity reduction algorithm for all the 3 block sizes 1) 16x16 luma 2) 4x4 luma 3) 8x8 chroma

Algorithm The proposed intra mode selection algorithm for a 16x16 luma block is summarized as follows: Step 1 - Examine sizes of adjacent blocks: if both blocks (upper block and left block) are 16x16, go to Step 2, otherwise go to Step 4. Step 2 - Examine modes of adjacent blocks: if both modes are same, go to Step 3, otherwise select the best mode for a 16x16 luma block, which results in the minimum SATD (sum of absolute transformed differences) between two adjacent modes of modeA and modeB. Step 3 - If both adjacent modes are DC mode, go to Step 4, otherwise select the best mode for a 16x16 luma block, which results in the minimum SATD between the adjacent mode and DC mode.

Algorithm Modes of upper and left blocks used as candidate modes[8] Indices used in 4x4 luma block[8]

Yes No

Algorithm Step 4 - Let ΔV be a vertical difference between upper boundary pixels of the current block and boundary pixels of the upper block, and ΔH be a horizontal difference between left boundary pixels of the current block and boundary pixels of the left block as follows. ΔV = Σ |u(i)-q(i)| for i =0 to 15. ΔH = Σ |l(i)-r(i)| for i =0 to 15. Where u(i) -> upper block boundary pixels q(i) -> upper boundary pixels of current block l(i) -> boundary pixels of the left block r(i) -> left boundary pixels of the current block T 2 -> Threshold level 2 (T 2 = 32)

Algorithm Obtain candidate modes as follows by using two difference values, ΔV and ΔH: if |ΔV − ΔH | is smaller than 2T 2, candidate modes are DC mode and plane mode; if (ΔV − ΔH) is larger than T 2, candidate modes are DC mode and horizontal mode; if (ΔV − ΔH) is smaller than −T 2, candidate modes are DC and vertical modes, where T 2 is a positive value. T 2 is the threshold value equal to 32. The calculation for ΔV and ΔH is shown in Fig. 8. Finally, select the best mode between each candidate mode by choosing the mode with minimum SATD.

Algorithm Fig. 6 Calculation of ΔV & ΔH [8]

Comparison of number of RDO computations Intra block sizesNumber of modes - Original JM 17.2 Implementation Number of modes - Complexity reduction algorithm 16 x x x After implementing the complexity reduction algorithms the number of mode decisions for every macro block size in intra prediction is reduced. Table 2 lists reduction in mode decisions. Table 2. Reduction in Number of mode decisions for Intra block prediction. [8]

CIF and QCIF sequences. CIF (Common Intermediate Format) is a format used to standardize the horizontal and vertical resolutions in pixels of Y, C b, C r sequences in video signals, commonly used in video teleconferencing systems. QCIF means "Quarter CIF". To have one fourth of the area as "quarter" implies the height and width of the frame are halved. The differences in Y, C b, C r of CIF and QCIF are as shown below in fig.6. [16] Fig.6 CIF and QCIF resolutions( Y, C b, C r ).

Results The following QCIF and CIF sequences were used to test the complexity reduction algorithm. [10] Akiyo Foreman Car phone Hall monitor Silent News Container Coastguard

Test Sequences News Foreman Akiyo CoastguardCar phone Bus

Test Sequences Hall monitor Silent

Results The complexity reduction algorithm was successfully implemented on the JM 17.2 software.  Profile used : Baseline  Total number of frames : 100 (I frames only)  QP (quantization parameter) : 28 Table 1 shows the simulation results for QCIF sequences. Table 2 shows the simulation results for CIF sequences. Computational efficiency is measured by the amount of time reduction, which is computed as follows: Δ Time is calculated as :

Results Δ MSE is calculated as: Δ Bit rate is calculated as: Δ PSNR is calculated as :

Results Sequence (QCIF) Δ Time (%) Δ PSNR(dB)Δ Bit rate (%) Δ MSE Akiyo Foreman Car phone Hall monitor Silent News Container Coastguard *Negative values indicate the gain (e.g. decrease in encoding time) *Positive values indicate the loss (e.g. increase in the bit rate) Table 1 : Test results for QCIF sequences

Results Sequence (CIF) Δ Time (%) Δ PSNR(dB)Δ Bit rate (%) Δ MSE Bus Container Coastguard *Negative values indicate the gain (e.g. decrease in encoding time) *Positive values indicate the loss (e.g. increase in the bit rate) Table 2 : Test results for CIF sequences

References: 1. ITU-T Rec. H.264 | ISO/IEC : Information Technology – Coding of Audio-visual Objects, Part 10: Advanced Video Coding T.Wiegand, et al,: “Overview of the H.264/AVC Video Coding Standard.” IEEE Trans. Circuits and Syst. for Video Technol., Vol. 13, pp , July Z.Chen, et al,: “Fast Integer Pixel and Fractional Pixel Motion Estimation for JVT.” Doc. #JVT-F017,Dec B.Hsieh, et al,:“Fast Motion Estimation for H.264/MPEG-4 AVC by Using Multiple Reference Frame Skipping Criteria.” VCIP 2003,Proceedings of SPIE, Vol. 5150, pp , Oct A.Puri et al, “Video coding using the H.264/MPEG-4 AVC compression standard”, Signal Processing: Image Communication, vol.19, pp , Oct

References 6. H.264/AVC JM software: 7. An overview of the H.264 encoder: 8. J. Kim, et al, “Complexity reduction algorithm for intra mode selection in H.264/AVC video coding” J. Blanc-Talon et al. (Eds.): ACIVS 2006, LNCS 4179, pp. 454 – 465, Springer-Verlag Berlin Heidelberg, I. Richardson, “The H.264 advanced video compression standard” – second edition, Wiley, YUV test video sequences :