Low complexity AVS-M by implementing data mining algorithm

Slides:

Advertisements

Similar presentations

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Advertisements

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.

-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang

1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)

H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.

Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin

DWT based Scalable video coding with scalable motion coding Syed Jawwad Bukhari.

BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.

BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.

An Introduction to H.264/AVC and 3D Video Coding.

HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.

EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.

EE 5359 TOPICS IN SIGNAL PROCESSING

IMPLEMENTATION AND PERFORMANCE ANALYSIS of Dirac VIDEO CODING STANDARD AND COMPARISON WITH AVS CHINA Under the guidance of Dr. K R. Rao Electrical Engineering.

By Sudeep Gangavati ID EE5359 Spring 2012, UT Arlington

EE 5359 TOPICS IN SIGNAL PROCESSING Interim Report ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept.

Low complexity H.264 Encoder using machine learning.

Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project.

Priyadarshini Anjanappa UTA ID:

Windows Media Video 9 Tarun Bhatia Multimedia Processing Lab University Of Texas at Arlington 11/05/04.

By:-Ramolia Pragnesh R. Guided by :-Dr. K.R.Rao. Term:-Spring

MULTIMEDIA PROCESSING (EE 5359) SPRING 2011 DR. K. R. RAO PROJECT PROPOSAL Error concealment techniques in H.264 video transmission over wireless networks.

By, ( ) Low Complexity Rate Control for VC-1 to H.264 Transcoding.

Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.

Sadaf Ahamed G/4G Cellular Telephony Figure 1.Typical situation on 3G/4G cellular telephony [8]

- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.

Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.

EE 5359 TOPICS IN SIGNAL PROCESSING PROJECT ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of.

Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.

Sub pixel motion estimation for Wyner-Ziv side information generation Subrahmanya M V (Under the guidance of Dr. Rao and Dr.Jin-soo Kim)

Implementation and comparison study of H.264 and AVS China EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.

- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.

Figure 1.a AVS China encoder [3] Video Bit stream.

PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT By Vidur Vajani ( ) Under the guidance of Dr.

IMPLEMENTATION OF H.264/AVC, AVS China Part 7 and Dirac VIDEO CODING STANDARDS Under the guidance of Dr. K R. Rao Electrical Engineering Department The.

-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Study and Optimization of the Deblocking Filter in H.265 and its Advantages over H.264 By: Valay Shah Under the guidance of: Dr. K. R. Rao.

Vamsi Krishna Vegunta University of Texas, Arlington

IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者：劉冠宇.

UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.

Study and Performance Comparison of H.264/AVC, Dirac and AVS China Part 7 EE5359 Project Fall 2010 Touseef Khan

Porting of Fast Intra Prediction in HM7.0 to HM9.2

Transcoding from H.264/AVC to HEVC

Video Compression and Standards

UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.

Study and Comparison of H.264, AVS- China and Dirac - by Jennie G. Abraham EE5359 – Multimedia Processing, Fall 2009 EE Dept., University of Texas at Arlington.

Instructor : Dr. K. R. Rao Presented by : Vigneshwaran Sivaravindiran

Time Optimization of HEVC Encoder over X86 Processors using SIMD Kushal Shah Advisor: Dr. K. R. Rao Spring 2013 Multimedia.

By: Santosh Kumar Muniyappa ( ) Guided by: Dr. K. R. Rao Final Report Multimedia Processing (EE 5359)

Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06

Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.

Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.

MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.

Complexity varying intra prediction in H.264 Supervisors: Dr. Ofer Hadar, Mr. Evgeny Kaminsky Students: Amit David, Yoav Galon.

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Present by 楊信弘 Advisor: 鄭芳炫

CSI-447: Multimedia Systems

Automatic Video Shot Detection from MPEG Bit Stream

Early termination for tz search in hevc motion estimation

Overview of the Scalable Video Coding

Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.

Supplement, Chapters 6 MC Course, 2009.

Study and Optimization of the Deblocking Filter in H

PROJECT PROPOSAL HEVC DEBLOCKING FILTER AND ITS IMPLIMENTATION RAKESH SAI SRIRAMBHATLA UTA ID: EE 5359 Under the guidance of DR. K. R. RAO.

Fast Decision of Block size, Prediction Mode and Intra Block for H

ENEE 631 Project Video Codec and Shot Segmentation

Standards Presentation ECE 8873 – Data Compression and Modeling

MPEG4 Natural Video Coding

CUI BIN AVS team of the MPL at UTA

Presentation transcript:

Low complexity AVS-M by implementing data mining algorithm By :- Ramolia Pragnesh R. Guided by :- Dr. K.R.Rao Dr. Dongil Han Term :- Fall-2009

Introduction to AVS-M Overview of AVS-M Complexity present in AVS-M encoder Various approaches to reduce complexity Introduction to data mining algorithm: C4.5 Project implementation steps AVS-M execution, and mode and attribute extraction. Future work.

Introduction to AVS-M AVS-M is the seventh part of video coding standard developed by AVS workgroup of China which aims at mobile applications. It has 9 different levels for different formats. It supports only progressive video coding hence codes frames only. It uses only 4:2:0 chroma sub-sampling format. It uses only I and P frames.

Different parts of AVS [10] Name 1 System 2 Video 3 Audio 4 Conformance test 5 Reference software 6 Digital media rights management 7 Mobile video 8 Transmit AVS via IP network 9 AVS file format 10 Mobile speech and audio coding Table 1: Different parts of AVS

Layered Data Structure G.O.P. Sequence Picture Slice Macro Block Block Sequence Picture Slice Block Macro block

AVS-M Codec Each MB needs to be intra or inter predicted. Switch S0(Fig. 1 ) is used to decide between inter and intra based type of MB. Unit size for intra prediction is block size of 4x4, and predictions are derived based on left and upper blocks. Inter predictions are derived on blocks of varying sizes: 16x16, 16x8, 8x16, 8x8, 8x4, 4x8, and 4x4 from locally reconstructed frames. Transform coefficients are coded by VLC. Deblocking filter is applied on reconstructed image.

Figure 1: Encoder of AVS-M [10]

Decoder Figure 2: Decoder of AVS-M [10]

Major and Minor tools of AVS-M Network abstraction layer (NAL). Supplemental enhancement information (SEI). Transform –4x4 integer transform. Adaptive quantization of step size varying from 0-63. Intra prediction –9 modes (Fig. 3), simple 4x4 intra prediction and direct intra prediction. Motion compensation –16x16, 16x8, 8x16, 8x8, 8x4, 4x8, and 4x4 block sizes. Quarter-pixel interpolation –8-tap horizontal interpolation filter and 4-tap vertical interpolation filter. Simplified in-loop de-blocking filter. Entropy coding. Error resilience .

Intra adaptive directional prediction [25] Figure 3: Intra adaptive directional prediction

Intra prediction Intra prediction scheme in AVS-M brings much simplicity as compared to H.264 baseline profile. It uses 4x4 block as the unit for intra-prediction. It uses 2 modes of prediction in intra prediction intra_4x4 and direct intra prediction. Intra_4x4 uses content based most probable intra mode decision as shown in Table 2 to save bits, where U and L represents the upper ad left blocks as shown in Fig. 4. Direct intra prediction brings much of the compression based on trade-off decision. Upper block[U] Left block[L] Current block Fig. 4 : Current block and neighboring block representation

Intra prediction U L -1 1 2 3 4 5 6 7 8 Table 2: Content based most probable mode decision table [25] Mode ‘-1’ is assigned to ‘L’ or “U’ when the current block does not have ‘Left’ or ‘Upper’ block respectively.

Inter-frame prediction Size of the blocks in inter-frame prediction can be 16x16, 16x8, 8x16, 8x8, 8x4, 4x8, and 4x4 depending on the amount of information present within the macro-block. Motion is predicted up to ¼ pixel accuracy. If the half_pixel_mv_flag is 1 then it is up to ½ pixel accuracy. 8-tap filter F1 = (−1,4,−12,41,41,−12,4,−1) and 4-tap filter F2 = (−1,5,5,−1) are used for horizontal and vertical interpolations respectively for ½ pixel MV search and averaging (liner interpolation) is used for ¼ pixel accuracy as shown in Fig. 6.

Inter frame block sizes: 7 block sizes are present in AVS-M for inter frame prediction [9]. Figure 5: Inter frame prediction block sizes

sub-pixel motion estimation by interpolation[16]: Figure 6: interpolation of sub-pixels (hatched lines show half-pixels, empty circles are quarter-pixels, and capital letters represent full-pixels.)

Error concealment and resilience 3- techniques are used for error concealment – forward, backward and interactive error concealment. For error resilience supplemental enhancement information (SEI) is sent along with the bit-stream which has details of 1) The frame number from which particular block motion starts and 2) Type of motion (zooming out/in, transversal motion in plane etc.). SEI helps to recover any information lost due to transmission error.

AVS-M encoder complexity variable block sizes in Inter Mode. It supports 9 intra_4*4 mode and 1 Direct_intra prediction mode. Full search for motion estimation gives the optimum result, but that comes along with implementation complexity. For example, assuming FS(full search) and M block types, N reference frames and a search range for each reference frame and block type equal to +/- W, we need to examine N x M x (2W + 1)^2 positions compared to only (2W + 1)^2 positions for a single reference/block type.

Continued… 7 inter prediction modes because of 7 different block sizes, 9 intra_4*4 modes and 1 direct intra prediction mode. ¼ and ½ pixel accuracy in motion vector estimation.

Various techniques to reduce complexity Intra mode selection algorithm[26]. Only intra spatial-prediction scheme[27]. Fast mode decision algorithm for intra prediction for H.264/AVC [28]. Dynamic control of motion estimation search parameters for low complexity H.264[29]. Adaptive algorithm for fast motion estimation [30]. Data mining algorithm for fast motion estimation [2].

Data mining algorithm C4.5 Extracts information from data automatically, by computational and statistical methods. Based on the information extracted, develops trees. These trees give the decision statement for mode decision Takes decision based on metrics such as MB mean, MB variance, amplitude of edge detection, residual variance etc.

Goal of this project:- Implement data mining algorithm c4.5 to decide the inter prediction block mode.

Implementation steps:- Select number of frames of a video sequence in QCIF as training sequences. Obtain the required attributes off line Encode the training sequence using full complexity AVS-M encoder Store the attributes calculated off line and mode decision taken by encoder in ARFF file Feed this ARFF file to weka tool, which will give decision tree similar to that of figure 10.

Continued… Mask the motion estimation part in the actual AVS-M encoder Overwrite that with if-else statements based on the decision tree Compare the performance of the simple codec with actual AVS-M

AVS-M execution and mode &attribute extraction:

Input parameters defined Akiyo_qcif Foreman_qcif Foreman_cif Frame size 176*144 352*288 No. of frames coded 60 Intra period 20 QP_first frame 28 QP_rest frames 40 Frames/sec 30 Output file name test.avs Recon file name Test_rec_.yuv Table 3: Parameters defined in encoder.cfg

Encoder performance: Table 4: AVS-M performance Parameter Akiyo_qcif Foreman_qcif Foreman_cif Original file size 2227.5 Kbytes 8910 Kbytes Encoded file size 20833 bytes 15376 bytes 54069 bytes 172468 bytes Reconstructed file size Decoded file size Compression Ratio 109.487 : 1 148.34 : 1 42.168 : 1 52.90 : 1 SNR(Y) 38.43 dB 38.74 dB 37.31 dB 37.33 dB SNR(U) 40.20 dB 40.33 dB 41.05 dB 40.99 dB SNR(V) 40.98 dB 41.21 dB 42.15 dB 43.31 dB SNR (YUV) 39.039 dB 39.31 dB 38.325 dB Encoding time 142.08 sec. 145.127 sec 211.07 sec 903.202 sec Decoding time 12.391 sec 20.54 sec 22.94 sec 171.87 sec Bit rate 83.9295 Kbps 61.105 Kbps 215.85 Kbps 689.87 Kbps Table 4: AVS-M performance

Encoder output for foreman_qcif sequence Figure 7: AVS-M output sceenshot

AVS-M decoder output: Figure 8: AVS-M decoder screenshot

AVS-M mode decisions: Figure 9: AVS-M mode decisions extracted

Encoded and decoded frame: a. Original Akiyo sequence b. Reconstructed Akiyo sequence c. Decoded Akiyo sequence Fig. 10: 45th frame: a. original frame b. reconstructed frame on the encoder side c. Decoded frame on the decoder side

Original, reconstructed and decoded foreman_cif frame: Figure 11:50th frame: a. original frame b. reconstructed frame on the encoder side c. Decoded frame on the decoder side

Original, reconstructed and decoded foreman_qcif frame: Figure 12:50th frame: a. original frame b. reconstructed frame on the encoder side c. Decoded frame on the decoder side

.arff file: Figure 13: .arff file look-how

Further plan: Get a decision tree from weka tool for attributes: mean, variance, and edge vector with mode decision as class. Embed the c++ code to extract attributes into AVS-M to extract attributes on line for all the test sequences. Mask the motion estimation part in AVS-M and implement the decision tree obtained in step-1.

Example of the decision tree generated by C4.5 Figure 13: Decision tree generated by weka tool

References: [1]http://ee.uta.edu/Dip/Courses/EE5359/Multimedia%20Processing%20project%20report%20final.pdf ; course website UTA [2]P. Carrillo, H.Kalva and T.Pin “Low complexity H.264 video encoding", SPIE. vol.7443, Paper # 74430A, Aug. 2009 [3]Kusrini1, Sri Hartati2”Implementation of C4.5 algorithm to evaluate the cancellation possibility of new student applicants at STMIK AMIKOM YOGYKARTA”, Proceedings of the International Conference on Electrical Engineering and Informatics Institute Teknologi Bandung, Indonesia June 17-19, 2007 [4]S. Saponara, et al “Adaptive algorithm for fast motion estimation in H.264/MPEG-4 AVC”, Proc. Eusipco2004, pp. 569 – 572, Wien, Sept. 2004 [5]Décisions tree basics : http://dms.irb.hr/tutorial/tut_dtrees.php [6]Weka tool software :http://www.cs.waikato.ac.nz/ml/weka/

Continued… [7]X. Jing and L. P. Chua, “An efficient inter mode decision approach for H.264 video coding” International Conference on Multimedia and Expo (ICME), pp. 1111-1114, July 2004. [8]Software download: ftp://159.226.42.57/public/avs_doc/avs_software [9]Power point slides by L.Yu, chair of AVS video : http://www-ee.uta.edu/dip/Courses/EE5351/ISPACSAVS.pdf [10]L.Fan, “Mobile multimedia broadcasting standards”, ISBN: 978-0-387-78263-8, Springer US, 2009 [11]AVS working group official website, http://www.avs.org.cn [12]Test sequences can be downloaded from the site http://trace.eas.asu.edu/yuv/index.html [13]Y.Xiang et al., “Perceptual evaluation of AVS-M based on mobile platform”, Congress on Image and Signal Processing, 2008, vol. 2, Issue, pp76 – 79, 27-30 May 2008.

Continued… [14]M.Liu and Z.Wei. “A fast mode decision algorithm for intra prediction in AVS-M video coding”, vol.1, ICWAPR apos; 07, Issue, 2-4, pp.326 – 331, Nov. 2007. [15]L.Yu et al., “Overview of AVS-Video: Tools, performance and complexity,” SPIE VCIP, vol. 5960, pp. 596021-1~ 596021-12, Beijing, China, July 2005. [16]L.Yu , S.Chen, J.Wang, “Overview of AVS-video coding standards” special issue on AVS, SP:IC, vol. 24, p. 247-262, April 2009. [17]Y.Shen, et. al., “A simplified intra prediction method”, AVS Doc. AVS-M 1419, 2004. [18]F.Yi, et al., “An improvement of intra prediction mode coding”, AVS Doc. AVS-M 1456, 2004. [19]L.Xiong, “Improvement of chroma intra prediction”, AVS Doc. AVS-M1379, 2004

Continued… [20]X.Mao, et al., “Adaptive block size coding for AVS-X profile”. AVS Doc. AVS-M2372, 2008. [21]R.Wang , et al., “Sub-pixel motion compensation interpolation filter in AVS”, 2004 IEEE International Conference on Multimedia and Expo, 1:93-96, 2004. [22]F.Yi et al., “Low-complexity tools in AVS Part 7”, J. Computer Science Technology, vol.21, pp. 345-353, May. 2006 [23]W.Gao and T.Huang “AVS Standard -Status and Future Plan”, Workshop on Multimedia New Technologies and Application, Shenzhen, China, Oct. 2007. [24]W.Gao et al., “AVS– the Chinese next-generation video coding standard,” National Association of Broadcasters, Las Vegas, 2004. [25] Z.Ma, et al., “Intra coding of AVS Part 7 video coding standard”, J. Computer Science Technology, vol.21, Feb.2006.

Continued… [26] Jongho Kim et.al , “H.264 Intra Mode Decision for Reducing Complexity Using Directional Masks and Neighboring Modes”, PSIVT 2006, LNCS 4319, pp. 959 – 968, 2006. [27]Xin, Vetro, “Fast Mode Decision for Intra-only H.264/AVC Coding”, TR 2006-034 May 2006. [28]Pan et. al “Fast Mode Decision Algorithm for Intraprediction in H.264/AVC Video Coding”, IEEE Transactions On Circuits And Systems For Video Technology. Vol 15, No. 7, July 2005 [29]S. Saponara et. al “Dynamic Control of Motion Estimation Search Parameters for Low Complex H.264 Video Coding”, IEEE Transactions on Consumer Electronics, Vol. 52, No. 1, FEBRUARY 2006. [30]Cheng-Chang Lien, Chung-Ping Yu, “A Fast Mode Decision Method for H.264/AVC Using the Spatial-Temporal Prediction Scheme”, ICPR 2006