EE 5359 TOPICS IN SIGNAL PROCESSING

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
SWE 423: Multimedia Systems
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin
Overview of Error Resiliency Schemes in H.264/AVC Standard Sunil Kumar, Liyang Xu, Mrinal K. Mandal, and Sethuraman Panchanathan Elsevier Journal of Visual.
Video Transmission Adopting Scalable Video Coding over Time- varying Networks Chun-Su Park, Nam-Hyeong Kim, Sang-Hee Park, Goo-Rak Kwon, and Sung-Jea Ko,
Overview of AVS Video Standard Kai-Chao Yang. Outline Audio Video Coding Standard (AVS) Audio Video Coding Standard (AVS) AVS Schedule AVS Schedule AVS.
Department of Computer Engineering University of California at Santa Cruz Video Compression Hai Tao.
H.264 / MPEG-4 Part 10 Nimrod Peleg March 2003.
BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.
MPEG-2 Digital Video Coding Standard
EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.
IMPLEMENTATION AND PERFORMANCE ANALYSIS of Dirac VIDEO CODING STANDARD AND COMPARISON WITH AVS CHINA Under the guidance of Dr. K R. Rao Electrical Engineering.
By Sudeep Gangavati ID EE5359 Spring 2012, UT Arlington
EE 5359 TOPICS IN SIGNAL PROCESSING Interim Report ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept.
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
Video Coding. Introduction Video Coding The objective of video coding is to compress moving images. The MPEG (Moving Picture Experts Group) and H.26X.
M P E G A Presentation by Sebastian Graf and Thomas Winterscheid.
Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
Video Compression Techniques By David Ridgway.
Comparative study of various still image coding techniques. Harish Bhandiwad EE5359 Multimedia Processing.
Windows Media Video 9 Tarun Bhatia Multimedia Processing Lab University Of Texas at Arlington 11/05/04.
Concepts of Multimedia Processing and Transmission IT 481, Lecture 5 Dennis McCaughey, Ph.D. 19 February, 2007.
IMPLEMENTATION AND PERFORMANCE ANALYSIS of Dirac VIDEO CODING STANDARD AND COMPARISON WITH AVS CHINA Under the guidance of Dr. K R. Rao Electrical Engineering.
STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC) EE 5359-Multimedia Processing Spring 2012 Dr. K.R Rao By: Sumedha Phatak( )
MULTIMEDIA PROCESSING (EE 5359) SPRING 2011 DR. K. R. RAO PROJECT PROPOSAL Error concealment techniques in H.264 video transmission over wireless networks.
By, ( ) Low Complexity Rate Control for VC-1 to H.264 Transcoding.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.
Codec structuretMyn1 Codec structure In an MPEG system, the DCT and motion- compensated interframe prediction are combined. The coder subtracts the motion-compensated.
June, 1999 An Introduction to MPEG School of Computer Science, University of Central Florida, VLSI and M-5 Research Group Tao.
Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.
EE 5359 TOPICS IN SIGNAL PROCESSING PROJECT ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Implementation and comparison study of H.264 and AVS China EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Watermarking Part 2: Future Work Electrical and Computer Engineering Department Villanova University 18 August 2004 Robert J. Berger II Michael P. Marcinak.
Figure 1.a AVS China encoder [3] Video Bit stream.
PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT By Vidur Vajani ( ) Under the guidance of Dr.
IMPLEMENTATION OF H.264/AVC, AVS China Part 7 and Dirac VIDEO CODING STANDARDS Under the guidance of Dr. K R. Rao Electrical Engineering Department The.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
Vamsi Krishna Vegunta University of Texas, Arlington
UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.
Study and Performance Comparison of H.264/AVC, Dirac and AVS China Part 7 EE5359 Project Fall 2010 Touseef Khan
Video Compression—From Concepts to the H.264/AVC Standard
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Video Compression and Standards
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
MPEG CODING PROCESS. Contents  What is MPEG Encoding?  Why MPEG Encoding?  Types of frames in MPEG 1  Layer of MPEG1 Video  MPEG 1 Intra frame Encoding.
Time Optimization of HEVC Encoder over X86 Processors using SIMD Kushal Shah Advisor: Dr. K. R. Rao Spring 2013 Multimedia.
Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
Principles of Video Compression Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
EE 5359 MULTIMEDIA PROCESSING PROJECT PROPOSAL SPRING 2016 STUDY AND PERFORMANCE ANALYSIS OF HEVC, H.264/AVC AND DIRAC By ASHRITA MANDALAPU
Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
H. 261 Video Compression Techniques 1. H.261  H.261: An earlier digital video compression standard, its principle of MC-based compression is retained.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
ENEE 631 Project Video Codec and Shot Segmentation
Standards Presentation ECE 8873 – Data Compression and Modeling
MPEG4 Natural Video Coding
Presentation transcript:

EE 5359 TOPICS IN SIGNAL PROCESSING Final Report ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of Electrical Engineering , UT Arlington Submitted by: Aditya R Deshkar aditya.deshkar@mavs.uta.edu Student ID:1000848085

OBJECTIVE Audio video standard for Mobile (AVS-M) [1] is seventh part of the standard developed by Audio Video coding Standard (AVS) workgroup of China. Provide insight into AVS-M video coding standard (Jiben Profile) [2] Analyze its architecture, features and data formats for its use in low complexity and low picture resolution mobile applications.

History[1] Figure 1 : Audio video coding standards history [1]

AVS CHINA PROFILES[4] Table 1: AVS China Profiles and Applications[4]

Various AVS China Parts[3] Table 1: AVS China Parts [3]

Main Characteristics of AVS Video standards: Streamlined and highly efficient video coder Provide optimization between absolute coding performance and complexity of implementation Designed to provide near optimum performance Provide low cost implementations

Data Formats Used in AVS[5] 1) Progressive scan format It is a method of storing and transmitting images , where in , all lines of each frame is drawn in sequence 2) Interlace scan format It involves alternate drawing of odd and even lines

Advantages of Progressive Scan Format Efficiency in operation of motion estimation[11] Significantly lower bit rate required for encoding Less complexity involved in motion compensation[11]

Layered Structure[5] Figure 2 :Layered Structure of AVS China[5] http://www - ee.uta.edu/Dip/Courses/EE5359/index.html : UTA Multimedia Processing Website Figure 2 :Layered Structure of AVS China[5]

Sequence[3] Figure 3 : Video Sequence example[3] The Sequence layer provides an entry point into the coded video. Sequence headers should be placed in the bit stream to support user access appropriately for the given distribution medium.

Picture[3] The picture layer provides the coded representation of a video frame. It comprises a header with mandatory and optional parameters and optionally with user data. There are 3 types of pictures defined by the AVS: 1) I- Pictures (Intra Pictures) 2) P-Pictures (Predicted Pictures) 3) B-Pictures (Interpolated Pictures)

Figure 4 :SLICE STRUTURE FOR AVS PART 7[3] Slice comprises a series of Macro blocks. The Slice structure provides the lowest-layer mechanism for re-synchronizing the bit stream in case of transmission error Figure 4 :SLICE STRUTURE FOR AVS PART 7[3]

Macro blocks and Blocks[3] Picture is divided into macro blocks The upper left sample of each MB should not exceed picture boundary. Macro block partitioning is used for motion compensation. The number in each rectangle specifies the order of appearance of motion vectors. Figure 5 : MACROBLOCK PARTITIONING[3]

AVS-M Encoder[5] Figure 6 : AVS-M Encoder [5]

AVS-M Decoder[5] Figure 7: AVS-M Decoder [5]

Network Abstraction Layer(NAL) Unit[12] Packetization layer– Prefixes certain headers to encoded bit streams. NAL is designed for : Provide network friendly environment Address video related applications Covert AVS encoded raw bit stream into NAL unit for secure transfer over network

Figure 8 : NAL Unit Syntax [12]

Table 3: NAL Unit types[13]

Intra Prediction[4],[13] It significantly reduces the complexity and maintains a comparable performance. There are two types of Intra Prediction which are used. Intra _4x4 [13] Direct Intra Prediction (DIP) [4]

Intra_4x4 [13] Figure 9 : INTRA_4X4 PREDICTION [13] Prediction using prior decoded samples in adjacent block

For each 4x4 block, one of the nine predictions modes can be utilized to exploit spatial correlation Figure 10 : NINE INTRA_4X4 PREDICTION MODES OF AVS PART 7 [4]

Direct Intra Prediction [4] Direct intra prediction mainly contains 5 steps. Step 1: All 16 4×4 blocks in a MB use their MPMs to do Intra_4×4 prediction and calculate RDCost(DIP) of this MB. Step 2: Mode search of Intra_4×4, find the best intra prediction mode of each block, and calculate RDCost(Intra_4x4). Step 3: Compare RDCost(DIP) and RDCost(Intra_4x4). If RDCost(DIP) is less than RDCost(Intra_4x4), DIP flag equals to 1 then go to step 4, else DIP lag equals to 0 go to step5. Step 4: Encode the MB using DIP and finish encoding of this MB. Step 5: Encode the MB using ordinary Intra_ 4×4 and finish encoding of this MB  

Inter-frame Prediction [13] The positions of the integer, half and quarter pixel samples are shown in the figure. Capital letters indicate integer sample positions, while small letters indicate half and quarter sample positions. Figure 11 :The position of integer, half and quarter pixel samples[13]

Inter-Frame Prediction If the half_pixel_mv_flag is equal to 1, the precision of the motion vector is up to ½ pixel, otherwise the precision of motion vector is up to ¼ pixel. When half_pixel_mv_flag is not present in the bit stream, it shall be inferred to be 11. The interpolated values at half sample positions can be obtained using 8 tap filter F1 = (-1, 4,-12,41,41,-12,4,-1) and 4 tap filter F2 = (-1, 5,5, 1).

Entropy Coding[5] Reduce statistical correlation among syntax elements. Entropy coding involves mapping from a video signal after prediction and transforming to a variable length coded bit stream. AVS M uses Exp Golomb code, to encode syntax elements such as quantized coefficients, macroblock coding type, and motion vectors. 18 coding tables are used in quantized coefficients encoding. The encoder uses the run and the absolute value of the current coefficient to select the table

FLOW DIAGRAM OF ENTROPY CODING IN AVS PART 7 [5]

Context based Adaptive 2-DVariable Length Coding In AVS an efficient context based adaptive 2D variable length coding is designed for coding transform coefficients in a 4x4 block. The transform coefficients are mapped into one dimensional (level, run) sequence by the reverse zigzag scan. It employs 2D joint VLC to remove the redundancy between the levels and runs in transform coefficient blocks. It employs multiple conditionally trained 2D VLC tables to better match different (level, run) ‘s probability distributions at different coding phases by automatic table switching. It makes use of an improved table switching method and an improved escape coding method.

Error Concealment • To deal with the transmission error problem numerous techniques have been specified which are: Forward Error Concealment: Encoder plays the primary role. Backward error Concealment: Estimation of lost information due to transmission errors in which the decoder fulfills the error concealment task. Interactive error Concealment: Best reconstruction quality, but difficult to implement

Error Resilience With the purpose of error concealment, scene signaling in SEI illustrates two kinds of information: (1) frames in which the short change starts and ends; and (2) the type of the scene transition. If a part of the current picture with which a scene information SEI message is associated is lost or corrupted, the decoder may apply a spatial error concealment algorithm to construct the lost or corrupted parts of the current picture if the scene has changed since the previous received picture. Otherwise the decoder may use a spatiotemporal error concealment algorithm

Comparison between AVS Part 7 and H.264 baseline Profile

Experimental Results The software which has been used to perform for AVS China Part 7 it is RM 3.3.7 [9]. Microsoft Visual Studio Professional 2012 [14] has been used to run the code and build the project for the codec. After building the project, code will generate two application files namely encode.exe and decode.exe. We run these two files using appropriate and necessary parameters and obtain the final result which is a decoded file. The original file and decoded file are than evaluated using MSU video quality measurement tool. The values of PSNR[8], MSE and SSIM[3] are obtained from it.

Software used for Quality Measurement[15] Figure 12 : Screenshot of MSU Video Quality Measurement Tool software

Input sequence : mother-daughter_qcif.yuv[16] BIT RATE, PSNR Total No: of frames: 30 frames. Original file size : 1139Kb Width: 176. Height: 144. Frame rate: 30 fps Original Image

Video quality at various QP values

Results for mother-daughter_qcif.yuv Sequence Compressed file size, compression ratio, bit rate, PSNR and SSIM at various QP for mother-daughter_qcif sequence

Figure 13: PSNR vs Bit Rate

Figure 14: SSIM vs Bit Rate  

Input sequence : news_cif.yuv[16] BIT RATE, PSNR Input Sequence: news_cif.yuv Total No: of frames: 30 frames. Original file size : 14850Kb Width: 288. Height: 352. Frame rate: 25 fps Original Image

Video quality at various QP values QP = 31 QP = 63 Video quality at various QP values QP = 10

Results for Sequence Compressed file size, compression ratio, bit rate, PSNR and SSIM at various QP for news_cif sequence

Figure 15:PSNR vs Bit Rate

Figure 16:SSIM vs Bit Rate

Conclusion AVS part 7 targets low complexity and low picture resolution mobility applications. The AVS encoder and decoder are implemented using AVS M software. Tests are carried out on various QCIF and CIF sequences. The performance of AVS-china was analyzed by varying the quantization parameter (QP). The PSNR and bit rate and SSIM were calculated.

Acronyms AU Access Unit AVS Audio Video Standard AVS-M Audio Video Standard for mobile B-Frame Interpolated Frame CAVLC Context Adaptive Variable Length Coding CBP Coded Block Pattern CIF Common Intermediate Format DIP Direct Intra Prediction DPB Decoded Picture Buffer EOB End of Block HD High Definition HHR Horizontal High Resolution ICT Integer Cosine Transform IDR Instantaneous Decoding Refresh I-Frame Intra Frame IMS IP Multimedia Subsystem ITU-T International Telecommunication Union MB Macroblocks

MPEG Moving Picture Experts Group MPM Most Probable Mode MV Motion Vector NAL Network Abstraction Layer P-Frame Predicted Frame PIT Prescaled Integer Transform PPS Picture Parameter Set QCIF Quarter Common Intermediate Format QP Quantization Parameter RD Cost Rate Distortion Cost SAD Sum of Absolute Differences SD Standard Definition SEI Supplemental Enhancement Information SPS Sequence Parameter Set VLC Variable Length Coding

References: [1] AVS working group official website, http://www. avs References: [1] AVS working group official website, http://www.avs.org.cn [2] W. Gao et al, "AVS– the Chinese next-generation video coding standard," National Association of Broadcasters, Las Vegas, 2004 [3] L.Fan et al, "Overview of AVS Video Standard", IEEE International conference on multimedia and expo, Vol 1, pp. 423 - 426, June 2004. [4] B. Tang, Y. Chen and W. Ji "AVS Encoder Performance and Complexity Analysis Based on Mobile Video Communication", 2009 International Conference on Communications and Mobile Computing [5] L.Fan, "Mobile Multimedia Broadcasting Standards", Springer US, 2009 [6] AVS-M Reference Software, http://www.avs.org.cn/fruits/en/softList.asp [7] Y. Cheng et al, "Analysis and application of error concealment tools in AVS-M decoder", Journal of Zhejiang University –Science A, vol. 7, pp. 54-58, Jan 2006 [8] Website for PSNR, http://en.wikipedia.org/wiki/Peak_signal-to-noise_ratio [9] AVS China software: Part 7: ftp://124.207.250.92/incoming/video_codec/AVS1_P7 [10] S. Ma , S. Wang, W. Gao, "Overview of IEEE 1857 Video Coding Standards” IEEE ICIP, pp. 1500-1504, September 2013 , Melbourne, Australia (Several papers related to AVS China are in IEEE ICIP,2013) [11] Lu Yu et al, " Overview of AVS-video coding standards", Signal Processing: Image Communication, pp. 247-262, Nov 2009.

[12] Y. Wang ” AVS_M: From standards to Applications”, Journal of Computer Science and Technology - Special section on China AVS standard Vol.21. No.3 pp. 332-344, May 2006 [13] L. Yu, “AVS Project and AVS-Video Techniques”, http://www-ee.uta.edu/dip/Courses/EE5351/ISPACSAVS.pdf, Dec.13, 2005 ISPACS 2005 [14] Microsoft Visual Studio Professional 2012 : http://www.microsoft.com/en-us/download/details.aspx?id=34673 [15] MSU video quality measurement tool: http://www.softrecipe.com/Download/msu_video_quality_measurement_tool.html [16] Test video sequences : http://trace.eas.asu.edu/yuv/ [17] M. Liu and Z. Wei, “A fast mode decision algorithm for intra prediction in AVS-M video coding” Vol. 1, ICWAPR apos;07,Issue, 2-4, pp.326 -331, Nov. 2007. [18] Y. Cheng et al, “Analysis and application of error concealment tools in AVS-M decoder”, Journal of Zhejiang University –Science A, vol. 7, pp. 54-58, Jan 2006. [19] S.Hu, X.Zhang and Z.Yang, “Efficient Implementation of Interpolation for AVS”, Congress on Image and Signal Processing,2008. Vol 3, pp133 –138, 27-30 May 2008