Download presentation
Presentation is loading. Please wait.
1
EE 5359 TOPICS IN SIGNAL PROCESSING
Final Report ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of Electrical Engineering , UT Arlington Submitted by: Aditya R Deshkar Student ID:
2
OBJECTIVE Audio video standard for Mobile (AVS-M) [1] is seventh part of the standard developed by Audio Video coding Standard (AVS) workgroup of China. Provide insight into AVS-M video coding standard (Jiben Profile) [2] Analyze its architecture, features and data formats for its use in low complexity and low picture resolution mobile applications.
3
History[1] Figure 1 : Audio video coding standards history [1]
4
AVS CHINA PROFILES[4] Table 1: AVS China Profiles and Applications[4]
5
Various AVS China Parts[3]
Table 1: AVS China Parts [3]
6
Main Characteristics of AVS Video standards:
Streamlined and highly efficient video coder Provide optimization between absolute coding performance and complexity of implementation Designed to provide near optimum performance Provide low cost implementations
7
Data Formats Used in AVS[5]
1) Progressive scan format It is a method of storing and transmitting images , where in , all lines of each frame is drawn in sequence 2) Interlace scan format It involves alternate drawing of odd and even lines
8
Advantages of Progressive Scan Format
Efficiency in operation of motion estimation[11] Significantly lower bit rate required for encoding Less complexity involved in motion compensation[11]
9
Layered Structure[5] Figure 2 :Layered Structure of AVS China[5]
- ee.uta.edu/Dip/Courses/EE5359/index.html : UTA Multimedia Processing Website Figure 2 :Layered Structure of AVS China[5]
10
Sequence[3] Figure 3 : Video Sequence example[3]
The Sequence layer provides an entry point into the coded video. Sequence headers should be placed in the bit stream to support user access appropriately for the given distribution medium.
11
Picture[3] The picture layer provides the coded representation of a video frame. It comprises a header with mandatory and optional parameters and optionally with user data. There are 3 types of pictures defined by the AVS: 1) I- Pictures (Intra Pictures) 2) P-Pictures (Predicted Pictures) 3) B-Pictures (Interpolated Pictures)
12
Figure 4 :SLICE STRUTURE FOR AVS PART 7[3]
Slice comprises a series of Macro blocks. The Slice structure provides the lowest-layer mechanism for re-synchronizing the bit stream in case of transmission error Figure 4 :SLICE STRUTURE FOR AVS PART 7[3]
13
Macro blocks and Blocks[3]
Picture is divided into macro blocks The upper left sample of each MB should not exceed picture boundary. Macro block partitioning is used for motion compensation. The number in each rectangle specifies the order of appearance of motion vectors. Figure 5 : MACROBLOCK PARTITIONING[3]
14
AVS-M Encoder[5] Figure 6 : AVS-M Encoder [5]
15
AVS-M Decoder[5] Figure 7: AVS-M Decoder [5]
16
Network Abstraction Layer(NAL) Unit[12]
Packetization layer– Prefixes certain headers to encoded bit streams. NAL is designed for : Provide network friendly environment Address video related applications Covert AVS encoded raw bit stream into NAL unit for secure transfer over network
17
Figure 8 : NAL Unit Syntax [12]
18
Table 3: NAL Unit types[13]
19
Intra Prediction[4],[13] It significantly reduces the complexity and maintains a comparable performance. There are two types of Intra Prediction which are used. Intra _4x4 [13] Direct Intra Prediction (DIP) [4]
20
Intra_4x4 [13] Figure 9 : INTRA_4X4 PREDICTION [13] Prediction using prior decoded samples in adjacent block
21
For each 4x4 block, one of the nine predictions modes can be utilized to exploit spatial correlation
Figure 10 : NINE INTRA_4X4 PREDICTION MODES OF AVS PART 7 [4]
22
Direct Intra Prediction [4]
Direct intra prediction mainly contains 5 steps. Step 1: All 16 4×4 blocks in a MB use their MPMs to do Intra_4×4 prediction and calculate RDCost(DIP) of this MB. Step 2: Mode search of Intra_4×4, find the best intra prediction mode of each block, and calculate RDCost(Intra_4x4). Step 3: Compare RDCost(DIP) and RDCost(Intra_4x4). If RDCost(DIP) is less than RDCost(Intra_4x4), DIP flag equals to 1 then go to step 4, else DIP lag equals to 0 go to step5. Step 4: Encode the MB using DIP and finish encoding of this MB. Step 5: Encode the MB using ordinary Intra_ 4×4 and finish encoding of this MB
23
Inter-frame Prediction [13]
The positions of the integer, half and quarter pixel samples are shown in the figure. Capital letters indicate integer sample positions, while small letters indicate half and quarter sample positions. Figure 11 :The position of integer, half and quarter pixel samples[13]
24
Inter-Frame Prediction
If the half_pixel_mv_flag is equal to 1, the precision of the motion vector is up to ½ pixel, otherwise the precision of motion vector is up to ¼ pixel. When half_pixel_mv_flag is not present in the bit stream, it shall be inferred to be 11. The interpolated values at half sample positions can be obtained using 8 tap filter F1 = (-1, 4,-12,41,41,-12,4,-1) and 4 tap filter F2 = (-1, 5,5, 1).
25
Entropy Coding[5] Reduce statistical correlation among syntax elements. Entropy coding involves mapping from a video signal after prediction and transforming to a variable length coded bit stream. AVS M uses Exp Golomb code, to encode syntax elements such as quantized coefficients, macroblock coding type, and motion vectors. 18 coding tables are used in quantized coefficients encoding. The encoder uses the run and the absolute value of the current coefficient to select the table
26
FLOW DIAGRAM OF ENTROPY CODING IN AVS PART 7 [5]
27
Context based Adaptive 2-DVariable Length Coding
In AVS an efficient context based adaptive 2D variable length coding is designed for coding transform coefficients in a 4x4 block. The transform coefficients are mapped into one dimensional (level, run) sequence by the reverse zigzag scan. It employs 2D joint VLC to remove the redundancy between the levels and runs in transform coefficient blocks. It employs multiple conditionally trained 2D VLC tables to better match different (level, run) ‘s probability distributions at different coding phases by automatic table switching. It makes use of an improved table switching method and an improved escape coding method.
28
Error Concealment • To deal with the transmission error problem numerous techniques have been specified which are: Forward Error Concealment: Encoder plays the primary role. Backward error Concealment: Estimation of lost information due to transmission errors in which the decoder fulfills the error concealment task. Interactive error Concealment: Best reconstruction quality, but difficult to implement
29
Error Resilience With the purpose of error concealment, scene signaling in SEI illustrates two kinds of information: (1) frames in which the short change starts and ends; and (2) the type of the scene transition. If a part of the current picture with which a scene information SEI message is associated is lost or corrupted, the decoder may apply a spatial error concealment algorithm to construct the lost or corrupted parts of the current picture if the scene has changed since the previous received picture. Otherwise the decoder may use a spatiotemporal error concealment algorithm
30
Comparison between AVS Part 7 and H.264 baseline Profile
31
Experimental Results The software which has been used to perform for AVS China Part 7 it is RM [9]. Microsoft Visual Studio Professional 2012 [14] has been used to run the code and build the project for the codec. After building the project, code will generate two application files namely encode.exe and decode.exe. We run these two files using appropriate and necessary parameters and obtain the final result which is a decoded file. The original file and decoded file are than evaluated using MSU video quality measurement tool. The values of PSNR[8], MSE and SSIM[3] are obtained from it.
32
Software used for Quality Measurement[15]
Figure 12 : Screenshot of MSU Video Quality Measurement Tool software
33
Input sequence : mother-daughter_qcif.yuv[16] BIT RATE, PSNR
Total No: of frames: 30 frames. Original file size : 1139Kb Width: 176. Height: 144. Frame rate: 30 fps Original Image
34
Video quality at various QP values
35
Results for mother-daughter_qcif.yuv Sequence
Compressed file size, compression ratio, bit rate, PSNR and SSIM at various QP for mother-daughter_qcif sequence
36
Figure 13: PSNR vs Bit Rate
37
Figure 14: SSIM vs Bit Rate
38
Input sequence : news_cif.yuv[16]
BIT RATE, PSNR Input Sequence: news_cif.yuv Total No: of frames: 30 frames. Original file size : 14850Kb Width: 288. Height: 352. Frame rate: 25 fps Original Image
39
Video quality at various QP values
QP = 31 QP = 63 Video quality at various QP values QP = 10
40
Results for Sequence Compressed file size, compression ratio, bit rate, PSNR and SSIM at various QP for news_cif sequence
41
Figure 15:PSNR vs Bit Rate
42
Figure 16:SSIM vs Bit Rate
43
Conclusion AVS part 7 targets low complexity and low picture resolution mobility applications. The AVS encoder and decoder are implemented using AVS M software. Tests are carried out on various QCIF and CIF sequences. The performance of AVS-china was analyzed by varying the quantization parameter (QP). The PSNR and bit rate and SSIM were calculated.
44
Acronyms AU Access Unit AVS Audio Video Standard
AVS-M Audio Video Standard for mobile B-Frame Interpolated Frame CAVLC Context Adaptive Variable Length Coding CBP Coded Block Pattern CIF Common Intermediate Format DIP Direct Intra Prediction DPB Decoded Picture Buffer EOB End of Block HD High Definition HHR Horizontal High Resolution ICT Integer Cosine Transform IDR Instantaneous Decoding Refresh I-Frame Intra Frame IMS IP Multimedia Subsystem ITU-T International Telecommunication Union MB Macroblocks
45
MPEG Moving Picture Experts Group
MPM Most Probable Mode MV Motion Vector NAL Network Abstraction Layer P-Frame Predicted Frame PIT Prescaled Integer Transform PPS Picture Parameter Set QCIF Quarter Common Intermediate Format QP Quantization Parameter RD Cost Rate Distortion Cost SAD Sum of Absolute Differences SD Standard Definition SEI Supplemental Enhancement Information SPS Sequence Parameter Set VLC Variable Length Coding
46
References: [1] AVS working group official website, http://www. avs
References: [1] AVS working group official website, [2] W. Gao et al, "AVS– the Chinese next-generation video coding standard," National Association of Broadcasters, Las Vegas, 2004 [3] L.Fan et al, "Overview of AVS Video Standard", IEEE International conference on multimedia and expo, Vol 1, pp , June [4] B. Tang, Y. Chen and W. Ji "AVS Encoder Performance and Complexity Analysis Based on Mobile Video Communication", 2009 International Conference on Communications and Mobile Computing [5] L.Fan, "Mobile Multimedia Broadcasting Standards", Springer US, 2009 [6] AVS-M Reference Software, [7] Y. Cheng et al, "Analysis and application of error concealment tools in AVS-M decoder", Journal of Zhejiang University –Science A, vol. 7, pp , Jan 2006 [8] Website for PSNR, [9] AVS China software: Part 7: ftp:// /incoming/video_codec/AVS1_P7 [10] S. Ma , S. Wang, W. Gao, "Overview of IEEE 1857 Video Coding Standards” IEEE ICIP, pp , September 2013 , Melbourne, Australia (Several papers related to AVS China are in IEEE ICIP,2013) [11] Lu Yu et al, " Overview of AVS-video coding standards", Signal Processing: Image Communication, pp , Nov 2009.
47
[12] Y. Wang ” AVS_M: From standards to Applications”, Journal of Computer Science and Technology - Special section on China AVS standard Vol.21. No.3 pp , May 2006 [13] L. Yu, “AVS Project and AVS-Video Techniques”, Dec.13, 2005 ISPACS 2005 [14] Microsoft Visual Studio Professional 2012 : [15] MSU video quality measurement tool: [16] Test video sequences : [17] M. Liu and Z. Wei, “A fast mode decision algorithm for intra prediction in AVS-M video coding” Vol. 1, ICWAPR apos;07,Issue, 2-4, pp , Nov [18] Y. Cheng et al, “Analysis and application of error concealment tools in AVS-M decoder”, Journal of Zhejiang University –Science A, vol. 7, pp , Jan [19] S.Hu, X.Zhang and Z.Yang, “Efficient Implementation of Interpolation for AVS”, Congress on Image and Signal Processing,2008. Vol 3, pp133 –138, May 2008
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.