Pyramid Vector Quantization for Video Coding

Slides:



Advertisements
Similar presentations
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Advertisements

Medical Image Registration Kumar Rajamani. Registration Spatial transform that maps points from one image to corresponding points in another image.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Ray Tracing CMSC 635. Basic idea How many intersections?  Pixels  ~10 3 to ~10 7  Rays per Pixel  1 to ~10  Primitives  ~10 to ~10 7  Every ray.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Light Field Compression Using 2-D Warping and Block Matching Shinjini Kundu Anand Kamat Tarcar EE398A Final Project 1 EE398A - Compression of Light Fields.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Department of Computer Engineering University of California at Santa Cruz Data Compression (3) Hai Tao.
Redundant Bit Vectors for the Audio Fingerprinting Server John Platt Jonathan Goldstein Chris Burges.
Spatial and Temporal Data Mining
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
1 Geometry A line in 3D space is represented by  S is a point on the line, and V is the direction along which the line runs  Any point P on the line.
3D orientation.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
CS :: Fall 2003 MPEG-1 Video (Part 1) Ketan Mayer-Patel.
CMPT 365 Multimedia Systems
Representation and Compression of Multi-Dimensional Piecewise Functions Dror Baron Signal Processing and Systems (SP&S) Seminar June 2009 Joint work with:
MPEG2 FGS Implementation ECE 738 Advanced Digital Image Processing Author: Deshan Yang 05/01/2003.
Image and Video Compression
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
JPEG C OMPRESSION A LGORITHM I N CUDA Group Members: Pranit Patel Manisha Tatikonda Jeff Wong Jarek Marczewski Date: April 14, 2009.
Game Physics – Part IV Moving to 3D
Section 13.4 The Cross Product. Torque Torque is a measure of how much a force acting on an object causes that object to rotate –The object rotates around.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
10/6/2015 3:12 AM1 Data Encoding ─ Analog Data, Digital Signals (5.3) CSE 3213 Fall 2011.
INTERPOLATED HALFTONING, REHALFTONING, AND HALFTONE COMPRESSION Prof. Brian L. Evans Collaboration.
CE Digital Signal Processing Fall 1992 Waveform Coding Hossein Sameti Department of Computer Engineering Sharif University of Technology.
Layered Coding Basic Overview. Outline Pyramidal Coding Scalability in the Standard Codecs Layered Coding with Wavelets Conclusion.
1 PCM & DPCM & DM. 2 Pulse-Code Modulation (PCM) : In PCM each sample of the signal is quantized to one of the amplitude levels, where B is the number.
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
16/5/ :47 UML Computer Graphics Conceptual Model Application Model Application Program Graphics System Output Devices Input Devices API Function.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
1 Polygonal Techniques 이영건. 2 Introduction This chapter –Discuss a variety of problems that are encountered within polygonal data sets The.
DIGITAL IMAGE. Basic Image Concepts An image is a spatial representation of an object An image can be thought of as a function with resulting values of.
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System -MPEG-4 Natural Video Coding April, 2001.
11/6/ :55 Graphics II Introduction to Parametric Curves and Surfaces Session 2.
Vector Quantization CAP5015 Fall 2005.
Rendering Pipeline Fall, D Polygon Rendering Many applications use rendering of 3D polygons with direct illumination.
CSE 681 Brief Review: Vectors. CSE 681 Vectors Basics Normalizing a vector => unit vector Dot product Cross product Reflection vector Parametric form.
Signal Prediction and Transformation Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
Perceptually-Driven Video Coding with the Daala Video Codec
Singular Value Decomposition and its applications
Daala: A Perceptually-Driven Still Picture Codec
Pyramid Vector Quantization
JPEG Compression What is JPEG? Motivation
A New Technique for Sidelobe Suppression in OFDM Systems
- photometric aspects of image formation gray level images
High-Quality, Low-Delay Music Coding in the Opus Codec
Digital Communications Chapter 13. Source Coding
Vocoders.
ENEE 631 Project Video Codec and Shot Segmentation
FP1 Matrices Transformations
PCM & DPCM & DM.
MPEG4 Natural Video Coding
Foundation of Video Coding Part II: Scalar and Vector Quantization
Wavelet-based Compression of 3D Mesh Sequences
Mathematics for Computer Graphics
Image Filtering with GLSL
LSH-based Motion Estimation
Predictive Grayscale Image Coding Scheme Using VQ and BTC
Mathematics for Computer Graphics
GPAT – Chapter 7 Physics.
Templates and Image Pyramids
Presentation transcript:

Pyramid Vector Quantization for Video Coding Jean-Marc Valin Daala Coding Party Sep 2013

Motivations Pyramid vector quantization is a key technique used in Opus (both SILK and CELT parts) Investigate PVQ for a video codec (Daala) Potential advantages Preserves energy (details) even when details are imperfect (instead of blurring) Implicit activity masking Better representation of coefficients

Gain-Shape Quantization Represent a vector as magnitude multiplied by unit- norm vector (radius + point on sphere) Amount of texture vs exact details Code magnitude separately Adjust resolution of the sphere based on the magnitude

Pyramid Vector Quantizer (PVQ) Place K unit pulses in N dimensions Up to N = 1024 dimensions Normalize to unit norm (L2)

Codebook for N=3 and different K

Distortion, N and K Fewer pulses needed D = N2/(24K2)

PVQ vs Scalar Quantization -6 dB/bit

Prediction Unlike CELT, we want to predict the vectors PVQ on the residual loses energy preservation Apply prediction in the normalized vector Use Householder reflection to align prediction with one axis Encode magnitude of the residual as an angle

2-D Projection Input Input

2-D Projection Input+prediction Prediction Input

2-D Projection Input+prediction Compute reflection plane Prediction

2-D Projection Input+prediction Compute reflection plane Apply reflection Prediction Input

2-D Projection Input+prediction Compute reflection plane Apply reflection Compute/code angle Prediction θ Input

2-D Projection Input+prediction Compute reflection plane Apply reflection Compute/code angle Code other dimensions Prediction θ Input

Activity Masking Artefacts are easier to detect on flat areas they on textured areas Code unit-norm vector with a resolution that depends on the gain (texture) Code companded gain gc = g Implicit activity masking built into the bitstream

Open Questions How to split into bands Avoid wasting bits on still video Quantization matrix Take advantage of correlation/prediction in gain and angle Rate-Distortion Optimization Fast RDO PVQ search?