Page 15/18/2015 CSE 40373/60373: Multimedia Systems Bluray (http://www.blu-ray.com/faq)  MPEG-2 - enhanced for HD, also used for playback of DVDs and.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Data Compression CS 147 Minh Nguyen.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple,
Practical and Scalable Transmission of Segmented Video Sequences to Multiple Players using H.264 Fabian Di Fiore, Panagiotis Issaris Expertise Centre for.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
EVENTS: INRIA Work Review Nov 18 th, Madrid.
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
SWE 423: Multimedia Systems
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Lecture 14: Spring 2007 MPEG Audio Compression
1 Static Sprite Generation Prof ︰ David, Lin Student ︰ Jang-Ta, Jiang
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
CSc 461/561 Multimedia Systems Part B: 2. Lossy Compression
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain Berna Erol and Faouzi Kossentini, Senior Member,
MPEG-4 Applications Interactive TV (broadcast) Streaming media on the web (internet) Multimedia titles (CD-ROM) Network games Mobile multimedia (picture.
Multimedia for our time (For Dummies) ISO/IEC Visa Hyoungjune Yi.
Klara Nahrstedt Spring 2011
An Introduction to H.264/AVC and 3D Video Coding.
1 Video Compression 2: Bi-directional Coding Multimedia Systems (Module 4 Lesson 3) Summary: r MPEG Coding m Bi-directional Motion- Compensation r MPEG.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Moving PicturestMyn1 Moving Pictures MPEG, Motion Picture Experts Group MPEG is a set of standards designed to support ”Coding of Moving Pictures and Associated.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 12 – MPEG-2/ MPEG-4 (Part 6) Klara Nahrstedt Spring 2012.
MPEG MPEG-VideoThis deals with the compression of video signals to about 1.5 Mbits/s; MPEG-AudioThis deals with the compression of digital audio signals.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
CMPT365 Multimedia Systems 1 Media Compression - Video Coding Standards Spring 2015 CMPT 365 Multimedia Systems.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
Chapter 4 Audio and video compression
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
CS Spring 2011 CS 414 – Multimedia Systems Design Lecture 11 – MP3 Audio & Introduction to MPEG-4 (Part 6) Klara Nahrstedt Spring 2011.
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Fundamentals of Multimedia Chapter 12 MPEG Video Coding II MPEG-4, 7 Ze-Nian Li & Mark S. Drew.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
MPEG-4 Systems Introduction & Elementary Stream Management
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System -MPEG-4 Natural Video Coding April, 2001.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Video Compression and Standards
Blind Quality Assessment System for Multimedia Communications Using Tracing Watermarking P. Campisi, M. Carli, G. Giunta and A. Neri IEEE Transactions.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
RATE SCALABLE VIDEO COMPRESSION Bhushan D Patil PhD Research Scholar Department of Electrical Engineering Indian Institute of Technology, Bombay Powai,
UNIT V. Linear Predictive coding With the advent of inexpensive digital signal processing circuits, the source simply analyzing the audio waveform to.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
MPEG Video Coding II — MPEG-4, 7 and Beyond. Administrative Issues (04/26/2016)  Draft of Final Report is due on Tuesday, April 26, 2016  Submit a narrative.
Fundamentals of Multimedia 2 nd ed., Chapter 14 Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Audio Codecs 14.4 MPEG-7.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Data Compression.
MPEG4 Natural Video Coding
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

page 15/18/2015 CSE 40373/60373: Multimedia Systems Bluray (  MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings  MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)  SMPTE VC-1 - standard based on Microsoft's Windows Media Video (WMV) technology  Video bitrate Mbps (vs ~10 Mbps for DVD)

page 25/18/2015 CSE 40373/60373: Multimedia Systems MPEG-4  MPEG-4 adopts a object-based coding:  Offering higher compression ratio, also beneficial for digital video composition, manipulation, indexing, and retrieval  The bit-rate for MPEG-4 video now covers a large range between 5 kbps to 10 Mbps  More interactive than MPEG-1 and MPEG-2

page 35/18/2015 CSE 40373/60373: Multimedia Systems Comparison between Block-based Coding and Object-based Coding

page 45/18/2015 CSE 40373/60373: Multimedia Systems Composition and manipulation of object

page 55/18/2015 CSE 40373/60373: Multimedia Systems Overview of MPEG-4 1.Video-object Sequence (VS)—delivers the complete MPEG-4 visual scene, which may contain 2-D or 3-D natural or synthetic objects 2.Video Object (VO) — a object in the scene, which can be of arbitrary shape corresponding to an object or background of the scene 3.Video Object Layer (VOL) — facilitates a way to support (multi-layered) scalable coding. A VO can have multiple VOLs under scalable coding, or have a single VOL under non-scalable coding 4.Group of Video Object Planes (GOV) — groups Video Object Planes together (optional level) 5.Video Object Plane (VOP) — a snapshot of a VO at a particular moment

page 65/18/2015 CSE 40373/60373: Multimedia Systems Object oriented  VOP – I-VOP, B-VOP, P-VOP  Objects can be arbitrary shape – need to encode the shape and the texture (object)  Need to treat MB inside object different than boundary blocks (padding, different DCT etc)

page 75/18/2015 CSE 40373/60373: Multimedia Systems Sprite Coding  A sprite is a graphic image that can freely move around within a larger graphic image or a set of images  To separate the foreground object from the background, we introduce the notion of a sprite panorama: a still image that describes the static background over a sequence of video frames  The large sprite panoramic image can be encoded and sent to the decoder only once at the beginning of the video sequence  When the decoder receives separately coded foreground objects and parameters describing the camera movements thus far, it can reconstruct the scene in an efficient manner

page 85/18/2015 CSE 40373/60373: Multimedia Systems

page 95/18/2015 CSE 40373/60373: Multimedia Systems Global Motion Compensation (GMC)  “Global” – overall change due to camera motions (pan, tilt, rotation and zoom)  Without GMC this will cause a large number of significant motion vectors  There are four major components within the GMC algorithm:  Global motion estimation  Warping and blending  Motion trajectory coding  Choice of LMC (Local Motion Compensation) or GMC.

page 105/18/2015 CSE 40373/60373: Multimedia Systems

page 115/18/2015 CSE 40373/60373: Multimedia Systems MPEG-7  The main objective of MPEG-7 is to serve the need of audio-visual content-based retrieval (or audiovisual object retrieval) in applications such as digital libraries

page 125/18/2015 CSE 40373/60373: Multimedia Systems MPEG-7 video segment

page 135/18/2015 CSE 40373/60373: Multimedia Systems A video summary

page 145/18/2015 CSE 40373/60373: Multimedia Systems Chapter 13: VOCODER  Voice only coder use aspects of human hearing  E.g. Formant Vocoder - voice is not equal represented in all frequencies because of vocal cord  They can produce good quality sound in 1,000 bps  concerned with modeling speech so that the salient features are captured in as few bits as possible  use either a model of the speech waveform in time (LPC (Linear Predictive Coding) vocoding  break down the signal into frequency components and model these (channel vocoders and formant vocoders)  Simulations are improving but still recognizable (automated phone calls)

page 155/18/2015 CSE 40373/60373: Multimedia Systems Phase insensitivity  A complete reconstituting of speech waveform is really unnecessary, perceptually: all that is needed is for the amount of energy at any time to be about right, and the signal will sound about right.  Solid line: Superposition of two cosines, with a phase shift. Dashed line: No phase shift. wave is different, yet the sound is the same, perceptually

page 165/18/2015 CSE 40373/60373: Multimedia Systems Linear Predictive Coding (LPC)  LPC vocoders extract salient features of speech directly from the waveform, rather than transforming the signal to the frequency domain  LPC Features:  uses a time-varying model of vocal tract sound generated from a given excitation  transmits only a set of parameters modeling the shape and excitation of the vocal tract, not actual signals or differences  small bit-rate

page 175/18/2015 CSE 40373/60373: Multimedia Systems Chapter 14: MPEG Audio  Psychoacoustics  Frequency: Remove audio that are masked anyway  A lower tone can effectively mask (make us unable to hear) a higher tone  The reverse is not true – a higher tone does not mask a lower tone well  The greater the power in the masking tone, the wider is its influence – the broader the range of frequencies it can mask  As a consequence, if two tones are widely separated in frequency then little masking occurs  Temporal Phenomenon: any loud tone will cause the hearing receptors in the inner ear to become saturated and require time to recover

page 185/18/2015 CSE 40373/60373: Multimedia Systems 14.2 MPEG Audio  MPEG audio compression takes advantage of psychoacoustic models, constructing a large multi- dimensional lookup table to transmit masked frequency components using fewer bits  Applies a filter bank to the input to break it into its frequency components  In parallel, a psychoacoustic model is applied to the data for bit allocation block  The number of bits allocated are used to quantize the info from the filter bank – providing the compression

page 195/18/2015 CSE 40373/60373: Multimedia Systems MPEG Layers  Each succeeding layer offering more complexity in the psychoacoustic model and better compression for a given level of audio quality  Layer 1 quality can be quite good provided a comparatively high bit-rate is available  Digital Audio Tape typically uses Layer 1 at around 192 kbps  Layer 2 has more complexity; was proposed for use in Digital Audio Broadcasting  Layer 3 (MP3) is most complex, and was originally aimed at audio transmission over ISDN lines  Most of the complexity increase is at the encoder, not the decoder – accounting for the popularity of MP3 players

page 205/18/2015 CSE 40373/60373: Multimedia Systems Summary  Apply different set of heuristics (than video), yet achieve the goal of attaining good compression by removing components that the human ear is not good at distinguishing  More complexity at the encoding phase leads to better compression  Humans are lot more sensitive to dropped “frames” in audio than in video. Audio should also be well synchronized - otherwise distracting  Humans also like audio better than video, TVs/DVDs send higher fidelity audio