Audio Coding Team Member: ChungMing Yan, Chun Tong.

Slides:



Advertisements
Similar presentations
RECORDING MECHANISM VOICE Competency : Principle of master making.
Advertisements

MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Developement and Implementation of an MPEG1 Layer III Decoder on x86 and TMS320C6711 platforms Braidotti Enrico (Farina Simone)
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
Mar 2003 Ognen Paunovski :: Andon Dragomanov S3CTIT03 Modern Trends in Audio Compression presented by Ognen Paunovski Andon Dragomanov 2 nd International.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Multimedia Authoring1 Introduction to Garageband Garageband is both a: MIDI sequencer Digital audio recorder Garageband: Real Instruments Tracks displayed.
Analysis of Audio Compression Algorithms Sanjeev Sharma.
Pro Tools 7 Session Secrets Chapter 6: After the Bounce or Life Outside of Pro Tools Life Outside of Pro Tools.
A stereo audio file 1. Audio Channels Number of audio channels determines number of waveforms in a recording Two relevant types of recording Stereo recording.
Streaming Media From the Web (and other alternative deliveries) The technicals, the processes, the formats, and the reasons.
4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.
Digital Audio Production Munsang College Information and Communication Technology S2.
1. Digitization of Sound What is Sound? Sound is a wave phenomenon like light, but is macroscopic and involves molecules of air being compressed and expanded.
.AAC and.MP3 By: Jared Hendricks & Billy Wolfram.
Digital Audio Compression
Free open source audio recording and editing software 1Using Audacity.
Technology ICT Option: Audio.
Image and Sound Editing Raed S. Rasheed Sound What is sound? How is sound recorded? How is sound recorded digitally ? How does audio get digitized.
Audiovisual digital documents Adolf Knoll National Library of the Czech Republic
MP3toFM Midterm Presentation February 21, About Us 2 Brandon Leatherwood CPE/SE MCU Firmware Ethernet Design Josh Wilson CPE MP3 Decoder MCU Firmware.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
Multimedia for our time (For Dummies) ISO/IEC Visa Hyoungjune Yi.
AUDIO VIDEO FLASH DIGITAL MEDIA: COMMUNICATION AND DESIGN
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
An Overview of Perceptual Audio Coding and MPEG AAC
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
Video Basics. Agenda Digital Video Compressing Video Audio Video Encoding in tools.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AVI File Format By : Jacob, Bab and Conor. Basic operation Presented By: Conor.
Allison Schein.  Adobe Audition (  Recommended program, metadata creation and manipulation is easy and complete.
CHAPTER SEVEN SOUND. CHAPTER HIGHLIGHTS Nature of sound – Sine waves, amplitude, frequency Traditional sound reproduction Digital sound – Sampled – Synthesized.
Audio Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Dhatchaini Rajendran Student ID: Date :
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 9 This presentation © 2004, MacAvon Media Productions Sound.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
AUDIO AND VIDEO COMPRESSION AND IT’S IMPORTANCE ON THE INTERNET Brian Dillinger May 3, 2010.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Multimedia and weBLOGging Grade 7-9 | Cahaya Bangsa Classical School (C) 2010 Digital Media Production Facility 04 – Audio Basic.
Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.5 (F Oct 30) MP3 and other digital audio file formats.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
By: Cheidalisse Feliciano & Ki’rah Howard. WMAMP3  Windows Media Audio (WMA) is an audio data compression technology developed by Microsoft.  The definition.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
How to Create a Podcast. Podcasting “is the distribution of audio or video files, such as radio programs or music videos, over the Internet using either.
MP3 and MP4 Audio By: Krunal Tailor
Sound Jan Růžička Institute of geoinformatics VSB-TU Ostrava 17.listopadu 15, Ostrava-Poruba,
Chapter 4 Fundamentals of Digital Audio
Video Basics.
III Digital Audio III.5 (W Oct 18) MP3 and other digital audio file formats.
Data Compression.
Video Compression - MPEG
Audio Henning Schulzrinne Dept. of Computer Science
Sound Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman
Technology ICT Option: Audio.
MPEG-1 Overview of MPEG-1 Standard
Technology ICT Option: Audio.
Govt. Polytechnic Dhangar(Fatehabad)
Digital Video Faraz Khan.
Presentation transcript:

Audio Coding Team Member: ChungMing Yan, Chun Tong

Overview Mp3, AAC, Ogg Vorbis Technical specifications Test Results Sample clips Conclusion

Mp3 MPEG1 Layer 3 Audio Coding A research project in EUREKA Digital Audio Broadcasting (DAB) in 1987 A power of data reduction algorithm Standardized as ISO-MPEG Audio-Layer 3

Mp3 (Continued) Pros: Fast Decoding Excellent hardware support ISO standard Cons: Quality varies widely between encoders Even at highest quality, quality still suffers

Mp3 (Continue) Bit Rate: Average 128 kbps or 192 kbps Sampling Frequency: 16-24KHz (MPEG2 Layer 3) 32-48KHz (MPEG1 Layer 3) Parameters: Birate: 1. CBR (Constant Bitrate) 2. VBR (Variable Bitrate) 3. ABR (Average Bitrate)

Mp3 (Continue) Compression Techniques Huffman coding Non-linear quantization M/S Matrixing (Mid/side matrixing) Intensity stereo MDCT

Mp3 (Continue) Encoders: Lame, Audio Catalysis Decoders: Winamp, Window Media Player, etc.

AAC MPEG2/MPEG4 Advanced Audio Coding Developed by MPEG group (Dolby, Frauhofer, AT&T, Sony…etc) More over of mp3

AAC (Continued) Pros: Competitive at low and mid bitrates against other formats Decoders/Encoders work on all platforms Cons: All high-quality implementations of AAC encoding are non-free and closed source. Relatively CPU intensive

AAC (Continue) Bit Rate: 96 kbps, 128kbps, 196kbps Sampling Frequency: 48 full-bandwidth (up to 96 KHz) Low Frequency Enhancement (LFE to 120 KHz)

AAC (Continue) Profiles LC (Low Complexity) Main Main LTP

AAC (Continue) Compression techniques Huffman coding Non-linear quantization and scaling Vector quantization M/S matrixing (middle/side channels) for high bitrates Intensity stereo for low bitrates TNS (temporal noise shaping) LTP(MPEG4 profile 2, reduce redundancy in successive frames) MDCT PNS (perceptual noise shaping)

AAC (Continue) Encoders: Psytel AacEnc, Nero Decoders: Winamp (with an AAC plug- in), QuickTime 6

Ogg Vorbis Open source project Free, open, unpatented from other audio coding format

Ogg Vorbis (Continue) Pros: Open source and patent free No loyalties even in commercial products Cons: No commercial hardware players High bitrates not fully tuned

Ogg Vorbis (Continue) Bit rate: ~64kbps Sampling Frequency: From 8 KHz (telephony) to 192 KHz (Digital Masters)

Ogg Vorbis (Continued) Compression techniques Huffman coding MDCT (Cosine + Sine) Wavelet in Vorbis II to improve quality

Ogg Vorbis (Continue) Encoders: Besweet, OggDrop Decoders: Winamp (with an Ogg Vorbis plug-in)

Test Result Three music clip used Orchestra Music with voice Voice only Different bitrate setting (switches) High bitrate Medium variable bitrate Low bitrate Additional switches (voice, pns)

Sample clips Wav Mp3 Ogg AAC

filesizes mp3: track 1track 2track 3 original2470KB2587KB318KB cbr 256(160 voice) r3mix (96) abr abr /60 aac: track 1track 2track 3 original2470KB2587KB318KB cbr abr abr 48(tape, 40-59) cbr cbr 32 resampled ogg: track 1track 2track 3 original2470KB2587KB318KB cbr 256/ abr 96/ abr 48/ abr 32-the encoder cannot encode lower bitrate

Switches used Mp3 GUI automatically writes the proper command line CBR "c:\EE3414\lame\lame.exe" -m s -b 256 -k "C:\EE3414\input.wav" "C:\EE3414\output.mp3“ r3mix - "c:\EE3414\lame\lame.exe" --nspsytune --vbr-mtrh -V1 -mj -h -b96 --lowpass athtype 3 --ns-sfb21 2 -Z --scale X0 "C:\EE3414\input.wav" "C:\EE3414\output.mp3" abr 48 - "c:\EE3414\lame\lame.exe" --abr 48 -b 32 -B 320/160 "C:\EE3414\input.wav“ "C:\EE3414\output.mp3“ abr 32 - "c:\EE3414\lame\lame.exe" --abr 32 -b 32 -B 320/160 "C:\EE3414\input.wav" "C:\EE3414\output.mp3" abr 32 voice - "c:\EE3414\lame\lame.exe" --voice --abr 32 -b 32 -B 160 "C:\EE3414\voice.wav" "C:\EE3414\voice(abr32-voice).mp3"

Switches used (Continued) AAC cbr 256 -production -low_ath -profile 0 -br 256 abr 96 -production -profile 0 -br 96 –vbrhi abr 48-tape abr 32-br 32 abr 32-br 32 -resample 22050

Switches used (continued) Ogg Vorbis Lacking control parameters besides “quality” cbr 256 GUI quality set to 8 abr 96GUI quality set to 2 abr 48GUI quality set to -1 abr 32such bitrate is not possible with given tools even with manual bitrate tweaking

Conclusion ACC = Ogg > MP3 There are very little differences, very hard to tell Depends on application Alternative Audio Coding Lossless encoding Monkey audio Speech specific Speez

Future research or improvements As technology improves, there will be newer coding schemes to be examined More extensive research of the parameters and encoding procedures Matlab waveform analysis (object analysis) Alternative Implementation

Resources Team website