AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100.

Slides:

Advertisements

Similar presentations

Alex Chen Nader Shehad Aamir Virani Erik Welsh

Advertisements

| Page Angelo Farina UNIPR | All Rights Reserved | Confidential Digital sound processing Convolution Digital Filters FFT.

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.

Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.

Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew

Psycho-acoustics and MP3 audio encoding

Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.

MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.

Fourier Transforms and Their Use in Data Compression

Time-Frequency Analysis Analyzing sounds as a sequence of frames

Digital Audio Compression

Chapter 4: Representation of data in computer systems: Sound OCR Computing for GCSE © Hodder Education 2011.

Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.

Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.

1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 

Data Compression Michael J. Watts

Speech & Audio Processing

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.

MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.

Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.

SWE 423: Multimedia Systems Chapter 7: Data Compression (3)

Digital Watermarking. Introduction Relation to Cryptography –Cryptography is Reversibility (no evidence) Established –Watermarking (1990s) Non-reversible.

Wavelet-based Coding And its application in JPEG2000 Monia Ghobadi CSC561 project

Audio Steganography Echo Data Hiding

Department of Computer Engineering University of California at Santa Cruz Data Compression (2) Hai Tao.

SWE 423: Multimedia Systems Chapter 7: Data Compression (5)

Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:

Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.

Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري

Lecture 1 Signals in the Time and Frequency Domains

GG 313 Lecture 26 11/29/05 Sampling Theorem Transfer Functions.

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.

Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.

Multiresolution STFT for Analysis and Processing of Audio

CMPT 365 Multimedia Systems

Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

Digital Signal Processing

Preprocessing Ch2, v.5a1 Chapter 2 : Preprocessing of audio signals in time and frequency domain  Time framing  Frequency model  Fourier transform 

1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.

Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.

Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.

Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.

Introduction to Digital Signals

Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Progress To Date, Monday 21 st January 2008.

HOW JEPG WORKS Presented by: Hao Zhong For 6111 Advanced Algorithm Course.

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.

Chapter 8 Lossy Compression Algorithms. Fundamentals of Multimedia, Chapter Introduction Lossless compression algorithms do not deliver compression.

Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.

WAVELET NOISE REMOVAL FROM BASEBAND DIGITAL SIGNALS IN BANDLIMITED CHANNELS Dr. Robert Barsanti SSST March 2010, University of Texas At Tyler.

Fletcher’s band-widening experiment (1940) Present a pure tone in the presence of a broadband noise. Present a pure tone in the presence of a broadband.

Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.

MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD

MP3 and MP4 Audio By: Krunal Tailor

Data Compression Michael J. Watts

Chapter 8 Lossy Compression Algorithms

Fletcher’s band-widening experiment (1940)

III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.

Spread Spectrum Audio Steganography using Sub-band Phase Shifting

Data Compression.

CSI-447: Multimedia Systems

III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.

1-D DISCRETE COSINE TRANSFORM DCT

Image Coding and Compression

Govt. Polytechnic Dhangar(Fatehabad)

Presentation transcript:

AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100 Main Street, Rice University, Houston, Texas GOAL To explore the MP3 technology and to implement various audio data compression algorithms. Analyze This  Audio compression is to compress an audio file into a smaller-sized file.  People cannot differentiate between these two files by just hearing.  Due to its smaller size, the new file can be easily transferred via the Internet.  People try to find better audio compression algorithms that retain satisfying audio quality. Algorithms  Average Energy Algorithm Z eroes out selected high and low frequencies of the audio file. Procedure  Perform the Discrete Cosine Transform (DCT).  Calculate the signal ’ s energy.  Find the mean and the standard deviation of from the energy spectrum.  Keep all frequencies with energies within 1 standard deviation (std) from the mean.  Zero out frequencies with energies outside this range.  Similarly, keep frequencies with energies within 2 and 3 stds from the mean.  Perform the Inverse DCT and get the output. Results Amount of compression is insignificant. Algorithm would probably work better if the signal is very short, has monotonous tones, and has little noise. Ding.wav before compression Ding.wav with frequencies within 1 std from the mean Ding.wav with frequencies within 2 std from the mean Ding.wav with frequencies within 3 std from the mean  Psycho Acoustic Algorithm Linear, tangent or arctangent quantization of the signal. Procedure  Perform the Discrete Cosine Transform (DCT)  Quantize the signal in one of the following ways : Diagram of the quantization “ buckets ” for the three methods  Give certain frequency bands more bits (1000 – 5100 Hz and Hz).  Throw away frequencies below 20Hz and above 20,000Hz.  Perform the Inverse DCT. Results Compression is very significant. Quality is good for the amount of compression. Arctangent quantization yields the best quality. Original signal sampled at 44100Hz The x-axis DT sample and the y-axis is the amplitude Original signal sampled at 44100Hz The x-axis DT sample and the y-axis is the amplitude After linear quantization After arctangent quantization After tangent quantization  Masking Algorithm The presence of a signal at a particular frequency can raise the perceptual threshold of signals close to the the masking frequency. Procedure  Go through every sample and remove the following samples if they are below a certain threshold. Results No significant improvement. Need a better way of implementing to get good results. Conclusion  We didn’t create MP3 files.  Used the underlying concepts.  Produced much smaller files.  Psycho Acoustic Algorithm is the best, in terms of - amount of compression - sound quality of the output. Improvements  Implement windowing  Implement temporal masking Bibliography: tutorials/mp3/mp3how.php and more…