Digital Audio Processing

Slides:

Advertisements

Similar presentations

EET260 Introduction to digital communication

Advertisements

Digital Audio 1.

Digital Audio Teppo Räisänen LIIKE/OAMK. General Information Auditive information is transmitted by vibrations of air molecules The speed of sound waves.

Pro Tools 7 Session Secrets Chapter 6: After the Bounce or Life Outside of Pro Tools Life Outside of Pro Tools.

Sound in multimedia How many of you like the use of audio in The Universal Machine? What about The Universal Computer? Why or why not? Does your preference.

4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.

1. Digitization of Sound What is Sound? Sound is a wave phenomenon like light, but is macroscopic and involves molecules of air being compressed and expanded.

Time-Frequency Analysis Analyzing sounds as a sequence of frames

Dale & Lewis Chapter 3 Data Representation Analog and digital information The real world is continuous and finite, data on computers are finite  need.

Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.

I Power Higher Computing Multimedia technology Audio.

Speech Compression. Introduction Use of multimedia in personal computers Requirement of more disk space Also telephone system requires compression Topics.

ECE 4321: Computer Networks Chapter 3 Data Transmission.

DIGITAL COMMUNICATIONS.  The modern world is dependent on digital communications.  Radio, television and telephone systems were essentially analog in.

Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.

1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 

Audio signal processing. Audio signal processing, sometimes referred to as audio processing. It is the processing of a representation of auditory signals,

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.

Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.

William Stallings Data and Computer Communications 7th Edition (Selected slides used for lectures at Bina Nusantara University) Data, Signal.

Chapter 1: Introduction Business Data Communications, 4e.

Advanced Lecture.  dynamic range The ratio of the loudest (undistorted) signal to that of the quietest (discernible) signal in a unit or system as expressed.

5. Multimedia Data. 2 Multimedia Data Representation  Digital Audio  Sampling/Digitisation  Compression (Details of Compression algorithms – following.

Data dan Teknologi Multimedia Sesi 08 Nofriyadi Nurdam.

Digital Audio Multimedia Systems (Module 1 Lesson 1)

1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.

Digital Communication Techniques

Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.

Representation of Data in Computer Systems

1 Live Sound Reinforcement Equalizers and other signal processing equipment.

Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.

Introduction to Interactive Media 10: Audio in Interactive Digital Media.

COMP Representing Sound in a ComputerSound Course book - pages

CE 4228 Data Communications and Networking

Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.

Inspire School of Arts and Science Jim White Compression.

Media Representations - Audio

CHAPTER SEVEN SOUND. CHAPTER HIGHLIGHTS Nature of sound – Sine waves, amplitude, frequency Traditional sound reproduction Digital sound – Sampled – Synthesized.

Dynamic Range and Dynamic Range Processors

Sound Part 2 Sound and how to make it. What is Sound? Sound is a compression wave.

1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.

8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.

COMPRESSION. Compression in General: Why Compress? So Many Bits, So Little Time (Space) CD audio rate: 2 * 2 * 8 * = 1,411,200 bps CD audio storage:

Digital Recording. Digital recording is different from analog in that it doesn’t operate in a continuous way; it breaks a continuously varying waveform.

Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.

Analogue & Digital. Analogue Sound Storage Devices.

Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.

Data dan Teknologi Multimedia Sesi 09 Nofriyadi Nurdam.

Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.

The Digital Revolution Changing information. What is Digital?  Discrete values used for  Input  Processing  Transmission  Storage  Display  Derived.

COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.

Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.

Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.

Today’s lesson Discovering and understanding compression Working on club dance portfolios.

Audio Formats. Digital sound files must be organized and structured so that your media player can read them. It's just like being able to read and understand.

By :- Ishank Ranjan Akash Gupta. Audio & Audio File Formats Audio is an electrical or other representation of sound. An audio file format is a file format.

Introduction to Digital Audio

Multimedia: making it Work

Analogue & Digital.

Data Compression.

Introduction to Digital Audio

Introduction to Digital Audio

Introduction to Digital Audio

Digital Music Scott Bennett 1/3/2019.

Assist. Lecturer Safeen H. Rasool Collage of SCIENCE IT Dept.

Audio Compression Techniques

Govt. Polytechnic Dhangar(Fatehabad)

Introduction to Digital Audio

Digital Audio Application of Digital Audio - Selected Examples

Presentation transcript:

Digital Audio Processing By: Eng. Mohanned Dawoud

Audio signal processing Audio signal processing, sometimes referred to as audio processing. It is the processing of a representation of auditory signals, or sound.

Analog signals An analog representation is usually electrical. A voltage level represents the air pressure waveform of the sound.

Digital signals A digital representation expresses the pressure wave-form as a sequence of symbols, usually binary numbers, which permits digital signal processing. human ears cannot perceive frequencies below approx. 20 Hz or above approx. 18 kHz (strongly depends on the age of the listener). Therefore, there is no significant loss of information when the analog signal is sampled using a high enough sampling rate.

Digital signals The signal-to-noise ratio (SNR), which measures the noise level, can be easily calculated thru this formula, where n is the number of bits used on the ADC: SNR = 6.02 x n + 1.76 dB The higher the SNR, the better. An 8-bit ADC provides a SNR of 49.9 dB, while a 16-bit SNR provides a SNR of 98 dB (which is, by the way, a virtually no-noise value). More than 130 dB Signal-to-noise ratio is almost impossible to achieve.

Application areas Processing methods and application areas include: Storage. level compression data compression. Transmission. Enhancement (e.g., equalization, filtering, noise cancellation, echo or reverb removal or addition, etc.)

Sound recording and reproduction Sound recording and reproduction is the electrical or mechanical inscription and re-creation of sound waves, usually used for the voice or for music.

Sound recording and reproduction The two main classes of sound recording technology are analog recording and digital recording. Analog recording is achieved by a small microphone diaphragm that can detect changes in atmospheric pressure (acoustic sound waves) and record them as graphic sound waves on a medium. Digital recording and reproduction uses the same analog technologies, with digitization of the sonographic data and signal.

Sound recording and reproduction The most recent and revolutionary developments have been in digital recording, with the invention of purely electronic consumer recording formats such as the WAV digital music file and the compressed file type, the MP3. use of computers has made editing operations faster and easier to execute with software, and the use of hard-drives for storage has made recording cheaper

Dynamic range compression In music, dynamic range is the difference between the quietest and loudest volume of an instrument, part or piece of music. Dynamic range compression, also called DRC (often seen in DVD player settings) or simply compression, is a process that reduces the dynamic range of an audio signal. Compression is used during sound recording, live sound reinforcement, and broadcasting to control the level of audio. A compressor is the device used to apply compression.

Dynamic range compression In simple terms, a compressor is an automatic volume control. Loud sounds over a certain threshold are reduced in level while quiet sounds remain untreated (this is known as downward compression, while the less common upward compression involves making sounds below the threshold louder while the louder passages remain unchanged). In this way it reduces the dynamic range of an audio signal. This may be done for aesthetic reasons, to deal with technical limitations of audio equipment, or to improve audibility of audio in noisy environments.

Dynamic range compression A compressor reduces the gain (level) of an audio signal if its amplitude exceeds a certain threshold. The amount of gain reduction is determined by a ratio. For example, with a ratio of 4:1, when the (time averaged) input level is 4 dB over the threshold, the output signal level will be 1 dB over the threshold. The gain (level) has been reduced by 3 dB. When the input level is 8 dB above the threshold, the output level will be 2 dB; a 6 dB gain reduction. A more specific example for a 4:1 ratio: Threshold = −10 dB Input = −6 dB (4 dB above the threshold) Output = −9 dB (1 dB above the threshold)

Dynamic range compression Design Feed-forward: is used today on all compressors. The signal entering a compressor is split, with one copy sent to a variable-gain amplifier and the other to a path called the side-chain. Control circuit calculates the required amount of gain reduction. The control-circuit outputs the requested gain-reduction amount to the amplifier. Feedback type: Early compressor designs were based on this type. the signal feeding the control circuit was taken after the amplifier.

Dynamic range compression Compressor features Threshold Threshold is the level above which the signal is reduced. It is commonly set in dB, where a lower threshold (e.g. -60 dB) means a larger portion of the signal will be treated (compared to a higher threshold of -5 dB). Ratio The ratio determines the input/output ratio for signals above the threshold. The highest ratio of ∞:1 is commonly achieved using a ratio of 60:1.

Dynamic range compression Attack and release A compressor might provide a degree of control over how quickly it acts. The 'attack phase' is the period when the compressor is increasing gain reduction to reach the level that is determined by the ratio. The 'release phase' is the period when the compressor is decreasing gain reduction to the level determined by the ratio, or, to zero, once the level has fallen below the threshold. The length of each period is determined by the rate of change and the required change gain reduction.

Dynamic range compression Soft and hard knees Another control a compressor might offer is hard/soft knee. This controls whether the bend in the response curve is a sharp angle or has a rounded edge. A soft knee slowly increases the compression ratio as the level increases and eventually reaches the compression ratio set by the user. A soft knee reduces the audible change from uncompressed to compressed, especially for higher ratios where the changeover is more noticeable.

Dynamic range compression Stereo Linking Stereo linking can be achieved in two ways: Either the compressor sums to mono the left and right channel at the input, then only the left channel controls are functional. The compressor still calculates the required amount of gain reduction independently for each channel and then apply the highest amount of gain reduction to both . Make up gain Because the compressor is reducing the gain (or level) of the signal, the ability to add a fixed amount of make-up gain at the output is provided so that an optimum level can be used.

Dynamic range compression Compressor usage high threshold and low compression ratio achieve dynamic range reduction with few obvious effects so that the source material is being compressed very slightly most of the time. To deliberately soften the attack of a snare drum, they might choose a fast attack time and a moderately fast release time combined with a higher threshold. To accentuate the attack of the snare, they might choose a slower attack time to avoid affecting the initial transient. It is easier to successfully apply these controls if the user has a basic knowledge of musical instrument acoustics. It should be noted that compression can also be used to lift the soft passages of a selection, pulling the sound toward a compressed "middle". Hence, loud sounds are pulled back and soft passages are boosted.

Dynamic range compression Limiting Compression and limiting are no different in process, just in degree and in the perceived effect. A limiter is a compressor with a higher ratio, and generally a fast attack time. Engineers sometimes refer to soft and hard limiting which are differences of degree. The "harder" a limiter, the higher its ratio and the faster its attack and release times.

Dynamic range compression Side-chaining Side-chaining uses the signal level of another input or an equalized version of the original input to control the compression level of the original signal. This is used by disc jockeys to lower the music volume automatically when speaking. A stereo compressor without a sidechain can be used as a mono compressor with a sidechain. The key or sidechain signal is sent to the first (main) input of the stereo compressor while the signal that is to be compressed is routed into and out of the second channel of the compressor.

Dynamic range compression Parallel compression parallel compression is to insert the compressor in a parallel signal path. give a measure of dynamic control without significant audible side effects, if the ratio is relatively low and the compressor's sound is relatively neutral. On the other hand, a high compression ratio with significant audible artifacts can be chosen in one of the two parallel signal paths.

Dynamic range compression Multiband compression Multiband compressors are compressors that can act differently on different frequency bands. It is as if each bandpass has its own compressor with its own threshold, ratio, attack, and release. Serial compression Serial compression is a technique used in sound recording and mixing. Serial compression is achieved by using two fairly different compressors in a signal chain. One compressor will generally stabilize the dynamic range while the other will more aggressively compress stronger peaks.

Dynamic range compression Common uses Public spaces Music production Voice Broadcasting Marketing

Audio compression (data) Audio compression is a form of data compression designed to reduce the size of audio files. Audio compression algorithms are implemented in computer software as audio codecs. Generic data compression algorithms perform poorly with audio data. Consequently, specific audio "lossless" and "lossy" algorithms have been created. In both lossy and lossless compression, information redundancy is reduced.

Audio compression (data) Lossless audio compression Lossless audio compression allows one to preserve an exact copy of one's audio files. Compression ratios are similar to those for generic lossless data compression (around 50–60% of original size). Compression ratios are substantially less for lossy compression (which typically yield 5–20% of original size).

Audio compression (data) Use of Lossless audio compression The primary use of lossless encoding are: Archives : For archival purposes, one naturally wishes to maximize quality. Audio quality: Being lossless, these formats completely avoid compression artifacts.

Audio compression (data) Use of Lossless audio compression (cont.) A specific application is to: Store lossless copies of audio. Produce lossily compressed versions for a digital audio player. As formats and encoders improve, one can produce updated lossily compressed files from the lossless master. As file storage and communications bandwidth have become less expensive and more available, lossless audio compression has become more popular.

Audio compression (data) Formats of Lossless audio compression Shorten was an early lossless format; newer ones include Free Lossless Audio Codec (FLAC), Apple's Apple Lossless, MPEG-4 ALS, Monkey's Audio, and TTA. Some audio formats feature a combination of a lossy format and a lossless correction; this allows stripping the correction to easily obtain a lossy file. Such formats include MPEG-4 SLS (Scalable to Lossless), WavPack, and OptimFROG DualStream. Some formats are associated with a technology, such as: Direct Stream Transfer, used in Super Audio CD Meridian Lossless Packing, used in DVD-Audio and Dolby TrueHD, used in in Blu-ray and HD DVD

Audio compression (data) Difficulties in lossless compression of audio data It is difficult to maintain all the data in an audio stream and achieve substantial compression. First, the vast majority of sound recordings are highly complex, recorded from the real world. Second, The values of audio samples change very quickly, so generic data compression algorithms don't work well for audio, and strings of consecutive bytes don't generally appear very often.

Audio compression (data) Evaluation criteria of Lossless audio compression Lossless audio codecs have no quality issues, so the usability can be estimated by Speed of compression and decompression Degree of compression Software and hardware support Robustness and error correction

Audio compression (data) Lossy audio compression Lossy audio compression is used in an extremely wide range of applications. Digitally compressed audio streams are used in many applications, for example: Video DVDs. Digital television. Streaming media on the internet. Satellite and cable radio. Increasingly in radio broadcasts. Lossy compression typically achieves far greater compression than lossless compression (5-20% of the original stream, rather than 50-60% in the Lossless.

Audio compression (data) Lossy audio compression (cont.) In lossy audio compression, data that can not be perceived by the human auditory system are coded with decreased accuracy or not coded at all. Most lossy compression reduces perceptual redundancy. Removing or reducing 'unhearable' sounds may account for a small percentage of bits saved in lossy compression.

Audio compression (data) Lossy audio compression (cont.) Reducing the number of bits used to code a signal increases the amount of noise in that signal (Recall SNR). The real key in lossy compression, is to 'hide' the noise generated by the bit savings in areas of the audio stream that cannot be perceived. This is done by, for instance, using very small numbers of bits to code the high frequencies of most signals. human ear can only perceive very loud signals in this region so that softer sounds 'hidden' there simply aren't heard.

Audio compression (data) Lossy audio compression (cont.) If reducing perceptual redundancy does not achieve sufficient compression for a particular application, it may require further lossy compression. Depending on the audio source, this still may not produce perceptible differences. Speech for example can be compressed far more than music. Most lossy compression schemes allow compression parameters to be adjusted to achieve a target rate of data, usually expressed as a bit rate.

Audio compression (data) Lossy audio compression (cont.) Because data is removed during lossy compression and cannot be recovered by decompression, some people may not prefer lossy compression for archival storage. People use lossy compression may wish to keep a losslessly compressed archive for other applications.

Audio compression (data) Applications of lossy compression Due to the nature of lossy algorithms, audio quality suffers when a file is decompressed and recompressed (digital generation loss). Lossy compression unsuitable for storing the intermediate results in professional audio engineering applications. Popular with end users (particularly MP3).

Audio compression (data) Evaluation criteria Usability of lossy audio codecs is determined by: Perceived audio quality Compression factor Speed of compression and decompression Inherent latency of algorithm (critical for real-time streaming applications; explained later) Software and hardware support

Audio compression (data) Inherent latency of algorithm Lossy formats are often used for the distribution of streaming audio, or interactive applications. In such applications, the data must be decompressed as the data flows. Not all audio codecs can be used for streaming applications. For such applications a codec designed to stream data effectively will usually be chosen.

Audio compression (data) Inherent latency of algorithm (cont.) Latency results from the methods used to encode and decode the data. Some codecs will analyze a longer segment of the data to optimize efficiency. Often codecs create segments called a "frame" to create discrete data segments for encoding and decoding.

Audio compression (data) Inherent latency of algorithm (cont.) Latency refers to the number of samples which must be analysed before a block of audio is processed. In the minimum case, latency is 0 zero samples (e.g., if the coder/decoder simply reduces the number of bits used to quantize the signal). Time domain algorithms have low latencies, hence their popularity in speech coding for telephony.