III Digital Audio III.9 (Wed Oct 24) Phase vocoder for tempo and pitch changes.

Slides:



Advertisements
Similar presentations
DFT & FFT Computation.
Advertisements

DCSP-12 Jianfeng Feng
Sound Synthesis Part V: Effects. Plan Overview of effects Chorus effect Treble & bass amplification Saturation Pitch vocoder Summary.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.3 (Fr Oct 10) Complex Fourier representation (preliminaries to FFT)
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
Digital Kommunikationselektronik TNE027 Lecture 5 1 Fourier Transforms Discrete Fourier Transform (DFT) Algorithms Fast Fourier Transform (FFT) Algorithms.
ACHIZITIA IN TIMP REAL A SEMNALELOR. Three frames of a sampled time domain signal. The Fast Fourier Transform (FFT) is the heart of the real-time spectrum.
1 Acoustic Sampling Of Instruments Dan Starr Capstone Design Project Advisors: Prof. Catravas Prof. Postow.
Time-scale and pitch modification Algorithms review Alexey Lukin.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Chapter 8: The Discrete Fourier Transform
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Effects in frequency domain Stefania Serafin Music Informatics Fall 2004.
Continuous Time Signals All signals in nature are in continuous time.
Discrete Time Periodic Signals A discrete time signal x[n] is periodic with period N if and only if for all n. Definition: Meaning: a periodic signal keeps.
First semester King Saud University College of Applied studies and Community Service 1301CT.
Frequency Domain Representation of Sinusoids: Continuous Time Consider a sinusoid in continuous time: Frequency Domain Representation: magnitude phase.
Numerical algorithms for power system protection Prof. dr. sc. Ante Marušić, doc. dr. sc. Juraj Havelka University of Zagreb Faculty of Electrical Engineering.
Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?
Where we’re going Speed, Storage Issues Frequency Space.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Basics of Signal Processing. SIGNALSOURCE RECEIVER describe waves in terms of their significant features understand the way the waves originate effect.
Digital Signal Processing
Chapter 16 Speech Synthesis Algorithms 16.1 Synthesis based on LPC 16.2 Synthesis based on formants 16.3 Synthesis based on homomorphic processing 16.4.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
Zhongguo Liu_Biomedical Engineering_Shandong Univ. Chapter 8 The Discrete Fourier Transform Zhongguo Liu Biomedical Engineering School of Control.
Course Outline (Tentative) Fundamental Concepts of Signals and Systems Signals Systems Linear Time-Invariant (LTI) Systems Convolution integral and sum.
Real time DSP Professors: Eng. Julian S. Bruno Eng. Jerónimo F. Atencio Sr. Lucio Martinez Garbino.
Pre-Class Music Paul Lansky Six Fantasies on a Poem by Thomas Campion.
Fourier Series Fourier Transform Discrete Fourier Transform ISAT 300 Instrumentation and Measurement Spring 2000.
Time Frequency Analysis
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.4 (We Oct 28) Fast Fourier Transform (FFT)
GG313 Lecture 24 11/17/05 Power Spectrum, Phase Spectrum, and Aliasing.
Time Compression/Expansion Independent of Pitch. Listening Dies Irae from Requiem, by Michel Chion (1973)
Learning from the Past, Looking to the Future James R. (Jim) Beaty, PhD - NASA Langley Research Center Vehicle Analysis Branch, Systems Analysis & Concepts.
بسم الله الرحمن الرحيم Digital Signal Processing Lecture 14 FFT-Radix-2 Decimation in Frequency And Radix -4 Algorithm University of Khartoum Department.
Fourier Analysis Patrice Koehl Department of Biological Sciences National University of Singapore
Lecture 19 Spectrogram: Spectral Analysis via DFT & DTFT
III Digital Audio III.2 (M Oct 02) Finite Fourier Theory.
Introduction to Audio Watermarking Schemes N. Lazic and P
Discrete Fourier Transform (DFT)
CS 591 S1 – Computational Audio
Spectrum Analysis and Processing
CS 591 S1 – Computational Audio
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
Fourier’s Theorem.
III Digital Audio III.4 (Mo Oct 16) Fast Fourier Transform (FFT)
MECH 373 Instrumentation and Measurements
III Digital Audio III.9 (Wed Oct 25) Phase vocoder for tempo and pitch changes.
FFT-based filtering and the
MECH 373 Instrumentation and Measurements
Continuous-Time Signal Analysis
III Digital Audio III.3 (M Oct 09) Complex Fourier representation (preliminaries to FFT)
EET 422 EMC & COMPLIANCE ENGINEERING
Basic Concepts of Audio Watermarking
Sinusoids: continuous time
Net 222: Communications and networks fundamentals (Practical Part)
Chapter 8 The Discrete Fourier Transform
Lecture 18 DFS: Discrete Fourier Series, and Windowing
Fourier Analyses Time series Sampling interval Total period
III Digital Audio III.2 (F Oct 05) Finite Fourier Theory.
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
7.2 Even and Odd Fourier Transforms phase of signal frequencies
Chapter 8 The Discrete Fourier Transform
C H A P T E R 21 Fourier Series.
Chapter 8 The Discrete Fourier Transform
INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)
Lec.6:Discrete Fourier Transform and Signal Spectrum
ENEE222 Elements of Discrete Signal Analysis Lab 9 1.
Geol 491: Spectral Analysis
Presentation transcript:

III Digital Audio III.9 (Wed Oct 24) Phase vocoder for tempo and pitch changes

Phase Vocoder This algorithm is built in order to enable tempo and pitch changes of a digital sound file. Tempo change: Same music, but played at a different tempo Pitch change: Same tempo, but transposed pitch. The algorithm was first described by James L. Flanagan and R.M. Golden in a paper “Phase Vocoder” published in Bell System Technical Journal, Vol. 45, No. 9, p. 1493, November 1966. James L. Flanagan

frequency’ = frequency. Δ/Δ’ Phase Vocoder The basic technique starts with so-called resampling. time amplitude Δ time amplitude Δ’ frequency’ = frequency. Δ/Δ’ problem: sound changes dramatically! Why???

Phase Vocoder The fundamental idea here is this: Construct a new sample with longer duration + same pitch ⇒ back to original duration + higher pitch via resampling Construct a new sample with shorter duration + same pitch ⇒ back to original + lower pitch via resampling. So basically we are dealing with the time change problem! The procedure is that we first cover the original signal by a sequence of sound frames of equal length, but in order to grasp their commonalities, we choose overlapping frames. Typically this is achieved by 75%, and the frame duration is typically 1/20 sec (corresponding to 20 Hz fundamental frequency for finite Fourier). time amplitude

Phase Vocoder The idea is to work on these frames, processing them on the frequency space, and then generating a synthesis sound by adding these new frames with different overlapping times and thereby changing the tempo of the overall signal:

Phase Vocoder A frame is generated from the original sample by multiplying it with a Hanning window function H(t) time amplitude time amplitude time amplitude

First step of the algorithm: Analysis Phase Vocoder time amplitude D First step of the algorithm: Analysis The frame is the transformed to frequency representation via FFT. The fundamental frequency is of course f = 1/frame duration = 1/D. The highest frequency is fs = n.f, so that we have n frequency intervals from 0 to fs(n-1)/n Hz. Attention: n has nothing to do with the original sample frequency of the signal!! The temporal delay of ¼ frame has then 2n/4 (temporal) samples; this number is called analytical hop size hopa. In other words, we have the equation hopa/2fs = 2n/(4.2nf) = D/4 = Da = analytical hop time between successive frames.

Phase Vocoder What is the problem now? The reproduction of the frames with different distances causes phase problems: Ds synthetical hop time Da analytical hop time

Second step (processing) : We look at the phase problem: Phase Vocoder sin(2πft) Second step (processing) : We look at the phase problem: sin(2πf(t+Da)) = sin(2πft+ΔΦ) Frame i-1      ΔΦ can be calculated, omit this! ΔΦ = 2πf.Da , f = ΔΦ/2πDa = “true frequency”. Third step (synthesis): Replace Da by the synthetic frame distance Ds and then set a new phase of frame i ΔΦs, i = ΔΦs, i-1 + 2πf.Ds where the (i-1)th phase has been calculated by recursion. Correct the complex coefficients of the FFT transform accordingly. FFTransform back, multiply each frame by a Hamming curve and add it all. Frame i      take sinusoidal signal component