CS 591 S1 – Computational Audio -- Spring, 2017

Slides:



Advertisements
Similar presentations
Frequency modulation and circuits
Advertisements

November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Techniques in Signal and Data Processing CSC 508 Frequency Domain Analysis.
Physics 1251 The Science and Technology of Musical Sound Unit 1 Session 8 Harmonic Series Unit 1 Session 8 Harmonic Series.
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
3.1 Chapter 3 Data and Signals Computer Communication & Networks.
Math Review with Matlab:
Intro to Fourier Analysis Definition Analysis of periodic waves Analysis of aperiodic waves Digitization Time-frequency uncertainty.
Fourier Analysis D. Gordon E. Robertson, PhD, FCSB School of Human Kinetics University of Ottawa.
Chapter 25 Nonsinusoidal Waveforms. 2 Waveforms Used in electronics except for sinusoidal Any periodic waveform may be expressed as –Sum of a series of.
1 Chapter 8 The Discrete Fourier Transform 2 Introduction  In Chapters 2 and 3 we discussed the representation of sequences and LTI systems in terms.
Lecture 1 Signals in the Time and Frequency Domains
GG 313 Lecture 26 11/29/05 Sampling Theorem Transfer Functions.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
Copyright 2004 Ken Greenebaum Introduction to Interactive Sound Synthesis Lecture 11: Modulation Ken Greenebaum.
Lecture 1B (01/07) Signal Modulation
Chapter 15 AC Fundamentals.
Vibrationdata 1 Unit 6a The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
NASSP Masters 5003F - Computational Astronomy Lecture 12 Complex numbers – an alternate view The Fourier transform Convolution, correlation, filtering.
Chapter2 : SIGNALS 1st semester King Saud University
Chapter 8 © Copyright 2007 Prentice-HallElectric Circuits Fundamentals - Floyd Chapter 8.
Chapter 8 Introduction to Alternating Current and Voltage.
Fourier Analysis Patrice Koehl Department of Biological Sciences National University of Singapore
Digital Image Processing Lecture 8: Fourier Transform Prof. Charlene Tsai.
CS 591 S1 – Computational Audio -- Spring, 2017
Part II Physical Layer Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Trigonometric Identities
Electrical impedance Electrical impedance, or simply impedance, describes a measure of opposition to alternating current (AC). Electrical impedance extends.
CS 591 S1 – Computational Audio -- Spring, 2017
Sinusoidal Excitation of Circuits
CS 591 S1 – Computational Audio – Spring 2017
CS 591 S1 – Computational Audio
Introduction to Alternating Current and Voltage
CS 591 S1 – Computational Audio
Digital Logic & Design Dr. Waseem Ikram Lecture 02.
MECH 373 Instrumentation and Measurements
Data Processing As a science major, you will all eventually have to deal with data. All data has noise Devices do not give useful measurements; must convert.
CS 591 S1 – Computational Audio -- Spring, 2017
Addition and Subtraction Formulas
Unit 5 The Fourier Transform.
MECH 373 Instrumentation and Measurements
ALTERNATING CURRENT AND VOLTAGE
ECE 2202 Circuit Analysis II
CS 591 S1 – Computational Audio -- Spring, 2017
CHAPTER 3 DATA AND SIGNAL
Sinusoidal Waveform Phasor Method.
Fundamental Identities
Copyright 2012, 2008, 2004, 2000 Pearson Education, Inc.
Orthogonal Frequency Division Multiplexing
All about convolution.
UNIT II Analysis of Continuous Time signal
Trigonometric Identities
Fitting Curve Models to Edges
Electric Circuits Fundamentals
UNIT-8 INVERTERS 11/27/2018.
Copyright © Cengage Learning. All rights reserved.
Digital Logic & Design Lecture 02.
Advanced Digital Signal Processing
The Beauty of Mathematics in
Simple Harmonic Motion
Signal Processing First
Copyright © 2017, 2013, 2009 Pearson Education, Inc.
Sum and Difference Formulas
9.5 Least-Squares Digital Filters
9.4 Enhancing the SNR of Digitized Signals
Correlation, Energy Spectral Density and Power Spectral Density
8.6 Autocorrelation instrument, mathematical definition, and properties autocorrelation and Fourier transforms cosine and sine waves sum of cosines Johnson.
C H A P T E R 11 A.C. Fundamentals.
Data Processing Chapter 3
3.1 Chapter 3 Data and Signals Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Presentation transcript:

CS 591 S1 – Computational Audio -- Spring, 2017 Wayne Snyder Computer Science Department Boston University Today Multiplying (and squaring) Sine Waves – the foundation of frequency detection! F0 Determination Zero-Crossing Rate Correlation and Autocorrelation Next Time: F0 Determination continued: peak picking and refinements Pitch Tracking 1

Digital Audio Fundamentals: Multiplying/Squaring Signals We have spent nearly a month learning how to add sine waves together; we will spend the next month learning what happens when we multiply! In fact, sophisticated ways of detecting frequencies in a signal will depend on what happens when we multiply one signal by another. So let’s investigate! Ring Modulation consisted of multiplying a carrier wave by a modulating wave, generally of different frequencies: The result is a signal, whose spectrum can be calculated from the carrier and modulating signals: Carrier wave fc is split into two sideband frequencies fc ± fm of amplitude (Ac+Am) / 2. 2

Digital Audio Fundamentals: Multiplying/Squaring Signals Important Digression: In a musical signal, the values alternate between positive and negative. As long as there is an integral number of periods within the window, this means that the average sample value (or the sum of the samples) is very close to 0.0: sum = 0 for k in range(len(X)): sum += X[k] print( sum / len(X) ) Average Sample Value: -1.02695629778e-17 -3.26128013484e-17 -2.85882428841e-17 3

Digital Audio Fundamentals: Multiplying/Squaring Signals Definition: A (relatively) small slice of a signal is called a window and the window length W is the duration in samples or in seconds. Definition: For a window of length W samples, a window frequency is one whose period P is such that W = P * k for some integer k, i.e., an integral number of periods exactly fit within the window; alternately, it begins and ends at same instantaneous phase. Fact 1: For any particular window, the window frequencies form a harmonic series. 4

Digital Audio Fundamentals: Multiplying/Squaring Signals Fact 2: The sum (or average) of the samples in window of a sine wave which has a window frequency will be very close to 0.0. The phase does not matter. Examples: Windows of 1 sec with waves (3.0,1.0,0.0) and (3.0,1.0,2.0): Sum -1.1934897514720433e-15 -6.8885492510162294e-07 5

Digital Audio Fundamentals: Multiplying/Squaring Signals If the wave is not at a window frequency (i.e., it ends at a different instantaneous phase than it began), then the sum could be positive or negative, due to the last, incomplete period: If we take the average sample value over larger and larger windows of a sine wave, we get a gradual convergence to a limit of 0.0: Sum 49,645,112.1607 6

Digital Audio Fundamentals: Multiplying/Squaring Signals Fact 3: When you add together two window frequency components (i.e., Additive Synthesis), you will get a signal whose sum (and average) is very close to 0.0. Average Sample Value: 6.757896233037486e-13 -5.0684221747781142e-13 -3.5061281769045235e-12 + = 7

Digital Audio Fundamentals: Multiplying/Squaring Signals Fact 4: When you multiply two window frequencies fc and fm (i.e., Ring Modulation), you will get an average sample value very close to 0.0 as long as fc ≠ fm. Average Sample Value: 6.757896233037486e-13 -5.0684221747781142e-13 -9.9964391019162251e-08 * = 8 8

Digital Audio Fundamentals: Multiplying/Squaring Signals Fact 4: When you multiply two window frequencies fc and fm (i.e., Ring Modulation), you will get an average sample value very close to 0.0 as long as fc ≠ fm. WHY? Recall that Ring Modulation with (fc, Ac, _) * (fm, Am, _) removes the carrier frequency and replaces it with two sideband frequencies (fc ± fm, (Ac, Am) / 2, _ ); thus any Ring Modulation signal is identical to one produced by Additive Synthesis. * = 9 9

Digital Audio Fundamentals: Multiplying/Squaring Signals But what happens in multiplying two sine waves when the frequencies of the two sine waves is the same? Why does this result not hold? Multiplying two identical sine waves (i.e., squaring it) produces a sine wave of twice the frequency but shifted in phase and amplitude: Average Sample: 0.0 0.25 + - This is consistent with the “half angle formula” for cos: (Obvious) Note: All the values in the squared wave are ≥ 0. Average is > 0. + - + + 10

Digital Audio Fundamentals: Multiplying/Squaring Signals These results are consistent with the spectral analysis of Ring Modulation: Ring Modulation with carrier ( fc, Ac, 0 ) and modulator ( fm, Am, 0 ) with fc = fm produces: ( 0, (Ac*Am)/2, 0 ) and ( 2*fc, (Ac*Am)/2, 0 ) where the 0 Hz wave is a constant offset of (Ac*Am)/2 to the wave:

Digital Audio Fundamentals: Multiplying/Squaring Signals Ok, let’s summarize what happens….. Let’s consider the product of two sine waves ( 5.0, 1.0, 0.0) and ( f, 1.0, 0.0 ) for f in [0.0, 0.2, 0.4, …, 4.6, 4.8, 5.0, 5.2, 5.4, …, 9.8 ]: 12

Digital Audio Fundamentals: Multiplying/Squaring Signals Alternately, we could use the formula for the product of sine waves: sin(x)*sin(y) = ½ [ cos(x-y) – cos(x+y) ] so in the case where x and y differ by δ: sin(x+δ)*sin(x) = ½ [ cos(δ) + –cos(2x+δ) ] When δ = 0, we have ½ [ 1.0 + –cos(2x) ] Amplitude displacement in range [-0.5 . 0.5] twice-frequency wave with shifted phase and inverted amplitude 13

Digital Audio Fundamentals: Multiplying/Squaring Signals The MOST IMPORTANT thing to remember, however, is that when you multiply a sine wave by itself, all the values in the squared wave are ≥ 0 and the sum (or average) sample value is > 0. Average Sample: 0.0 0.25 14

Digital Audio Fundamentals: Multiplying/Squaring Signals What happens when the signal is composite (not a simple sine wave)? Let’s track the average sample value when multiplying a composite wave created from [ ( 3.0,0.5,0.0), (5.0,0.3,0.0), (10,0.2,0.0) ] by a “probe wave” created from [ ( f, 1.0, 0.0 ) ] for various frequencies f … 15

Digital Audio Fundamentals: Multiplying/Squaring Signals What happens when the signal is composite (not a simple sine wave)? Let’s track the average sample value when multiplying a composite wave created from [ ( 3.0,0.5,0.0), (5.0,0.3,0.0), (10,0.2,0.0) ] by a “probe wave” created from [ ( f, 1.0, 0.0 ) ] for various frequencies f … 16

Digital Audio Fundamentals: Multiplying/Squaring Signals What happens when the signal is composite (not a simple sine wave)? Let’s track the average sample value when multiplying a composite wave created from [ ( 3.0,0.5,0.0), (5.0,0.3,0.0), (10,0.2,0.0) ] by a “probe wave” created from [ ( f, 1.0, 0.0 ) ] for various frequencies f … 17

Digital Audio Fundamentals: Multiplying/Squaring Signals What happens when the signal is composite (not a simple sine wave)? Let’s track the average sample value when multiplying a composite wave created from [ ( 3.0,0.5,0.0), (5.0,0.3,0.0), (10,0.2,0.0) ] by a “probe wave” created from [ ( f, 1.0, 0.0 ) ] for various frequencies f … Note that average values are exactly half the amplitudes in the original spectrum; when a “probe wave” matches the frequency and phase exactly, you get a bump in the average value, due to the squared part! 18

Digital Audio Fundamentals: Multiplying/Squaring Signals What happens when the signal is composite (not a simple sine wave)? You can think of this as a simple application of (pair-wise) plus and multiplication on vectors, which obeys the typical arithemetic rules X3 = signal from [ ( 3.0, 0.5, 0.0 ) ] X5 = signal from [ ( 5.0, 0.3, 0.0 ) ] X10 = signal from [ ( 10.0, 0.2, 0.0 ) ] P = “probe wave” = signal from [ (3.0, 1.0, 0.0) ] X = X3 + X5 + X10 X5 * P average value will be close to 0.0 X3 * P average value will be close to 0.25 = half amplitude of X3 X * P = P * (X3 + X5 + X10) = ( P * X3 ) + (P * X5 ) + (P * X10) avg close to 0.25 avg: 0.25 0.0 0.0 19

Digital Audio Fundamentals: Multiplying/Squaring Signals Conclusions on Multiplying Signals in a Window: We (for now) assume that all sine waves are at window frequencies. If a “probe wave” EXACTLY matches a component of a signal in frequency AND phase, the average sample value will be half the amplitude of the component. If a “probe wave” does not match a component, the average sample value will be very close to 0.0. This gives us a way to detect frequencies in a signal…. sort of….. Problems: Frequency must match exactly Phase must match exactly Signal must be composed entirely of window frequencies. We will deal with these problems in two phases, one as part of pitch determination and one as part of understanding the Fourier Transform…. 20

Digital Audio Fundamentals: F0 (lowest frequency) Detection Detecting the fundamental frequency of a signal (“pitch detection”, more properly “F0 detection”) is a good place to start in our study of musical signal analysis. For some signals it is easy to see their fundamental frequency: 0.3305 – 0.326 = 0.0045 1/0.0045 = 220 Hz 100 Hz These signals have a clear sense of a dominant pitch. 21

Digital Audio Fundamentals: F0 Detection Other sounds are harder to determine or have no pitch: [Example: Beethoven’s Seventh Symphony.] [Higher love, percussion opening.] Interestingly, although we are not always conscious of it, speech sounds do usually have a dominant pitch, although some consonants (“s”) are practically pitchless. [Genesis01.wav] 22

Digital Audio Fundamentals: F0 Detection with ZCR The simplest pitch detection is the Zero-Crossing Rate (ZCR) Algorithm; since a sine wave crosses the 0 line twice in its period, you could just count the number of times the signal changes sign (+ to – or – to +) in a signal window and divide by 2*length of the window: 4/(2*0.1s) = 20 Hz 23

Digital Audio Fundamentals: F0 Detection with ZCR There are various problems with the approach, first, that it can be off by half a period: 4/(2*0.1s) = 20 Hz but the signal is 23.2 Hz 24

Digital Audio Fundamentals: F0 Detection with ZCR There are various problems with the approach, first, that it can be off by half a period: Works better when frequency is large relative to window size….. 25

Digital Audio Fundamentals: F0 Detection with ZCR The second problem is that higher harmonics in the signal may create too many zero crossings! The solution is typically to smooth the signal using a “low-pass filter” that filters out the higher harmonics….. this is the approach taken by cheap “chromatic tuners for guitars”…. 26

Digital Audio Fundamentals: Correlation and AutoCorrelation How do we find the period of a periodic signal? In fact, what we need is to determine at what distance the signal is periodic…. Recall: a signal is periodic with period P if X[k] == X[k+P] for all k < len(X) – P: P X[k] X[k+P] 27

Digital Audio Fundamentals: Correlation and AutoCorrelation In other words, we need to find the correlation between the signal, and the signal shifted by P samples: When we find the correlation of a signal with a shifted version of itself, this is called Auto-Correlation X[k] Signal shifted by P samples 28 X[k+P]

Digital Audio Fundamentals: Correlation and AutoCorrelation Correlation measures how similar two random variables (or vectors or signals) are: Pearson’s Product-Moment Coefficient for random variables X and Y: But for signals, there are several simplifying conditions: Since we will be interested only in comparing magnitudes (e.g., finding the maximum), we do not need to normalize to the range [-1..1], and so can eliminate the denominator; Since signals have a mean very close to 0, we do not need to standardize by subtracting the means (already standardized!) So we would have: average sample value = E( X * Y ) = Σ(Xi * Yi) / len(X) but, again, as in (1), we do not need the denominator! We have simply: Σ(Xi * Yi) = dotProduct(X,Y) = sum of product of X and Y 29

Digital Audio Fundamentals: Correlation and AutoCorrelation In fact, the correlation varies as a sine wave, with a maximum at 0 and 2π and a minimum at π, and 0 at π/2 and 3π/2 (effectively a cosine): 30

Digital Audio Fundamentals: Correlation and AutoCorrelation Autocorrelation is the same thing as correlation, but the Y signal is a time-shifted version of the X signal: Correlate X[ L : R ] with X[ L+lag: R+ lag ] Σ(Xi * Xi+lag) 31 Lag

Pitch Determination using Autocorrelation The code is (as usual) fairly simple: def acorr(X, lag): sum = 0 for i in range(len(X) - lag): sum += X[i]*X[i+lag] return sum/(len(X)-lag) 32 Lag

Pitch Determination using Autocorrelation When you have a simple sine wave and the perfect lag, you are essentially squaring the wave and this maximizes the sum of products. Lag 33

Pitch Determination using Autocorrelation When we graph the autocorrelation coefficient for various lag times, the peaks in the curve show where the correlation between the lagged signal is strongest, which corresponds to frequency F0. Let’s look at some (artificial) examples. Spectrum: [ ( 441, 1.0, 0 ) ] 34

Pitch Determination using Autocorrelation Spectrum: [ ( 441, 1.0, pi ) ] Note that the phase does not matter! 35

Pitch Determination using Autocorrelation Spectrum: [ ( 441, 1.0, 2.3 ) ] Note that the phase does not matter! 36

Pitch Determination using Autocorrelation Spectrum (with even harmonics): [ (441,0.6,0), (441*3,0.2,0), (441*5,0.2,0) ] 37

Pitch Determination using Autocorrelation Spectrum (with even harmonics): [ (441,0.6,0), (441*3,0.2,0), (441*5,0.2,0) ] Spectrum: [ (110, 1.0, 0 )] 38

Pitch Determination using Autocorrelation Bell Spectrum: 220 Hz 39

Pitch Determination using Autocorrelation Steel String Spectrum: 220 Hz 40

Pitch Determination using Autocorrelation Now let’s try some actual music signals….. Cello.wav at 11.35 41

Pitch Determination using Autocorrelation WTC1_01.wav at 0.3 sec 42

Pitch Determination using Autocorrelation SteelString.wav at 0.5 sec (220 Hz) Bell Spectrum: 220 Hz 43

Pitch Determination using Autocorrelation SteelString.wav at 0.006 sec (220 Hz) 44

Pitch Determination using Autocorrelation SteelString.wav at 0.006 sec for 0.015 seconds duration 45

Pitch Determination using Autocorrelation SteelString.wav at 0.006 sec for 0.01 seconds duration 46

Pitch Determination using Autocorrelation Genesis01.wav at 2.87 sec 47

Pitch Determination using Autocorrelation Beethoven.Seventh.wav at 3.125 48

Pitch Determination using Autocorrelation Bell.wav at 0.03 49

Pitch Determination using Autocorrelation Cymbal.wav at 0.3 sec 50

Pitch Determination using Autocorrelation HigherLove.wav at 1.143 sec 51

EXTRA SLIDES But what happens when the frequencies of the two sine waves is the same? Multiplying two sine waves of the same frequency but opposite phase produces the inverse of the product: Average Sample: 0.0 - 0.25 + - This makes sense if we think of signals as vectors which we can multiply pairwise: X * X = X2 X * - X = - (X * X) = - X2 + - - - 52

Digital Audio Fundamentals: Multiplying/Squaring Signals But what happens when the frequencies of the two sine waves is the same? Multiplying two sine waves of the same frequency but off by π/2 (e.g., sin and cos) produces a (normal) sine wave (of twice the frequency). This is consistent with the formula for the product of two sin expressions: sin(x)*sin(y) = ½ [ cos(x-y) – cos(x+y) ] so: sin(x+φ)*sin(x+ξ) = ½ [ cos(φ+ξ) – cos(2x+(φ+ξ) ) ] Amplitude displacement in range [-0.5 . 0.5] twice-frequency wave with shifted phase and inverted amplitude 53

Digital Audio Fundamentals: Multiplying/Squaring Signals The MOST IMPORTANT thing to remember, however, is that when you multiply a sine wave by itself, all the values in the squared wave are ≥ 0 and the sum (or average) sample value is > 0. Average Sample: 0.0 0.25 54