Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Music Audio Processing

Similar presentations


Presentation on theme: "Digital Music Audio Processing"— Presentation transcript:

1 Digital Music Audio Processing
Grows out of DSP and speech recognition research Feature detection mostly from Fast Fourier Transforms (FFT) and Mel Frequency Cepstral Coefficients (MFCC)

2 Music Digital Audio

3 Audio: Two Domain Problem
Frequency domain Time domain

4 Our Hero Jean-Baptiste Joseph Fourier Mathematician and physicist
Born: 21 March 1768 Died: 16 May 1830 Most famous for his spiffy “Fourier Transform” and related “Fourier’s Law” Also noted for early “greenhouse effect” work! By User:Bunzil at en.wikipedia [Public domain], from Wikimedia Commons

5 From Wave to Data

6 What do we mean by “audio feature”?
Ideal: TRUE MEANING extracted from the audio signal

7 What do we mean by “audio feature”?
Ideal: TRUE MEANING extracted from the audio signal

8 What do we mean by “audio feature”?
Reality: something we can squint at & interpret a bit

9 “Low-level” and “high-level” features
Low-level: “mechanically recovered” from the audio e.g. amplitude, timbral descriptors, spectral features High-level: usually obtained from low-level features + lots of context (template matching, machine-learning, domain knowledge) e.g. key, pitch, tempo, notes, phrases, similarity

10 Vamp plugins Small files you can install that add new feature extractors. Once installed, can be used with several different “hosts”: Sonic Visualiser Audacity audio editor (simple feature extractors only) Sonic Annotator – batch audio feature extraction program Python Vamp host – use with scientific coding packages for analysis, search, plotting etc

11 Vamp plugins and audio features

12 What does a Vamp feature consist of?

13 Example: Chromagram Somewhat representative of time- varying harmonic content Made by “wrapping around” time- frequency spectrogram into a single octave Various ways to do this → lots of different chromagram plugins Good example of an almost intuitively meaningful feature

14 Chromagram Motivation Limitations Applications
Reduce spectrogram in a way informed by musical structure Limitations Time/frequency resolution tradeoff Misleading outcome of harmonic folding (different approaches to this) Intrinsic difficulties, e.g. with temperament Applications Chord and key estimation “Harmonic feature” for search, retrieval & similarity tasks


Download ppt "Digital Music Audio Processing"

Similar presentations


Ads by Google