Digital Music Audio Processing

Slides:



Advertisements
Similar presentations
Frequency analysis.
Advertisements

David Hansen and James Michelussi
Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson
1 Copyright 2011 G.Tzanetakis Music Information Retrieval George Tzanetakis Associate Professor, IEEE Senior Member.
Chapter 3 The Fourier Series EE 207 Adil S. Balghonaim.
AES 120 th Convention Paris, France, 2006 Adaptive Time-Frequency Resolution for Analysis and Processing of Audio Alexey Lukin AES Student Member Moscow.
F 鍾承道 Acoustic Features for Speech Recognition: From Mel-Frequency Cepstrum Coefficients (MFCC) to BottleNeck Features(BNF)
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING CS 525 : Project Presentation PALDEN LAMA and MOUNIKA NAMBURU.
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
The Spectrum Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory All periodic waves are composed of a series of sinusoidal.
The Spectrum n Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory n All periodic waves are composed of a series of sinusoidal.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Representing Acoustic Information
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Audio and Music Representations (Part 2) 1.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Where we’re going Speed, Storage Issues Frequency Space.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,
2007 Multimedia System Final Paper Presentation Music Recognition 蘇冠年 蔡尚穎.
Multiresolution STFT for Analysis and Processing of Audio
Jacob Zurasky ECE5526 – Spring 2011
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Fundamentals of Music Processing
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Seismic Reflection Data Processing and Interpretation A Workshop in Cairo 28 Oct. – 9 Nov Cairo University, Egypt Dr. Sherif Mohamed Hanafy Lecturer.
Signals And Systems Chapter 3 Fourier Transform 2.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
Pre-Class Music Paul Lansky Six Fantasies on a Poem by Thomas Campion.
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio
CE Coding and Transformations Sept - Nov 2010.
Fourier Series Fourier Transform Discrete Fourier Transform ISAT 300 Instrumentation and Measurement Spring 2000.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
CS434/534: Mobile Computing and Wireless Networks Y. Richard Yang 08/30/2012.
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Predicting Voice Elicited Emotions
Fourier and Wavelet Transformations Michael J. Watts
Speech Processing Using HTK Trevor Bowden 12/08/2008.
Presented by Michael Katic. Main Influence Most of my practices in this project were an attempt to recreate an Audio Search Algorithm developed by Shazam.
The Spectrum n Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory.
LIGO-G Z S5 calibration: time dependent coefficients  Myungkee Sung, Gabriela González, Mike Landry, Brian O’Reilly, Xavier Siemens,…
CE Coding and Transformations April – June 2011.
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
IIS for Speech Processing Michael J. Watts
 Carrier signal is strong and stable sinusoidal signal x(t) = A cos(  c t +  )  Carrier transports information (audio, video, text, ) across.
Speech Processing Dr. Veton Këpuska, FIT Jacob Zurasky, FIT.
CS 445/656 Computer & New Media
Ch. 2 : Preprocessing of audio signals in time and frequency domain
CS 591 S1 – Computational Audio
Spectrum Analysis and Processing
CS 591 S1 – Computational Audio
Fourier’s Theorem.
3.1 Introduction Why do we need also a frequency domain analysis (also we need time domain convolution):- 1) Sinusoidal and exponential signals occur.
ARTIFICIAL NEURAL NETWORKS
Speech Processing AEGIS RET All-Hands Meeting
Spoken Digit Recognition
Fourier and Wavelet Transformations
Fourier Analyses Time series Sampling interval Total period
Audio and Speech Computers & New Media.
Wavelet Transform Fourier Transform Wavelet Transform
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Fourier series Periodic functions and signals may be expanded into a series of sine and cosine functions
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Scientific Python Introduction
ECE 791 Project Proposal Project Title: Developing and Evaluating a Tool for Converting MP3 Audio Files to Staff Music Project Team: Salvatore DeVito.
Advances in Deep Audio and Audio-Visual Processing
Measuring the Similarity of Rhythmic Patterns
BeatBox Hero Ankit Gupta Rohan Jain.
Music Signal Processing
Presentation transcript:

Digital Music Audio Processing Grows out of DSP and speech recognition research Feature detection mostly from Fast Fourier Transforms (FFT) and Mel Frequency Cepstral Coefficients (MFCC)

Music Digital Audio http://en.wikipedia.org/wiki/Digital_audio

Audio: Two Domain Problem Frequency domain Time domain

Our Hero Jean-Baptiste Joseph Fourier Mathematician and physicist   Born: 21 March 1768 Died: 16 May 1830 Most famous for his spiffy “Fourier Transform” and related “Fourier’s Law” Also noted for early “greenhouse effect” work! https://upload.wikimedia.org/wikipedia/commons/a/aa/Fourier2.jpg By User:Bunzil at en.wikipedia [Public domain], from Wikimedia Commons

From Wave to Data http://en.wikipedia.org/wiki/User:LucasVB/Gallery

What do we mean by “audio feature”? Ideal: TRUE MEANING extracted from the audio signal

What do we mean by “audio feature”? Ideal: TRUE MEANING extracted from the audio signal

What do we mean by “audio feature”? Reality: something we can squint at & interpret a bit

“Low-level” and “high-level” features Low-level: “mechanically recovered” from the audio e.g. amplitude, timbral descriptors, spectral features High-level: usually obtained from low-level features + lots of context (template matching, machine-learning, domain knowledge) e.g. key, pitch, tempo, notes, phrases, similarity

Vamp plugins Small files you can install that add new feature extractors. Once installed, can be used with several different “hosts”: Sonic Visualiser Audacity audio editor (simple feature extractors only) Sonic Annotator – batch audio feature extraction program Python Vamp host – use with scientific coding packages for analysis, search, plotting etc

Vamp plugins and audio features

What does a Vamp feature consist of?

Example: Chromagram Somewhat representative of time- varying harmonic content Made by “wrapping around” time- frequency spectrogram into a single octave Various ways to do this → lots of different chromagram plugins Good example of an almost intuitively meaningful feature

Chromagram Motivation Limitations Applications Reduce spectrogram in a way informed by musical structure Limitations Time/frequency resolution tradeoff Misleading outcome of harmonic folding (different approaches to this) Intrinsic difficulties, e.g. with temperament Applications Chord and key estimation “Harmonic feature” for search, retrieval & similarity tasks