Digital Music Audio Processing

Slides:

Advertisements

Similar presentations

Frequency analysis.

Advertisements

David Hansen and James Michelussi

Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson

1 Copyright 2011 G.Tzanetakis Music Information Retrieval George Tzanetakis Associate Professor, IEEE Senior Member.

Chapter 3 The Fourier Series EE 207 Adil S. Balghonaim.

AES 120 th Convention Paris, France, 2006 Adaptive Time-Frequency Resolution for Analysis and Processing of Audio Alexey Lukin AES Student Member Moscow.

F 鍾承道 Acoustic Features for Speech Recognition: From Mel-Frequency Cepstrum Coefficients (MFCC) to BottleNeck Features(BNF)

1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.

A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING CS 525 : Project Presentation PALDEN LAMA and MOUNIKA NAMBURU.

DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.

The Spectrum Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory All periodic waves are composed of a series of sinusoidal.

The Spectrum n Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory n All periodic waves are composed of a series of sinusoidal.

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh

Representing Acoustic Information

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Audio and Music Representations (Part 2) 1.

Digital Sound and Video Chapter 10, Exploring the Digital Domain.

Where we’re going Speed, Storage Issues Frequency Space.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,

2007 Multimedia System Final Paper Presentation Music Recognition 蘇冠年蔡尚穎.

Multiresolution STFT for Analysis and Processing of Audio

Jacob Zurasky ECE5526 – Spring 2011

Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.

Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,

Fundamentals of Music Processing

Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.

Seismic Reflection Data Processing and Interpretation A Workshop in Cairo 28 Oct. – 9 Nov Cairo University, Egypt Dr. Sherif Mohamed Hanafy Lecturer.

Signals And Systems Chapter 3 Fourier Transform 2.

Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.

Pre-Class Music Paul Lansky Six Fantasies on a Poem by Thomas Campion.

Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio

CE Coding and Transformations Sept - Nov 2010.

Fourier Series Fourier Transform Discrete Fourier Transform ISAT 300 Instrumentation and Measurement Spring 2000.

EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,

CS434/534: Mobile Computing and Wireless Networks Y. Richard Yang 08/30/2012.

MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.

Predicting Voice Elicited Emotions

Fourier and Wavelet Transformations Michael J. Watts

Speech Processing Using HTK Trevor Bowden 12/08/2008.

Presented by Michael Katic. Main Influence Most of my practices in this project were an attempt to recreate an Audio Search Algorithm developed by Shazam.

The Spectrum n Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory.

LIGO-G Z S5 calibration: time dependent coefficients  Myungkee Sung, Gabriela González, Mike Landry, Brian O’Reilly, Xavier Siemens,…

CE Coding and Transformations April – June 2011.

Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.

IIS for Speech Processing Michael J. Watts

 Carrier signal is strong and stable sinusoidal signal x(t) = A cos(  c t +  )  Carrier transports information (audio, video, text, ) across.

Speech Processing Dr. Veton Këpuska, FIT Jacob Zurasky, FIT.

CS 445/656 Computer & New Media

Ch. 2 : Preprocessing of audio signals in time and frequency domain

CS 591 S1 – Computational Audio

Spectrum Analysis and Processing

CS 591 S1 – Computational Audio

Fourier’s Theorem.

3.1 Introduction Why do we need also a frequency domain analysis (also we need time domain convolution):- 1) Sinusoidal and exponential signals occur.

ARTIFICIAL NEURAL NETWORKS

Speech Processing AEGIS RET All-Hands Meeting

Spoken Digit Recognition

Fourier and Wavelet Transformations

Fourier Analyses Time series Sampling interval Total period

Audio and Speech Computers & New Media.

Wavelet Transform Fourier Transform Wavelet Transform

AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION

Fourier series Periodic functions and signals may be expanded into a series of sine and cosine functions

Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611

Scientific Python Introduction

ECE 791 Project Proposal Project Title: Developing and Evaluating a Tool for Converting MP3 Audio Files to Staff Music Project Team: Salvatore DeVito.

Advances in Deep Audio and Audio-Visual Processing

Measuring the Similarity of Rhythmic Patterns

BeatBox Hero Ankit Gupta Rohan Jain.

Music Signal Processing

Presentation transcript:

Digital Music Audio Processing Grows out of DSP and speech recognition research Feature detection mostly from Fast Fourier Transforms (FFT) and Mel Frequency Cepstral Coefficients (MFCC)

Music Digital Audio http://en.wikipedia.org/wiki/Digital_audio

Audio: Two Domain Problem Frequency domain Time domain

Our Hero Jean-Baptiste Joseph Fourier Mathematician and physicist Born: 21 March 1768 Died: 16 May 1830 Most famous for his spiffy “Fourier Transform” and related “Fourier’s Law” Also noted for early “greenhouse effect” work! https://upload.wikimedia.org/wikipedia/commons/a/aa/Fourier2.jpg By User:Bunzil at en.wikipedia [Public domain], from Wikimedia Commons

From Wave to Data http://en.wikipedia.org/wiki/User:LucasVB/Gallery

What do we mean by “audio feature”? Ideal: TRUE MEANING extracted from the audio signal

What do we mean by “audio feature”? Ideal: TRUE MEANING extracted from the audio signal

What do we mean by “audio feature”? Reality: something we can squint at & interpret a bit

“Low-level” and “high-level” features Low-level: “mechanically recovered” from the audio e.g. amplitude, timbral descriptors, spectral features High-level: usually obtained from low-level features + lots of context (template matching, machine-learning, domain knowledge) e.g. key, pitch, tempo, notes, phrases, similarity

Vamp plugins Small files you can install that add new feature extractors. Once installed, can be used with several different “hosts”: Sonic Visualiser Audacity audio editor (simple feature extractors only) Sonic Annotator – batch audio feature extraction program Python Vamp host – use with scientific coding packages for analysis, search, plotting etc

Vamp plugins and audio features

What does a Vamp feature consist of?

Example: Chromagram Somewhat representative of time- varying harmonic content Made by “wrapping around” time- frequency spectrogram into a single octave Various ways to do this → lots of different chromagram plugins Good example of an almost intuitively meaningful feature

Chromagram Motivation Limitations Applications Reduce spectrogram in a way informed by musical structure Limitations Time/frequency resolution tradeoff Misleading outcome of harmonic folding (different approaches to this) Intrinsic difficulties, e.g. with temperament Applications Chord and key estimation “Harmonic feature” for search, retrieval & similarity tasks