Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Data Mining and Text Analytics in Music Audi Sugianto and Nicholas Tawonezvi.
Evaluation of the Audio Beat Tracking System BeatRoot By Simon Dixon (JNMR 2007) Presentation by Yading Song Centre for Digital Music
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
Chord Recognition EE6820 Speech and Audio Signal Processing and Recognition Mid-term Presentation JunHao Ip.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
ADVISE: Advanced Digital Video Information Segmentation Engine
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Presented by Zeehasham Rasheed
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Language Identification in Web Pages Bruno Martins, Mário J. Silva Faculdade de Ciências da Universidade Lisboa ACM SAC 2005 DOCUMENT ENGENEERING TRACK.
Improving Musical Genre Classification with RBF Networks Douglas Turnbull Department of Computer Science and Engineering University of California, San.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
/14 Automated Transcription of Polyphonic Piano Music A Brief Literature Review Catherine Lai MUMT-611 MIR February 17,
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Tonal Harmony and Chord Recognition Juhan Nam 1.
Polyphonic Queries A Review of Recent Research by Cory Mckay.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
JSymbolic Cedar Wingate MUMT 621 Professor Ichiro Fujinaga 22 October 2009.
Audio Thumbnailing of Popular Music Using Chroma-Based Representations Matt Williamson Chris Scharf Implementation based on: IEEE Transactions on Multimedia,
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Modeling Expressive Performances of the Singing Voice Maria-Cristina Marinescu (Universidad Carlos III de Madrid) Rafael Ramirez (Universitat Pompeu Fabra)
Structure Discovery of Pop Music Using HHMM E6820 Project Jessie Hsu 03/09/05.
AUDIO TONALITY MODE CLASSIFICATION WITHOUT TONIC ANNOTATIONS Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), China.
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.
Creating Music Text, Rhythm, and Pitch Combined to Compose a Song.
Combining Audio Content and Social Context for Semantic Music Discovery José Carlos Delgado Ramos Universidad Católica San Pablo.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Dynamic Captioning: Video Accessibility Enhancement for Hearing Impairment Richang Hong, Meng Wang, Mengdi Xuy Shuicheng Yany and Tat-Seng Chua School.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Improving Music Genre Classification Using Collaborative Tagging Data Ling Chen, Phillip Wright *, Wolfgang Nejdl Leibniz University Hannover * Georgia.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Text Classification Improved through Multigram Models.
Musical Genre Categorization Using Support Vector Machines Shu Wang.
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
R ESEARCH P ROGRESS R EPORT – C OVER S ONGS I DENTIFICATION Ken.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.
Learning to analyse tonal music Pl á cido Rom á n Illescas David Rizo Jos é Manuel I ñ esta Pattern recognition and Artificial Intelligence group University.
Melody Recognition with Learned Edit Distances Amaury Habrard Laboratoire d’Informatique Fondamentale CNRS Université Aix-Marseille José Manuel Iñesta,
Recent Paper of Md. Akmal Haidar Meeting before ICASSP 2013 報告者:郝柏翰 2013/05/23.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
What is automatic music transcription? Transforming an audio signal of a music performance in a symbolic representation (MIDI or score). Aim: This prototype.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
Introduction to ISMIR/MIREX
David Sears MUMT November 2009
Tomás Pérez-García, Carlos Pérez-Sancho, José M. Iñesta
Rhythmic Transcription of MIDI Signals
Deep Learning Amin Sobhani.
A Melody Composer for both Tonal and Non-Tonal Languages
HarmLang A language for probabilistic computation over harmonies
Brian Whitman Paris Smaragdis MIT Media Lab
Term Project Presentation By: Keerthi C Nagaraj Dated: 30th April 2003
Presented by Steven Lewis
ECE539 final project Instructor: Yu Hen Hu Fall 2005
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,
Pitch Spelling Algorithms
Music Signal Processing
Chord Recognition with Application in Melodic Similarity
Presentation transcript:

Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, Spain Stefan Kersten, Rafael Ramirez Music Technology Group, Universitat Pompeu-Fabra, Barcelona, Spain

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

Introduction Chord progressions as features for genre recognition In classical music, each period had its own harmonic framework to compose We can find typical chord progressions in every style: I-VI-ii-V (jazz) I-IV-V (pop-rock) Chord progressions are symbolic features They can be applied to classify audio data if a chord transcription system is applied

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

System architecture MUSIC AUDIO FILE Am Dm Em … CHORD PROGRESSIONS C/Am TRANSPOSED AUDIO TO CHORD TRANSCRIPTION SYSTEM AUDIO DOMAIN SYMBOLIC DOMAIN CLASSICAL JAZZ POPULAR LANGUAGE MODELS GENRE EVALUATION

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

Audio feature extraction MUSIC AUDIO FILE FEATURE EXTRACTION (HARMONIC PITCH CLASS PROFILES) KEY ESTIMATION AUDIO TO CHORD TRANSCRIPTION SYSTEM BEAT TRACKING/ CHORD WEIGHTING Cm Fm Gm … CHORD PROGRESSIONS Am Dm Em … CHORD PROGRESSIONS C/Am TRANSPOSED

Audio feature extraction (2) Frame-based extraction of Harmonic Pitch Class Profile (HPCP)‏ Spectral peak tracking and mapping of peaks into 36 pitch classes (enhanced chroma feature) [ E. Gómez. “Tonal Description of Music Audio Signals”. PhD thesis, 2006 ] The most probable chord is selected for each frame

Audio feature extraction (3) Beat detection using two algorithms: Dixon's multiple agent algorithm (BeatRoot) [ S. Dixon. “Evaluation of the audio beat tracking system beatroot”. Journal of New Music Research, 36(1): 39–50, 2007 ] Ellis' dynamic programming algorithm [ D. P. W. Ellis. “Beat tracking by dynamic programming”. Journal of New Music Research, 36(1): 51–60, 2007 ] Selection of the onset stream with the least mean variance in tempo period

Audio feature extraction (4) Computation of beat-level features from frame- level features by building chord histogram Chords weighted by their probabilities computed in the previous feature extraction step Select maximum in histogram as the most salient chord among the frames in a given beat Key and mode of the song are determined from HPCP in order to transpose the chord transcription

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

Language models (LM) … … … Am Dm Em … n-gram extraction 2-gram probability Am Dm Dm Em … LANGUAGE MODEL Training: n = 2 We obtain a language model for each genre LM can be constructed using different n-gram lengths (2-grams, 3-grams, 4-grams) For each genre, we have a set of chord progression files A new problem file is evaluated against each LM The genre is assigned as that maximizing the probability of the LM Test:

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

Datasets DS1: Ground-truth of chord progressions 761 Band-in-a-box (symbolic) files in three genres: classical, jazz and popular; synthesized into audio  Full chords: Fm7 Bb7 Em7 A7 Dm7 Am7  Triads: Fm Bb Em A Dm Am DS2: 12 audio files extracted from commercial CDs of the same genres

Chord transcription system proof of concept SYNTHESIS LANGUAGE MODELS TRANSCRIPTION LANGUAGE MODELS TRAINING 10-FOLD CROSS VALIDATION 10-FOLD CROSS VALIDATION DS1 … … … Am Dm Em … … … … Am Dm Em …

Chord transcription system proof of concept Poorer performance for triads due to the lack of full chord structure information Errors in the transcription process: Bad chord recognitions Bad key estimation Good results despite the simplicity of the features; note that a vocabulary of 24 chords (only major/minor triads) was used

Evaluation with real audio data LANGUAGE MODELS TRANSCRIPTION TRAINING DS1 DS2 FEATURE EXTRACTION (n-grams) … … … Am Dm Em … … … … C Em F … GENRE EVALUATION

Evaluation with real audio data 3- and 4-grams perform the same and slightly better than 2-grams Better results obtained when classifying jazz against academic or popular music

Outline Introduction System architecture Audio to chord transcription Language modeling Experiments Conclusions and future work

Conclusions Chord progressions seem to be a suitable representation for genre classification State-of-the-art transcription systems can be used to obtain chord progressions from audio files for this task Results limited by the small vocabulary size

Future work Improve the chord transcription system  Better chord recognition, including structures like 7th, dim, aug… Combination of the system with melody-based language models

Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, Spain Stefan Kersten, Rafael Ramirez Music Technology Group, Universitat Pompeu-Fabra, Barcelona, Spain