Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Retrieval Evaluation J. H. Wang Mar. 18, Outline Chap. 3, Retrieval Evaluation –Retrieval Performance Evaluation –Reference Collections.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
Evaluating Search Engine
Information Retrieval in Practice
CSE466 Autumn ‘00- 1 System Functional Requirements  Children’s toy…comes with PC software. Child plays notes on the screen and the device makes corresponding.
Copyright Nov. 2002, George Tzanetakis Digital Music & Music Processing George Tzanetakis PostDoctoral Fellow Computer Science Department Carnegie Mellon.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ADVISE: Advanced Digital Video Information Segmentation Engine
Designing Software for Personal Music Management and Access Frank Shipman & Konstantinos Meintanis Department of Computer Science Texas A&M University.
Distinguishing Photographic Images and Photorealistic Computer Graphics Using Visual Vocabulary on Local Image Edges Rong Zhang,Rand-Ding Wang, and Tian-Tsong.
Learning to Extract Form Labels Nguyen et al.. The Challenge We want to retrieve and integrate online databases We want to retrieve and integrate online.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
1 Music Classification Using SVM Ming-jen Wang Chia-Jiu Wang.
Overview of Search Engines
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Audio Retrieval David Kauchak cs458 Fall Administrative Assignment 4 Two parts Midterm Average:52.8 Median:52 High:57 In-class “quiz”: 11/13.
Sound Applications Advanced Multimedia Tamara Berg.
JSymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada.
Polyphonic Queries A Review of Recent Research by Cory Mckay.
MACHINE LEARNING TECHNIQUES FOR MUSIC PREDICTION S. Grant Lowe Advisor: Prof. Nick Webb.
Multimedia Databases (MMDB)
Contactforum: Digitale bibliotheken voor muziek. 3/6/2005 Real music libraries in the virtual future: for an integrated view of music and music information.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
JSymbolic Cedar Wingate MUMT 621 Professor Ichiro Fujinaga 22 October 2009.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
MULTIMEDIA DEFINITION OF MULTIMEDIA
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Multimedia Technology and Applications Chapter 2. Digital Audio
Audio Thumbnailing of Popular Music Using Chroma-Based Representations Matt Williamson Chris Scharf Implementation based on: IEEE Transactions on Multimedia,
CMSCDHN1114/CMSCD1011 Introduction to Computer Audio
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
AUDIO TONALITY MODE CLASSIFICATION WITHOUT TONIC ANNOTATIONS Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), China.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
BRAZIL VOX POPULI is WINNER of The International Competition Dream Centenary Computer Graphics in Aizu, Japan.
A survey of different shape analysis techniques 1 A Survey of Different Shape Analysis Techniques -- Huang Nan.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
What Is Text Mining? Also known as Text Data Mining Process of examining large collections of unstructured textual resources in order to generate new.
Issues in Automatic Musical Genre Classification Cory McKay.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
1 Automatic Music Style Recognition Arturo Camacho.
Improving Music Genre Classification Using Collaborative Tagging Data Ling Chen, Phillip Wright *, Wolfgang Nejdl Leibniz University Hannover * Georgia.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,
Ionian University Department of Informatics
The Greek Audio Dataset
Tomás Pérez-García, Carlos Pérez-Sancho, José M. Iñesta
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Text-To-Speech System for English
Introduction to Music Information Retrieval (MIR)
Chapter 8 Arrays Objectives
Presented by Steven Lewis
Measuring the Similarity of Rhythmic Patterns
Chord Recognition with Application in Melodic Similarity
Presentation transcript:

Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work

Why do we classify? Increasing importance of digital music distribution Effectively navigating through large web-based music collections Structuring on-line music stores & radio stations Creating intelligent Internet music search engines and Peer-to-Peer systems Can be used in other type of analysis like similarity retrieval or summarization

Audio Classification Jazz Rock Classical Country Electronica Reggae World Folk New Age ? ? ? ? ? ? ? ? ? ? ? ? ?

Audio Classification (cont.)

Music Information Retrieval (MIR) The process of indexing and searching music collections. Symbolic MIR – Structured signals such as MIDI files are used. – Melodic information is typically utilized. Two different approaches: Query-by-melody (manual) and Query-by-humming Audio MIR – Arbitrary unstructured audio signals are used. – Timbral and rhythmic (beat) information is utilized.

What is MIDI? Musical Instrument Digital Interface A music definition language Communication protocol supports 128 different voices includes 16 channels

Classification Process Steps MIDI file Audio-from-MIDI fileArbitrary Audio file Pitch Histogram 4D Feature Vector (Pitch Content Feature Set) Multiple Pitch Detection Algorithm Labeled Feature Vectors used by Statistical Classifiers Histogram Construction Algorithm Timbral & Rhythmic Features Genre Classification Result by comparing the feature vectors

Pitch Histograms Unfolded Histogram –an array of 128 integer values (bins) indexed by MIDI note numbers –showing the frequency of occurrence of each note in a musical piece –contains information regarding the pitch range of the music Folded Histogram –All notes are transposed into a single octave and mapped to a circle of fifths –an array of 12 integer values –contains information regarding the pitch content of the music

Folded Pitch Histogram – Index Numbers Index Numbers

Unfolded Pitch Histograms Fig.1 - Unfolded Pitch Histograms of 2 Jazz pieces (left) and 2 Irish songs (right).

Pitch Histogram features Four dimensional feature vector –PITCH-Fold –AMPL-Fold –PITCH-Unfold –DIST-Fold

Pitch Histogram Calculation For MIDI files: –The algorithm increments the corresponding note’s frequency counter while using linear traversal over all MIDI events in the file. –Normalization For arbitrary audio files: –Multiple Pitch Detection Algorithm

Multiple Pitch Detection Algorithm Fig.2 – Multiple Pitch Detection Flow Chart

Experiment Details Types of music contents: –symbolic (refers to MIDI) –audio-from-MIDI (generated using a synthesizer playing a MIDI file) –audio (digital audio files like mp3’s found on the web) Five musical genres are used: –Electronica, Classical, Jazz, Irish Folk and Rock Experiment Set: –A set of 100 musical pieces in MIDI format for each genre –A set of 100 audio-from-MIDI pieces for each genre –A set of 100 general audio files KNN(3) Classifier

Classification Results in MIDI Fig.3 – Classification accuracy comparison of random and MIDI

Classification Results in MIDI

Fig.4 – Pair-wise evaluation in MIDI

Classification Results in MIDI Fig.5 – Average classification accuracy as a function of the length of input MIDI data

Classification Results in Audio-from-MIDI Fig.6 - Classification accuracy comparison of random and Audio-from-MIDI

Classification Results in Audio-from-MIDI

Comparison of Classification Results Fig.7 – Classification accuracy comparison

Implementation  MARSYAS –MusicAl Research SYstem for Analysis and Synthesis –the software used for audio Pitch Histogram calculation and musical genre classification. –Three distinct modes of visualization: Standard Pitch Histogram plots 3D pitch-time surfaces Projection of the pitch-time surfaces onto a 2D bitmap

MARSYAS Visualization Fig.8 – Examples of grayscale pitch-time surfaces. Jazz (top) and Irish Folk music (bottom)

Summary Symbolic representation is more preferable in the sense of computing Pitch Information. This work can be viewed as an attempt to bridge the two distinct MIR approaches by using Pitch Histograms. Pitch Histograms do carry a certain amount of genre- identifying information. Multiple Pitch Detection Algorithm is not perfect, but it works by a certain degree.

Future Work Real-time running version of Pitch Histogram. –for better classification performance. –to conduct more detailed harmonic analysis such as figured bass extraction, tonality recognition, and chord detection. The features derived from Pitch Histograms might be applicable to the problem of content-based audio identification or audio fingerprinting. Alternative feature sets are needed. Query-based retrieval mechanism for audio music signals.

Thanks Cosku Turhan for the art work on my slides… 4 Non Blondes for their song, “What's Up?” :)