MATCH A Music Alignment Tool Chest

Slides:

Advertisements

Similar presentations

Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.

Advertisements

Multimedia Database Systems

Word Spotting DTW.

In collaboration with Hualin Gao, Richard Duncan, Julie A. Baca, Joseph Picone Human and Systems Engineering Center of Advanced Vehicular System Mississippi.

LAM: Musical Audio Similarity Michael Casey Centre for Cognition, Computation and Culture Department of Computing Goldsmiths College, University of London.

Lyric alignment in popular songs Luong Minh Thang.

Dynamic Programming Tutorial Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.

Evaluation of the Audio Beat Tracking System BeatRoot By Simon Dixon (JNMR 2007) Presentation by Yading Song Centre for Digital Music

74 th EAGE Conference & Exhibition incorporating SPE EUROPEC 2012 Automated seismic-to-well ties? Roberto H. Herrera and Mirko van der Baan University.

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.

Distance Functions for Sequence Data and Time Series

Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh

Exact Indexing of Dynamic Time Warping

Variable Penalty Dynamic Time Warping For Aligning Chromatography Data David Clifford Research Scientist June 2009.

Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.

Paper by Craig Stuart Sapp 2007 & 2008 Presented by Salehe Erfanian Ebadi QMUL ELE021/ELED021/ELEM021 5 March 2012.

Audio Fingerprinting MUMT 611 Ichiro Fujinaga McGill University.

National Taiwan University

Polyphonic Music Transcription Using A Dynamic Graphical Model Barry Rafkind E6820 Speech and Audio Signal Processing Wednesday, March 9th, 2005.

Incorporating Dynamic Time Warping (DTW) in the SeqRec.m File Presented by: Clay McCreary, MSEE.

Voice Recognition (Presentation 2) By: Priya Devi A. S/W Developer, Xsys technologies Bangalore.

Fundamentals of Music Processing

Audio Thumbnailing of Popular Music Using Chroma-Based Representations Matt Williamson Chris Scharf Implementation based on: IEEE Transactions on Multimedia,

Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.

Demos for QBSH J.-S. Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Search. Search issues How do we say what we want? –I want a story about pigs –I want a picture of a rooster –How many televisions were sold in Vietnam.

Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.

Introduction to Onset Detection Functions HAO-HSUN LI 1/30.

Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.

By Danny Matthews Supervised by Dr Des Watson. 8 Bit 8 Bit console released in Million 60 Million Units Sold 1000 Released Titles Over 1000 Released.

Exact indexing of Dynamic Time Warping

March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.

QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.

By Kejia Zhang PowerSpy: Location Tracking using Mobile Device Power Analysis Yan Michalevsky, Aaron Schulman, etc. Stanford University Published in USENIX.

Query by Singing and Humming System

DTW for Speech Recognition J.-S. Roger Jang ( 張智星 ) MIR Lab ( 多媒體資訊檢索實驗室 ) CS, Tsing Hua Univ. ( 清華大學.

DYNAMIC TIME WARPING IN KEY WORD SPOTTING. OUTLINE KWS and role of DTW in it. Brief outline of DTW What is training and why is it needed? DTW training.

Natural Language and Speech (parts of Chapters 8 & 9)

1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.

Definition of the Hidden Markov Model A Seminar Speech Recognition presentation A Seminar Speech Recognition presentation October 24 th 2002 Pieter Bas.

Rashomon: Toolkit for Assembling and Analyzing Multi-Perspective Video Chronologies Rashomon Project (Under Construction - February 2012) About the Project.

1/16 Dynamic Programming Carmine Casciato MUMT 611 Thursday March 31 st 2005.

A Music Search Engine for Plagiarism Detection

A NONPARAMETRIC BAYESIAN APPROACH FOR

Time Series and Dynamic Time Warping

Online Signature Verification

David Sears MUMT November 2009

Rhythmic Transcription of MIDI Signals

OUTLINE Introduction Background Dataset Context Analysis Methodology

Catherine Lai MUMT-611 MIR February 17, 2005

OUTLINE Introduction Background Dataset Context Analysis Methodology

A review of audio fingerprinting (Cano et al. 2005)

Artificial Intelligence for Speech Recognition

Accelerometer-Based Character Recognition Pen

Genomic Data Clustering on FPGAs for Compression

Introduction to Music Information Retrieval (MIR)

Database Performance Tuning and Query Optimization

Distance Functions for Sequence Data and Time Series

Speech Database/Tool System And Preliminary Accent study.

Chapter 4: Representing sound

Lesson 7 Plan a Presentation

Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611

Chapter 11 Database Performance Tuning and Query Optimization

Biometric transaction confirmation with ComBiom.

Accelerometer-Based Character Recognition Pen

Using Animation and Multimedia

Measuring the Similarity of Rhythmic Patterns

Presentation transcript:

MATCH A Music Alignment Tool Chest by Simon Dixon & Gerhard Widmer (ISMIR2005) Presentation prepared by Richard Matthew Flanagan for QMUL ELE021 Music & Speech Processing 27 February 2012

MATCH : A Music Alignment Tool Chest Dr…WHO?? Developed in 2005 by Dr. Simon Dixon. Forms part of C4DM’s MIR research. “A toolkit for aligning audio recordings of different renditions of the same piece of music” ♩♩♩ ♫ ♩♩ ♬ ♪ ♩♫♩ ♪♫♫ ‘Time Lord’

MATCH : A Music Alignment Tool Chest WHAT Is The Point?? Current indexing does not cut the mustard… Content-based indexing of CDs is limited to the level of tracks (number of songs or movements). USER DEFINED INDEXING = HAPPY FACE

MATCH : A Music Alignment Tool Chest USECASE : Piano Student or Music Lover Comparison of how they play the same phrase Pianist 3 Pianist 2 Pianist 1 Requires manual search to find the exact phase in each recording. VERY LONG!!

MATCH : A Music Alignment Tool Chest HOW? Does It All MATCH Up Based on an efficient dynamic time warping algorithm. An idea first established in 1978 by H. Sakoe and S. Chiba. Heavily used in speech processing. Measures the similarity between two sequences which may vary in time or speed.

MATCH : A Music Alignment Tool Chest DYNAMIC TIME WARPING

MATCH : A Music Alignment Tool Chest MATCH IMPLEMENTATION The returned path by the DTW algorithm is used as a lookup table between the two audio files. Includes various functions for displaying the cost matrix, forward and backwards path and other meta data associated with the files. Alignment takes approximately 4% of the sum durations of the files Thus allowing for playback while matching takes place.

MATCH : A Music Alignment Tool Chest TIME FOR A FIDDLE…

MATCH : A Music Alignment Tool Chest References… S. Dixon and G. Widmer, “MATCH: A music alignment tool chest,” in 6th International Conference on Music Information Retrieval, 2005 S. Dixon. Live tracking of musical performances using on-line time warping. In Proceedings of the 8th International Conference on Digital Audio Effects, 2005 H. Sakoe and S. Chiba. Dynamic programming algorithm optimisation for spoken word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 26:43–49, 1978 http://web.science.mq.edu.au/~cassidy/comp449/html/ch11s02.html