Discussions on Audio Melody Extraction (AME) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

Slides:



Advertisements
Similar presentations
Feature Selection for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CSIE Dept., National Taiwan University ( 台灣大學 資訊工程系 )
Advertisements

Dynamic Time Warping (DTW)
Standard Template Library Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Task Force on National Greenhouse Gas Inventories Tier 3 Approaches, Complex Models or Direct Measurements, in Greenhouse Gas Inventories Report of the.
Onset Detection in Audio Music J.-S Roger Jang ( 張智星 ) MIR LabMIR Lab, CSIE Dept. National Taiwan University.
Retrieval Methods for QBSH (Query By Singing/Humming) J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval.
Learning to Align Polyphonic Music. Slide 1 Learning to Align Polyphonic Music Shai Shalev-Shwartz Hebrew University, Jerusalem Joint work with Yoram.
Performance Evaluation: Estimation of Recognition rates J.-S. Roger Jang ( 張智星 ) CSIE Dept., National Taiwan Univ.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
NM7613: Music Signal Analysis and Retrieval 音樂訊號分析與檢索 Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
2015/9/111 Introduction to ISMIR/MIREX J.-S. Roger Jang (張智星) Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Field Testing Performing Fine Arts Assessment Project.
Theoretical and Methodological Fundaments of Music Annotation Theoretical and Methodological Fundaments of Music Annotation Institute.
2015/10/221 Progressive Filtering and Its Application for Query-by-Singing/Humming J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept.,
Demos for QBSH J.-S. Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Singer Survey Results September 25, TWC Sections ( ) Soprano I – 30 (3 JWC) Soprano II – 40 (2 JWC) Alto I - 36 (1 JWC) Alto II – 21.
Singly Linked Lists Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University 1.
2016/6/41 Recent Improvement Over QBSH and AFP J.-S. Roger Jang (張智星) Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.
Creating Music Text, Rhythm, and Pitch Combined to Compose a Song.
Sorting Algorithms Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
RuSSIR 2013 QBSH and AFP as Two Successful Paradigms of Music Information Retrieval Jyh-Shing Roger Jang ( 張智星 ) MIR Lab, CSIE Dept.
Quadratic Classifiers (QC) J.-S. Roger Jang ( 張智星 ) CS Dept., National Taiwan Univ Scientific Computing.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
THE 2006 MUSIC INFORMATION RETRIEVAL EVALUATION EXCHANGE (MIREX 2006) RESULTS OVERVIEW The IMIRSEL Group led by J. Stephen Downie Graduate School of Library.
Query by Singing and Humming System
Some Research Activities in MIR Lab J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS.
R ESEARCH P ROGRESS R EPORT – C OVER S ONGS I DENTIFICATION Ken.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Simulation of Stock Trading J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Linear Classifiers (LC) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Final Project: English Preposition Usage Checker J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Introduction to Music Information Retrieval (MIR)
Introduction to ISMIR/MIREX
Onset Detection, Tempo Estimation, and Beat Tracking
Search in Google's N-grams
CSIE Dept., National Taiwan Univ., Taiwan
Quadratic Classifiers (QC)
MIR Lab: R&D Foci and Demos ( MIR實驗室:研發重點及展示)
DP for Optimum Strategies in Games
Understanding Standards National 5 and Higher Course Event
Query by Singing/Humming via Dynamic Programming
Introduction to Pattern Recognition
Singing Voice Separation via Active Noise Cancellation 使用主動式雜訊消除於歌聲分離
Intro to Machine Learning
National Taiwan University
Closing Remarks on MSAR-2017
CHAPTER 4 Creative Marketing Project Event
By Dan Roth and Wen-tau Yih PowerPoint by: Reno Kriz CIS
Introduction to Music Information Retrieval (MIR)
Feature Selection for Pattern Recognition
National Curriculum Requirements of Music at Key Stage 1
Weaving Music Knowledge, Skills and Understanding into the new National Curriculum Key Stage 1: Music Forest Academy.
Search in OOXX Games J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.
Introduction to Music Information Retrieval (MIR)
Deep Neural Networks (DNN)
CHAPTER 4 Creative Marketing Project Event
Circularly Linked Lists and List Reversal
National Taiwan University
Applications of Heaps J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.
Query by Singing/Humming via Dynamic Programming
Examples of Time Complexity
Scientific Computing: Closing 科學計算:結語
Prediction in Stock Trading
Selection Algorithm Jyh-Shing Roger Jang (張智星)
Naive Bayes Classifiers (NBC)
Game Trees and Minimax Algorithm
Harmonically Informed Multi-pitch Tracking
Sorting Algorithms Jyh-Shing Roger Jang (張智星)
Presentation transcript:

Discussions on Audio Melody Extraction (AME) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University

2/6 Outline Dataset preparation for AME Suggestions to AME task in MIREX

3/6 Goals Large enough to have statistical significance Diversified contents for better generation More instrumental music Should be full songs instead of excerpts Annotation procedure should be standardized and fully documented Music contents should be as professional as possible. Dataset Preparation for AME J. Salamon and J. Urbano, "Current Challenges in the Evaluation of Predominant Melody Extraction Algorithms", ISMIR, 2012 How about two datasets?

4/6 Goal: to simply the task such that Reduce the entry barrier Since the basic task is already hard enough Encourage more people to participate Such that it can promote other task such as cover song ID Directions for simplification Datasets for different lead instruments Lead singer only: Subset of type A Other lead instruments: Subset of type A About submissions Different submissions for different datasets Train/test procedures Simpler criteria Get rid of +5dB and -5dB? Suggestions to AME Task in MIREX

5/6 3 Definitions of Melody Type A: The f0 curve of the most predominant melodic source in the recording, and only that source. So for example in this scenario if there's a lead singer but also a guitar solo, the annotation will only include the lead singer. This is closest to the definition used in MIREX right now. Type B: The f0 of the most predominant melodic source in the recording at any given point in time. In this more relaxed definition, the f0 curve can include the pitch of several sources (but only one source at any point in time). To create this we annotated all the pitch tracks that we considered melodic (e.g. lead voice, solos, etc.). Then we ranked them from most predominant to least predominant. Then the final f0 curve was generated by taking at every timestamp the f0 value from the most predominant source that was active at that time. Type C: The multi-f0 curve of all melodic instruments. This is basically closer to multi-f0 tracking in the sense that there may be several active melodic f0 values at the same time. However, unlike multi-f0 tracking we don't annotate all pitched instruments in the track (e.g. we don't annotate the bass line), only the tracks that are considered melodic. Under this definition, the algorithm's estimate would be considered correct if it matches any of the active melodic sources in the annotation. R. M. Bittner, J. Salamon, M. Tierney, M. Mauch, C. Cannam and J. P. Bello. "MedleyDB: A Multitrack Dataset for Annotation-Intensive MIR Research“ ISMIR, 2014

6/6 Discussions How can we join force to create an AME dataset that satisfies (almost) all the requirements?