Finding a single voice in music Christine Smit April 26, 2007.

Finding a single voice in music Christine Smit April 26, 2007

Outline Introduction Introduction Classification Strategies: Classification Strategies: Counting silent frequency bins Counting silent frequency bins Pitch cancellation Pitch cancellation MFCCs MFCCs Trading recall for precision Trading recall for precision What worked and what didn’t What worked and what didn’t

Introduction What am I doing?

What is a ‘single voice’? a single note sounding at a time a single note sounding at a time

Why do this? single voice finder + instrument identifier = instrument sample library

What are the data sets? training set: 10 1-minute samples training set: 10 1-minute samples test set: 10 1-minute test samples test set: 10 1-minute test samples 25% single voice, 75% multi-voice/silence 25% single voice, 75% multi-voice/silence mixture of classical and folk music mixture of classical and folk music

What characterizes a single voice? non-solo solonon-solo

What characterizes a single voice?

Strategies

Strategy #1: Silence detection find silence silent HMM? music silence counts raw classification Nothing really worked

Strategy #2: Pitch Cancellation music filtered music raw classification final classification filter pitch single voice? HMM

Strategy #3: MFCCs MFCC GMM HMM music 13 features likelihood final classification

Trading recall for precision

Quick reminder Precision = out of the stuff we got, how much of it was right? Precision = out of the stuff we got, how much of it was right? Are google’s results relevant? Recall = out of all the right stuff, how much did we get? Recall = out of all the right stuff, how much did we get? If I asked google for the UN, did I get all the UN’s websites?

Precision is important If I have a large enough database, I can afford to have relatively low recall. But I want high precision so what I do get is what I want. If I have a large enough database, I can afford to have relatively low recall. But I want high precision so what I do get is what I want.

Strategy #2: Pitch Cancellation music filtered music raw classification final classification filter pitch single voice? HMM

Strategy #3: MFCCs MFCC GMM HMM music 13 features likelihood final classification

Results

Strategy #1: Silence detection (just for comparison)

Strategy #2: Pitch Cancellation

Strategy #3: MFCCs

Conclusion Silence detection really didn’t work out. Silence detection really didn’t work out. MFCCs + GMM is really just as good as pitch cancellation MFCCs + GMM is really just as good as pitch cancellation At 90% precision, I get about 25% recall. At 90% precision, I get about 25% recall.

Acknowledgements Much thanks to Professor Ellis for his assistance on this project.

Questions?

Finding a single voice in music Christine Smit April 26, 2007.

Similar presentations

Presentation on theme: "Finding a single voice in music Christine Smit April 26, 2007."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Finding a single voice in music Christine Smit April 26, 2007.

Similar presentations

Presentation on theme: "Finding a single voice in music Christine Smit April 26, 2007."— Presentation transcript:

Similar presentations

About project

Feedback