Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chord Recognition EE6820 Speech and Audio Signal Processing and Recognition Mid-term Presentation JunHao Ip.

Similar presentations


Presentation on theme: "Chord Recognition EE6820 Speech and Audio Signal Processing and Recognition Mid-term Presentation JunHao Ip."— Presentation transcript:

1 Chord Recognition EE6820 Speech and Audio Signal Processing and Recognition Mid-term Presentation JunHao Ip

2 Chord Recognition:  Introduction and Background Information  Previous methods  Feature description  Experimental procedures

3 Introduction:  Music transcription >> Very difficult task >> Requires strong musical background and training  Chord transcription >> Some musical background >> Good ears >> Time consuming  Needed for automation >> Very challenging problem >> Limited successes  Investigate existing methods and try new techniques for Chord recognition

4 Musical Background:  Chord is a several tones played simultaneously.  It is usually played in a group of three tones called Triad.  Chord symbol is defined by root note and the key associated with it. Chord Families maj, min, maj7, min7, dom7, aug, dim RootsAb,Bb,Cb,Eb,Fb,Gb A,B,C,D,E,F,G A#,B#,C#,D#,E#,F#,G#

5 Previous Methods:  Manual transcription >> Currently the most accurate technique of all >> Strong musical training is needed >> Very time consuming  EM Trained Hidden Markov Model >> Compare performance of MFCC and Pitch Class Profile (PCP) >> PCP outperforms MFCC >> 83.3% accuracy in chord alignment, 26.4% accuracy in recognition  Chord Progression Hypothesis Model >> Attempt to make educational guess on chord progression >> Find keys, chord symbols, and beats concurrently >> Uses Chroma Vectors for chord estimation, similar to PCP >> 77% accuracy in recognition

6 Pitch Class Profile:  Combine pitches from different octaves to form a 12 semi- tones vector from (Ab to G#)  Feature is popular  Problem with data presentation when voice and multi instruments present.  Use 24 bins instead of 12 p [k] = floor( 24 log2( (k / N) (fs / f ref ) ) mod 24 PCP [p] = sum ( |X[k]| 2 ) k = frequency index p = bin index Fs = sampling frequencyFref = 440Hz (note A)

7

8

9 Use image processing technique?

10 Autocorrelation of Sub-band energy envelope:  New technique  Investigate correlation of sub-band energy envelope with long lag  The least common multiple of the different fundamental frequencies making up a chord Rxx[l] = ∑ x[n] x[n-l] = Rxx[-l]

11 Experiments:  Investigate the PCP feature  Sub-band Autocorrelation feature  Feature comparison  Train by Expectation Maximization Model? Nearest Neighbor?  Measure accuracy of Chord classification using test sets

12 Reference: 1. Chord Segmentation and Recognition using EM – Trained Hidden Markov Model, Alex Sheh and Daniel P.W. Ellis, 2003 2. Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries, Takuya Yoskioka, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno, 2004 3. A Chorus-section Detecting Method for Musical Audio Signals 4. Transcription Techniques – part 1, Lucas Pickford http://www.globalbass.com/archives/dec2000/transcription_techniques.htm http://www.globalbass.com/archives/dec2000/transcription_techniques.htm 5. Introductory Musicianship A Workbook 6 th Ed, Thomson Schirmer Inc, Theodore A. Lynn 6. Speech and Audio Signal Processing, John Wiley & Sons Inc, Ben Gold and Nelson Morgan

13


Download ppt "Chord Recognition EE6820 Speech and Audio Signal Processing and Recognition Mid-term Presentation JunHao Ip."

Similar presentations


Ads by Google