Lyric alignment in popular songs Luong Minh Thang
Outline Project description & techniques Some important knowledge Base-line system
Outline Project description & techniques Some important knowledge Base-line system
Project description Given: textual transcription of lyrics acoustic musical signal of a song Purpose: find the timestamps for the beginning and ending points for each line of the song.
Techniques to be investigated A repetition based technique for detecting self-similarities in both the audio and the text Dynamic programming (also known as dynamic time warping) will be employed to align the repetition analysis of both media together to produce an alignment
Outline Project description & techniques Some important knowledge Base-line system
Chroma vectors 1 octaves with 12 semitones 12-dimensional chroma vector C C# D … B C C# D…B C C# D…B C C# D…B Tone heights 1 octaves with 12 semitones
Dynamic time warping algorithm Actually, dynamic programming
Word similarity using phoneme dictionary Using CMU phoneme dictionary Each word is decomposed into phoneme sequence ALIGNMENT: AH0 L AY1 N M AH0 N T ALGORITHMS: AE1 L G ER0 IH2 DH AH0 M Z Similarity of 2 words is the similarity of the 2 phoneme sequences
Outline Project description & techniques Some important knowledge Base-line system
Base-line system: overview Musical signal inputText input Chroma-vector calculations Simplification Text processing Music notation sequence Word sequence Word self- similarity matrix Mapping Aligning Phoneme dictionary Symbol Self- similarity matrix
Base-line system: overview Musical signal inputText input Chroma-vector calculations Simplification Text processing Music notation sequence Word sequence Word self- similarity matrix Mapping Aligning Phoneme dictionary Symbol Self- similarity matrix
Base-line system: overview Musical signal inputText input Chroma-vector calculations Simplification Text processing Music notation sequence Word sequence Word self- similarity matrix Mapping Aligning Phoneme dictionary Symbol Self- similarity matrix
Base-line system: overview Musical signal inputText input Chroma-vector calculations Simplification Text processing Music notation sequence Word sequence Symbol Self- similarity matrix Mapping Aligning Phoneme dictionary Word self- similarity matrix
Base-line system: overview Musical signal inputText input Chroma-vector calculations Simplification Text processing Music notation sequence Word sequence Symbol Self- similarity matrix Mapping Aligning Phoneme dictionary Word self- similarity matrix
Thank you !