Presentation is loading. Please wait.

Presentation is loading. Please wait.

Special Indices for LaaLaLaa Lyric Analysis & Generation Framework

Similar presentations


Presentation on theme: "Special Indices for LaaLaLaa Lyric Analysis & Generation Framework"— Presentation transcript:

1 Special Indices for LaaLaLaa Lyric Analysis & Generation Framework
Dr. V. Madhan Karky, Tamil Computing Lab (TaCoLa), College of Engineering Guindy, Anna University, Chennai.

2 Overview Objective Introduction Background Rhyme Schemes in Tamil
Meter Pattern System Architecture Indexing Structure Indexing Algorithm Results and Analysis

3 Objective To build special indices for the LaaLaLaa lyric analysis and generation framework to facilitate faster retrieval based on Meter Pattern Rhyme

4 Introduction Tamil is a vibrant language with a rich grammar, vocabulary, an inherent poetic flavor. About 1000 lyrics are being written every year as private albums, jingles and as original soundtracks of mainstream movies.

5 Background WASP (Pablo Grevas (2000)) splits a given block of text, identifies patterns and fits words from the vocabulary to get verses of similar pattern. COLIBRI (Agudo, Grevas, Calero (2002)) follows a case based approach to poem generation.

6 Background Tra-la-Lyrics (Oliviera, Cardoso, Pereira (2005)) finds out the beat pattern of the midi file and places words with similar syllabic division and stress pattern. Automatic generation of tamil lyrics for melodies (Ramakrishnan, Kuppan, Devi (2009)) converts the midi file to KNM reprsn and fits words to it from a corpus. Phrases are meaningful as parts only. No edhugai, monai or iyaybu.

7 Background LaaLaLaa (Sowmiya, Karky (2010)) talks of splitting raw text from midi file to templates and filling them with words from a wordnet according to the pattern mined from an existing corpus of lyrics with due consideration to rhyme, meaning and flow.

8 Rhyme Schemes in Tamil Two words are said to rhyme in
Monai (மோனை) - first letters are the same. Edhugai (எதுகை) - second letters are the same. Iyaibu (இயைபு) - last letters are the same.

9 Rhyme Schemes in Tamil Examples: அருவி and விருப்பு rhyme in edhugai.
பறவை and பச்சை rhyme in monai. அருவி and விருப்பு rhyme in edhugai. யாக்கை and வாழ்க்கை rhyme in iyaibu. அருவி and குருவி rhyme in edhugai and iyaibu. கவிதைகள் and கவிஞர்கள் rhyme in all the three schemes.

10 Meter Pattern Maathirai (மாத்திரை) - time taken to wink an eyelid.
Maathirai based classification of Tamil alphabets. Nedil (N) (நெடில்) - Those alphabets which are pronounced for the time interval of 2 maathirai. Kuril (K) (குறில்) - Alphabets which take 1 maathirai to be pronounced. Mei (M) (மெய்) - Alphabets which are pronounced for 0.5 maathirai. Meter pattern of a word refers to its Kuril Nedil Mei pattern. For example, the Meter pattern of the word பாடல் is NKM as பா is a Nedil(N), ட is a Kuril(K) and ல் is a Mei(M).

11 System Architecture Lyric DB Word Object Convertor Rhyme Pattern
Extractor Rhyme Index Builder Rhyme Meter Index

12 Indexing Structure Part of Speech மோனை MeterPattern1 Letter1 Words
எதுகை இயைபு

13 Indexing Algorithm

14 Result and Analysis Retrieval Time (in milliseconds) Word

15 Results and Analysis Retrieval complexity of both the approaches tested using a dataset of 500 tamil words. The average retrieval time in Word indexed approach millisecond Meter Rhyme Indexed approach – 1.90millisecond

16 Results and Analysis The drastic decrease in retrieval time from O(α) to O(1) [α is the number of words in the database] is due to The use of hash-tables which are efficient for retrieval. Having separate hash-tables for the மோனை, எதுகை and இயைபு of each POS.

17 Thank You!!!


Download ppt "Special Indices for LaaLaLaa Lyric Analysis & Generation Framework"

Similar presentations


Ads by Google