special indices for laalalaa lyric analysis & generation framework special indices for laalalaa...

17
Special Indices for Special Indices for LaaLaLaa Lyric Analysis LaaLaLaa Lyric Analysis & Generation Framework & Generation Framework Dr. V. Madhan Karky, Tamil Computing Lab (TaCoLa), College of Engineering Guindy, Anna University, Chennai.

Upload: sybil-osborne

Post on 17-Dec-2015

225 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Special Indices for LaaLaLaa Lyric Special Indices for LaaLaLaa Lyric Analysis & Generation FrameworkAnalysis & Generation Framework

Dr. V. Madhan Karky,Tamil Computing Lab (TaCoLa),College of Engineering Guindy,

Anna University, Chennai.

Page 2: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

OverviewOverview• Objective• Introduction• Background• Rhyme Schemes in Tamil• Meter Pattern• System Architecture• Indexing Structure• Indexing Algorithm• Results and Analysis

Page 3: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

ObjectiveObjective

• To build special indices for the LaaLaLaa lyric analysis and generation framework to facilitate faster retrieval based on – Meter Pattern – Rhyme

Page 4: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

IntroductionIntroduction

• Tamil is a vibrant language with a rich grammar,

vocabulary, an inherent poetic flavor.

• About 1000 lyrics are being written every year as

private albums, jingles and as original soundtracks of

mainstream movies.

Page 5: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

BackgroundBackground

• WASP (Pablo Grevas (2000)) splits a given block of

text, identifies patterns and fits words from the

vocabulary to get verses of similar pattern.

• COLIBRI (Agudo, Grevas, Calero (2002)) follows a

case based approach to poem generation.

Page 6: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

BackgroundBackground

• Tra-la-Lyrics (Oliviera, Cardoso, Pereira (2005))

finds out the beat pattern of the midi file and

places words with similar syllabic division and

stress pattern.

• Automatic generation of tamil lyrics for melodies

(Ramakrishnan, Kuppan, Devi (2009)) converts the

midi file to KNM reprsn and fits words to it from a

corpus. Phrases are meaningful as parts only. No

edhugai, monai or iyaybu.

Page 7: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

BackgroundBackground

• LaaLaLaa (Sowmiya, Karky (2010)) talks of splitting raw

text from midi file to templates and filling them with

words from a wordnet according to the pattern mined

from an existing corpus of lyrics with due consideration

to rhyme, meaning and flow.

Page 8: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Rhyme Schemes in TamilRhyme Schemes in Tamil

• Two words are said to rhyme in

– Monai (மோ��னை�) - first letters are the same.– Edhugai (எதுனை�) - second letters are the same.– Iyaibu (இனைபு) - last letters are the same.

Page 9: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Rhyme Schemes in TamilRhyme Schemes in Tamil

• Examples:– பறனை� and பச்னை� rhyme in monai.– அரு�� and ��ருப்பு rhyme in edhugai.– �க்னை� and ��ழ்க்னை� rhyme in iyaibu.–அரு�� and குரு�� rhyme in edhugai and

iyaibu.– ���னை��ள் and ���ஞர்�ள் rhyme in all

the three schemes.

Page 10: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Meter PatternMeter Pattern

• Maathirai (��த்��னை�) - time taken to wink an eyelid.

• Maathirai based classification of Tamil alphabets.– Nedil (N) (நெ டில்) - Those alphabets which are pronounced for the

time interval of 2 maathirai.– Kuril (K) (குற#ல்) - Alphabets which take 1 maathirai to be

pronounced.– Mei (M) (நெ�ய்) - Alphabets which are pronounced for 0.5 maathirai.

• Meter pattern of a word refers to its Kuril Nedil Mei pattern. • For example, the Meter pattern of the word ப�டல் is NKM as ப�

is a Nedil(N), ட is a Kuril(K) and ல் is a Mei(M).

Page 11: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

System ArchitectureSystem Architecture

Lyric DB

Lyric DB

Word Object Convertor

Rhyme PatternExtractor

Rhyme Extractor

Index Builder

Rhyme Meter Index

Page 12: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Indexing StructureIndexing StructurePart of Speech

மோ��னை�

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

எதுனை�

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

இனைபு

MeterPattern1Letter1 Words

Letter2 Words

MeterPattern2Letter1 Words

Letter2 Words

Page 13: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Indexing AlgorithmIndexing Algorithm

Page 14: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Result and AnalysisResult and Analysis

Word

Retrieval Time(in milliseconds)

Page 15: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Results and AnalysisResults and Analysis

• Retrieval complexity of both the approaches tested using a dataset of 500 tamil words.

• The average retrieval time in – Word indexed approach - 875.47millisecond – Meter Rhyme Indexed approach – 1.90millisecond

Page 16: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Results and AnalysisResults and Analysis

• The drastic decrease in retrieval time from O(α) to O(1) [α is the number of words in the database] is due to

– The use of hash-tables which are efficient for retrieval.

– Having separate hash-tables for the மோ��னை�, எதுனை� and இனைபு of each POS.

Page 17: Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky,

Thank You!!!Thank You!!!