punjabi wordnet development

36
Punjabi WordNet Development Thapar University & Punjabi University Patiala

Upload: monet

Post on 13-Jan-2016

70 views

Category:

Documents


0 download

DESCRIPTION

Punjabi WordNet Development. Thapar University & Punjabi University Patiala. Presentation Outline. Status of number of synsets completed Database Development Process Demonstration of Punjabi WordNet Site. Status of work completed Dec' 2011. Total Number of synset : 1347 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Punjabi WordNet Development

Punjabi WordNet

Development

Thapar University & Punjabi University Patiala

Page 2: Punjabi WordNet Development

Presentation Outline

• Status of number of synsets completed

• Database Development Process• Demonstration of Punjabi WordNet

Site

Page 3: Punjabi WordNet Development

Status of work completed Dec' 2011

S. No. Record File Total Synsets Total Synsets Completed by Thapar University

Total Synsets Completed by Punjabi University

1. Pan Indian Record 1347(All Completed)

674 673

2. Universal Synsets (7168 rercords)

7168(All Completed)

3084 4084

3. Adverb Synsets 209(All Completed)

104 105

4. Verb Synsets 1798(Completed 1679)

976/991(Completed/Assigned)

703/807(Completed/Assigned)

5. Adjective Synsets 3605(Completed 3574)

1803

1771/1802(Completed/Assigned)

6. Remaining Noun Record File 22050(Completed 2988)

1190/11025(Completed/Assigned)

1798/11025(Completed/Assigned)

TOTAL 36177 (16965 Completed) 7831 9134

  

16965 (Completed By Thapar University & Punjabi University ).

Page 4: Punjabi WordNet Development

Pan Indian Synsets

Total Number of synset : 1347

Total Synsets Completed by Thapar University : 674

Total Synsets Completed by Punjabi University : 673

File Status : Completed

Page 5: Punjabi WordNet Development

Universal Synsets

Total Number of synset : 7168

Total Synsets Completed by Thapar University : 3084

Total Synsets Completed by Punjabi University : 4084

File Status : Completed

Page 6: Punjabi WordNet Development

Adverb Synsets File

Total Number of synset : 209

Total Synsets Completed by Thapar University : 104

Total Synsets Completed by Punjabi University : 105

File Status : Completed

Page 7: Punjabi WordNet Development

Verb Synsets

Total Number of synset : 1798

Completed : 1679

Total Synsets Completed by Thapar University : 976/991(Completed/Assigned)

Total Synsets Completed by Punjabi University : 703/807 (Completed/Assigned)

File Status : Ongoing

Page 8: Punjabi WordNet Development

Adjective Synsets

Total Number of synset : 3605

Completed : 3574

Total Synsets Completed by Thapar University : 1803

Total Synsets Completed by Punjabi University : 1771/1802 (Completed/Assigned)

File Status : Ongoing

Page 9: Punjabi WordNet Development

Remaining Noun Synsets

Total Number of synset : 22050

Completed : 2988

Total Synsets Completed by Thapar University:1190/11025(Completed/Assigned)

Total Synsets Completed by Punjabi University : 1771/1802 (Completed/Assigned)

File Status : Ongoing

Total Synsets Completed: 16965 (Completed By Thapar University & Punjabi University ).

Page 10: Punjabi WordNet Development

Synset Id's that have been transliterated during creation process

9629, 10024, 11063, 12113, 13954, 14168, 14256, 14384, 14602

Page 11: Punjabi WordNet Development
Page 12: Punjabi WordNet Development
Page 13: Punjabi WordNet Development
Page 14: Punjabi WordNet Development
Page 15: Punjabi WordNet Development
Page 16: Punjabi WordNet Development
Page 17: Punjabi WordNet Development
Page 18: Punjabi WordNet Development

Database Creation Process

Issues in the tool send by Goa University.

Page 19: Punjabi WordNet Development

Sample Synset file uploaded on the tool

Page 20: Punjabi WordNet Development

Ouput Sanpshot of the tool

Page 21: Punjabi WordNet Development

Data in different tableswn_word wn_synset_words

Page 22: Punjabi WordNet Development

Data in different tablesTable: wn_synset

Table: wn_synset _example

Page 23: Punjabi WordNet Development

Approached followed in creation of database

Port the whole synset file in a table with following structure:

*Synset_id

*Synset

*Gloss: stores both concept and examples

*Category

Page 24: Punjabi WordNet Development

Table Snapshot

Page 25: Punjabi WordNet Development

Logic for Insertion of Data into different tables

To insert data into wn_word and wn_synset_words

We select both synset_id and synsets fields from the table. After getting synsets value from a tbl_all_punjabi_synset_data table and their corresponding synset_id. We seperated the individual synset sepearted by commas(,) using tokenizer, and before insert we check whether that particular word is already exists in the data base or not. If the doesn't exists in database then the query automatically insert the word into data base. During insertion of words it also insert the priority of word into data table by counting the words under same synset_id the query sets priority one to the very first word of the synset_id, sets priority two to the next after first word and so on.

To insert data in wn_synset and wn_synset_example

Select both synset_id and gloss fields from the table. After getting all gloss value from a tbl_all_punjabi_synset_data table and their corresponding synset_id. We inserts the examples and gloss into “wn_synset_example” and “wn_synset. If there exists two or more examples or concepts for a particular id then we simply seperated the individual value sepearted by commas(/) using tokenizer, and insert the values into tables with same synset_id.

Page 26: Punjabi WordNet Development

Snapshot of wn_word after insertion of data

Page 27: Punjabi WordNet Development

Snapshot of wn_synset_words after insertion of data

Page 28: Punjabi WordNet Development

Snapshot of wn_synset after insertion of data

Page 29: Punjabi WordNet Development

Snapshot of wn_synset_example after insertion of data

Page 30: Punjabi WordNet Development

Punjabi WordNet Demonstartion

Site Address: http://125.19.69.26:8080/PunjabiWordNet/

Available over Internet

Page 31: Punjabi WordNet Development

Snapshot of Punjabi WordNet Website

Page 32: Punjabi WordNet Development

Snapshot of Punjabi WordNet Website

Page 33: Punjabi WordNet Development

Snapshot of Punjabi WordNet Website

Page 34: Punjabi WordNet Development

Snapshot of Punjabi WordNet Website

Page 35: Punjabi WordNet Development

Snapshot of Punjabi WordNet Website

Page 36: Punjabi WordNet Development

Thanks