Consortium Project on Development of Dravidian WordNet: An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam.

Slides:



Advertisements
Similar presentations
Academic Database at DMU. Outline History and Context Discussion Please ask questions as we go along.
Advertisements

 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. SAOR: Semantically Annotated Oncology and Radiology Progress.
WG3: Innovative e-dictionaries Simon Krek „Jožef Stefan“ Institute, Ljubljana, Slovenia Carole Tiberius Institute of Dutch Lexicology, Leiden, the Netherlands.
WordNet Team, Amrita University, Coimbatore. Name of the Project: Development of Dravidian WordNet: An Integrated Wordnet for Telugu, Tamil, Kannada and.
DRAVIDIAN WORDNET S.Arulmozi Dravidian University 29 April 2013.
AIDA Governing Board meeting 11 Dec 2014 Final Project Reporting 1 AIDA Governing Board meeting, 11 Dec 2014.
CSE Department, I.I.T. Bombay Automatic Lexicon Generation through WordNet by Nitin Verma and Pushpak Bhattacharyya Jan 21, 2004.
Knowledge Sharing Platform Empowering Communities through regional Content and Services C. Kathiresan C-DAC, Hyderabad, India Session V : e-Content & ICT.
1 Looking back at 16 months successful network activities CONNEX Mid-Term Conference, 3-5 November 2005.
FINANCIAL MANAGEMENT AND BUDGET
KOR-EU Leaders for Global Education (KE-LeGE) – ADMINISTRATION AND FINANCES.
Govt. Engineering College Kozhikode TEQIP –II VII th Review meeting th April 2015.
Expansion of Technology Enhanced Learning Initiatives of Visvesvaraya Technological University Prof T.N.Nagabhushan M.E(IISc), Ph.D(IISc) Special Officer.
AU-KBC FIRE2008 Submission - Cross Lingual Information Retrieval Track: Tamil- English Pattabhi R.K Rao and Sobha. L AU-KBC Research Centre, MIT Campus,
School of Linguistic, Speech and Communication Sciences Trinity College Dublin Coláiste na Tríonóide, Baile Átha Cliath Centre for Language and Communication.
WG3: Innovative e-dictionaries Simon Krek „Jožef Stefan“ Institute, Ljubljana, Slovenia Carole Tiberius Institute of Dutch Lexicology, Leiden, the Netherlands.
Antonym Creation Tool Presented By Thapar University WordNet Development Team.
Implementing a Calibration Management System Cory Otto Principal Metrology Engineer, Boston Scientific 10 October 2012.
2 nd Steering Committee Meeting October 2008, Athens and Aegina.
DEREL TEMPUS DEVELOPMENT OF ENVIRONMENTAL AND RESOURCES ENGINEERING LEARNING DEVELOPMENT OF ENVIRONMENTAL AND RESOURCES ENGINEERING LEARNING.
Development of Expert System on Wheat Crop Management (EXOWHEM)
© 2008 California State University, Fullerton Account Management & Reporting Tools Financial Services Division of Administration & Finance.
S L H C – P P Management Tools Kick-off Meeting April 8 th, 2008 Mar CAPEANS CERN This project has received funding from the European.
NERIL: Named Entity Recognition for Indian FIRE 2013.
European dimension in learning and Memory training of the seniors SOKRATES/Grundtvig 2 GRU2/2005/14-k-BA-1.
Standing Committee Meeting July 1 st, 2014 MHRD-NMEICT EnhanceEdu, IIIT Hyderabad PI: Sandhya KodeCo-PI: Srinathan Kannan Learning by Doing (LbD) based.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Use of WordNet and on-line dictionaries to build EN-SK synsets (experimental tool) Ján GENČI Technical University of Košice, Slovakia
Tertiary Education Project Quality Improvement Fund (QIF) (June 2005 – Feb. 2006) Drs. Suhail S. Sultan, Ind. Eng., MBA, Mphil. QIF Coordinator
Development of NE Wordnet: An Integrated Wordnet for Languages of the North-East India Assamese & Bodo by Utpal Saikia Biswajit Brahma Dibyajyoti Sarmah.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems Communication in the consortium Review meeting Delft,
TESTBED FOR FUTURE INTERNET SERVICES TEFIS at the EU-Canada Future Internet Workshop, March Annika Sällström – Botnia Living Lab at Centre for.
02/19/13English-Indian Language MT (Phase-II)1 English – Indian Language Machine Translation Anuvadaksh Phase – II - The SMT Team, CDAC Mumbai.
LEONARDO TRANSFER OF INNOVATION PROJECT “MEDIA TECH: The future of media industry using innovative technologies ” No. LLP-LdV-ToI-11-CY Kick-off.
January 2013 NPSAS unit, FCGO. Outline of Presentation  Nepal Public Sector Accounting Standard (NPSAS)  Introduction of Project  Component II  Component.
School Site Technology Plan Magnolia Elementary School By: Longina Burroughs.
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
Implementing an Institutional Repository: Part III 16 th North Carolina Serials Conference March 29, 2007 Resource Issues.
EGEE is a project funded by the European Union under contract IST Collaboration Board F.Gagliardi Project Director Cork Conference, 20 th April.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Triple Spaces for an Ubiquitous Web of Services Reto Krummenacher,
IndoWordNet Database Design Presented By: Konkani NLP Team Goa University IndoWordNet Database Design 1.
Ministry of Panchayati Raj Government of India September 24, 2015 Ministry of Panchayati Raj Government of India September 24, 2015 Experience Sharing.
Company Confidential 1 AS9104/2A Oversight Standard Revision Report Tim Lee Project Chair September 11, 2012.
POS Tagger and Chunker for Tamil
Annotation Framework & ImageCLEF 2014 JAN BOTOREK, PETRA BUDÍKOVÁ
Status and role of International Department (Slovak experience) MGSC Meeting Luxembourg 23 – 24 March 2012 SOSR.
LINCS Literacy Information aNd Communication System A service of the National Institute For Literacy.
Task XV Extension: Task Status Report Dr David Crossley Managing Director Energy Futures Australia Pty Ltd IEA DSM Executive Committee Meeting New Delhi,
Presentation of Work Package 6 (WP6) “Design and implementation of Dissemination Activities” MATcHES Kick-off meeting Feb 2014 Ruse, Bulgaria TEMPUS BG-TEMPUS-JPHES.
Word Sense Disambiguation Algorithms in Hindi
Learning by Doing (LBD) based course content development An NMEICT project.
తెలుగు పదమాలిక TELUGU WORDNET A Lexical Database for Telugu.
DIVISIONAL COMMISSIONER CRIME PREVENTION M W MAKHUBELA
IHTSDO SNOMED CT Tooling
Date of Inception: 21st July 2012
Universitat de Barcelona / FBG

Technology Development
HRA User Satisfaction Report
WordNet: A Lexical Database for English
ECLI-BG Project and the introduction of ECLI in Bulgaria
Implementing an Institutional Repository: Part III
2009 TIMELINE PROJECT PLANNING 12 Months Example text Jan Feb March
Indradhanush WordNet Project Consortium PRSG Meeting
Ministry of Education Youth and Sport
Automatic generation of UW Dictionary through WordNet
WP 1 Management and Coordination
2009 TIMELINE PROJECT PLANNING 12 Months Example text Jan Feb March
European Studies Revitalized Across Asian Universities
Presentation transcript:

Consortium Project on Development of Dravidian WordNet: An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam

Objective Develop an integrated WordNet in four major Dravidian languages, viz. Tamil, Telugu, Kannada and Malayalam o Linked with Hindi and English WordNets 30-April PRSG Meeting

Consortium Members Consortium Leader ▫ Prof. Pushpak Bhattacharya, IIT Bombay Consortium Members ▫ Dr. S. Baskaran, Tamil University (Tamil) ▫ Prof. K.P.Soman, Amrita Viswa Vidyapeetham (Malayalam) ▫ Prof. C.S.Ramachandra, University of Mysore (Kannada) ▫ Dr. S. Arulmozi, Dravidian University (Co-Consortium Leader & Telugu) 30-April PRSG Meeting

Project Details Total Outlay of the Project: o lakhs Date of Commencement: o 26 Dec 2011 Duration of the Project: o 24 months 30-April PRSG Meeting

Project Deliverables The integrated Dravidian WordNet will be linked with Hindi and English WordNets, with which the users will be able to ▫ Look up their language specific words to obtain lexico- semantic relations like synonymy, hypernymy, meronymy etc. ▫ Query for cross-lingual lexical information ▫ Design and implement complex natural language applications like machine translation and cross-lingual search 30-April PRSG Meeting

Organization and Distribution of Tasks IIT-B ▫ Overall Coordination of the project ▫ providing guidance on the architecture and technology ▫ making available existing tools and interfaces ▫ Computational tasks; algorithms on WordNets 30-April PRSG Meeting

Organization & Distribution of Tasks Other Partners ▫ synsets creation ▫ Validation of synsets ▫ Adaptation of semantic relations and validation (each in Tamil, Telugu, Malayalam and Kannada) 30-April PRSG Meeting

Tamil WordNet Commencement Date: 24 April 2012 Principal Investigator: Dr.S.Baskaran Senior Linguist ▫ G. Vasuki, M.A. M.Phil (Ling.) Computer Scientist ▫ G.Biju, MCA, M.Phil Lexicographers ▫ D. Yoga, M.A. M.Phil (Ling), M.A. (Tamil) ▫ M. Ramasundari, M.A. M.Phil, Ph.D (Ling.) ▫ D. Vinodha, M.A.(Hindi), Dip. In Translation ▫ K. Bakkiyaraj, M.A. M.Phil (Ling.) 30-April PRSG Meeting

Malayalam WordNet Commencement Date: 24 April 2012 Principal Investigator: Prof.K.P.Soman Senior Linguist o N. Rajendran, M.A. Ph.D (Ling.) Computer Scientist o K.Krishnakumar, MA, M.Phil, Ph.D (Ling.) Lexicographers o S. Veera Alagiri, M.A. M.Phil, Ph.D (Ling) o Jyothi Ratnam, M.A. (Hindi) 30-April PRSG Meeting

Telugu WordNet Commencement Date: 2 July 2012 Principal Investigator:Dr.S.Arulmozi Co-PI: Dr.M.C.Kesava Murty Senior Linguist ▫ Dr.S.Chandra Kiran, M.A. M.Phil (Tel.) Ph.D (Comp.Lit.) Computer Scientist ▫ T. Swathi, MCA Lexicographers ▫ S. Sravanti, M.A. (Telugu) ▫ K. Sukanya, M.A. (Telugu) ▫ K. Sampoorna, M.A. (Telugu) ▫ N.Silparani, M.A. (Telugu) 30-April PRSG Meeting

Kannada WordNet Commencement Date: 23 July 2012 Principal Investigator: Prof. C.S.Ramachandra Co-PI: Prof. G.Hemanthakumar Senior Linguist o Dr.B.P.Hemananda, M.A. Ph.D (Ling.) Lexicographers o Chaya Devi, M.A. Linguistics o R M Ramya, M.A. Kannada 30-April PRSG Meeting

Status of synset creation LanguageCategoryTotal Synsets Universal NounsVerbsAdjectivesAdverbs Kannada Malayalam Tamil Telugu Pan-Indian Kannada Malayalam Tamil Telugu April PRSG Meeting

LanguageNounVerbAdjectiveAdverb Total Kannada Malayalam Tamil Telugu Total Synsets Developed 30-April PRSG Meeting Includes Pan-Indian, Universal, Remaining Synsets

Status on Tasks Synset Creation – o Pan-Indian, Universal – Completed o Nouns – 40% completed o Verbs – 70 % completed o Adjectives – completed o Adverbs – 70% completed Language & Culture Specific synsets – Initiated Named Entity – to start Web tool – Telugu is completed, others are in line. 30-April PRSG Meeting

Manpower Trained ManpowerNumber Consortium Leader1 Co-Consortium Leader1 Principal Investigator5 Co-Principal Investigator2 Project Manager1 Senior Linguist5 Lexicographer12 Computer Scientist5 Total32 30-April PRSG Meeting

Equipment Purchased EquipmentNumber Desktop10 Laptop11 Scanner1 Printer3 Hard Disk1 Total26 30-April PRSG Meeting

Financial Details 30-April PRSG Meeting

Institute-wise Project Budget 30-April PRSG Meeting

Head-wise Fund Distribution HeadAmount Capital Equipment Consumable Stores Manpower Travel12.00 Workshop and Training Contingencies Over heads 15% Total April PRSG Meeting

Amount Received & Expenditure (upto 28 Feb 2013) Sr. No.Name of Institute Amount Received InterestExpenditureBalance 1 IIT Bombay DU, Kuppam TU, Thanjavur UoM, Mysore AU, Coimbatore Total April PRSG Meeting Project commenced after 5 months of administrative approval

Man-power Details 30-April PRSG Meeting

Papers Published `Tamil WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran) `Building a WordNet’ for Dravidian Languages, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran, S.Gopakumar, V.Dhanalakshmi) `Representation of Kinship in WordNet’, Proceedings of the 9 th International Tamil Internet Conference, Coimbatore, June 2010 (S.Arulmozi) `Polysemy in Tamil and other Indian Languages’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi & Panchanan Mohanty) `Telugu WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi) ` Augmenting IndoWordNet with Context ’ Proceedings of the ICON 2010 (S.Rajendran & S.Arulmozi) 30-April PRSG Meeting

Workshop conducted First Dravidian WordNet Workshop o March, 2012 o Amrita Vishwa Vidyapeetham Second Dravidian WordNet Workshop o 5-6 October, 2012 o Dravidian University 30-April PRSG Meeting

Action Plan Hosting Web version Completion of synset creation Internal validation of synsets 30-April PRSG Meeting

Thank you. 30-April PRSG Meeting