Presentation is loading. Please wait.

Presentation is loading. Please wait.

CONTENTS Abstract Motivation Literature Survey Existing System

Similar presentations


Presentation on theme: "CONTENTS Abstract Motivation Literature Survey Existing System"— Presentation transcript:

1 CONTENTS Abstract Motivation Literature Survey Existing System
Proposed System Framework Modules Description Comparative Analysis Experimental Results Conclusion References

2 ABSTRACT Machine Translation is one of the major area under NLP. While translating English - Tamil, preposition in English sentences should be translated into postpositions in Tamil to make meaningful sentences. This project mainly focused to eliminate the prepositional phrase attachment and orthographical errors.

3 MOTIVATION Machine translation quality has improved substantially in recent years. Prepositions are plays sound role in meaningful translation for any languages. The prepositional phase errors are the major issue. The motivation of this project is to improve the English-Tamil translation quality. Use some semantic rule to correct the prepositional errors.

4 LITERATURE SURVEY Word Alignment Problem
No Authour & Year Approaches 1 R.Harshawardhan et.al, IJCSE, 2011 Linear Programming 2 S.Vetrivel and Diana Baby,ICN,2010 HMM-Viterbi Algorithm Sentence Simplification Problem S. No Authour & Year Approaches 1 R.Harshawardhan et.al., IJCA, 2011 Concept Labeling 2 Thiruumeni P G et.al., IJCA,2011 Idioms and Phrasal Verbs 3 C.Poornima et.al., IJCA,2011 Rule Based

5 Contd… Morphological Analyzer and Generator POS Tagging S.No
Authour & Year Approaches 1 M.Selvam and A M. Natarajan,IJCSE,2009 Rule Based 2 V.Dhanalakshmi and S.Rajendran, IJCA,2010 SVM Based 3 Anand Kumar M et.al.,IJCSE,2010 Sequence Labeling 4 Antony P.J and K P Soman,IJCSET,2012 Suffix Stripping POS Tagging S.No Authour & Year Approaches 1 D.Chandrakanth,IJCE,2012 SVM Based 2 Selvam M et.al., IJCPL,2008 Phrase Structure Tree Bank 3 Adam R. Teichert et.al,EMNL,2010 HMM Based

6 EXISTING SYSTEM

7 PROPOSED SYSTEM FRAMEWORK

8 POS TAGGING

9 WORD BY WORD TRANSLATION

10 WORD BY WORD TRANSLATION

11 MORPHOLOGICAL ANALYSIS

12 RULES OF PREPOSITIONAL PHRASE ATTACHMENT
Rules of the prepositional phrase “of” 1. <NN><IN><DT> or <NN><IN><NN> = Prepositional phrase is “udaiya/in”. 2.<NN><IN><JJ> = Prepositional phrase is “kkaana” 3.<RB><IN><NNP> = Prepositional phrase is “il”. 4.<VBN><IN><NN> = Prepositional phrase is “aal”. Rules of the prepositional phrase “by” <POSP1><IN><POSP2> = Prepositional phrase is “aal”. Rules of the prepositional phrase “on” <POSP1><IN><POSP2> = Prepositional phrase is “mele/il”. Rules of the prepositional phrase “in” <POSP1><IN><POSP2>=Prepositional phrase is “il”. Rules of the prepositional phrase “to” <POSP1><IN><POSP2> = Prepositional phrase is “kku”. Rules of the prepositional phrase “from ” <POSP1><IN><POSP2> =Prepositional phrase is “irunthu”.

13 PREPOSITIONAL PHRASE ATTACHMENT
NN IN NN உடைய/இன் A Page of the Book – புக்கினுடைய பக்கம்

14 PREPOSITIONAL PHRASE ATTACHMENT
NN IN JJ க்கான Cotton is a crop of subtropical climate – பருத்தி ஒரு மித வெப்ப மண்டல காலநிலைக்கான பயிராகும்

15 PREPOSITIONAL PHRASE ATTACHMENT
RB IN NNP இல் He lives south of London– அவர் தெற்கு லண்டனில் வசிக்கிறார்

16 PREPOSITIONAL PHRASE ATTACHMENT
VBN IN NN ஆல் Most tables are made of the wood – பெரும்பாலான மேஜைகள் மரத்தால் செய்யப்பட்டது

17 ORTHOGRAPHICAL RULES Rule 1: Rule 2: Rule 3:

18 WORDS REORDERING Reorder went to Shop He கடைக்கு அவன் சென்றான் கடைக்கு

19 ENGLISH-TAMIL TRANSLATION

20 COMPARATIVE STUDY

21 Total No. of Translated Words No. of Correct sentences
EXPERIMENTAL RESULTS System/ Metrics Total. No. of Sentences Total No. of Words Total No. of Translated Words No. of Correct sentences No. of Correct words *P *R *F Proposed System 200 1020 970 185 940 92% 97% 94% TDIL Translate 120 610 60% 63% 61% Google Translate 160 820 80% 85% 82% P*- Precision , R*-Recall,F*-F-Measure

22 CONCLUSION There has been a significant advancement in the area of machine translation than the existing system. This work is mainly focused to identify the exact meaning of the preposition with respect to the content and place for English-Tamil translation. Thus the accuracy of the proposed translation system is 92%, 97% and 94%.

23 REFERENCES R.Harshawardhan, Mridula sara Augustine and Dr.K.P.Soman(2011), “A simplified approach to word alignment algorithm for English-Tamil translation”,IJCSE,Vol.2,No.1 Pages: S.Vetrivel and Diana Baby (2010), “English to Tamil statistical machine translation and alignment using HMM”, Proceedings of the 12th international conference on Networking, VLSI and signal processing, Pages: R. Harshawardhan, Mridula Sara Augustine and Dr K. P. Soman(2011), “Phrase based English-Tamil translation system by concept labeling using translation memory”,IJCA,Vol.20,No.3,Pages:1-6. Thiruumeni P G,Anand Kumar M,Dhanalakshmi and V,Soman K P(2011), “An approach to handle idioms and phrasal verbs in English-Tamil machine translation system”, IJCA,Vol.26,No.10,Pages:36-41. Poornima C,Dhanalakshmi V,Anand Kumar and M,Soman K P(2011), “Rule based sentence simplification for English to Tamil Machine Translation system”,IJCA,Vol.25,No.8,Pages:38-42. M.Selvam and A M.Natarajan (2009), “Improvement of Rule Based Morphological Analysis and POS Tagging in Tamil Language via Projection and Induction Techniques”,IJCSE,Vol.3,Pages: Dhanalakshmi and Rajendran(2010), “Natural Language processing tools for Tamil grammar learning and teaching”,IJCA,Vol.8,No.14,Pages:26-30. Lakshmana Pandian S and Kumanan Kadhirvelu(2012), “Machine translation from English to Tamil using Hybrid Technique”,IJCA,Vol.46,No.16,Pages:36-42. Anand Kumar M, Dhanalakshmi V,Soman K P and Rajendran(2010), “A Sequence labeling approach to morphological analyzer for Tamil language”,IJCSE,Vol.2,No.6,Pages: Anand Kumar M,Dhanalakshmi V, Rekha R U,Soman K P and Rajendran(2010), “A Novel data driven algorithm for Tamil morphological generator”,IJCA,Vol.6,No.12,Pages:52-56.

24 CONTD… Antony P J and K P Soman(2012), “Computational Morphology and Natural language parsing for Indian languages: A literature Survey”, IJCSET,Vol.3,No.4,Pages: D.Chandrakanth, M.Anand Kumar and S.Gunasekaran(2012), “Parts-of-Speech tagging for Tamil language”,IJCE,Vol.6,No.6,Pages:88-93. Dinesh Kumar and Gurpreet Singh Josan(2010), “Part of speech Taggers for morphologically rich Indian languages: A survey”, IJCA,Vol.6,No.5,Pages:1-9. Selvam M,Natarajan.A M, and Thangarajan R(2008), “Structural parsing of Natural Language text in Tamil using phrase structure Hybrid Language Model”,IJCPL,World Academy of Science,Engineering and Technology,Vol.22,No.3,Pages: Adam R.Teichert and Hal Daume III (2010), “Unsupervised Part of Speech Tagging without a Lexicon”, In Proceedings of the 2010 Conference on Empirical Methods in Natural Language, Processing,Pages:1-6. Antony P J and Soman K P(2011), “Parts of Speech tagging for Indian languages: A Literature Survey”,IJCA,Vol.34,No.8,Pages:22-29. S.Saraswathi,P.Kanivadhana,M.Anusiya and S.Sathiya(2011), “Bilingual Translation System” ,IJCSE,Vol.3,No.3 , Pages: 18. Matt Post,Chris Callison-Burch and Miles Osborne(2012), “Constructing parallel corpora for six Indian languages via crowd sourcing”, Proceeding of the 7th workshop on Statistical machine translation, Pages: 19. Meera Subhash,Wilscy M and S A Shanavas(2012), “A Rule based approach for Root word identification in Malayalam language”,IJCSIT,Vol.4,No.3,Pages: 20. Pushpak Bhattacharyya (2012), “Natural Language processing: A perspective from computation in presence of Ambiguity ,Resource constraint and Multilinguality”, CSI Journal of Computing,Vol.1,No.2,Pages:1-13.

25 CONTD… 21. Kuang-hua and Hsin-His Chen(1996), “A Rule based and Corpus-Oriented approach to prepositional phrase attachment”, Proceedings of the 16th conference on Computational linguistics,Vol.1,Pages: 22. Vincent Van Asch and Walter Daelemans(2009), “Prepositional phrase attachment in shallow parsing”, Proceedings of the 7th International Conference on Recent Advances in Natural Language Processing, Pages: 23. Rajat Kumar Mohanty,Ashish Francis,Almeida and Pushpak Bhattacharyya(2005), “ Prepositional Phrase attachment and Interlingua”, Research on Computing Science,Pages: Sudip Kumar Naskar and Sivaji Bandyopadhyay(2006), “Handling of prepositions in English to Bengali Machine translation”, Proceedings of the Third ACL-SIGSEM Workshop on prepositions, Association for Computational Linguistics,Pages:89-94. I.Dan Melamed,Ryan Green and Joseph P.Turian (2006), “Precision and Recall of Machine Translation”, 03 Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology,Vol.2,Pages

26 Thank You !


Download ppt "CONTENTS Abstract Motivation Literature Survey Existing System"

Similar presentations


Ads by Google