Khaled Shaalan Doaa Samy Marwa Magdy

Slides:



Advertisements
Similar presentations
Web Passive Voice Tutor: an Intelligent Computer Assisted Language Learning System over the WWW Maria Virvou & Victoria Tsiriga Department of Informatics,
Advertisements

Active and Passive Voice
Shariah Program Online - Lesson 2
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
Morphology.
Are You Smarter Than a 5 th Grader? 1,000,000 5th Grade Topic 1 5th Grade Topic 2 4th Grade Topic 3 4th Grade Topic 4 3rd Grade Topic 5 3rd Grade Topic.
Tel More Telugu Morphological Generator
Intelligent Information Retrieval CS 336 –Lecture 3: Text Operations Xiaoyan Li Spring 2006.
ARLINGTON BAPTIST COLLEGE HEBREW STUDY TOOLS LNG 2304.
Morphology Chapter 7 Prepared by Alaa Al Mohammadi.
Brief introduction to morphology
Autosegmental Phonology
1 A Hidden Markov Model- Based POS Tagger for Arabic ICS 482 Presentation A Hidden Markov Model- Based POS Tagger for Arabic By Saleh Yousef Al-Hudail.
Chapter 6 Identifying Grammatical Morphemes Morphology Lane 333.
Grammatical frameworks Inflectional morphology. Grammar In the Middle Ages, grammatica […] chiefly meant the knowledge or study of Latin, and were hence.
Unit One: Parts of Speech
Getting started with Sanskrit grammar. Inflectional form: Root + Affix = Stem Stem + Inflectional ending = Word.
1 A Chart Parser for Analyzing Modern Standard Arabic Sentence Eman Othman Computer Science Dept., Institute of Statistical Studies and Research (ISSR),
I r r c u l c u u m 4 2 o 1 Presentation Title: Introduction Curriculum 2014.
Paradigm based Morphological Analyzers Dr. Radhika Mamidi.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Computational Investigation of Palestinian Arabic Dialects
The Linguistics of Second Language Acquisition
Comparative study of various Machine Learning methods For Telugu Part of Speech tagging -By Avinesh.PVS, Sudheer, Karthik IIIT - Hyderabad.
LAS LINKS DATA ANALYSIS. Objectives 1.Analyze the 4 sub-tests in order to understand which academic skills are being tested. 2.Use sample tests to practice.
Reasons to Study Lexicography  You love words  It can help you evaluate dictionaries  It might make you more sensitive to what dictionaries have in.
Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop Nizar Habash and Owen Rambow Center for Computational Learning.
Using a Lemmatizer to Support the Development and Validation of the Greek WordNet Harry Kornilakis 1, Maria Grigoriadou 1, Eleni Galiotou 1,2, Evangelos.
Introduction to CL & NLP CMSC April 1, 2003.
Morphology A Closer Look at Words By: Shaswar Kamal Mahmud.
Linguistics The ninth week. Chapter 3 Morphology  3.1 Introduction  3.2 Morphemes.
Morphology!. But puns first.  In partnership with Gabe  Have you seen Ken Burns' new documentary on the impact of yeast on agricultural societies? 
By: Jeremy Pagnotti.  Phonetic language (no silent letters)  No particular word order  Grammatical function of nouns and verbs displayed by endings.
Evaluating Python as an Introductory Programming Language A. Thomas and H.L. Liang UNISA.
The Greek Verb System: A Bird’s Eye View Chapter 2.
Shariah Program Online - Sarf 1 1 صرف الدرس الاول.
An Ambiguity-Controlled Morphological Analyzer for Modern Standard Arabic By: Mohammed A. Attia Abbas Al-Julaih Natural Language Processing ICS.
Morphological typology
Natural Language Processing Chapter 2 : Morphology.
Hybrid Method for Tagging Arabic Text Written By: Yamina Tlili-Guiassa University Badji Mokhtar Annaba, Algeria Presented By: Ahmed Bukhamsin.
Macmillan IELTS Improving students' answers in IELTS Writing Task 1 Sam McCarter.
Concord (Also called agreement) The word in a sentence must match or agree with one another.
BAMAE: Buckwalter Arabic Morphological Analyzer Enhancer Sameh Alansary Alexandria University Bibliotheca Alexandrina 4th International.
Word Study With Diverse Learners What? Why? How? 2009 IRA Regional Conference: Branson, MO Presenters: Jenifer Pastore and Brandi Clowers.
INTRODUCTION TO THE HEBREW VERB and QAL PERFECT
The Present Tense of Latin Verbs!
Latin Index Card Project You may give this first card whatever title and decoration you want.
Warm-Up Translate the following sentence into Latin. The master wanted to visit a mine and see the slaves.
Lesson 20 4 th Conjugation Verbs Third Conjugation – iō Verbs.
Learning to Generate Complex Morphology for Machine Translation Einat Minkov †, Kristina Toutanova* and Hisami Suzuki* *Microsoft Research † Carnegie Mellon.
DAARA Sarf 11 الصرف الدرس الاول. DAARA Sarf 12 علم الصرف الصرف is the science of Classical Arabic that deals with: الصرف is the science of Classical Arabic.
Lesson XXII.
CIS, Ludwig-Maximilians-Universität München Computational Morphology
Intro to Verbs and Present Active Indicative
Assessing Grammar Module 5 Activity 5.
Class XI: Introduction to Verbs Dr. Esa Autero
Assessing Grammar Module 5 Activity 5.
Verb Tense Tense denotes the time of the action indicated by a verb. The time is not always the same as that indicated by the name of the tense.
Shariah Program Online - Lesson 2
Chapter 6 Morphology.
Getting started with Sanskrit grammar
Agenda diēs Martis, a.d. xiv Kal. Oct. A.D. MMXVIII
Purpose of Study & Introduction to Sarf (Morphology)
A Systematic Framework for Language Analysis
Arabic Language Challenges
How To Answer Questions in Latin!
Agenda diēs Martis, a.d. xiv Kal. Oct. A.D. MMXVIII
Design and Implementation of a Computerized Arabic Braille Environment
Verb Synapses MarshLatin.wordpress.com.
HSP3 Grammar Review By: [Insert your name].
Presentation transcript:

Khaled Shaalan Doaa Samy Marwa Magdy Towards Resolving Morphological Ambiguity in Arabic Intelligent Language Tutoring Framework Khaled Shaalan Doaa Samy Marwa Magdy 1 1

Outlines Introduction. Arabic Morphological Ambiguity Problem The Proposed Disambiguation System Evaluation & Results Conclusions 2

Outlines Introduction Arabic Morphological Ambiguity Problem The Proposed Disambiguation System Evaluation & Results Conclusions 3

Introduction curriculum sequence Adaptive navigation Intelligent Language Tutoring System adaptive presentation A computer-based educational system that allows simulation of a human tutor. its objective is to enhance teaching and learning of foreign languages. error remediation Intelligent feedback to student solutions 4

Main Challenges Lack of resources, such as Learner corpus for Arabic language Lack of tools dealing with ill-formed input In ILTS, relaxing the constraints of the language to analyze learner’s answer results in handling more interpretations than systems designed for only well-formed input 5

Main Challenges (cont‘d) Use techniques, such as constraints relaxation Analyzes قالت (said) In well formed systems: - 3rd female sg past verb In ILTS: 3rd person female sg 1st person sg past verb 2nd person male sg past verb 2nd person female sg past verb Intelligent Language Tutoring System Erroneous Learner Answer More Interpretation 6

Outlines Introduction Arabic Morphological Ambiguity Problem The Proposed Disambiguation System Evaluation & Results Conclusions 7

Arabic Morphological Ambiguity Arabic language is one of the Semitic languages that is defined as a diacritized language. Unfortunately, diacritics are rarely used in current Arabic writing conventions. So two or more words in Arabic are homographic 8

Different Interpretations Homographic Example Different Interpretations Lemma Word يعِد (bring back) أعاد يعد يعُد (return) عاد يعِد (promise) وعد یَعُدّ (count) عد یُعِدّ (prepare) أعد 9

Factors of Arabic Ambiguity For example, the word أسد can be interpreted as أسد (lion) or أسٌدً (I-Block). For example, the deletion of the letter (و) in taking the present (imperfect) tense of the trilateral root و-ع-د /w-E-d/, it appears in written texts as يعد (promise). For example, the perfect verb suffix ت can indicate either: 1) first person singular, 2) second person singular masculine, 3) second person singular feminine, or 4) third person singular feminine. For example, the فعل /faEala/ and فعًل /faE~ala/. Main Factors Orthographic alteration operations such as deletion Some verb prefixes and suffixes can be homographic Ambiguity of Undiacritized verb Arabic patterns Prefixes and suffixes can produce a form homographic with another word class 10

Outlines Introduction Arabic Morphological Ambiguity Problem The Proposed Disambiguation System Evaluation & Results Conclusions 11

Disambiguation Module Error Classification Module The Proposed System Affix Error Representation: Added final letter قال+ ت و Tense Voice active Subject Features 1st sg Object Features ‘’ Feature Value Stem قال Prefix Suffix ت Lexical Category verb Verb Type hollow Pattern فعل perfect Stem Error Representation: Added middle letter ق ل ا Suffix ‘’ Tense Voice active Mood indicative Subject Features 1st sg Object Features Feature Value Answer أقول Root ق-و-ل Lexical Category verb Pattern فعل Verb Type hollow Prefix أ imperfect Arabic ILTS Possible Word Analyses Verb tense error Verb conjugation Vowel letters Question: Build a sentence using the following roots: ق-و-ل، ح-ق، د-و-م /q-w-l, H-q, d-w-m/ The question goal is: conjugation hollow verb in imperfect tense active voice Word Analyzer Module Disambiguation Module قالتو الحق دائما (I always said the truth) Incorrect use of perfect verb instead of imperfect Item banking Selected Word Analysis Learner Answer Train the rules till get maximum likelihood for the corpus. Feedback Message Error Type Error Classification Module Tutoring Module 12

Prioritized Conditions Disambguation Module Prioritized Conditions Affix Collection Multiple Analyses Pattern Collection Train the rules till get maximum likelihood for the corpus. Selected Analysis No Action Multiple Analyses 13

Prioritized Conditions Yes Select Passive Analysis The question goal is to test passive voice No Select Active Analysis Item banking Train the rules till get maximum likelihood for the corpus. Multiple Analyses 14

Prioritized Conditions Yes Select Imperative Verb Analysis The question goal is to test imperative tense No Select perfect or imperfect verb Analysis Item banking Train the rules till get maximum likelihood for the corpus. Multiple Analyses 15

Example If the learner writes the following sentence: تباع جدتي الارز (My-grandmother sells the-rice ) The system produces two analyses: Third person singular feminine imperfect verb in the active voice with converted middle letter Third person singular feminine imperfect verb in the passive voice Train the rules till get maximum likelihood for the corpus. 16

Prioritized Conditions Disambguation Module Prioritized Conditions Affix Collection Multiple Analyses Pattern Collection Train the rules till get maximum likelihood for the corpus. Selected Analysis No Action Multiple Analyses 17

Example If the learner writes the following sentence: محمد تورطت في جريمة قتل (Mohamed was-involved in murder crime ) The system produces four analyses: First person singular perfect verb in the active voice Second person singular masculine perfect verb in the active voice Singular perfect verb conjugation in the active voice Second person singular feminine perfect verb in the active voice Third person singular feminine perfect verb in the active voice Train the rules till get maximum likelihood for the corpus. 18

Prioritized Conditions Disambguation Module Prioritized Conditions Affix Collection Multiple Analyses Pattern Collection Train the rules till get maximum likelihood for the corpus. Selected Analysis No Action Multiple Analyses 19

Example If the learner writes the following sentence: جدي وجدتي نقلوا الي بيت جديد (my-grandfather and my-grandmother moved to a new house ) The system produces two analyses: Third person masculine plural perfect verb in the active voice following the pattern 'فعل'. Third person masculine plural perfect verb in the active voice instead of dual Third person masculine plural perfect verb in the active voice following the pattern 'فعًل'. Train the rules till get maximum likelihood for the corpus. 20

Outlines Introduction. Arabic Morphological Ambiguity Problem. The Proposed Disambiguation System. Evaluation & Results Conclusions 21

Evaluations & Results Evaluation & Results A real test set that consists of 116 real Arabic sentences is collected The system successfully solved 60% of the cases 22

Evaluation Problems Classification For example, the erroneous word أجوب: the imperfect verb أجيب />u-jiyb/ (I-answer), 2) or imperfect verb أجوب />a-juwb/ (I-explore). Problems For example, the word تناول; 1) the noun تناول /tanAwul/ (dealing with/ eating), 2) the perfect verb تناول /tanAwala/ (he/it-dealt with/ ate), or 3) the imperfect verb تناول /tu-nAwil/ (hand over/ deliver). Orthographic match between un-diacritized forms Additional- orthographic matches as a result of relaxing a constraint 23

Outlines Introduction Arabic Morphological Ambiguity Problem The Proposed Disambiguation System Evaluation & Results Conclusions 24

Conclusions The ambiguity problem presents a challenge to ILTS The preferred method in ILTS for disambiguating multiple readings should consider the likelihood of an error and the difficulty of concepts If a large tagged learner corpus exist then the ambiguity problem can be resolved by considering the likelihood of errors 25

Thank you 26