Morphology For Marathi POS-Tagger Veena Dixit 11/ 10 /2005.

Slides:



Advertisements
Similar presentations
Morphology.
Advertisements

Greenberg 1963 Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements.
Linguistics, Morphology, Syntax, Semantics. Definitions And Terminology.
What is Morphology? The study of words and word parts
Statistical NLP: Lecture 3
Verb, Adverb, Preposition, Conjunction, Interjection
Used in place of a noun pronoun.
Morphology Chapter 7 Prepared by Alaa Al Mohammadi.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Language is very difficult to put into words. -- Voltaire What do we mean by “language”? A system used to convey meaning made up of arbitrary elements.
Stemming, tagging and chunking Text analysis short of parsing.
Grammatical frameworks Inflectional morphology. Grammar In the Middle Ages, grammatica […] chiefly meant the knowledge or study of Latin, and were hence.
Paradigm based Morphological Analyzers Dr. Radhika Mamidi.
18 / 10 /2005Veena Dixit1 Marathi Verb Morphology and POS Tagger Veena Dixit.
Phonemes A phoneme is the smallest phonetic unit in a language that is capable of conveying a distinction in meaning. These units are identified within.
Quiz 1 Answers What is a morpheme? It is the smallest unit of meaning that can be free or must be bound. What is the difference between a free and bound.
08 / 11/ 2005VEENA DIXIT1 Marathi Noun Morphology for POS Tagger.
Dr. Monira Al-Mohizea MORPHOLOGY & SYNTAX WEEK 11.
Parts of Speech Project Language Arts
Morphology A Closer Look at Words By: Shaswar Kamal Mahmud.
_____________________ Definition Part of Speech (circle one) Picture Antonym (Opposite) Vocab Word Noun Pronoun Adjective Adverb Conjunction Verb Interjection.
Linguistic Essentials
Linguistics The eleventh week. Chapter 4 Syntax  4.1 Introduction  4.2 Word Classes.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Natural Language Processing
Deep structure (semantic) Structure of language Surface structure (grammatical, lexical, phonological) Semantic units have all meaning components such.
Parts of Speech Major source: Wikipedia. Adjectives An adjective is a word that modifies a noun or a pronoun, usually by describing it or making its meaning.
LANGUAGE ARTS LA WORKS UNIT 3 REVIEW STUDY GUIDE.
Morphological typology
Natural Language Processing Chapter 2 : Morphology.
Parts of Speech Review. A Noun is a person, place, thing, or idea.
GoBack definitions Level 1 Parts of Speech GoBack is a memorization game; the teacher asks students definitions, and when someone misses one, you go back.
MORPHOLOGY definition; variability among languages.
III. MORPHOLOGY. III. Morphology 1. Morphology The study of the internal structure of words and the rules by which words are formed. 1.1 Open classes.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
The Building Blocks of Good Writing
1 LIN 1310B Introduction to Linguistics Prof: Nikolay Slavkov TA: Qinghua Tang CLASS 11, Feb 9, 2007.
A knowledge rich morph analyzer for Marathi derived forms Ashwini Vaidya IIIT Hyderabad.
MORPHOLOGY. PART 1: INTRODUCTION Parts of speech 1. What is a part of speech?part of speech 1. Traditional grammar classifies words based on eight parts.
Parts of Speech By: Miaya Nischelle Sample. NOUN A noun is a person place or thing.
Chapter 3 Word Formation I This chapter aims to analyze the morphological structures of words and gain a working knowledge of the different word forming.
Writing 2 ENG 221 Norah AlFayez. Lecture Contents Revision of Writing 1. Introduction to basic grammar. Parts of speech. Parts of sentences. Subordinate.
Morphology 1 : the Morpheme
Parts of Speech Review.
INTRODUCTION ADE SUDIRMAN, S.Pd ENGLISH DEPARTMENT MATHLA’UL ANWAR UNIVERSITY.
Grammar for Parents 20th October 2016 Welcome! Questions are welcome…
Introduction to Linguistics
Parts of Speech Review.
Lecture 7 Summary Survey of English morphology
Descriptive Grammar – 2S, 2016 Mrs. Belén Berríos Droguett
Morphology Morphology Morphology Dr. Amal AlSaikhan Morphology.
Appendix A: Basic Grammar and Punctuation Reference
عمادة التعلم الإلكتروني والتعليم عن بعد
Statistical NLP: Lecture 3
Revision Outcome 1, Unit 1 The Nature and Functions of Language
Chapter 3 Morphology Without grammar, little can be conveyed. Without vocabulary, nothing can be conveyed. (David Wilkins ,1972) Morphology refers to.
ENGLISH MORPHOLOGY Week 1.
Lecturer Ms. Abrar Mujaddidi LANE 321
Syntax.
A Systematic Framework for Language Analysis
FIRST SEMESTER GRAMMAR
English parts of speech
Linguistic Essentials
Latin: The Written Language
Chapter Six CIED 4013 Dr. Bowles
Introduction to English morphology
Introduction to Linguistics
Word phoneme SENTENCE PHRASE SUFFIX prefix PHRASE CLAUSE UTTERANCE PART OF SPEECH MICRO-LINGUISTICS Macro-linguistics Language dictionary LEXICON allophone.
A Link Grammar for an Agglutinative Language
Parts of Speech.
Presentation transcript:

Morphology For Marathi POS-Tagger Veena Dixit 11/ 10 /2005

Contents Word Morphology Marathi Morphology - definition of the task and difficulties thereto. Marathi Morphology - solutions to the challenges Different word classes Postpositions Particles Interjections Conjunctions Pronouns Adjectives Adverbs Verbs Nouns

Words are the orthographical strings separated by spaces and some punctuation marks. To syntax, words make sentences and to morphology, word has internal structure and has different inflectional forms. Inflectional forms of a root word form a paradigm based on a principle. Root word is the form which is stored in lexicons / dictionaries.

What is Morphology? Morphology is the study of forms of words in the language, especially the different forms used in declensions, conjugations, and word building. It deals with the morphemes. Morpheme is a term which refers to the smallest component of a word that (a) seems to contribute some sort of meaning, or a grammatical function to the word to which it belongs, and (b) cannot be decomposed into smaller morphemes.

Marathi Morphology Definition of the task and difficulties thereto Morphological analysis of Marathi plays significant role in natural language processing because Marathi, a pan Indian Language, is rich in morphology. Marathi, being the language of the area situated centrally, gets influenced by almost all language groups of India. This makes the Marathi morphology more complicated.

Marathi Morphology solutions to the challenges Morphological analysis is done category wise. Parameters for changes in the root word are identified. Rules are constructed in the tabular form to facilitate computation.

Marathi Word Classes Nouns Pronouns Adjectives Verbs Adverbs Postpositions Conjunctions Interjections Particles Punctuation Mark

Postpositions Postposition is the morpheme that follows the words and shows the relation between the word that is followed and other word in the sentences. Case markers and shabdayogi avyaya are classified as postpositions in Marathi because they show same behavior. (ref. ‘Classification of Words’, Veena Dixit, proceedings of 26 th AICL, Shilong, 2004 )

Postpositions (continued) In Marathi, postpositions are attached to all classes of words except interjection. examplesexamples When a postposition is attached to a stem it produces mainly adverb, but also, adjective and conjunction. Postpositions are handled along with other word classes. 5 subgroups of postpositions are identified on the basis of what is the possible order of their attachment and to which group of words they can be attached.

Particles Strings like ही – hi_also, च – cha_only, सुद्धा – suddha_also, are –sometimes attached to other words (e.g.. खाली – khaali _under – खालीसुद्धा - khaalisuddhaa_under also / झाड - jhaaDa _ tree - झाडसुद्धा - jhaaDasuddhaa _ tree also ) –or sometimes they are written separately (e.g.. झाडाखाली - jhaaDaakhaali_ under the tree – झाडाखाली सुद्धा - jhaaDaakhaalisuddhaa_ under the tree also). When such words are attached to other words, the word to which it is attached, does not get inflected.

Interjections Interjections are identified from the lexicon and stored to produce the tag. Conjunctions Conjunctions are identified from the lexicon and stored to produce the tag. Morphology also plays a role in the case of conjunctions.

Conjunctions (continued) When some of Marathi postpositions are attached to a pair of demonstrative pronouns, they produce a pair of conjunctions in some instances. जो – ज्यापासून (jo – jyaapaasuna --- which – from which) तो – त्यापासून (to – tyaapaasuna --- that – from that) ज्यापासून काल सुरुवात केली, त्यापासून आज नक्कीच सुरुवात करायला नको. – jyaapaasuna kaala suruvaata keli, tyaapaasun aaja nakkicha suruvaata karaayalaa nako_One should not start from the (same point) from which it was started yesterday.

Pronouns Number of inflected forms of a pronoun and the rules describing such inflection are almost equal in number. Number of pronouns and their respective inflected forms are finite and less when compared to verbs and nouns. All inflected forms of the pronouns will be stored to produce the tag for pronoun. Derivational morphology of pronoun is handled with rules.

Pronouns (continued) Inflectional forms of pronouns act either as adjectives ( माझा – maajhaa_my) or as adverbs ( मला – malaa_to me ) or as conjunctions ( जो – ज्यापासून (jo – jyaapaasuna --- which – from which) तो – त्यापासून (to – tyaapaasuna --- that – from that)).

Pronouns (continued) All together 29 pronouns have 526 inflectional forms, which are either words or stems. 21 paradigms are identified generating several rules.

Adjectives Adjectives are mainly, inflectional and non - inflectional. Adjectives inflect for gender, number and attachment of postposition to the noun they modify. Adjectives in Marathi agree in gender and number with the nouns they modify.

Adjectives (continued) All inflectional adjectives belong to one paradigm, which corresponds to several rules for generating inflectional and derivational forms from an adjective. Most of ‘aa’ ending adjectives agree with masculine nouns and further get inflected according to the gender and number of the noun they modify. ( मोकळा / मोकळी / मोकळे / मोकळ्या _mokaLaa / mokaLi / mokaLe / mokaLyaa_empty) There are some exceptions to this rule, such as, ( जादा - jaada_extra, नाना – naanaa_different, वाया vaayaa_wasted).

Adverbs Adverbs are mainly, inflectional and non - inflectional. Adverbs inflect for attachment of postpositions. खाली – ( khaali_under –-- खालपासून – khaalapaasuna _from the underneath )

Verbs and Nouns will be discussed in next sessions. Thank you. Veena Dixit 11/ 10 /2005