MALAGASY The indigenous language of Madagascar John Cadigan & Martin Horn.

Slides:



Advertisements
Similar presentations
September 13 th, Language of the ancient Latins and Romans. Spread to Europe together with the Roman Empire; foundation of “Romance” languages (i.e.
Advertisements

Syntax Lecture 10: Auxiliaries. Types of auxiliary verb Modal auxiliaries belong to the category of inflection – They are in complementary distribution.
Linguistics, Morphology, Syntax, Semantics. Definitions And Terminology.
Statistical NLP: Lecture 3
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
Chapter 4 Basics of English Grammar
1 A Hidden Markov Model- Based POS Tagger for Arabic ICS 482 Presentation A Hidden Markov Model- Based POS Tagger for Arabic By Saleh Yousef Al-Hudail.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Elicitation Corpus April 12, Agenda Tagging with feature vectors or feature structures Combinatorics Extensions.
The Loanword Typology Project Measuring the Borrowability of Word Meanings Uri Tadmor and Martin Haspelmath Max Planck Institute for Evolutionary Anthropology.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Young Children Learn a Native English Anat Ninio The Hebrew University, Jerusalem 2010 Conference of Human Development, Fordham University, New York Background:
C SC 620 Advanced Topics in Natural Language Processing 3/9 Lecture 14.
Is It Plausible that Middle English is a Creole?
Subject Adjective Clauses & Adjective Phrases
Chapter 2 Words and word classes.
Top Ten Tips for teachers preparing students for the academic version of IELTS Sam McCarter Macmillan Online Conference 2013.
323 Morphology The Structure of Words 1.1 What is Morphology? Morphology is the internal structure of words. V: walk, walk+s, walk+ed, walk+ing N: dog,
LDMT MURI Data Collection and Linguistic Annotations November 4, 2011 Jason Baldridge, UT Austin Ulf Hermjakob, USC/ISI.
The Birth of Modern English Renaissance Language Renaissance Language.
Nan Connolly Stephanie Lancaster Emily McLoughlin Andrew Shaheen MORPHOLOGY PRESENTATION.
GRAMMAR APPROACH By: Katherine Marzán Concepción EDUC 413 Prof. Evelyn Lugo.
Grammar Skills Workshop
Chapter 4 Basics of English Grammar Business Communication Copyright 2010 South-Western Cengage Learning.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
DICTIONARY A dictionary is a reference book, containing an alphabetical list of words with information about them like pronunciation, functions and.
What are little verbs made of? What are little verbs made of? Deriving the English verbal system from underlying elements Jim Baker Trinity Hall McMenemy.
A Remedial English Grammar. CHAPTERS ARTICLES AGREEMENT OF VERB AND SUBJECT CONCORD OF NOUNS, PRONOUNS AND POSSESSIVE ADJECTIVES CONFUSION OF ADJECTIVES.
Phonemes A phoneme is the smallest phonetic unit in a language that is capable of conveying a distinction in meaning. These units are identified within.
English Review for Final These are the chapters to review. In Textbook: Chapter 1 Nouns Chapter 2 Pronouns Chapter 3 Adjectives Chapter 4 Verbs Chapter.
Language Learning Targets based on CLIMB standards.
English Review for Final These are the chapters to review. In Textbook: Chapter 1 Nouns Chapter 2 Pronouns Chapter 3 Adjectives Chapter 4 Verbs Chapter.
Final Exam Review Section V-XI on the Study Guide.
Nahuatl Language Project by Carmen Tirado-Paredes.
Morphology An Introduction to the Structure of Words Lori Levin and Christian Monson Grammars and Lexicons Fall Term, 2004.
Metalanguage Revision English language year
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Grammars Grammars can get quite complex, but are essential. Syntax: the form of the text that is valid Semantics: the meaning of the form – Sometimes semantics.
Technical Communication A Practical Approach Chapter 17: Style in Technical Writing William Sanborn Pfeiffer Kaye Adkins.
WHAT IS LANGUAGE?. INTRODUCTION In order to interact,human beings have developed a language which distinguishes them from the rest of the animal world.
LANGUAGE ARTS LA WORKS UNIT 3 REVIEW STUDY GUIDE.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
English Review for Final These are the chapters to review. In Textbook: Chapter 9 Nouns Chapter 10 Pronouns Chapter 11 Adjectives Chapter 12 Verbs Chapter.
LDMT MURI Data Collection and Linguistic Annotation November 2, 2012 Jason Baldridge, UT Austin Lori Levin, CMU.
Morphological typology
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
Subject-Verb Agreement & Parallel Structure
Basic Syntactic Structures of English CSCI-GA.2590 – Lecture 2B Ralph Grishman NYU.
SYNTACTIC DEVELOPMENT ECSE 500 CLASS SESSION 6. REVIEW PHONOLOGY SEMANTICS MORPHOLOGY TODAY - SYNTAX.
Jeopardy Syntax Morphology Heading3Heading4 Heading5 Q $600 Q $700 Q $800 Q $900 Q $1000 Q $600 Q $700 Q $800 Q $900 Q $1000 Final Jeopardy.
PROCEDURES FOR THE STRUCTURE QUESTIONS (Paper TOEFL Test and Computer TOEFL Test) First, study the sentence. Your purpose is to determine what is needed.
Lecture 1 Sentences Verbs.
Or What You Need to Know to Survive Latin I
Statistical NLP: Lecture 3
Revision Outcome 1, Unit 1 The Nature and Functions of Language
Grammar Workshop Thursday 9th June.
Word Classes and Affixes
A Review of Words and Phrases
Chapter 4 Basics of English Grammar
Translation Problems.
Syntax.
GRAMMAR قواعد اللغــــــــــة الإنجليزية
Statistical n-gram David ling.
Linguistic Essentials
Chapter 4 Basics of English Grammar
Jeopardy Game Grammar Edition
Introduction to English morphology
Ms. McDaniel 6th Grade Language Arts
Presentation transcript:

MALAGASY The indigenous language of Madagascar John Cadigan & Martin Horn

PART OF THE AUSTRONESIAN FAMILY

GENUS BARITO

HISTORICAL PERSPECTIVE PAST Mainly an oral tradition until 1800’s Some use of Arabic alphabet until early 1800’s with the arrival of British missionaries; they switched to the Latin alphabet Colonial influences: French, Dutch, English “Frenchification:” nasality PRESENT DAY 18 million speakers Official language alongside French Zefaniasy Bemananjara and Suzy- Andree Ramamojisoa conclude that Malagasy is not popular in written literature

MORPHOLOGY Agglutinative (Andrzejewski 433) : Ifampandrenesana: re (heard) andrenesana (the situation in which one hears something) Ifampandrenesana (the situation in which speech is heard by two or more people) Words tend to be shortened: Miaraka Miara

MORPHOLOGY Inflection: Concatenative (like English) Verbs: 4-5 inflections No past or future tense No perfective Nouns: pronominal plurals no nominal plurals Case Not morphological

POSSESSIVE MARKING Marks possessive noun phrases in a unique way: (98) Dependent marking: English marks the dependent: owner + “‘s” owned (78) Head marking: Hungarian marks the head: owned (32) No marking: owner owned (22) Double marked: owner and owned both marked (6) Other: Malagasy demoted subject such as one you would find in a passive construction (Coene)

5 DISTANCE CONTRASTS IN DEMONSTRATIVES (127) Two-way contrast : English: this and that (88) Three-way contrast Spanish: este, eso, aquel (8) Four-way contrast (7)No distance contrast (4) Five or more: itý io iny iroa itsy near S near H close away far away

WORKS CITED Andrzejewski, Bogumil W., Stanislaw Pilaszewicz, and Witold Tyloch. Literatures in African languages: theoretical issues and sample surveys. Cambridge University Press, Coene, Martine, and Yves D’hulst. "From P to DP: The expression of possession in noun phrases." (2003). Dryer, Matthew S. & Haspelmath, Martin (eds.) The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology. (Available online at Accessed on ) Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian Glottolog 2.7. Jena: Max Planck Institute for the Science of Human History. (Available online at Accessed on )

SYNTAX VOS word order Other word orders allowed for emphasis Fronting of focused word Focus also marked with particle Demonstrative determiners precede and follow noun Complex voice system: Actor, object, beneficiary, instrument, etc. can be promoted to subject position (Each also has unique affix) Negation, Yes/No questions expressed with particle tsy (neg), ve (yes/no) before verb Definite marker (ny) precedes noun Adjectives, numerals, quantifiers, relative clauses follow noun Adverbs, some quantifiers precede verb Mamaky boky ny mpianatra “reads book the student” ity boky ity “this book this”

MT PAPERS One paper in the MT archives: Pivot-based Triangulation for Low-Resource Languages (Dholakia & Sarkar 2014) Another paper from EMNLP Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation (Dou, Vaswani, & Knight 2014) Paper from University of Washington & Microsoft: Improving Dependency Parsing with Interlinear Glossed Text and Syntactic Projection (Georgi, Xia, & Lewis 2012) Other non-MT ACL papers Morphological analysis, tokenization

MT RESOURCES Google Translate language 81,219 Wikipedia articles written in Malagasy Global Voices Malagasy-English Parallel Corpus 3,000 documents, 100,000 sentences, 2M English words Wortschatz web text corpus 90,791 sentences; 110,517 types; 1,479,752 tokens Unitex Verb dictionary (1,801 simple verbs) Corpus of news articles (2009)

CHALLENGES FOR MT Little prepared monolingual data Even less parallel corpus data Agglutinative = sparse data Variability of word order Complexity due to deixis (context-dependent words)

REFERENCES About World Languages: Qing Dou, Ashish Vaswani, and Kevin Knight Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. Rohit Dholakia and Anoop Sarkar Pivot-based Triangulation for Low-Resource Languages.In Proceedings of AMTA MT Researchers. Ryan Georgi, Fei Xia, and William D. Lewis Improving Dependency Parsing with Interlinear Glossed Text and Syntactic Projection. In Proceedings of COLING Association for Computational Linguistics Global Voices Corpus: Dryer, Matthew S. & Haspelmath, Martin (eds.) The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology. Deutscher Wortschatz Corpus: Malagasy (mlg_web_2012). Leipzig University. Unitex/GramLab: