Challenges of Machine Translation

Slides:



Advertisements
Similar presentations
Lecture 3a Clause functions Adapted from Mary Laughren.
Advertisements

The Structure of Sentences Asian 401
Day 1 Punctuation and Capitalization
Dr. Abdullah S. Al-Dobaian1 Ch. 2: Phrase Structure Syntactic Structure (basic concepts) Syntactic Structure (basic concepts)  A tree diagram marks constituents.
Grammar Engineering: Set-valued Attributes Various Kinds of Constraints Case Restrictions on Arguments Miriam Butt (University of Konstanz) and Martin.
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
1 Auxiliary Verbs and Movement Phenomena Allen ’ s Chapter 5 J&M ’ s Chapter 11.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Introduction to MT Ling 580 Fei Xia Week 1: 1/03/06.
1 Annotation Guidelines for the Penn Discourse Treebank Part B Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi, Bonnie Webber.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Outline of English Syntax.
Machine translation (I) MT overview Ling 571 Fei Xia Week 9: 11/22/05 – 11/29/05.
Machine Translation Challenges and Language Divergences Alon Lavie Language Technologies Institute Carnegie Mellon University : Machine Translation.
Syntax The number of words in a language is finite
Infinitive Phrases Advanced Composition. Infinitives A verbal that functions as a noun, an adjective, or an adverb. An infinitive usually begins with.
Introduction to English Syntax Level 1 Course Ron Kuzar Department of English Language and Literature University of Haifa Chapter 2 Sentences: From Lexicon.
Syntax.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
Subjective Case Objective Case Possessive Form used before a Noun Possessive Form used Independently I me my mine you your.
Language Arts 1/13/14. Opening Finish Pronouns packet – be ready to review!
Syntax III November 19, Sentences The basic phrase types include: NP, VP, AP, PP A basic sentence is an “inflectional phrase” (IP). The head of.
Challenges of Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Introduction to MT CSE 415 Fei Xia Linguistics Dept 02/24/06.
Lecture Week 5 Basic Constructions of English Sentence.
1 Natural Language Processing Lectures 8-9 Auxiliary Verbs Movement Phenomena Reading: James Allen NLU (Chapter 5)
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
NLP. Introduction to NLP #include int main() { int n, reverse = 0; printf("Enter a number to reverse\n"); scanf("%d",&n); while (n != 0) { reverse =
Unit 1: Present Tense   Simple Present Tense   Present Continuous Tense   Subject & Object Pronouns (I, you, it, he, she, they) vs. (me, you, him,
American Sign Language
Syntactical Changes in English Dr. Muhammad Shahbaz.
SYNTAX.
Chapter 4 Syntax a branch of linguistics that studies how words are combined to form sentences and the rules that govern the formation of sentences.
Natural Language Processing Vasile Rus
Lecture 2: Categories and Subcategorisation
Linguistics 1 Syntax Week 2 Lectures 3 & 4.
Verb to be Verb to be: (am, is, are).
Good afternoon class Hoang Liet - THCS Cai Nuoc.
Morphology Morphology Morphology Dr. Amal AlSaikhan Morphology.
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Week 11. Verb movement: Aspectual Auxiliaries
10 Minutes of Book Love (Have your poem out on your desk, please)
Lecture 4b: Verb Processes
Modern English Grammar I
Pronoun Notes.
Subject and object pronouns
Chapter Seven Grammar.
DIRECT and INDIRECT QUESTIONS
Part I: Basics and Constituency
Syntax.
Personal Pronouns PRONOUN - Takes the place of a noun
8C possessive pronouns Whose coat is it? It’s my coat. It’s mine.
Personal Pronouns PRONOUN - Takes the place of a noun
Introduction to Linguistics
Language Review Topics
Core Concepts Lecture 1 Lexical Frequency.
Personal Pronouns PRONOUN - Takes the place of a noun
Verbs and Verb Phrases I
Language Arts Grade 11 Week 23 Lesson 1 & 2
GUSTAR and Verbs Like It
DIRECT and INDIRECT QUESTIONS
Natural Language Processing
Sub Plans 10/28/2018.
Ling200 Prof. Toshi Ogihara Spring 2006
Day 1 Punctuation and Capitalization
Daily Oral Language (DOL)
Introduction to English morphology
Parts of sentence & word order in English
Presentation transcript:

Challenges of Machine Translation CSC 4598 Machine Translation Dr. Tom Way

Translation is hard Novels Word play, jokes, puns, hidden messages Concept gaps: go Greek, bei fen Other constraints: lyrics, dubbing, poem, …

Major challenges Getting the right words: Choosing the correct root form Getting the correct inflected form Inserting “spontaneous” words Putting the words in the correct order: Word order: SVO vs. SOV, … Unique constructions: Divergence

Lexical choice Homonymy/Polysemy: bank, run Concept gap: no corresponding concepts in another language: go Greek, go Dutch, fen sui, lame duck, … Coding (Concept  lexeme mapping) differences: More distinction in one language: e.g., kinship vocabulary. Different division of conceptual space:

Choosing the appropriate inflection Inflection: gender, number, case, tense, … Ex: Number: Ch-Eng: all the concrete nouns: ch_book  book, books Gender: Eng-Fr: all the adjectives Case: Eng-Korean: all the arguments Tense: Ch-Eng: all the verbs: ch_buy  buy, bought, will buy

Inserting spontaneous words Function words: Determiners: Ch-Eng: ch_book  a book, the book, the books, books Prepositions: Ch-Eng: … ch_November  … in November Relative pronouns: Ch-Eng: … ch_buy ch_book de ch_person  the person who bought /book/ Possessive pronouns: Ch-Eng: ch_he ch_raise ch_hand  He raised his hand(s) Conjunction: Eng-Ch: Although S1, S2  ch_although S1, ch_but S2 …

Inserting spontaneous words (cont) Content words: Dropped argument: Ch-Eng: ch_buy le ma  Has Subj bought Obj? Chinese First name: Eng-Ch: Jiang …  ch_Jiang ch_Zemin … Abbreviation, Acronyms: Ch-Eng: ch_12 ch_big  the 12th National Congress of the CPC (Communist Party of China) …

Major challenges Putting the words in the correct order: Getting the right words: Choosing the correct root form Getting the correct inflected form Inserting “spontaneous” words Putting the words in the correct order: Word order: SVO vs. SOV, … Unique construction: Structural divergence

Word order SVO, SOV, VSO, … VP + PP  PP VP VP + AdvP  AdvP + VP Adj + N  N + Adj NP + PP  PP NP NP + S  S NP P + NP  NP + P

“Unique” Constructions Overt wh-movement: Eng-Ch: Eng: Why do you think that he came yesterday? Ch: you why think he yesterday come ASP? Ch: you think he yesterday why come? Ba-construction: Ch-Eng She ba homework finish ASP  She finished her homework. He ba wall dig ASP CL hole  He digged a hole in the wall. She ba orange peel ASP skin  She peeled the orange’s skin.

Translation divergences Source and target parse trees (dependency trees) are not identical. Example: I like Mary  S: Marta me gusta a mi (‘Mary pleases me’)