C SC 620 Advanced Topics in Natural Language Processing Lecture 19 4/6.

Slides:



Advertisements
Similar presentations
Approach, Methods, Techniques
Advertisements

The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Machine Translation II How MT works Modes of use.
CS3773 Software Engineering Lecture 01 Introduction.
MT Evaluation: Human Measures and Assessment Methods : Machine Translation Alon Lavie February 23, 2011.
1 Linguistics and translation theory Mark Shuttleworth Teaching Translation Swansea, 20 January 2006.
C SC 620 Advanced Topics in Natural Language Processing Lecture 22 4/15.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, September 2004.
Machine Translation Anna Sågvall Hein Mösg F
C SC 620 Advanced Topics in Natural Language Processing Lecture 20 4/8.
C SC 620 Advanced Topics in Natural Language Processing 3/11 Lecture 15.
C SC 620 Advanced Topics in Natural Language Processing Lecture 21 4/13.
C SC 620 Advanced Topics in Natural Language Processing Lecture 23 4/20.
Scientific Writing Skills Presented by Prof. Leon de Stadler US Writing Centre.
C SC 620 Advanced Topics in Natural Language Processing Lecture 16 3/23.
Finishing Up Comm Arts II Mr. Wreford. Finishing Up  You have a clear position on your topic.  You are now ready to create a review draft.  Argue your.
C SC 620 Advanced Topics in Natural Language Processing Lecture 24 4/22.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
C SC 620 Advanced Topics in Natural Language Processing 3/9 Lecture 14.
C SC 620 Advanced Topics in Natural Language Processing Lecture 10 2/19.
C SC 620 Advanced Topics in Natural Language Processing Lecture 17 3/25.
C SC 620 Advanced Topics in Natural Language Processing Lecture 13 3/4.
Linguistics and Language Teaching Lecture 9. Approaches to Language Teaching In order to improve the efficiency of language teaching, many approaches.
Teaching Grammar in the Communicative Classroom:
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
VENDORS, CONSULTANTS AND USERS
An Agile View of Process
MACHINE TRANSLATION TRANSLATION(5) LECTURE[1-1] Eman Baghlaf.
FISH 521 Peer review. Peer review Mechanics Advantages Challenges Solutions.
M.Hosseinzadeh EDC Translation Art or Skill Session.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Grammar-Translation Approach Direct Approach
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Prepare Yourself for IR Research ChengXiang Zhai Department of Computer.
Scientific Writing Skills Presented by Prof. Leon de Stadler US Writing Centre
Designing and implementing of the NQF Tempus Project N° TEMPUS-2008-SE-SMHES ( )
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
TRANSLATION & THE HIGH TECH INDUSTRY. INTRODUCTION Translation has been existing ever since mythology began, passed the prophethood, and now in modern.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Unit 3 The Direct Method.
Stages of Teaching an Oral Lesson
CHAPTER 1 Understanding RESEARCH
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Business Writing Thomas Tasker English Language Fellow Program
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, January 2003.
Agents that Reduce Work and Information Overload and Beyond Intelligent Interfaces Presented by Maulik Oza Department of Information and Computer Science.
TELETANDEM ORIENTATION SESSION. Foreign language learning in- tandem involves pairs of native (or competent) speakers of different languages working collaboratively.
1 Ch 1. VOCABULARY SIZE, TEXT COVERAGE & WORD LISTS Nation& Waring.
Let’s understand the eight sentences.(1-4)  The sender selects a message in the source language.  Encodes the message in signal 1 with the SL context.
Objective: Enabling students to translate from English into Arabic and vice versa. Why teach translation: It develops accuracy, fluency, clarity, and.
TYPE OF READINGS.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
INTRODUCTION TO APPLIED LINGUISTICS
Questionnaire-Part 2. Translating a questionnaire Quality of the obtained data increases if the questionnaire is presented in the respondents’ own mother.
Vocabulary Acquisition in a Second Language: Do Learners Really Acquire Most Vocabulary by Reading? Some Empirical Evidence Batia Laufer.
Advanced Software Engineering Dr. Cheng
Human Computer Interaction Lecture 21 User Support
Glottodidactics Lesson 7.
Software & Software Engineering Pertemuan-4 Dosen :Kundang K Juman
PLANNING AND DESIGNING A RESEARCH STUDY
Statistical NLP: Lecture 13
VENDORS, CONSULTANTS AND USERS
Statistical NLP: Lecture 9
What is Research? A research study is a study conducted to collect and analyse information in order to increase our understanding of a topic or an issue.
Presentation transcript:

C SC 620 Advanced Topics in Natural Language Processing Lecture 19 4/6

Reading List Readings in Machine Translation, Eds. Nirenburg, S. et al. MIT Press Reading list: –12. Correlational Analysis and Mechanical Translation. Ceccato, S. –13. Automatic Translation: Some Theoretical Aspects and the Design of a Translation System. Kulagina, O. and I. Mel’cuk –16. Automatic Translation and the Concept of Sublanguage. Lehrberger, J. –17. The Proper Place of Men and Machines in Language Translation. Kay, M.

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay Introduction –Fully Automatic High Quality Translation (FAHQT) not realistic –Main point: There is a great deal that computer scientists and linguists could contribute to the practical problem of producing translations, but, in their own interests as well as those of their customers, they should never be asked to provide an engineering solution to a problem that they only dimly understand By doing only what can be done with absolute surety and reliability now and by going forward from there in short, carefully measured steps, very considerable gains can be virtually guaranteed to all concerned

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay Machine Translation and Linguists –Translation for a pronoun –Target sentence pair: Since the dictionary is constructed on the basis of the text that is being processed, it need refer to only a small amount of context to resolve ambiguities Since the dictionary is constructed by a native speaker of the language, he need refer to only a small amount of context to resolve ambiguities Il (French) -> {it, he} –Pronoun ambiguity Ad hoc solution (case-by-case, statistical) Not solved –Great number of ad hoc solutions built into existing MT systems and any enhancement in the future will require more and more of the same

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay Machine Translation and Computer Science –Linguistic perspective: Unlikely to be adequate engineering where we know there is no adequate science –Programming style and techinque Efficiency important - large quantities of text to be handled Low level programming languages used (1980) –More likely to be ad hoc –Efficiency gains at most linear To be continually improvable, program needs to be perspicuous (no doubt about the role that each of its parts play) and robust (can be changed in important ways without fear of damage)

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay The Statistical Defense –Linguistics requires of its practitioners remarkable virtuosity in constructing examples of problems such as no existing or proposed computer system could possibly solve –The claim is that we do not have to solve them so long as they do not crop up very often May not have an algorithm that will identify the antecedent of a pronoun whenever a human reader could, but if it can devise a method that will identify it most of the time, that will be good enough –Algorithm that works most of the time is of very little use unless there is some automatic way of deciding when it is and when it is not working If algorithm could draw the proofreader’s attention to all cases of pronominal reference that were in doubt, and if a high proportion of the cases were known to be correctly handled, then the utility of the technique is clear

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay The Statistical Defense –If program works 99%, 90%, 80% or 50% of the time but we cannot tell which cases, the amount of work left to the repairman is essentially the same. –Real situation is worse: many decisions of the same difficulty must be made in the course of translating a single sentence. If there is reason to expect each of them to be correct 90% of the time, there need only be seven of them in a stretch of text to reduce the expectation to.9 7 =.48 The Sorcerer’s-Apprentice Defense –Incomplete theories is a worse base to build systems than no theories –The man looked at the girl with the telescope –French admits parallel ambiguities in PP attachment –The man looked at the girl with penetrating eyes –aux yeux or de ses yeux, avec yeux not acceptable

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay The Sorcerer’s-Apprentice Defense –Ils signeront le document pourvu que leur gouvernement accepte –(I) They {will, are going to} sign the document{ supplied, furnished, provided} that their government accepts –(II) They {will, are going to} sign the document{provided that, on condition that, only if} that their government accepts –Intersect (I) and (II) The Translator’s Amanuensis –transcriber Text Editing Translation Aids –Dictionary –Idioms (frequently occurring items)

Paper 17. The Proper Place of Men and Machine in Language Translation. M. Kay Machine Translation –Translator under the tight control of a human translator –Human/machine collaboration –Consult the user –It is far better that the labor and ingenuity spent on developing the machine’s ability to make bad guesses should be employed more productively –Cascading errors common in machine-only systems –Keep track of words and phrases used in some special way

Readings Readings in Machine Translation –19. Montague Grammar and Machine Translation. Landsbergen, J. –20. Dialogue Translation vs. Text Translation – Interpretation Based Approach. Tsujii, J.-I. And M. Nagao –21. Translation by Structural Correspondences. Kaplan, R. et al. –22. Pros and Cons of the Pivot and Transfer Approaches in Multilingual Machine Translation. Boitet, C. –31. A Framework of a Mechanical Translation between Japanese and English by Analogy Principle. Nagao, M. –32. A Statistical Approach to Machine Translation. Brown, P. F. et al. Papers from recent MT Summit Conferences