DARPA TIDES MT Group Meeting Marina del Rey Jan 25, 2002 Alon Lavie, Stephan Vogel, Alex Waibel (CMU) Ulrich Germann, Kevin Knight, Daniel Marcu (ISI)

Slides:



Advertisements
Similar presentations
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Advertisements

Problems for Statistical MT Preprocessing Language modeling Translation modeling Decoding Parameter optimization Evaluation.
GIZA ++ A review of how to run GIZA++ By: Bridget McInnes
Arthur Chan Prepared for Advanced MT Seminar
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Multilinugual PennTools that capture parses and predicate-argument structures, and their use in Applications Martha Palmer, Aravind Joshi, Mitch Marcus,
Powerpoint Templates Page 1 Voices From the Field Rick Pardo & Donna Norley Board: Thames Valley DSB Learning Team Facilitator (Professional Learning Cycle)
Re-evaluating Bleu Alison Alvarez Machine Translation Seminar February 16, 2006.
TAP-ET: TRANSLATION ADEQUACY AND PREFERENCE EVALUATION TOOL Mark Przybocki, Kay Peterson, Sébastien Bronsart May LREC 2008 Marrakech, Morocco.
Dependency-Based Automatic Evaluation for Machine Translation Karolina Owczarzak, Josef van Genabith, Andy Way National Centre for Language Technology.
MT Evaluation CA446 Week 9 Andy Way School of Computing Dublin City University, Dublin 9, Ireland
BLEU, Its Variants & Its Critics Arthur Chan Prepared for Advanced MT Seminar.
June 2004 D ARPA TIDES MT Workshop Measuring Confidence Intervals for MT Evaluation Metrics Ying Zhang Stephan Vogel Language Technologies Institute Carnegie.
Measuring Confidence Intervals for MT Evaluation Metrics Ying Zhang (Joy) Stephan Vogel Language Technologies Institute School of Computer Science Carnegie.
The current status of Chinese- English EBMT -where are we now Joy (Ying Zhang) Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
CAS-IA System Description Jinhua Du CNGL July 23, 2008.
Resources Distributors Linguistic Data Consortium NEMLAR (Network for Euro-Mediterranean LAnguage Resources)NEMLAR (Network for Euro-Mediterranean LAnguage.
Recent Trends in MT Evaluation: Linguistic Information and Machine Learning Jason Adams Instructors: Alon Lavie Stephan Vogel.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Minimum Error Rate Training in Statistical Machine Translation By: Franz Och, 2003 Presented By: Anna Tinnemore, 2006.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
MGMT 230 Lab 1 HTML Basics. 2 HTML Tags An HTML document contains both document content and tags. The tags are the HTML codes inserted in a document to.
Machine Translation A Presentation by: Julie Conlonova, Rob Chase, and Eric Pomerleau.
Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
9/12/2003LTI Student Research Symposium1 An Integrated Phrase Segmentation/Alignment Algorithm for Statistical Machine Translation Joy Advisor: Stephan.
1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
An Information Theoretic Approach to Bilingual Word Clustering Manaal Faruqui & Chris Dyer Language Technologies Institute SCS, CMU.
© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.
Automatic Evaluation Philipp Koehn Computer Science and Artificial Intelligence Lab Massachusetts Institute of Technology.
University Admissions and Standardized Testing Prepared by: David Espinoza, Director, University of Oregon Testing Center
LDMT MURI Data Collection and Linguistic Annotations November 4, 2011 Jason Baldridge, UT Austin Ulf Hermjakob, USC/ISI.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Achieving Domain Specificity in SMT without Over Siloing William Lewis, Chris Wendt, David Bullock Microsoft Research Machine Translation.
Direct Translation Approaches: Statistical Machine Translation
Large Language Models in Machine Translation Conference on Empirical Methods in Natural Language Processing 2007 報告者:郝柏翰 2013/06/04 Thorsten Brants, Ashok.
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Profile The METIS Approach Future Work Evaluation METIS II Architecture METIS II, the continuation of the successful assessment project METIS I, is an.
Arthur Chan Prepared for Advanced MT Seminar
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Automatic Metric for MT Evaluation with Improved Correlations with Human Judgments.
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
Enhanced Infrastructure for Creation & Collection of Translation Resources Zhiyi Song, Stephanie Strassel (speaker), Gary Krug, Kazuaki Maeda.
Coping with Surprise: Multiple CMU MT Approaches Alon Lavie Lori Levin, Jaime Carbonell, Alex Waibel, Stephan Vogel, Ralf Brown, Robert Frederking Language.
NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.
Advanced MT Seminar Spring 2008 Instructors: Alon Lavie and Stephan Vogel.
Transfer-based MT with Strong Decoding for a Miserly Data Scenario Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work with:
Publications Vamshi Ambati, Stephan Vogel and Jaime Carbonell. "Collaborative Workflow for Crowdsourcing Translation”, To Appear in the 2012 ACM Conference.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
LDMT MURI Data Collection and Linguistic Annotation November 2, 2012 Jason Baldridge, UT Austin Lori Levin, CMU.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Coping with Surprise: Multiple CMU MT Approaches Alon Lavie Lori Levin, Jaime Carbonell, Alex Waibel, Stephan Vogel, Ralf Brown, Robert Frederking Language.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
CMU MilliRADD Small-MT Report TIDES PI Meeting 2002 The CMU MilliRADD Team: Jaime Carbonell, Lori Levin, Ralf Brown, Stephan Vogel, Alon Lavie, Kathrin.
CALIFORNIA DEPARTMENT OF EDUCATION Jack O’Connell, State Superintendent of Public Instruction California English Language Development Test (CELDT) Program.
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
Named Entities in Domain Unlimited Speech Translation Alex Waibel, Stephan Vogel, Tanja Schultz Carnegie Mellon University Interactive Systems Labs.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
2012 Grade 3 Reading Student Portfolio
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Improved Automatic Metric for MT Evaluation Alon Lavie Joint work with: Satanjeev.
2017 Grade 3 Reading Student Portfolio
Urdu-to-English Stat-XFER system for NIST MT Eval 2008
Issues in Arabic MT Alex Fraser USC/ISI 9/22/2018 Issues in Arabic MT.
Computational Linguistics: New Vistas
2011 Grade 3 Reading Student Portfolio
Grade 3 Reading Student Portfolio
2013 Grade 3 Reading Student Portfolio
Presentation transcript:

DARPA TIDES MT Group Meeting Marina del Rey Jan 25, 2002 Alon Lavie, Stephan Vogel, Alex Waibel (CMU) Ulrich Germann, Kevin Knight, Daniel Marcu (ISI) Young-Suk Lee, Kishore Papineni, Salim Roukos (IBM) Franz Josef Och (RWTH Aachen) Moussa Bamba, Chris Cieri, Shudong Huang (LDC) Florence Reeder (Mitre) George Doddington (NIST)

Dec’01 Dry Run Chinese Review NIST-administered Testing on Xinhua, Zaobao, VOA data Training on HK News/Hansards, FBIS, dictionaries, Internet resources Humans beat machines on automatic scoring Data-driven research systems are competitive on Chinese “hard news” translation BLEU metric and NIST variations: –Statistical sensitivity experiments –Discussion of length penalty –Study of DARPA 94 data for automatic/manual score correlations Need to specify what “development test” data means!

Test Data Preparation LDC-produced 100 Chinese and Arabic hard news articles 10 human translations each Special instructions to translators Standards for formatting Human assessment of translations planned

June Chinese Evaluation: Training Data Available –LDC and public C/E dict (new version out in March) HK News, Hansards, Laws (not aligned!) –Publicly distributable (not yet at LDC) HK News alignment (Ulrich Germann’s ISI web site) Chinese treebank translations (100K words) –Martha Palmer at UPenn –Alignments done by Ulrich Germann at ISI –Might be public if donated HK Hansard alignment (BBN? Aachen?) –Will never be distributable Old FBIS data (TIDES only) Internet bilingual data Need to get all public data accessible through LDC!

Training Data (continued) Not yet available –UN data (50M words) Legal problems Alignment problems –New FBIS data (3m words?) –Xinhua bilingual data (possible 2m words): alignment –Various Internet bilingual data & dictionaries –Chem/bio dictionaries, State Dept, CDC

Testing –Chinese news plus DARPA discretion Administration & Scoring –NIST web page, mailing lists, proselytizing –BLEU/NIST scoring –Human assessment June Chinese Evaluation: Testing

Open track –Any resources/methods ok –No web crawling after March 15 Large common data track –No bilingual texts except LDC-available ones –No bilingual dicts except LDC-available ones –Any monolingual texts you want, tokenizers, etc, etc Small common data track –No bilingual texts except 100K-word Chinese treebank texts –Can’t use trees from treebank –10K dictionary to be released by S. Vogel in March –No corpus-trained tools like trained LDC tokenizer –Any English text, treebanks, etc, okay to use June Chinese Evaluation: Tracks

March 15 –no more web-crawling by participants –test data collection begins! June 3: test data out June 14: submissions due June 21: raw results distributed July 22-23: workshop June Chinese Evaluation: Timeline

June Arabic Evaluation No dry run for Arabic – the Chinese dry run should suffice for debugging the eval procedure Training –LDC aiming at 100K word POS tagged (when?) –LDC aiming at morph analyzer (when?) –LDC aiming at Ummah 100K word bilingual text (when?) –Koran used at IBM –UN data available at some sites (50M words) Testing – same as Chinese Administration & Scoring – same as Chinese Tracks – open track only, for first go-round Timeline – same as Chinese