Multi-Engine MT for Quick MT. Missing Technology for Quick MT LingWear ISI MT NICE Core Rapid MT - Multi-Engine MT - Omnivorous resource usage - Pervasive.

Slides:



Advertisements
Similar presentations
Plenary Panel: Building an Infrastructure through Training: Best Practices and Lessons Learned from Africa, Asia, Central-Eastern Europe and Latin America.
Advertisements

Inclusive Multicultural Schooling: How to enhance the participation of immigrant parents in the education system.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Creating Healthy and Complete Communities Valerie Shuttleworth MCIP RPP Chief Planner Planning and Economic Development York Region.
Africa Forum on Science, Technology and Innovation (STI) for Youth Employment, Human Capital Development and Inclusive Growth Nairobi, Kenya, April 1-3,
The current status of Chinese- English EBMT -where are we now Joy (Ying Zhang) Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
Flow Network Models for Sub-Sentential Alignment Ying Zhang (Joy) Advisor: Ralf Brown Dec 18 th, 2001.
NICE: Native language Interpretation and Communication Environment Lori Levin, Jaime Carbonell, Alon Lavie, Ralf Brown Carnegie Mellon University.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Automatic Rule Learning for Resource-Limited Machine Translation Alon Lavie, Katharina Probst, Erik Peterson, Jaime Carbonell, Lori Levin, Ralf Brown Language.
Machine Translation with Scarce Resources The Avenue Project.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
Seminar on Endangered Languages Alan W Black, Robert Frederking, Lori Levin, Laura Tomokiyo Language Technologies.
SCCOOS K-16 Educational Outreach Building on the Existing Infrastructure of the Centers for Ocean Sciences Education Excellence SCCOOS Governance and Planning.
Language Technologies Institute School of Computer Science Carnegie Mellon University NSF August 6, 2001 NICE: Native language Interpretation and Communication.
Constructing Bilingual Resources for Digital Libraries Rim, Hae-Chang Korea University
Ethical considerations on HIV and Syphilis research in indigenous communities in Peru Instituto Nacional de Salud Ministerio de Salud PERÚ Centro Nacional.
CHILE 1 Adaptation process of an evidence based model of investigative interviewing: the Chilean case* Carolina Navarro, Decio Mettifogo, Andrés.
Eliciting Features from Minor Languages The elicitation tool provides a simple interface for bilingual informants with no linguistic training and limited.
Overview of the Language Technologies Institute and AVENUE Project Jaime Carbonell, Director March 2, 2002.
Terminology Across Borders : A Partnership to Build a Bilingual Tool for the Americas Lori J. Finch, Thesaurus Coordinator National Agricultural Library,
Coping with Surprise: Multiple CMU MT Approaches Alon Lavie Lori Levin, Jaime Carbonell, Alex Waibel, Stephan Vogel, Ralf Brown, Robert Frederking Language.
MAG Regional Council July 23, Why Do This? In 2006 the average hospital stay for Arizona residents with asthma was 3.2 days In 2006 the average.
Transfer-based MT with Strong Decoding for a Miserly Data Scenario Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work with:
AVENUE Automatic Machine Translation for low-density languages Ariadna Font Llitjós Language Technologies Institute SCS Carnegie Mellon University.
Documenting Endangered Languages A Partnership between the National Endowment for the Humanities and the National Science Foundation.
Ideas for 100K Word Data Set for Human and Machine Learning Lori Levin Alon Lavie Jaime Carbonell Language Technologies Institute Carnegie Mellon University.
Carnegie Mellon Goal Recycle non-expert post-editing efforts to: - Refine translation rules automatically - Improve overall translation quality Proposed.
Data Collection and Language Technologies for Mapudungun Lori Levin, Rodolfo Vega, Jaime Carbonell, Ralf Brown, Alon Lavie Language Technologies Institute.
Overview of the Language Technologies Institute and AVENUE Project Jaime Carbonell, Director March 2, 2002.
¨ Free-loan library of described and captioned educational media available to students who are deaf, blind, hard of hearing, visually impaired, or deaf-blind.
An Overview of the AVENUE Project Presented by Lori Levin Language Technologies Institute School of Computer Science Carnegie Mellon University Pittsburgh,
Designing a Machine Translation Project Lori Levin and Alon Lavie Language Technologies Institute Carnegie Mellon University CATANAL Planning Meeting Barrow,
A Trainable Transfer-based MT Approach for Languages with Limited Resources Alon Lavie Language Technologies Institute Carnegie Mellon University Joint.
Language Technologies Institute School of Computer Science Carnegie Mellon University NSF, August 6, 2001 Machine Translation for Indigenous Languages.
Coping with Surprise: Multiple CMU MT Approaches Alon Lavie Lori Levin, Jaime Carbonell, Alex Waibel, Stephan Vogel, Ralf Brown, Robert Frederking Language.
Semi-Automated Elicitation Corpus Generation The elicitation tool provides a simple interface for bilingual informants with no linguistic training and.
NICE: Native Language Interpretation and Communication Environment Lori Levin, Jaime Carbonell, Alon Lavie, Ralf Brown, Erik Peterson, Katharina Probst,
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
Bridging the Gap: Machine Translation for Lesser Resourced Languages
Avenue Architecture Learning Module Learned Transfer Rules Lexical Resources Run Time Transfer System Decoder Translation Correction Tool Word- Aligned.
Eliciting a corpus of word- aligned phrases for MT Lori Levin, Alon Lavie, Erik Peterson Language Technologies Institute Carnegie Mellon University.
CMU MilliRADD Small-MT Report TIDES PI Meeting 2002 The CMU MilliRADD Team: Jaime Carbonell, Lori Levin, Ralf Brown, Stephan Vogel, Alon Lavie, Kathrin.
AMTEXT: Extraction-based MT for Arabic Faculty: Alon Lavie, Jaime Carbonell Students and Staff: Laura Kieras, Peter Jansen Informant: Loubna El Abadi.
AVENUE: Machine Translation for Resource-Poor Languages NSF ITR
Developing affordable technologies for resource-poor languages Ariadna Font Llitjós Language Technologies Institute Carnegie Mellon University September.
FROM BITS TO BOTS: Women Everywhere, Leading the Way Lenore Blum, Anastassia Ailamaki, Manuela Veloso, Sonya Allin, Bernardine Dias, Ariadna Font Llitjós.
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
Automated Assistant for Crisis Management (Reflective Agent with Distributed Adaptive Reasoning) RADAR.
Machine Translation Overview Alon Lavie Language Technologies Institute Carnegie Mellon University LTI Immigration Course August 22, 2011.
Teaching through partnerships: A Community Outreach Model Deogracia Cornelio & Leonel Yoque Labor Occupational Safety & Health Program University of California,
Background of the NICE Project Lori Levin Jaime Carbonell Alon Lavie Ralf Brown.
Machine Language Computer languages cannot be directly interpreted by the computer – they are not in binary. All commands need to be translated into binary.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
The AVENUE Project: Automatic Rule Learning for Resource-Limited Machine Translation Faculty: Alon Lavie, Jaime Carbonell, Lori Levin, Ralf Brown Students:
Traduciendo Culturas Perú / Translating Cultures Peru
Eliciting a corpus of word-aligned phrases for MT
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Multiple linguistic competencies
Language Technologies Institute Carnegie Mellon University
Introduction to Requirements Management
Multiple linguistic competencies
Code-Switching and Style Shifting
Presentation transcript:

Multi-Engine MT for Quick MT

Missing Technology for Quick MT LingWear ISI MT NICE Core Rapid MT - Multi-Engine MT - Omnivorous resource usage - Pervasive Machine Learning - Novel Approaches: * Max-Entropy models * Seeded Version Space Learning * Elicitation from native informants

NICE Carnegie Mellon University April 12, 2000

Project Members Ralf Brown: MT Jaime Carbonell: ML, MT Alon Lavie: ML, MT Lori Levin: Linguistics, MT Rodolfo Vega: International and Development Education, Information Technology in Education (IT-EDU)

Potential Collaborators Chile: Universidad de la Frontera Colombia: Ministry of Interior. (Ruth Connolly from OAS is looking into this.)

Universidad de la Frontera Instituto de Estudios Indigenas: Bilingual Multicultural Education Program Instituto de Informatica Educativa: ENLACES Project, rural component Both funded by the Ministry of Education

Mineduc Programs in Chile Education Quality Improvement Program, MECE ENLACES Austral Region Zonal Center: Instituto de Informatica Educativa Bilingual Multicultural Education Program La Araucania Region Projects: Instituto de Estudios Indigenas

Work in Year 1 Establish partnerships Collect data First version of Example-Based MT between Spanish and one indigenous language Develop elicitation corpus Build elicitation interface

Establishing Partnerships Identify a community that wants to work with us: design an MT application that fits in with their plans for community development or bilingual or monolingual education Identify scientists who want to work with us: linguists, computer scientists, etc. Identify non-U.S. funding sources for the indigenous community and scientists. Identify existing programs like ENLACES

Work in Year 2 Ongoing work from Year 1 Experiment with version space learning of translation rules Build a rule interpreter for running the translation rules