Speech-to-Speech MT Design and Engineering Alon Lavie and Lori Levin MT Class April 5 2000.

Slides:



Advertisements
Similar presentations
Natural Language Understanding Difficulties: Large amount of human knowledge assumed – Context is key. Language is pattern-based. Patterns can restrict.
Advertisements

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Design Concepts and Principles
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
Towards an NLP `module’ The role of an utterance-level interface.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Features and Unification
The Use of Speech in Speech-to-Speech Translation Andrew Rosenberg 8/31/06 Weekly Speech Lab Talk.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Part of speech (POS) tagging
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
Artificial Intelligence 2004 Natural Language Processing - Syntax and Parsing - Language Syntax Parsing.
Chapter 3: Formal Translation Models
Systems Analysis and Design in a Changing World, 6th Edition
Probabilistic Parsing Ling 571 Fei Xia Week 5: 10/25-10/27/05.
Context-Free Grammar CSCI-GA.2590 – Lecture 3 Ralph Grishman NYU.
Overview of Search Engines
Intelligent Tutoring Systems Traditional CAI Fully specified presentation text Canned questions and associated answers Lack the ability to adapt to students.
Speech-to-Speech MT JANUS C-STAR/Nespole! Lori Levin, Alon Lavie, Bob Frederking LTI Immigration Course September 11, 2000.
ANTLR.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
1 Object-Oriented Testing CIS 375 Bruce R. Maxim UM-Dearborn.
9/8/20151 Natural Language Processing Lecture Notes 1.
BİL711 Natural Language Processing1 Statistical Parse Disambiguation Problem: –How do we disambiguate among a set of parses of a given sentence? –We want.
Probabilistic Parsing Reading: Chap 14, Jurafsky & Martin This slide set was adapted from J. Martin, U. Colorado Instructor: Paul Tarau, based on Rada.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
APPLICATIONS OF CONTEXT FREE GRAMMARS BY, BRAMARA MANJEERA THOGARCHETI.
1 Chapter 14 Architectural Design 2 Why Architecture? The architecture is not the operational software. Rather, it is a representation that enables a.
Probabilistic Context Free Grammars for Representing Action Song Mao November 14, 2000.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
SIG IL 2000 Evaluation of a Practical Interlingua for Task-Oriented Dialogue Lori Levin, Donna Gates, Alon Lavie, Fabio Pianesi, Dorcas Wallace, Taro Watanabe,
May 2006CLINT-CS Verbmobil1 CLINT-CS Dialogue II Verbmobil.
Learning to Transform Natural to Formal Language Presented by Ping Zhang Rohit J. Kate, Yuk Wah Wong, and Raymond J. Mooney.
Te Kaitito A dialogue system for CALL Peter Vlugter, Alistair Knott, and Victoria Weatherall Department of Computer Science School of Māori, Pacific, and.
Analysis for Spoken Language Translation Using Phrase-Level Parsing and Domain Action Classification Chad Langley Language Technologies Institute Carnegie.
Speech-to-Speech MT in the JANUS System Lori Levin and Alon Lavie Language Technologies Institute Carnegie Mellon University.
Chapter 15 Natural Language Processing (cont)
A Multi-Perspective Evaluation of the NESPOLE! Speech-to-Speech Translation System Alon Lavie, Carnegie Mellon University Florian Metze, University of.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Deeper Sentiment Analysis Using Machine Translation Technology Kanauama Hiroshi, Nasukawa Tetsuya Tokyo Research Laboratory, IBM Japan Coling 2004.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Speech-to-Speech MT in NESPOLE! Design and Engineering Alon Lavie, Lori Levin Work with: Chad Langley, Tanja Schultz, Dorcas Wallace, Donna Gates, Kay.
Designing a Machine Translation Project Lori Levin and Alon Lavie Language Technologies Institute Carnegie Mellon University CATANAL Planning Meeting Barrow,
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University.
Integrating Multiple Knowledge Sources For Improved Speech Understanding Sherif Abdou, Michael Scordilis Department of Electrical and Computer Engineering,
Eliciting a corpus of word- aligned phrases for MT Lori Levin, Alon Lavie, Erik Peterson Language Technologies Institute Carnegie Mellon University.
NATURAL LANGUAGE PROCESSING
CMU MilliRADD Small-MT Report TIDES PI Meeting 2002 The CMU MilliRADD Team: Jaime Carbonell, Lori Levin, Ralf Brown, Stephan Vogel, Alon Lavie, Kathrin.
Parsing into the Interlingua Using Phrase-Level Grammars and Trainable Classifiers Alon Lavie, Chad Langley, Lori Levin, Dorcas Wallace,Donna Gates and.
A Speech Interface to Virtual Environment Authors Scott McGlashan and Tomas Axling Swedish Institute of Computer Science.
Carnegie Mellon IRST-itc Balancing Expressiveness and Simplicity in an Interlingua for Task Based Dialogue Lori Levin, Donna Gates, Dorcas Wallace, Kay.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.
Dependency Parsing Niranjan Balasubramanian March 24 th 2016 Credits: Many slides from: Michael Collins, Mausam, Chris Manning, COLNG 2014 Dependency Parsing.
Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates and Kay.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Natural Language Processing Vasile Rus
Approaches to Machine Translation
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
Approaches to Machine Translation
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Speech-to-Speech MT Design and Engineering Alon Lavie and Lori Levin MT Class April

Outline Design and Engineering of the JANUS/C-STAR speech-to-speech MT system The C-STAR Travel Domain Interlingua (IF) Evaluation and User Studies Open Problems, Current and Future Research

Overview Fundamentals of our approach System overview Engineering a multi-domain system Evaluations and user studies Alternative translation approaches Current and future research

JANUS Speech Translation Translation via an interlingua representation Main translation engine is rule-based Semantic grammars Modular grammar design System engineered for multiple domains Incorporate alternative translation engines

The C-STAR Travel Planning Domain General Scenario: Dialogue between one traveler and one or more travel agents Focus on making travel arrangements for a personal leisure trip (not business) Free spontaneous speech

The C-STAR Travel Planning Domain Natural breakdown into several sub-domains: Hotel Information and Reservation Transportation Information and Reservation Information about Sights and Events General Travel Information Cross Domain

Semantic Grammars Describe structure of semantic concepts instead of syntactic constituency of phrases Well suited for task-oriented dialogue containing many fixed expressions Appropriate for spoken language - often disfluent and syntactically ill-formed Faster to develop reasonable coverage for limited domains

Semantic Grammars Hotel Reservation Example: Input: we have two hotels available Parse Tree: [give-information+availability+hotel] ( we have [hotel-type] ([quantity=] ( two ) [hotel] ( hotels ) available )

The JANUS-III Translation System

The SOUP Parser Specifically designed to parse spoken language using domain-specific semantic grammars Robust - can skip over disfluencies in input Stochastic - probabilistic CFG encoded as a collection of RTNs with arc probabilities Top-Down - parses from top-level concepts of the grammar down to matching of terminals Chart-based - dynamic matrix of parse DAGs indexed by start and end positions and head cat

The SOUP Parser Supports parsing with large multiple domain grammars Produces a lattice of parse analyses headed by top-level concepts Disambiguation heuristics rank the analyses in the parse lattice and select a single best path through the lattice Graphical grammar editor

SOUP Disambiguation Heuristics Maximize coverage (of input) Minimize number of parse trees (fragmentation) Minimize number of parse tree nodes Minimize the number of wild-card matches Maximize the probability of parse trees Find sequence of domain tags with maximal probability given the input words: P(T|W), where T= t 1,t 2,…,t n is a sequence of domain tags

JANUS Generation Modules Two alternative generation modules: Top-Down context-free based generator - fast, used for English and Japanese GenKit - unification-based generator augmented with Morphe morphology module - used for German

Modular Grammar Design Grammar development separated into modules corresponding to sub-domains (Hotel, Transportation, Sights, General Travel, Cross Domain) Shared core grammar for lower-level concepts that are common to the various sub-domains (e.g. times, prices) Grammars can be developed independently (using shared core grammar) Shared and Cross-Domain grammars significantly reduce effort in expanding to new domains Separate grammar modules facilitate associating parses with domain tags - useful for multi-domain integration within the parser

Translation with Multiple Domain Grammars Parser is loaded with all domain grammars Domain tag attached to grammar rules of each domain Previously developed grammars for other domains can also be incorporated Parser creates a parse lattice consisting of multiple analyses of the input into sequences of top-level domain concepts Parser disambiguation heuristics rank the analyses in the parse lattice and select a single best sequence of concepts

Translation with Multiple Domain Grammars

A SOUP Parse Lattice

User Studies We conducted three sets of user tests Travel agent played by experienced system user Traveler is played by a novice and given five minutes of instruction Traveler is given a general scenario - e.g., plan a trip to Heidelberg Communication only via ST system, multi-modal interface and muted video connection Data collected used for system evaluation, error analysis and then grammar development

System Evaluation Methodology End-to-end evaluations conducted at the SDU (sentence) level Multiple bilingual graders compare the input with translated output and assign a grade of: Perfect, OK or Bad OK = meaning of SDU comes across Perfect = OK + fluent output Bad = translation incomplete or incorrect

August-99 Evaluation Data from latest user study - traveler planning a trip to Japan 132 utterances containing one or more SDUs, from six different users SR word error rate 14.7% 40.2% of utterances contain recognition error(s)

Evaluation Results

Evaluation - Progress Over Time

Alternative Approaches: SALT SALT - Statistical Analyzer for Lang. Translation Combines ML trainable and rule-based analysis methods for robustness and portability Rule-based parsing restricted to well-defined set of argument-level phrases and fragments Trainable classifiers (NN, Decision Trees, etc.) used to derive the DA (speech-act and concepts) from the sequence of argument concepts. Phrase-level grammars are more robust and portable to new domains

Alternative Approaches: Pangloss Glossary-based Translation Translates directly into target language (no IF) Based on Pangloss translation system developed at CMU Uses a combination of EBMT, phrase glossaries and a bilingual dictionary English/German system operational Good fall-back for uncovered utterances

Current and Future Work Expanding the travel domain: covering descriptive as well as task-oriented sentences Development of the SALT statistical approach and expanding it to other domains Full integration of multiple MT approaches: SOUP, SALT, Pangloss Task-based evaluation Disambiguation: improved sentence-level disambiguation; applying discourse contextual information for disambiguation

Students Working on the Project Chad Langley: improved SALT approach Dorcas Wallace: DA disambiguation using decision trees, English grammars Taro Watanabe: DA correction and disambiguation using Transformation-based Learning, Japanese grammars Ariadna Font-Llitjos: Spanish Generation

The JANUS/C-STAR Team Project Leaders: Lori Levin, Alon Lavie, Monika Woszczyna, Alex Waibel Grammar and Component Developers: Donna Gates, Dorcas Wallace, Taro Watanabe, Boris Bartlog, Marsal Gavalda, Chad Langley, Marcus Munk, Klaus Ries, Klaus Zechner, Detlef Koll, Michael Finke, Eric Carraux, Celine Morel, Alexandra Slavkovic, Susie Burger, Laura Tomokiyo, Takashi Tomokiyo, Kavita Thomas, Mirella Lapata, Matthew Broadhead, Cortis Clark, Christie Watson, Daniella Mueller, Sondra Ahlen