Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
1 Preparing Future multiSensorial inTerAction Research – human language technologies PF-STAR – the project. HUMAINE WP5-WS, Belfast, Dec.
Advertisements

Pattern-Based French-to-IF Analysis Fill in the blanks IF-to-French Generation within NESPOLE! Hervé Blanchon CLIPS-IMAG
J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.
MULTI LINGUAL ISSUES IN SPEECH SYNTHESIS AND RECOGNITION IN INDIAN LANGUAGES NIXON PATEL Bhrigus Inc Multilingual & International Speech.
C SC 620 Advanced Topics in Natural Language Processing Lecture 22 4/15.
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
Centro per la Ricerca Scientifica e Tecnologica Spoken language technologies: recent advances and future challenges Gianni Lazzari VIENNA July 26.
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
Spoken Language Generation Project II Synthesizing Emotional Speech in Fairy Tales.
Multiagent Systems and Societies of Agents
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
1 Distance education : What could technology offer ? Gérard CHOLLET ENST/CNRS-LTCI 46 rue Barrault PARIS cedex 13
Speech-to-Speech MT JANUS C-STAR/Nespole! Lori Levin, Alon Lavie, Bob Frederking LTI Immigration Course September 11, 2000.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
What will this presentation cover? Part 1: GCSE English Language Course outline Exam dates PART 2 : GCSE English Literature Course outline Exam dates.
Machine Learning CS 165B Spring 2012
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls Udhay Nallasamy, Alan W Black, Tanja Schultz, Robert Frederking,
Building a Framework for Assessment. Four Components of Course Design.
Linguistic Representation of Finnish in the Medical Domain Spoken Language Translation System Marianne Santaholma, University of Geneva, TIM/ISSCO.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
DFKI GmbH, , R. Karger Indo-German Workshop on Language Technologies Reinhard Karger, M.A. Deutsches Forschungszentrum für Künstliche Intelligenz.
Direct Translation Approaches: Statistical Machine Translation
What is LANGUAGE?.
SIG IL 2000 Evaluation of a Practical Interlingua for Task-Oriented Dialogue Lori Levin, Donna Gates, Alon Lavie, Fabio Pianesi, Dorcas Wallace, Taro Watanabe,
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
Lessons Learned Mokusei: Multilingual Conversational Interfaces Future Plans Explore language-independent approaches to speech understanding and generation.
Analysis for Spoken Language Translation Using Phrase-Level Parsing and Domain Action Classification Chad Langley Language Technologies Institute Carnegie.
PART OF SPEECH DEFINITION SYNONYM SENTENCE WORD ORIGIN NAME TEACHER AND DATE1 WORDS OF THE WEEK.
Speech-to-Speech MT Design and Engineering Alon Lavie and Lori Levin MT Class April
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
A Multi-Perspective Evaluation of the NESPOLE! Speech-to-Speech Translation System Alon Lavie, Carnegie Mellon University Florian Metze, University of.
Advanced MT Seminar Spring 2008 Instructors: Alon Lavie and Stephan Vogel.
If parents and teachers work together, students are more likely to be successful in school.
Weak AI: Can Machines Act Intelligently? Some things they can do: –Computer vision: face recognition from a large set –Robotics: autonomous (mostly) car.
Nespole!’s Experiment on Multimodality (Summer 2001) Erica Costantini (University of Trieste) Fabio Pianesi (ITC-irst, Trento) Susanne Burger (CMU)
MT with an Interlingua Lori Levin April 13, 2009.
Speech-to-Speech MT in NESPOLE! Design and Engineering Alon Lavie, Lori Levin Work with: Chad Langley, Tanja Schultz, Dorcas Wallace, Donna Gates, Kay.
NESPOLE! is a project which aims at providing a system capable of supporting communication in the field of e-commerce and e-service by resorting to automatic.
FlashNormalize: Programming by Examples for Text Normalization International Joint Conference on Artificial Intelligence, Buenos Aires 7/29/2015FlashNormalize1.
Designing a Machine Translation Project Lori Levin and Alon Lavie Language Technologies Institute Carnegie Mellon University CATANAL Planning Meeting Barrow,
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
AVID Trisha Hamilton 8 th Katie Ehlers 7th What is AVID? AVID is an academic, regularly scheduled elective class based on writing as a tool of learning,
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
金聲玉振 Taiwan Univ. & Academia Sinica 1 Spoken Dialogue in Information Retrieval Jia-lin Shen Oct. 22, 1998.
Copyright © 2013 by Educational Testing Service. All rights reserved. Evaluating Unsupervised Language Model Adaption Methods for Speaking Assessment ShaSha.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Q1.8 Monday, October 5 LA Agenda: Good Things Assignments: Introduction, counterargument, and conclusion for argument essay Raters Thought for the day:
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
Parsing into the Interlingua Using Phrase-Level Grammars and Trainable Classifiers Alon Lavie, Chad Langley, Lori Levin, Dorcas Wallace,Donna Gates and.
1 Sobah Abbas Petersen Adjunct Associate Professor, NTNU Researcher, Sintef TDT4252 Modelling of Information Systems Advanced Course TDT4252,
Carnegie Mellon IRST-itc Balancing Expressiveness and Simplicity in an Interlingua for Task Based Dialogue Lori Levin, Donna Gates, Dorcas Wallace, Kay.
Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates and Kay.
PART OF SPEECH DEFINITION SYNONYM SENTENCE NAME TEACHER AND DATE1 WORDS OF THE WEEK.
PART OF SPEECH DEFINITION SYNONYM SENTENCE NAME TEACHER AND DATE1 WORDS OF THE WEEK.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System Diane J. Litman AT&T Labs -- Research
Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.
HI 5354 – Cognitive Engineering
CORPORATE LEARNING COURSE Seminar -- Best Practices
CORPORATE LEARNING COURSE Seminar -- Best Practices
Advanced NLP: Speech Research and Technologies
Introduction to Scala Unit 2
PROJ2: Building an ASR System
Course Outline Advanced Introduction Expert Systems Topics Problem
Idiap Research Institute University of Edinburgh
Open Source SUMMA Platform
Presentation transcript:

Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University Lori Levin – Carnegie Mellon University Fabio Pianesi – ITC-irst

August 12, 2002ESSLLI-02 Speech Translation Tutorial 2 Course Objectives and Format Present the NESPOLE! System and Research Project as a “Case Study” for state-of-the-art speech translation systems: –Survey the underlying language technology involved, design considerations: components and architecture –Challenges, capabilities and limitations –Tasks and methodologies involved –Lessons learned Format: –Each day devoted to a different “theme” aspect(s) –Presentations by topic experts among senior researchers working on the NESPOLE! Project

August 12, 2002ESSLLI-02 Speech Translation Tutorial 3 Course Outline and Schedule Monday, 12 August: –Introduction and System Overview (Lavie and Pianesi) –Nespole! System Architecture (Lavie) –Data Collection in Nespole! (Costantini) Tuesday, 13 August: –Interchange Format (Levin) Wednesday, 14 August: Speech Recognition Challenges and Solutions: –ASR and Scalability (Vaufreydaz) –ASR and Robustness (Metze) –Multilinguality in Automatic Speech Recognition Systems (Schultz)

August 12, 2002ESSLLI-02 Speech Translation Tutorial 4 Course Outline and Schedule Thursday, 15 August: Analysis and Generation Approaches: –Trainable Analysis Approach (Lavie) –French Analysis and Generation Approaches (Blanchon) –Italian Generation (Pianta) Friday, 16 August: –Experimenting with Direct Approaches: Statistical Machine Translation (Vogel) –Evaluation in Nespole! (Lavie, Levin, Costantini) –Conclusion and Future Directions (Pianesi and Lavie)

August 12, 2002ESSLLI-02 Speech Translation Tutorial 5 Introduction Evolution of Speech Translation Systems

August 12, 2002ESSLLI-02 Speech Translation Tutorial 6 NESPOLE! System Overview Human-to-human spoken language translation for e-commerce application (e.g. travel & tourism) (Lavie et al., 2002) English, German, Italian, and French Translation via interlingua Translation servers for each language exchange interlingua to perform translation –Speech recognition (Speech  Text) –Analysis (Text  Interlingua) –Generation (Interlingua  Text) –Synthesis (Text  Speech)

August 12, 2002ESSLLI-02 Speech Translation Tutorial 7 Interchange Format Interchange Format (IF) is a shallow semantic interlingua for task-oriented domains Utterances represented as sequences of semantic dialog units (SDUs) IF representation consists of four parts –Speaker –Speech Act –Concepts –Arguments speaker : speech act +concept* +arguments* } Domain Action

August 12, 2002ESSLLI-02 Speech Translation Tutorial 8 Example “Hello. I would like to take a vacation in Val di Fiemme.” hello i would like to take a vacation in val di fiemme c:greeting (greeting=hello) c:give-information+disposition+trip (disposition=(who=i, desire), visit-spec=(identifiability=no, vacation), location=(place-name=val_di_fiemme)) ENG: Hello! I want to travel for a vacation at Val di Fiemme. ITA: Salve. Io vorrei una vacanza in Val di Fiemme.