Lessons Learned Mokusei: Multilingual Conversational Interfaces Future Plans Explore language-independent approaches to speech understanding and generation.

Slides:

Advertisements

Similar presentations

SPOKEN LANGUAGE SYSTEMS Spoken Conversational Interaction for Language Learning Stephanie Seneff, Chao Wang, and Julia Zhang Spoken Language Systems Group.

Advertisements

The Practical Value of Statistics for Sentence Generation: The Perspective of the Nitrogen System Irene Langkilde-Geary.

10. Lexicalized and Probabilistic Parsing -Speech and Language Processing- 발표자 : 정영임 발표일 :

J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.

Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.

CKY Parsing Ling 571 Deep Processing Techniques for NLP January 12, 2011.

MULTI LINGUAL ISSUES IN SPEECH SYNTHESIS AND RECOGNITION IN INDIAN LANGUAGES NIXON PATEL Bhrigus Inc Multilingual & International Speech.

Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.

Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.

CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:

Bootstrapping a Language- Independent Synthesizer Craig Olinsky Media Lab Europe / University College Dublin 15 January 2002.

Spoken Language Systems: The Unfinished Agenda Raj Reddy School of Computer Science Carnegie Mellon University Pittsburgh September 21, 2006 The entire.

1 Contents Introduction A Simple Compiler Scanning – Theory and Practice Grammars and Parsing LL(1) Parsing LR Parsing Lex and yacc Semantic Processing.

SPOKEN LANGUAGE SYSTEMS MIT Computer Science and Artificial Intelligence Laboratory Mitchell Peabody, Chao Wang, and Stephanie Seneff June 19, 2004 Lexical.

Equal-party Conversation System for Language Learning Chih-yu Chao (advisor: Stephanie Seneff) April 14 th, 2006 Dialogs on Dialogs Reading Group.

Non-native Speech Languages have different pronunciation spaces

High-quality Speech Translation for Language Learning Chao Wang and Stephanie Seneff June 24, 2004 Spoken Language Systems Group MIT Computer Science and.

Table-driven parsing Parsing performed by a finite state machine. Parsing algorithm is language-independent. FSM driven by table (s) generated automatically.

A Framework For Developing Conversational User Interfaces

Overview of Search Engines

Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.

SCILL: Spoken Conversational Interaction for Language Learning

Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.

DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.

Natural Language Understanding

Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.

C omputer S cience and A rtificial I ntelligence L aboratory Multilingual Conversational Systems SPEECH RECOGNITION LANGUAGE UNDERSTANDING LANGUAGE GENERATION.

Speech Recognition Final Project Resources

A Survey of ICASSP 2013 Language Model Department of Computer Science & Information Engineering National Taiwan Normal University 報告者：郝柏翰 2013/06/19.

Linguistic Representation of Finnish in the Medical Domain Spoken Language Translation System Marianne Santaholma, University of Geneva, TIM/ISSCO.

Spoken Dialogue Systems and the GALAXY Architecture 29 October 2000 Advanced Technology Laboratories 1 Federal Street A&E Building 2W Camden, New Jersey.

MIT 6.893; SMA 5508 Spring 2004 Larry Rudolph Lecture Introduction Speechbuilder Tutorial.

1 Chapter 5 LL (1) Grammars and Parsers. 2 Naming of parsing techniques The way to parse token sequence L: Leftmost R: Righmost Top-down  LL Bottom-up.

Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.

May 2006CLINT-CS Verbmobil1 CLINT-CS Dialogue II Verbmobil.

Winter 2007SEG2101 Chapter 71 Chapter 7 Introduction to Languages and Compiler.

Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador

THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)

PETRA – the Personal Embedded Translation and Reading Assistant Werner Winiwarter University of Vienna InSTIL/ICALL Symposium 2004 June 17-19, 2004.

Understanding Natural Language

Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,

Natural Language Processing Daniele Quercia Fall, 2000.

Unit-1 Introduction Prepared by: Prof. Harish I Rathod

Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.

TDDD55- Compilers and Interpreters Lesson 1 Zeinab Ganjei Department of Computer and Information Science Linköping University.

16.0 Spoken Dialogues References: , Chapter 17 of Huang 2. “Conversational Interfaces: Advances and Challenges”, Proceedings of the IEEE,

L C SL C S SpeechBuilder: Facilitating Spoken Dialogue System Creation Eugene Weinstein Project Oxygen Core Team MIT Laboratory for Computer Science

ELIS-DSSP Sint-Pietersnieuwstraat 41 B-9000 Gent Recognition of foreign names spoken by native speakers Frederik Stouten & Jean-Pierre Martens Ghent University.

ICS 482: Natural language Processing Pre-introduction

Interlingua Annotation Owen Rambow Advaith Siddharthan Kathleen McKeown

Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.

BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.

金聲玉振 Taiwan Univ. & Academia Sinica 1 Spoken Dialogue in Information Retrieval Jia-lin Shen Oct. 22, 1998.

L C S Spoken Language Systems Group Stephanie Seneff Spoken Language Systems Group MIT Laboratory for Computer Science January 13, 2000 Multilingual Conversational.

Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.

Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University.

Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:

Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:

By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.

Presented By Sharmin Sirajudeen S7 CS Reg No :

Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.

G. Anushiya Rachel Project Officer

Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :

Context-free grammars, derivation trees, and ambiguity

Text-To-Speech System for English

3.0 Map of Subject Areas.

PROJ2: Building an ASR System

Word embeddings (continued)

COP4620 – Programming Language Translators Dr. Manuel E. Bermudez

Artificial Intelligence 2004 Speech & Natural Language Processing

Presentation transcript:

Lessons Learned Mokusei: Multilingual Conversational Interfaces Future Plans Explore language-independent approaches to speech understanding and generation Port human-language technologies for English conversational interfaces to Japanese Use existing Jupiter domain as test case –A telephone-only conversational interface for weather information –More than 500 cities worldwide (~350 in US) –On-line information from four Web sites –Use the Galaxy client server architecture Speech Recognition (SUMMIT: Glass et al., ICSLP ‘96) –Lexicon: >2,000 words with phonemic pronunciations –Phonological modeling: *Japanese specific phonological rules, e.g., desu ka  /d e s k a/ *Japanese phonetic units mapped into English ones –Acoustic modeling: *Used English models to generate forced transcriptions utterances *Retrained acoustic models to create hybrid models –Language modeling: *Class n-gram using 60 word classes. trained on ~3,500 read & spontaneous sentences *Also exploring a class n-gram derived automatically from TINA Speech Synthesis –NTT Fluet text-to-speech system Note: Sample sentences from Japanese speakers can be played from PC S. Seneff, J. Glass, T.J. Hazen, J. Polifroni, and V. Zue MIT Laboratory for Computer Science Y. Minami NTT Cyberspace Laboratories Language as Interface Language Understanding (TINA: Seneff, Comp Ling, ‘92 ) –Japanese grammar contains >900 unique non-terminals –Translation file maps Japanese words to English equivalent –Produces same semantic frame as for English inputs –Left recursive structure of Japanese requires look-ahead to resolve role of content words *Parse each content word into structure labeled “object” *Drop off “object” after next particle, which defines role and position in hierarchy Language Generation (GENESIS, Glass et al., ICSLP ‘94) –Used English language generation tables as template –Modified ordering of constituents –Provided translation lexicon for words –Many language specific challenges, including constituent ordering, quantifier translation, and multiple meanings Language as Content Use the same internal representation for Japanese and English Update from Web sites and satellite feeds at frequent intervals Parse all data into semantic frames to capture meaning Scan frames for semantic content and prepare new relational database table entries English:Some thunderstorms may be accompanied by gusty winds and hail Japanese: clause: weather_event topic: precip_act, name: thunderstorm, num: pl quantifier: some pred: accompanied_by adverb: possibly topic: wind, num: pl, pred: gusty and: precip_act, name: hail weather wind hail rain/storm Frame indexed under weather, wind, rain, storm, and hail Our approach to developing multilingual interfaces appears feasible A top-down approach to parsing can be made effective for left-recursive languages Word order divergence between English and Japanese motivated a redesign of our language generation component Novel technique of generating a class n-gram language model using the NL component appears promising Involvement of Japanese researcher is essential Additional data collection from native Japanese speakers –Nearly 1000 sentences were collected in December Improvement of individual components –Vocabulary coverage, acoustic and language models –Parse coverage –Continued development of a more sophisticated language generation component Expansion of weather content for Japan Research Objectives