Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer.

Slides:



Advertisements
Similar presentations
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Advertisements

Cross-Language Retrieval INST 734 Module 11 Doug Oard.
Multimedia Retrieval. Outline Audio Retrieval Spoken information Music Document Image Analysis and Retrieval Video Retrieval.
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Summer Workshop 2000 Presented at the ANLP-NAACL.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
SPOKEN LANGUAGE SYSTEMS MIT Computer Science and Artificial Intelligence Laboratory Mitchell Peabody, Chao Wang, and Stephanie Seneff June 19, 2004 Lexical.
Issues in Pre- and Post-translation Document Expansion: Untranslatable Cognates and Missegmented Words Gina-Anne Levow University of Chicago July 7, 2003.
Context in Multilingual Tone and Pitch Accent Recognition Gina-Anne Levow University of Chicago September 7, 2005.
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Advance Information Retrieval Topics Hassan Bashiri.
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation Gina-Anne Levow University of Chicago October 14, 2005.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
The Third Chinese Language Processing Bakeoff: Word Segmentation and Named Entity Recognition Gina-Anne Levow Fifth SIGHAN Workshop July 22, 2006.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
Improved Tone Modeling for Mandarin Broadcast News Speech Recognition Xin Lei 1, Manhung Siu 2, Mei-Yuh Hwang 1, Mari Ostendorf 1, Tan Lee 3 1 SSLI Lab,
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, (2014) BERLIN CHEN, YI-WEN CHEN, KUAN-YU CHEN, HSIN-MIN WANG2 AND KUEN-TYNG YU Department of Computer.
Cross-Language Retrieval INST 734 Module 11 Doug Oard.
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
Search is not only about the Web An Overview on Printed Documents Search and Patent Search Walid Magdy Centre for Next Generation Localisation School of.
The Evolution of Shared-Task Evaluation Douglas W. Oard College of Information Studies and UMIACS University of Maryland, College Park, USA December 4,
Mandarin-English Information (MEI) Johns Hopkins University Summer Workshop 2000 presented at the TDT-3 Workshop February 28, 2000 Helen Meng The Chinese.
August 21, 2002Szechenyi National Library Support for Multilingual Information Access Douglas W. Oard College of Information Studies and Institute for.
Automatic Spoken Document Processing for Retrieval and Browsing Zahra Ahmadi.
Nasal endings of Taiwan Mandarin: Production, perception, and linguistic change Student : Shu-Ping Huang ID No. : NA3C0004 Professor : Dr. Chung Chienjer.
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Automated Scoring of Picture- based Story Narration Swapna Somasundaran Chong Min Lee Martin Chodorow Xinhao Wang.
Translingual Topic Tracking with PRISE Gina-Anne Levow and Douglas W. Oard University of Maryland February 28, 2000.
World Languages Mandarin English Challenges in Mandarin Speech Recognition  Highly developed language model is required due to highly contextual nature.
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer.
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
Enhanced Infrastructure for Creation & Collection of Translation Resources Zhiyi Song, Stephanie Strassel (speaker), Gary Krug, Kazuaki Maeda.
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer.
IIIT Hyderabad’s CLIR experiments for FIRE-2008 Sethuramalingam S & Vasudeva Varma IIIT Hyderabad, India 1.
Rapid and Accurate Spoken Term Detection Michael Kleber BBN Technologies 15 December 2006.
Overview of the TDT-2003 Evaluation and Results Jonathan Fiscus NIST Gaithersburg, Maryland November 17-18, 2002.
Mandarin Beginner What we will do…  Introduce Chinese as a language and dialects  Introduce Pinyin (function, composing rules)
Automatic Identification and Classification of Words using Phonetic and Prosodic Features Vidya Mohan Center for Speech and Language Engineering The Johns.
Information Retrieval at NLC Jianfeng Gao NLC Group, Microsoft Research China.
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
Subproject II: Robustness in Speech Recognition. Members (1/2) Hsiao-Chuan Wang (PI) National Tsing Hua University Jeih-Weih Hung (Co-PI) National Chi.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Learning Phonetic Similarity for Matching Named Entity Translations and Mining New Translations Wai Lam Ruizhang Huang Pik-Shan Cheung Department of Systems.
Results of the 2000 Topic Detection and Tracking Evaluation in Mandarin and English Jonathan Fiscus and George Doddington.
Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.
Latent Topic Modeling of Word Vicinity Information for Speech Recognition Kuan-Yu Chen, Hsuan-Sheng Chiu, Berlin Chen ICASSP 2010 Hao-Chin Chang Department.
A Joint Source-Channel Model for Machine Transliteration Li Haizhou, Zhang Min, Su Jian Institute for Infocomm Research 21 Heng Mui Keng Terrace, Singapore.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Copyright © 2013 by Educational Testing Service. All rights reserved. Evaluating Unsupervised Language Model Adaption Methods for Speaking Assessment ShaSha.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
Keyword Translation Accuracy and Cross-Lingual Question Answering in Chinese and Japanese Teruko Mitamura Mengqiu Wang Hideki Shima Frank Lin In CMU EACL.
Multilingual Search Shibamouli Lahiri
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
STD Approach Two general approaches: word-based and phonetics-based Goal is to rapidly detect the presence of a term in a large audio corpus of heterogeneous.
ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone Institute for Signal and Information Processing, Temple University.
Cross-Dialectal Data Transferring for Gaussian Mixture Model Training in Arabic Speech Recognition Po-Sen Huang Mark Hasegawa-Johnson University of Illinois.
F. López-Ostenero, V. Peinado, V. Sama & F. Verdejo
Speaker : chia hua Authors : Long Qin, Ming Sun, Alexander Rudnicky
Helen Meng,' Sanjeev Khudanpur,2 Gina Levow,3 Douglas W
Rapidly Retargetable Translingual Detection
Translingual Knowledge Projection and Statistical Machine Translation
Presentation transcript:

Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer Workshop 2000 Opening Ceremony The MEI Team July 17, 2000

MEI Team Senior Members Students Helen Meng Chinese University of Hong Kong Erika Grams Advanced Analytic Tools Sanjeev Khudanpur Johns Hopkins University Gina-Anne Levow University of Maryland Douglas Oard University of Maryland Patrick Schone US Department of Defense Hsin-Min Wang Academia Sinica, Taiwan Berlin Chen National Taiwan University Wai-Kit Lo Chinese University of Hong Kong Karen Tang Princeton University Jianqiang Wang University of Maryland

Outline Motivation MEI project overview Research challenges System architecture

Motivation Monolingual speech retrieval applications are emerging, e.g. – source: Feb 2000 Internet-accessible Radio and Television Stations

Translingual Speech Retrieval Allow anyone to find information that is expressed in any spoken language

The Big Picture Search Refinements Speech-to-Speech Translation English Spoken Documents Retrieval Engine English Text Query English-to-Chinese Translation Mandarin News Broadcasts Mandarin Audio Indexing Retrieved Mandarin Spoken Documents

Related Work TREC Spoken Document Retrieval –Close coupling of recognition and retrieval TREC Cross-Language Retrieval –Close coupling of translation and retrieval TDT-3 Topic Tracking –Coupling recognition, translation and retrieval –Using speech recognition transcripts

Research Challenges Multi-scale audio indexing –Multiple feature sets capture more information Multi-scale translation –Lexicon and pronunciation are complementary Multi-scale retrieval –Combination of evidence can add robustness The MEI Project Closely couple recognition and translation –for the purpose of retrieval

Task and Corpus Query by Example English Example Newswire Stories Mandarin Audio Collection TDT Corpus Challenge: using an English story as the query, find related Mandarin audio documents

Multi-scale Mandarin Processing: Phonological Considerations Chinese is a syllable-based language Mandarin is the official Chinese dialect –~400 base syllables, 4 lexical tones + light tone Syllable structure (CG)V(X) –(CG): onset, optional, consonant+medial glide –V:nuclear vowel –X:coda, glide / alveolar nasal / velar nasal –~ 21 initials, 39 finals Circumvent the OOV problem

Multi-scale Mandarin Processing: Linguistic Considerations Characters (written) -> syllables (spoken) Degenerate mapping – /hang2/, /hang4/, /heng2/ or /xing2/ –/fu4 shu4/ (LDC’s CALLHOME lexicon) Tokenization / Segmentation –/zhe4 yi1 wan3 hui4 ru2 chang2 ju3 xing2/

Multi-scale Audio Indexing Initial/Final Preme/Core Final Phones /iang/ /ji/ /j/ /ang/ /i//a/ /ng/ /j/ Preme/Toneme, Initial/Tonal-Final syllable /jiang/, single-character word, part of a multi-char word

Previous Work on Audio Indexing Syllable lattice matching [Chen, Wang & Lee, 2000] Comparison of multi-scale acoustic models [Lo, Meng & Ching, 2000] Overlapping syllable n-grams [Meng et al., 1999] Skipped syllable pairs [Chen, Wang & Lee, 2000]

Word-scale –Phrase-based, e.g. “the White House” –Dictionary-based [Levow & Oard 00] –Lexical gaps: parallel corpora, comparable corpora Subword-scale –Named entities: transliteration by cross-lingual phonetic mapping –Northern Ireland /bei2 ai4 er3 lan2/ –Kosovo (/ke1-suo3-wo4/, /ke1-suo3-fo2/, /ke1-suo3- fu1/, /ke1-suo3-fu2/) Multi-scale Translation

Cross-Lingual Phonetic Mapping Named entity Jiang Zemin, Kosovo Syllabify Pinyin Spelling E.g. jiang ze min English Pronunciation Lookup or Letter-to-Phone Generation English Phones, e.g. k ao s ax v ow Cross-lingual Phonetic Mapping Chinese Phones, e.g. k e s u o w o Syllabification Chinese syllables, e.g. ke suo wo

Multi-scale Retrieval Word-scale exploits lexical knowledge –Enhances precision Subwords can achieve complete coverage –Enhances recall Combination of evidence may be best –If a good merging strategy can be found

Merging Strategies Loose coupling –Separate retrieval runs –Merge ranked lists [Voorhees 1995] Tight coupling –Unified indexing of words and subwords –Single ranked list –[Ng 2000]

Multi-scale Retrieval Techniques Subword-scale –Syllable lattice matching [Chen, Wang & Lee 2000] –Overlapping syllable n-grams [Meng et al. 1999] –Syllable confusion matrix [Meng et al. 1999] Word-scale –Structured queries [Pirkola 1998] –Structured translation [Sperer & Oard 2000] Robust Retrieval –speech recognition errors –translation / transliteration ambiguities

Research Plan “MEI: Mandarin-English Information”, Proceedings of the TDT Workshop, “Mandarin-English Information (MEI): Investigating Translingual Retrieval” Proceedings of the NAACL Workshop on Embedded Machine Translation, 2000

Named Entity Tagger Phrase tagging Unknown words and phrases English to Chinese translation dictionary Term Translation Spoken Mandarin document s Dragon Mandarin ASR Query Processing Document Processing Query to INQUERY Document to INQUERY Character n-gram generation Mandarin-English Information: Investigation Translingual Speech Retrieval Mandarin-English Information: Investigation Translingual Speech Retrieval Johns Hopkins University, Center for Language and Speech Processing, JHU/NSF Summer Workshop 2000 MEI Team :Helen MENG (CUHK), Berlin CHEN (National Taiwan University), Erika GRAMS (Advanced Analytic Tools), Sanjeev KHUDANPUR (JHU/CLSP), Gina-Anne LEVOW (University of Maryland), Wai-Kit LO (CUHK) Douglas OARD (University of Maryland), Patrick SCHONE (Department of Defense), Karen TANG (Princeton University), Hsin-Min WANG (Academia Sinica), Jianqiang WANG (University of Maryland) MEI Team :Helen MENG (CUHK), Berlin CHEN (National Taiwan University), Erika GRAMS (Advanced Analytic Tools), Sanjeev KHUDANPUR (JHU/CLSP), Gina-Anne LEVOW (University of Maryland), Wai-Kit LO (CUHK) Douglas OARD (University of Maryland), Patrick SCHONE (Department of Defense), Karen TANG (Princeton University), Hsin-Min WANG (Academia Sinica), Jianqiang WANG (University of Maryland) Word sequence Character n-gram sequence INQUERY Ranked List of Possibly Relevant Documents Translated words and phrases Relevance Assessments Figure of Merit Scoring Query Term Selectio n As of Sunday July 9, 2000 Word sequence Character n-gram sequence Segmented Chinese Text Input English text query

END