Document Expansion for Speech Retrieval (Singhal, Pereira) Teoman Toraman Çağrı Toraman Bilkent University, 2010
Reasonable Transcription File: (or Manual) Speech Recognition Problem Statement Reasonable Transcription File: news_today.rtf Speech File: news_today.wav Automatic (or Manual) Speech Recognition 2 / 10
Fatal train crash in Italy Problem Statement Aboutness: Fatal train crash in Italy Query Indexing Results: D1, D2 3 / 10
Problem Statement Corrupted / Erroneous Erroneous Transcription File Noisy / Dirty Sound File Automatic (or Manual) Speech Recognition Corrupted / Erroneous 4 / 10
(Vocabulary Mismatch) Problem Statement Same Query Erroneous Corrupted / Erroneous Indexing Results: D2 (Vocabulary Mismatch) 5 / 10
Recognition Mistakes: Problem Statement Noisy / Dirty Sound File Automatic (or Manual) Speech Recognition Corrupted / Erroneous Recognition Mistakes: Deletions Wrong term weighting Insertions 6 / 10
Solution Corrupted / Erroneous Expanded Document Expansion 7 / 10
What is Document Expansion ? Solution What is Document Expansion ? Step 2) Step 3) Step 1) RELATED CORPUS Corrupted / Erroneous Reweighing & Adding New Terms ... 10 similar files 8 / 10
Experiments & Results 9 / 10
Experiments & Results %10-15 loss %20-25 loss 10 / 10