Download presentation
Presentation is loading. Please wait.
Published byMarilynn Lyons Modified over 9 years ago
1
Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen
2
Problem & Solution Problem: –We have Audio from radio & TV –We need Transcripts Solution ASR: Automatic Speech Recognition
3
History of ASR It all started more than 100 years ago
4
History of ASR 1870 - Alexander Graham Bell: Make speech visible, for the hearing impaired 1952 - AT&T Bell Laboratories: 1st ASR - ten English digits 2001 - ASR is ‘everywhere’ : –PC: dictation + ‘Command & Control’ –mobile phones (hands free) –call-centers –tap phone calls
5
First: A/D-conversion Mic. + sound card Before ASR: A/D-conversion WAV file- digital & discrete Speech- analogue & continuous
6
What is ASR? Answer: conversion from speech to text ASR W: a string of words X: unknown speech signal
7
How: probabilistic approach Find W that max. P(W|X) P(W|X) = P(X|W) * P(W) / P(X) P(W) - language model P(X|W) - acoustic model –Whole word models –Phoneme models + Lexicon
8
ASR ASR = Phoneme models (HMMs) Lexicon Language model P(X|W) P(W)
9
Training HMMs & LMs are trained: Training procedure ASR: HMMs (Hidden Markov Models) Language Models Speech + manual transcripts (lexicon)
10
Decoding Automatic Transcript Generation: ASR W: the automatic transcripts X: unknown speech signal
11
C-3PO - 6 million languages
12
MUMIS
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.