Presentation is loading. Please wait.

Presentation is loading. Please wait.

Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen.

Similar presentations


Presentation on theme: "Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen."— Presentation transcript:

1 Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen

2 Problem & Solution Problem: –We have Audio from radio & TV –We need Transcripts Solution ASR: Automatic Speech Recognition

3 History of ASR It all started more than 100 years ago

4 History of ASR 1870 - Alexander Graham Bell: Make speech visible, for the hearing impaired 1952 - AT&T Bell Laboratories: 1st ASR - ten English digits 2001 - ASR is ‘everywhere’ : –PC: dictation + ‘Command & Control’ –mobile phones (hands free) –call-centers –tap phone calls

5 First: A/D-conversion Mic. + sound card Before ASR: A/D-conversion WAV file- digital & discrete Speech- analogue & continuous

6 What is ASR? Answer: conversion from speech to text ASR W: a string of words X: unknown speech signal

7 How: probabilistic approach Find W that max. P(W|X) P(W|X) = P(X|W) * P(W) / P(X) P(W) - language model P(X|W) - acoustic model –Whole word models –Phoneme models + Lexicon

8 ASR ASR = Phoneme models (HMMs) Lexicon Language model P(X|W) P(W)

9 Training HMMs & LMs are trained: Training procedure ASR: HMMs (Hidden Markov Models) Language Models Speech + manual transcripts (lexicon)

10 Decoding Automatic Transcript Generation: ASR W: the automatic transcripts X: unknown speech signal

11 C-3PO - 6 million languages

12 MUMIS


Download ppt "Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen."

Similar presentations


Ads by Google