Ewald van der Westhuizen Digital signal processing group

Slides:



Advertisements
Similar presentations
Many studies suggest that the parser makes predictions for upcoming linguistic structure (i.e., ‘look-ahead’), but it remains controversial how much (empty)
Advertisements

Improved ASR in noise using harmonic decomposition Introduction Pitch-Scaled Harmonic Filter Recognition Experiments Results Conclusion aperiodic contribution.
MAI Internship April-May MAI Internship 2002 Slide 2 of 14 What? The AST Project promotes development of speech technology for official languages.
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Spoken Vs Written Language. Introduction Languages are first spoken, then written, and then an understanding.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Modern speech synthesis: communication aid personalisation Sarah Creer Stuart Cunningham Phil Green Clinical Applications of Speech Technology University.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Basic types of sound  Realistic sound – derived from actual sources (footsteps, voices, cars, etc.)  Synthetic sound – invented and have no counterpart.
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
CC 2007, 2011 attrbution - R.B. Allen Text and Text Processing.
What is discourse analysis?
MOOC on M4D 2013 S PEECH T ECHNOLOGY FOR M OBILE P HONES Rajesh Hegde Indian Institute of Technology Kanpur Commonwealth of Learning Vancouver.
Speech and Language Processing
Utterance Verification for Spontaneous Mandarin Speech Keyword Spotting Liu Xin, BinXi Wang Presenter: Kai-Wun Shih No.306, P.O. Box 1001,ZhengZhou,450002,
LISTENING & SPEAKING SESSION 2 / AUGUST TIPS FOR THE LISTENING & SPEAKING SECTION 1. Use the resources in your community to practice listening to.
Graphophonemic System – Phonics
A brief overview of Speech Recognition and Spoken Language Processing Advanced NLP Guest Lecture August 31 Andrew Rosenberg.
Thoughts on Validation hussein suleman uct cs honours 2007.
1 An Assessment of a Speech-Based Programming Environment Andrew Begel Microsoft Research (formerly UC Berkeley)
Quantitative Text Analysis Content and Interaction Analyses.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Recognition of Speech Using Representation in High-Dimensional Spaces University of Washington, Seattle, WA AT&T Labs (Retd), Florham Park, NJ Bishnu Atal.
1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech.
Windows Movie Maker DAMS Tech Training August 2009.
Dirk Van CompernolleAtranos Workshop, Leuven 12 April 2002 Automatic Transcription of Natural Speech - A Broader Perspective – Dirk Van Compernolle ESAT.
Engineering Students Opening Meeting. Montgomery College  Analyzing speech signals What did he say? Who is speaking Now?  Synthesizing speech  Improving.
ENGLISH PROFICIENCY 3 BIU 2032 UNIT 1 : LISTENING DRAWING INFERENCES.
S PEECH T ECHNOLOGY Answers to some Questions. S PEECH T ECHNOLOGY WHAT IS SPEECH TECHNOLOGY ABOUT ?? SPEECH TECHNOLOGY IS ABOUT PROCESSING HUMAN SPEECH.
Introduction Part I Speech Representation, Models and Analysis Part II Speech Recognition Part III Speech Synthesis Part IV Speech Coding Part V Frontier.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Pragmatics and Text Analysis Chapter 6.  concerned with the how meaning is communicated by the speaker (writer) and interpreted by the listener (reader)
Copyright © 2013 by Educational Testing Service. All rights reserved. Evaluating Unsupervised Language Model Adaption Methods for Speaking Assessment ShaSha.
1 Preschool English Learners Principles and Practices to Promote Language, Literacy, and Learning A Resource Guide, Second Edition Published by the California.
Natural conversation “When we investigate how dialogues actually work, as found in recordings of natural speech, we are often in for a surprise. We are.
PHASE-BASED DUAL-MICROPHONE SPEECH ENHANCEMENT USING A PRIOR SPEECH MODEL Guangji Shi, M.A.Sc. Ph.D. Candidate University of Toronto Research Supervisor:
Copyright © Texas Education Agency, All rights reserved. Images and other multimedia content used with permission. Principles of AAVTC AV Production.
Research Paper Alternatives: Using Digital Storytelling in the Photostory Software to Enrich the Curriculum Rebecca Foster-Faith.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Language Identification and Part-of-Speech Tagging
BEC Business English Communication
SPEECH TECHNOLOGY An Overview Gopala Krishna. A
Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C
Multiple linguistic competencies
The art of subtitling within the European institutions
ARTIFICIAL NEURAL NETWORKS
Artificial Intelligence for Speech Recognition
Course Projects Speech Recognition Spring 1386
Why Study Spoken Language?
--Mengxue Zhang, Qingyang Li
Automatic Speech Recognition
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
Why Study Spoken Language?
Automating Early Assessment of Academic Standards for Very Young Native and Non-Native Speakers of American English better known as The TBALL Project.
Maties Machine Learning meeting Emre Yılmaz
Sfax University, Tunisia
Advanced NLP: Speech Research and Technologies
The art of subtitling within the European institutions
Language and Community
عمادة التعلم الإلكتروني والتعليم عن بعد
Multiple linguistic competencies
Artificial Intelligence 2004 Speech & Natural Language Processing
Code-Switching The particular dialect or language that a person chooses to use on any occasion is a code, a system used for communication between two or.
Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan Lee
Huawei CBG AI Challenges
Emre Yılmaz, Henk van den Heuvel and David A. van Leeuwen
Presentation transcript:

Automatic speech recognition of code-switched speech for South African languages Ewald van der Westhuizen Digital signal processing group DSP lab, Rm E355, E&E Engineering Stellenbosch University

Introduction Automatic Speech Recognition – human speech to text ASR systems suited for monolingual speech Code-switching – multilingual speakers speak more than one language in a conversation or utterance Code-switching degrades ASR performance

Example Speech in South African soap operas Example video: Setswana, Zulu and English

Example Speech is natural, conversational, fast, emotion, accented Code-switching is spontaneous Filler words and hesitations Noisy environment (door, footsteps, paper rustle) Speech overlap

Modelling code-switched language Code-switching examples are sparse Synthesise artificial examples of code-switching Word embeddings Represent words as vectors in multidimensional space Capture semantic and syntactic relationships of words Synthesised examples enhance training data used for language modelling

Querying word embedding models Trigger i- ama- Query word album advertisers sim.wrd cos.scr Result words song 0.87 promotors 0.77 movie 0.81 creatives 0.73 film employers 0.72 soundtrack sponsors series 0.74 fans 0.70 i- song, i- movie ama- promotors, ama- creatives

Code-switching Inter-sentence Intra-sentence Intra-word Example: Insertional CS Alternational CS Intra-word Example: You’ve got no idea how vinnig I’ve been slaan-ing this bymekaar.