SPEECH SYNTHESIS --AusTalk Zhijie Shao Master of Computer Science Supervisor: Trent Lewis.

Slides:



Advertisements
Similar presentations
1 Using the HTK speech recogniser to analyse prosody in a corpus of German spoken learners English Toshifumi Oba, Eric Atwell University of Leeds, School.
Advertisements

Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Entropy and Dynamism Criteria for Voice Quality Classification Applications Authors: Peter D. Kukharchik, Igor E. Kheidorov, Hanna M. Lukashevich, Denis.
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Pitch Prediction From MFCC Vectors for Speech Reconstruction Xu shao and Ben Milner School of Computing Sciences, University of East Anglia, UK Presented.
Outlines  Objectives  Study of Thai tones  Construction of contextual factors  Design of decision-tree structures  Design of context clustering.
Facial expression as an input annotation modality for affective speech-to-speech translation Éva Székely, Zeeshan Ahmed, Ingmar Steiner, Julie Carson-Berndsen.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
This is an audio presentation. Please turn on your computer speakers. Press to start the presentation.
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Profile-profile alignment using hidden Markov models Wing Wong.
Oytun Turk and Levent M.Arslan Subband Based Voice Conversion SESTEK Inc., R&D Dept. Istanbul, Turkey Bogazici University, Electrical-Electronics Eng.
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
Modern speech synthesis: communication aid personalisation Sarah Creer Stuart Cunningham Phil Green Clinical Applications of Speech Technology University.
Authors: Anastasis Kounoudes, Anixi Antonakoudi, Vasilis Kekatos
Sunee Holland University of South Australia School of Computer and Information Science Supervisor: Dr G Stewart Von Itzstein.
Touching is Good Game Design for the Nintendo DS Nick Balogh Computer Science Senior Project Union College Prof. Burns.
Building High Quality Databases for Minority Languages such as Galician F. Campillo, D. Braga, A.B. Mourín, Carmen García-Mateo, P. Silva, M. Sales Dias,
May 20, 2006SRIV2006, Toulouse, France1 Acoustic Modeling of Accented English Speech for Large-Vocabulary Speech Recognition ATR Spoken Language Communication.
Speech Recognition Final Project Resources
H A N G Thank you. Integrating Computer-based Multimedia Instructional Design into Teaching International English Phonetic Symbols Mingzhu Qiu Ontario.
Segmental factors in language proficiency: Velarization degree as a signature of pronunciation talent Henrike Baumotte and Grzegorz Dogil {henrike.baumotte,
Utterance Verification for Spontaneous Mandarin Speech Keyword Spotting Liu Xin, BinXi Wang Presenter: Kai-Wun Shih No.306, P.O. Box 1001,ZhengZhou,450002,
CS280 – Introduction to Computing Welcome to CS 280A Introduction to Computer Science.
Prepared by: Waleed Mohamed Azmy Under Supervision:
HMM - Basics.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
The role of prosody in dialect synthesis and authentication Kyuchul Yoon Division of English Kyungnam University Spring 2008 Joint Conference of KSPS.
Synthesis of Child Speech With HMM Adaptation and Voice Conversion Oliver Watts, Junichi Yamagishi, Member, IEEE, Simon King, Senior Member, IEEE, and.
Effects of Two Advance Organizers on Listening Comprehension in Video Viewing— Pictorial Contextual Cues versus Verbal Contextual Keys Spooky Chang July.
Vergina: A Modern Greek Speech Database for Speech Synthesis Alexandros Lazaridis Theodoros Kostoulas Todor Ganchev Iosif Mporas Nikos Fakotakis Artificial.
A Multimedia English Learning System Using HMMs to Improve Phonemic Awareness for English Learning Yen-Shou Lai, Hung-Hsu Tsai and Pao-Ta Yu Chun-Yu Chen.
Building a sentential model for automatic prosody evaluation Kyuchul Yoon School of English Language & Literature Yeungnam University Korea.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Q : Is this principle widely used in America, Japan, Korea?
Forschungszentrum Telekommunikation Wien [Telecommunications Research Center Vienna] Michael Pucher Speech Synthesis Overview.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
1 Hidden Markov Model 報告人:鄒昇龍. 2 Outline Introduction to HMM Activity of HMM Problem and Solution Conclusion Reference.
Korean Phoneme Discrimination Ben Lickly Motivation Certain Korean phonemes are very difficult for English speakers to distinguish, such as ㅅ and ㅆ.
How to Use Media Master to record your voice in the ESL Lab By Marsha Chan.
Audio-visual Speaker association Zhijie Shao Master of Computer Science Supervisor: Trent Lewis.
HMM-Based Synthesis of Creaky Voice
Audient: An Acoustic Search Engine By Ted Leath Supervisor: Prof. Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering University.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
HMM training strategy for incremental speech synthesis.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Project Demonstration Template Computer Science University of Birmingham.
1 CS 552/652 Speech Recognition with Hidden Markov Models Winter 2011 Oregon Health & Science University Center for Spoken Language Understanding John-Paul.
Performance Comparison of Speaker and Emotion Recognition
HMM-Based Speech Synthesis Erica Cooper CS4706 Spring 2011.
Automatic Pronunciation Scoring of Specific Phone Segments for Language Instruction EuroSpeech 1997 Authors: Y. Kim, H. Franco, L. Neumeyer Presenter:
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
By: Nicole Cappella. Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved.
2014 Development of a Text-to-Speech Synthesis System for Yorùbá Language Olúòkun Adédayọ̀ Tolulope Department of Computer Science.
G. Anushiya Rachel Project Officer
Audio to Score Alignment for Educational Software
Mr. Darko Pekar, Speech Morphing Inc.
Introduction Intro Problem Materials Hypothesis Procedure Results
Statistical Models for Automatic Speech Recognition
Computational NeuroEngineering Lab
School Shop.
Unit One Introduction.
PatternHunter: faster and more sensitive homology search
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
I am a student. We are students. You are a student. You are students. He is a student. She is a student. It is a student. They are students.
2017 APSIPA A Study on Landmark Detection Based on CTC and Its Application to Pronunciation Error Detection Chuanying Niu1, Jinsong Zhang1, Xuesong Yang2.
1-P-30 Speech-to-Speech Translation using Dual Learning and Prosody Conversion Zhaojie Luo, Yoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki (Kobe.
Presentation transcript:

SPEECH SYNTHESIS --AusTalk Zhijie Shao Master of Computer Science Supervisor: Trent Lewis

Application brain-implanted-speech-sythesizer-revealed-new-paper

Voice Import ToolBlizzard DataAusTalkModificationEvaluation and Conclusion Project Procedure

MARY (Modular Architecture for Research on speech sYnthesis) is the German text-to-speech system. MARY

Voice Import Tools being provided by MARY contains a set of Voice components and helps users to build new voices under the MARY Environment. MARY— Voice Import Tool

Unit Selection Synthesis

HMM- Based Synthesis

HMM- based Synthesis HMM: Hidden Markov model An introduction to HMM-based speech Synthesis by Junichi Yamagishi

Blizzard Data The speaker is known as ‘Nancy’ and is a native speaker of US English, professional female voice talent, voice coach, and singer hours of data was made available Unit Selection Voice: HMM-based Voice: Unit Selection Voice: Hi, I am Jacky, welcome to my presentation. Hi, Jacky again, welcome to my presentation.

AusTalk Data “7. Sentences (8mns) A set of 59 sentences is presented one at a time on the screen.” from BigASC-RA-Manual

Comparison Blizzard 59 VS AusTalk 59 Synthesis Blizzard Unit Selection: Blizzard HMM: AusTalk HMM: AusTalk Unit Selection:

Comparison Continue Blizzard Unit Selection: Blizzard HMM: AusTalk HMM: AusTalk Unit Selection: Welcome to the speech synthesis

Austalk vs Blizzard (1) Phoneme Alignment (2) Quality of wav files (3) Boundary of utterance Comparison

Evaluation and Conclusion Modification and Evaluation Create a quality Aussie voice Further Research