SPEECH SYNTHESIS --AusTalk Zhijie Shao Master of Computer Science Supervisor: Trent Lewis
Application brain-implanted-speech-sythesizer-revealed-new-paper
Voice Import ToolBlizzard DataAusTalkModificationEvaluation and Conclusion Project Procedure
MARY (Modular Architecture for Research on speech sYnthesis) is the German text-to-speech system. MARY
Voice Import Tools being provided by MARY contains a set of Voice components and helps users to build new voices under the MARY Environment. MARY— Voice Import Tool
Unit Selection Synthesis
HMM- Based Synthesis
HMM- based Synthesis HMM: Hidden Markov model An introduction to HMM-based speech Synthesis by Junichi Yamagishi
Blizzard Data The speaker is known as ‘Nancy’ and is a native speaker of US English, professional female voice talent, voice coach, and singer hours of data was made available Unit Selection Voice: HMM-based Voice: Unit Selection Voice: Hi, I am Jacky, welcome to my presentation. Hi, Jacky again, welcome to my presentation.
AusTalk Data “7. Sentences (8mns) A set of 59 sentences is presented one at a time on the screen.” from BigASC-RA-Manual
Comparison Blizzard 59 VS AusTalk 59 Synthesis Blizzard Unit Selection: Blizzard HMM: AusTalk HMM: AusTalk Unit Selection:
Comparison Continue Blizzard Unit Selection: Blizzard HMM: AusTalk HMM: AusTalk Unit Selection: Welcome to the speech synthesis
Austalk vs Blizzard (1) Phoneme Alignment (2) Quality of wav files (3) Boundary of utterance Comparison
Evaluation and Conclusion Modification and Evaluation Create a quality Aussie voice Further Research