Audio-visual Speaker association Zhijie Shao Master of Computer Science Supervisor: Trent Lewis
Project Schedule Open MARYVoice Import ToolAusTalkA New VoiceEvaluation
Open MARY What is MARY? MARY – Modular Architecture for Research on speech sYnthesis Why is MARY?
Voice Import Tool Import new voices under the MARY environment. Two formats of files: 1.Wave files 2.Text files in MARY format Import Blizzard competition Data into MARY. (including English audiobook data and training data)
AusTalk AVSP requires large datasets. the largest-ever auditory-visual database of Australian speech HCSvLab Human Communication Science Virtual Laboratory A platform for eResearch in HCS
A New Voice Text Analysis Text Normalization Homonym Disambiguation Grapheme-to-Phoneme (Letter-to-Sound) Intonation Waveform Generation Unit Selection Diphones
Evaluation Emotion Quality intonation Consistent
Conclusion Final Object: Create a new voice Time schedule