An Introduction to S3ML Beijing InfoQuick SinoVoice Speech Technology Corp. CHEN Ming, LV Shinan, LI Xiulin.

Slides:



Advertisements
Similar presentations
.NET Technology. Introduction Overview of.NET What.NET means for Developers, Users and Businesses Two.NET Research Projects:.NET Generics AsmL.
Advertisements

TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational.
The eXtensible Markup Language (XML) An Applied Tutorial Kevin Thomas.
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
Applying the Pronunciation Lexicon Specification to ASR & TTS 1 Patrizio Bergallo 1 Monday, August 20, 2007 SpeechTEK ASTS - Advances in Text-to-Speech.
1 SSML The Internationalization of the W3C Speech Synthesis Markup Language SpeechTek 2007 – C102 – Daniel C. Burnett.
SSML extensions for multi-language usage Davide Bonardo W3C Workshop on Internationalizing SSML Crete, May 2006.
XML for Information Management – Day 2 Airi Salminen University of Erlangen-Nuremberg Computational Linguistics Instructor: Professor Airi Salminen
XML for Information Management – Day 2 Airi Salminen University of Erlangen-Nuremberg Computational Linguistics Instructor: Professor Airi Salminen
Unit 5 Talking About the Weather Unit 5 Talking About the Weather.
Create Photo-Realistic Talking Face Changbo Hu * This work was done during visiting Microsoft Research China with Baining Guo and Bo Zhang.
Extensible Markup Language XML MIS 520 – Database Theory Fall 2001 (Day) Lecture 14.
Outline IS400: Development of Business Applications on the Internet Fall 2004 Instructor: Dr. Boris Jukic XML.
XML October 24, Unit 6. What is XML? Stands for eXtensible Markup Language It is a markup language, like HTML But, –XML is designed to markup data –HTML.
11 Data Interface Standard for Accounting Software Project Progress Report China National Audit Office June, 2015.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
CONFIDENTIAL | © Nuance Communications, Inc. All rights reserved. ENTERPRISE SOLUTIONS 1 Parteek Singh.
Introduction to XML: Part I By Sandeep Jangity CS 157B, Section 2 Dr. Lee.
Position Paper for W3C Workshop on Internationalizing SSML The Usage of Part-Of-Speech for Resolving Multiple Pronunciations in SSML Myoung-Wan.
Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.
1 SSML Extensions for TTS in Indian Languages II workshop on Internationalizing SSML May 2006, Greece Nixon Patel and Kishore Prahallad Bhrigus.
Inducing Communication: Strategies and Assessment The Use of Voki in The Chinese Classroom 李麗君 Caroline Lee-Thompson, Ph.D. Park Tudor School (K-12 Independent.
CLUE Framework IETF 84 July 30 – Aug 3, 2012 Mark Duckworth Allyn Romanow Brian Baldino Andy Pepperell.
Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML.
JEITA Speech Group1 Issues of SSML in Japanese Wataru IMATAKE (ANIMO LIMITED) Makoto AKABANE (Sony Computer Entertainment Inc.) Kazuyo TANAKA (Tsukuba.
Public 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Development Challenges of Multilingual Text-to-Speech Systems Kimmo Pärssinen
How IPA is Used in SSML and PLS Paolo Baggia, Loquendo Wed. August 9 th, 2006.
Introduction to XML cs3505. References –I got most of this presentation from this site –O’reilly tutorials.
Conversational Applications Workshop Introduction Jim Larson.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
XML Language Family Detailed Examples Most information contained in these slide comes from: These slides are intended.
PrepTalk a Preprocessor for Talking book production Ted van der Togt, Dedicon, Amsterdam.
1 W3C Workshop on Internationalizing SSML SSML Extension for Korean Workshop : 2005/11/02 (Wed) Sang-Jin Kim
1 Web Developer Foundations: Using XHTML Chapter 2 Key Concepts.
Chapter 7. BEAT: the Behavior Expression Animation Toolkit
SSML 1.1: The Internationalization of SSML Daniel C. Burnett August 9, 2006.
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
Language Resources College 11 th ECESS meeting 11th ECESS Meeting College Language Resources 0. Minutes making for College ‘Language Resources’ 1. Goal.
Exploring XML-based Technologies and Procedures for Quality Evaluation from a Real-life Case Perspective Folkert de Vriend 1 & Giulio Maltese 2 1 Speech.
Designing the Team-oriented Ontology Management System with Ajax Technology Ze Li, Johannes Keizer, Zhong Wang, Margherita Sini, Yelu Zheng The Institute.
Overview of CSSML Yan Jun, Department Manager Anhui USTC iFLYTEK Co., Ltd University of Science & Tech of China.
10/15/20151 XML. 10/15/20152 Mark-up Languages Digitalizing information Content Format SGML HTML XML RDF OWL …
Biweekly Wednesday classes Taught by Dr. Xiaobo Yu and other Chinese faculty/Staff members at Kean.
PLS Considerations on using PLS for Slovenian Pronunciation Lexicon Construction Jerneja Žganec Gros Alpineon d.o.o., Ljubljana, Slovenia
1 1 The Why & How of IDN Generic Domain Names Presented by: Chuck Gomes Date: 13 May 2010.
Web Development & Design Foundations with XHTML Chapter 2 HTML/XHTML Basics.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
1 Robust Endpoint Detection and Energy Normalization for Real-Time Speech and Speaker Recognition Qi Li, Senior Member, IEEE, Jinsong Zheng, Augustine.
1 Introduction to XML XML stands for Extensible Markup Language. Because it is extensible, XML has been used to create a wide variety of different markup.
© 2013 by Larson Technical Services
A Fully Annotated Corpus of Russian Speech
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Irene Li 07/12/2011. Greeting and Welcome Review pīnyīn [p ʰ ín ín]) is the official system to transcribe Chinese characters into the Roman alphabet.
Title Authors Introduction Text, text, text, text, text, text Background Information Text, text, text, text, text, text Observations Text, text, text,
13 Information and Reservation Module (lesson 55-56) Information quiz review Chinese IIAB (IIA +IIB)
UML Profile BY RAEF MOUSHEIMISH. Background Model is a description of system or part of a system using well- defined language. Model is a description.
Presentation by : ZhangCarolyn (Zhang, Yajie --- 张雅杰 ) Online Project.
Quick Overview on Tones
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Presentation on Presentation I will talk on some “WH” question about presentation.
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
PLS for SSML Paolo Baggia Loquendo Workshop II on Internationalizing SSML.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
Kenwood Academy Mandarin 2 Test Modification, Administration & Analysis By Lindsay Decker.
Introduction to XHTML.
Preparing Conference Papers (1)
Preparing Conference Papers (1)
Technical Communication
Presentation transcript:

An Introduction to S3ML Beijing InfoQuick SinoVoice Speech Technology Corp. CHEN Ming, LV Shinan, LI Xiulin

Outline Background PinYin Support Definition Domain Support Conclusion

Background SSML Speech Synthesis Markup Language Now is W3C Recommendation SinoVoice Famous Speech Technology and Service Provider Leading Chinese TTS Technology and Product Deploy Real Systems

Background S3ML (SinoVoice SSML) Since the launching of jTTS 4.0, March 2004 Based on SSML Specification Defines some extensions aiming at Chinese TTS Defines the detail of some elements which SSML does not define precisely Provide maximum compatibility with newest SSML version

PinYin Support PinYin Phoneme annotation method for Chinese characters in SSML The phoneme element provides a phonemic/phonetic pronunciation for the contained text. Two attributes: alphabet and ph

PinYin Support alphabet The alphabet attribute is an optional attribute that specifies the phonemic/phonetic alphabet. Use ‘ py ’ as value of ‘ alphabet ’ to specify that PinYin will be used ph The ph attribute is a required attribute that specifies the phoneme/phone string. Use PinYin string as value of ‘ ph ’

PinYin Support Example More about PinYin string Conformed to “ Chinese Mandarin PinYin Specification “ Series of PinYin for several characters Tone information 1~4: high flat, rising, diving and falling tone 0, 5: light tone 查 良镛 查良镛 先生

PinYin support When PinYin string is included in normal text? Comparing with CSSML We think is not for such purpose, is more suitable We think extension in S3ML is more compatible with SSML Next station is di4 tan2 zha1 良镛 他姓 查

Definition The detail of element When SinoVoice define S3ML, the detail values of the attributes of this element is not defined in SSML. Now, “ SSML 1.0 say-as attribute values ” is proposed but it is still on progress / SinoVoice will support this proposal, so I will only talk about some additional values

Definition Name and address, especially person name because of the polyphone Chinese characters Math, some mathematic expression is confused with other info 张朝阳 朝阳区

Definition Net address Phoneme, useful for character/phoneme mixed text The pronunciation of ‘ tomato ’ is tɒmɑtoʊ Next station is di4 tan2

Domain Support Important for real system Customized TTS is used more and more popular Better voice quality than general version One possibility in SSML Use element and define special values of ‘ name ’ attribute But it is not natural because it is normal to support several different domains by using a same name (voice library)

Domain Support element The ‘ name ’ attribute is required to specify the customized TTS package used The value of ‘ name ’ attribute will be a vendor-specific name will not change voice If a voice library does not support this domain, this element will be just ignored.

Domain Support If we want TTS System select the best voice for this domain automatically Extended ‘ domain ’ attribute of ‘ domain ’ is still in least priority 今天白天 ,晴转多云,最高温度 26 度 今天白天 ,晴转多云,最高温度 26 度

Conclusion Summarize extension of S3ML name / address / math / phoneme / net We hope it will be helpful to define the standard for internationalizing SSML

Thank You!