Download presentation
Presentation is loading. Please wait.
Published byAndrea Angers Modified over 10 years ago
1
Speech Synthesis Markup Language SSML
2
Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted Often embedded in VoiceXML Can also be used alone SSML
3
SYNTHESIS PROCESSOR (Text-to-Speech System) SSML Document Synthesis Processor Spoken Output 6 Major Processing Steps
4
PROCESSING STEPS Extracts Document Tree and Content Structure Tags Attributes 1) XML Parse SSML document Synthesis Processor Document Tree Content
5
PROCESSING STEPS Paragraphs - element Sentences - element Used to indicate structure of document 2) Structure Analysis Document Tree Structure Synthesis Processor Content
6
PROCESSING STEPS Influences how the document will be read Synthesis processor can infer structure Uses punctuation etc. 2) Structure Analysis Document Tree Structure Synthesis Processor Content
7
PROCESSING STEPS Converting written form into spoken form $200 is spoken as two hundred dollars Ambiguous constructs such as 1/2 3) Text Normalization Document Tree Text in Spoken Form Synthesis Processor Content
8
PROCESSING STEPS Acronyms and abbreviations and elements Errors likely without mark up 3) Text Normalization Document Tree Text in Spoken Form Synthesis Processor Content
9
PROCESSING STEPS Derives pronunciation for each word Some words have different pronunciations “Read” can be pronounced “red” or “reed” 4) Text-to-Phoneme Conversion Document Tree Pronunciation Information Synthesis Processor Content
10
PROCESSING STEPS Foreign names and places element Errors likely without mark up 4) Text-to-Phoneme Conversion Document Tree Pronunciation Information Synthesis Processor Content
11
PROCESSING STEPS Pitch, timing, pausing Speaking rate, emphasis on words Other features that affect speech 5) Prosody Analysis Document Tree Prosody Synthesis Processor Content
12
PROCESSING STEPS Elements,, Not needed to generate suitable prosody Can use document structure, sentence syntax etc. 5) Prosody Analysis Document Tree Prosody Synthesis Processor Content
13
PROCESSING STEPS Information used to produce audio waveform and elements 6) Waveform Production Information from previous steps Waveforms Synthesis Processor
14
FUTURE OF SSML Internationalizing
15
FUTURE OF SSML Decomposition of words into syllables Tone assignment Pitch assignment Duration assignment Dialects and Styles need to be supported
16
USES OF SSML Access Web Services using Telephone Voice Browser SSML Synthetic Speech
17
USES OF SSML Airline Reservations Voicemail Cinema Schedules Bank Information Weather Forecasts
18
SSML EXAMPLES <speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.w3.org/2001/10/synthesis http://www.w3.org/TR/speech-synthesis/synthesis.xsd" xml:lang="en-US"> You have 4 new messages. The first is from Stephanie Williams and arrived at 3:45pm. The subject is ski trip
19
SSML EXAMPLES Today we preview the latest music from Example. Hear what the Software Reviews said about Example's newest hit. He sings about issues that touch us all. Here's a sample. Would you like to buy it?
20
SSML EXAMPLES This sentence is spoken fast This sentence is spoken low pitch This sentence is spoken with stress This sentence will be spoken out in ten seconds
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.