Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.

Slides:



Advertisements
Similar presentations
Proposals for Extending SSML 1.0 from the Point-of-View of Hungarian TTS Developers Géza Németh, Géza Kiss, Bálint Tóth Laboratory of Speech Technology,
Advertisements

HTML popo.
Introduction to HTML & CSS
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
HTML. The World Wide Web Protocols Addresses HTML.
XML and Enterprise Computing. What is XML? Stands for “Extensible Markup Language” –similar to SGML and HTML –document “tags” are used to define content.
Sub-Project I Prosody, Tones and Text-To-Speech Synthesis Sin-Horng Chen (PI), Chiu-yu Tseng (Co-PI), Yih-Ru Wang (Co-PI), Yuan-Fu Liao (Co-PI), Lin-shan.
1 SSML The Internationalization of the W3C Speech Synthesis Markup Language SpeechTek 2007 – C102 – Daniel C. Burnett.
XSLT Stylesheets Some more advanced examples (adapted from the Edinburgh LT site)
Tutorial 9 Working with XHTML
Tutorial 9 Working with XHTML. XP Objectives Describe the history and theory of XHTML Understand the rules for creating valid XHTML documents Apply a.
Creating a Well-Formed Valid Document. 2 Objectives Introducing XHTML Creating a Well-Formed Document Creating a Valid Document Creating an XHTML Document.
Screen Readers. What are they Screen readers are audio interfaces. Rather than displaying web content visually for users in a "window" or screen on the.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
AUDACITY a tool in vocal and pronunciation training by Assoc. Prof. Ainol Haryati Ibrahim
CNIT 133 Interactive Web Pags – JavaScript and AJAX Review HTML5.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.
Tutorial 1: Getting Started with HTML5
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML.
Public 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Development Challenges of Multilingual Text-to-Speech Systems Kimmo Pärssinen
XP Tutorial 9New Perspectives on Creating Web Pages with HTML, XHTML, and XML 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
Copyright © 2011 Educational Testing Service. All rights reserved.Copyright © 2013 Educational Testing Service. All rights reserved. Audio Guidelines &
Conversational Applications Workshop Introduction Jim Larson.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
WORKING WITH XSLT AND XPATH
DEFINING COMMUNICATION CHAPTER 8- MARKETING EDUCATION.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
SSML 1.1: The Internationalization of SSML Daniel C. Burnett August 9, 2006.
Integrating VoiceXML with SIP services
Tutorial 1: XML Creating an XML Document. 2 Introducing XML XML stands for Extensible Markup Language. A markup language specifies the structure and content.
1 What is HTML? Standardized codes Web pages SGML Descriptive markup Tags.
How do I use HTML and XML to present information?.
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
E0262 – MIS – Multimedia Storage Techniques XML (Extensible Markup Language  XML is a markup language for creating documents containing structured information.
Speech Technology. HOT! What are the big players in the area up to? Google – technology.htmlhttp://googleblog.blogspot.com/2010/12/can-we-talk-better-speech-
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
XHTML and CSS Session 1 Intro, (X)HTML, CSS, W3C, browsers, webpage, structure, tags, attributes, elements, web development process, basic XHTML elements.
Acknowledgements Prof. Mctear, Natural Language Processing, University of Ulster.
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
Creating User Interfaces Directed Speech. XML. VoiceXML Classwork/Homework: Sign up to be Voxeo developer. Do tutorials.
© 2013 by Larson Technical Services
English Phonetics 许德华 许德华. Objectives of the Course This course is intended to help the students to improve their English pronunciation, including such.
HTML.
Internet & World Wide Web How to Program, 5/e © by Pearson Education, Inc. All Rights Reserved.
Getting Started with Marking Up Page Content. Tag defines a paragraph Automatically creates some space before and after itself Code Browser Display.
Jozef Goetz, STEM Summer Camp Dr. Jozef Goetz.
HTML Overview Part 5 – JavaScript 1. Scripts 2  Scripts are used to add dynamic content to a web page.  Scripts consist of a list of commands that execute.
XP Tutorial 9New Perspectives on HTML and XHTML, Comprehensive 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
TEACHING PRONUNCIATION
Tutorial 9 Working with XHTML. XP Objectives Describe the history and theory of XHTML Understand the rules for creating valid XHTML documents Apply a.
XML Extensible Markup Language
Rendering XML Documents ©NIITeXtensible Markup Language/Lesson 5/Slide 1 of 46 Objectives In this session, you will learn to: * Define rendering * Identify.
 Insight into our world of images, sound, color, patterns, forms, and movements.
Presented By Sharmin Sirajudeen S7 CS Reg No :
Prosody and Non- Verbal Communication
NSF Grant Number: IIS PI: Joseph Picone Institution: Mississippi State University Title: Integrating Prosody, Speech Recognition, Parsing In Spoken-Language.
Natural Language Processing and Speech Enabled Applications
Kuiper and Allan Chapter 6.2
Software and Multimedia
Software and Multimedia
Kuiper and Allan Chapter 6.2
Internationalising SSML Perspectives from the Local Language Speech Technology Initiative Ksenia Shalonova & Roger Tucker Outside Echo Ltd 19 November.
گزارش اجرای سامانه ملی مدیریت اطلاعات علمی و فناوری
Getting Started with Marking Up Page Content
Presentation transcript:

Speech Synthesis Markup Language SSML

Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted Often embedded in VoiceXML Can also be used alone SSML

SYNTHESIS PROCESSOR (Text-to-Speech System) SSML Document Synthesis Processor Spoken Output 6 Major Processing Steps

PROCESSING STEPS Extracts Document Tree and Content Structure Tags Attributes 1) XML Parse SSML document Synthesis Processor Document Tree Content

PROCESSING STEPS Paragraphs - element Sentences - element Used to indicate structure of document 2) Structure Analysis Document Tree Structure Synthesis Processor Content

PROCESSING STEPS Influences how the document will be read Synthesis processor can infer structure Uses punctuation etc. 2) Structure Analysis Document Tree Structure Synthesis Processor Content

PROCESSING STEPS Converting written form into spoken form $200 is spoken as two hundred dollars Ambiguous constructs such as 1/2 3) Text Normalization Document Tree Text in Spoken Form Synthesis Processor Content

PROCESSING STEPS Acronyms and abbreviations and elements Errors likely without mark up 3) Text Normalization Document Tree Text in Spoken Form Synthesis Processor Content

PROCESSING STEPS Derives pronunciation for each word Some words have different pronunciations “Read” can be pronounced “red” or “reed” 4) Text-to-Phoneme Conversion Document Tree Pronunciation Information Synthesis Processor Content

PROCESSING STEPS Foreign names and places element Errors likely without mark up 4) Text-to-Phoneme Conversion Document Tree Pronunciation Information Synthesis Processor Content

PROCESSING STEPS Pitch, timing, pausing Speaking rate, emphasis on words Other features that affect speech 5) Prosody Analysis Document Tree Prosody Synthesis Processor Content

PROCESSING STEPS Elements,, Not needed to generate suitable prosody Can use document structure, sentence syntax etc. 5) Prosody Analysis Document Tree Prosody Synthesis Processor Content

PROCESSING STEPS Information used to produce audio waveform and elements 6) Waveform Production Information from previous steps Waveforms Synthesis Processor

FUTURE OF SSML Internationalizing

FUTURE OF SSML Decomposition of words into syllables Tone assignment Pitch assignment Duration assignment Dialects and Styles need to be supported

USES OF SSML Access Web Services using Telephone Voice Browser SSML Synthetic Speech

USES OF SSML Airline Reservations Voic Cinema Schedules Bank Information Weather Forecasts

SSML EXAMPLES <speak version="1.0" xmlns=" xmlns:xsi=" xsi:schemaLocation=" xml:lang="en-US"> You have 4 new messages. The first is from Stephanie Williams and arrived at 3:45pm. The subject is ski trip

SSML EXAMPLES Today we preview the latest music from Example. Hear what the Software Reviews said about Example's newest hit. He sings about issues that touch us all. Here's a sample. Would you like to buy it?

SSML EXAMPLES This sentence is spoken fast This sentence is spoken low pitch This sentence is spoken with stress This sentence will be spoken out in ten seconds