1 W3C Workshop on Internationalizing SSML SSML Extension for Korean Workshop : 2005/11/02 (Wed) Sang-Jin Kim

Slides:



Advertisements
Similar presentations
Ch. 1: Structure of English
Advertisements

Chapter 2: Structure of Spanish DewEtta Moss. What? Phonetically, Spanish is an easier language to learn than English because there are 22 phonemes and.
Suggested Sequencing of Graphophonemic Skills
Applying the Pronunciation Lexicon Specification to ASR & TTS 1 Patrizio Bergallo 1 Monday, August 20, 2007 SpeechTEK ASTS - Advances in Text-to-Speech.
2. Accents, Syllables, and English Grammar
1 SSML The Internationalization of the W3C Speech Synthesis Markup Language SpeechTek 2007 – C102 – Daniel C. Burnett.
Syllable. Syllable When talking about stress, we refer to the degree of force and loudness with which a syllable is uttered. When talking about stress,
SSML extensions for multi-language usage Davide Bonardo W3C Workshop on Internationalizing SSML Crete, May 2006.
Introduction to Linguistics Ms. Suha Jawabreh Lecture 10.
Decoding Lesson 4 VCV and VCCV Syllable Pattern
Syllable.
Talking Letters Consonants Lessons 1 - 5
Phonology Phonology is essentially the description of the systems and patterns of speech sounds in a language. It is, in effect, based on a theory of.
The sound patterns of language
Phonetics The study of productive sounds within a language 2 Basic types of sounds in English: Consonants (C): restriction on airflow Vowels (V): no restriction.
Linguisitics Levels of description. Speech and language Language as communication Speech vs. text –Speech primary –Text is derived –Text is not “written.
Adding “ed” and “ing”.
Phonetics and Phonology.
Unit 1 Meeting People September 2007 Nice to know you all!
Revision: What are pure vowel sounds?
PHONETICS (3) Dr. Ansa Hameed.
PHONETICS Introduction. P HONETICS Definition : The scientific study of speech. Speech? Represents words and other units of language. There are some sounds.
The Description of Speech
Position Paper for W3C Workshop on Internationalizing SSML The Usage of Part-Of-Speech for Resolving Multiple Pronunciations in SSML Myoung-Wan.
1 SSML Extensions for TTS in Indian Languages II workshop on Internationalizing SSML May 2006, Greece Nixon Patel and Kishore Prahallad Bhrigus.
Decoding/Word Attack Use Decoding Strategies 5 th Grade Lesson 29.
Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML.
JEITA Speech Group1 Issues of SSML in Japanese Wataru IMATAKE (ANIMO LIMITED) Makoto AKABANE (Sony Computer Entertainment Inc.) Kazuyo TANAKA (Tsukuba.
Public 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Development Challenges of Multilingual Text-to-Speech Systems Kimmo Pärssinen
How IPA is Used in SSML and PLS Paolo Baggia, Loquendo Wed. August 9 th, 2006.
Phonetics and Phonology
W3C Workshop, Beijing, 2nd of November 2005 An extension to the SSML for diacritics auto-completion R&D Centre Vocal Services Section.
SSML 1.1: The Internationalization of SSML Daniel C. Burnett August 9, 2006.
Language and Orthography Instructor: Tsueifen Chen.
Korea Maritime and Ocean University NLP Jung Tae LEE
Introduction to Linguistics Ms. Suha Jawabreh Lecture 9.
A Balanced Introduction to Computer Science, 3/E David Reed, Creighton University ©2011 Pearson Prentice Hall ISBN Chapter 15 JavaScript.
PLS Considerations on using PLS for Slovenian Pronunciation Lexicon Construction Jerneja Žganec Gros Alpineon d.o.o., Ljubljana, Slovenia
The Sounds of English: an Introduction to English Phonetics.
Instruction For the Chinese vocabulary: Record your pronunciation of the character or word. Provide the English meaning. Use the mouse write the character.
1 Linguistics week 6 Phonetics 4. 2 Parameters for describing consonants So far (this is not complete yet) we have – Airstream (usually the same for all.
PRODUCTION OF SPEECH SOUND Pertemuan 1 Matakuliah: G0332/English Phonology Tahun: 2007.
Phonetics, part III: Suprasegmentals October 19, 2012.
An Introduction to S3ML Beijing InfoQuick SinoVoice Speech Technology Corp. CHEN Ming, LV Shinan, LI Xiulin.
Virtual Agent 1 Dialog Manager Resources Input Technologies Output Technologies Data User © 2013 by Larson Technical Services Pronunciation Lexicon Pronunciation.
Phonetic / phonological typology
TEACHING PRONUNCIATION
How We Organize the Sounds of Speech 김종천 김완제 위이.
Chapter 2: The variation problem 1: Inter-speaker variation J. Jenkins The phonology of English as an international language Presented by: Carrie Newdall.
Introduction to English Pronunciation
 What is one fun thing that you did this summer?  Think about this question and be prepared to share aloud.
Soran University- College of Education English Department Phonology Talib M. Sharif Omer Assistant lecturer
PLS for SSML Paolo Baggia Loquendo Workshop II on Internationalizing SSML.
King Faisal University [ ] 1 E-learning and Distance Education Deanship Department of English Language College of Arts King Faisal University Introduction.
The Invention of Writing
Improving voice and diction Introduction
Chinese Language 华 文 huá wén
Ancient Japanese Language
SPOKEN english.
an Introduction to English
Manner of articulation is the way in which a speech sound
Week 4 – English Vowels Monophthongs Diphthongs Triphthongs One sound
Midterm Review (closed book)
English Phonetics and Phonology
What is the characteristic of an alphabetic writing system?
Phonetics.
Phonetics & Phonology John Corbett: USP-CAPES International Fellow
ENGLISH PHONETICS AND PHONOLOGY Week 2
Research on the Modeling of Chinese Continuous Speech Recognition
Presentation transcript:

1 W3C Workshop on Internationalizing SSML SSML Extension for Korean Workshop : 2005/11/02 (Wed) Sang-Jin Kim

2 Contents  Characteristic of Korean  SSML Extension for Chinese Characters in Korean  SSML Extension for Homograph Words in Korean  Conclusion

3 Characteristic of Korean  Hangul, The Korean Character Consists of forty letters  21 vowels (including 13 diphthongs), and 19 consonants Syllable  V, CV, VC, and CVC (C : consonant, V : vowel) Eojeol, the word phrase is different from a phrase in English Completely different from Japanese except for the grammatical structure Completely different from Chinese although Korean has borrowed many Chinese words and some Chinese characters

4 Characteristic of Korean  Vowels in Hangul, The Korean Character Monothong vowels classified according to tongue position and height

5 Characteristic of Korean  Consonants in Hangul, The Korean Character Consonants classified according to place and manner of articulation

6 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean Present Korean and Japanese use many Chinese Characters But, pronunciation of the characters is different Same characters is represented differently according to the country These simplified characters are not used in Korea

7 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean We can write text only with Korean characters Not unusual to use Chinese characters as well The pronunciation of the are exactly same

8 SSML Extension for Chinese Characters in Korean  Chinese Characters in Korean TTS The input text for text-to-speech(TTS) system has to be converted into a phonetic list If Chinese characters are mixed with Korean characters, they have to be substituted to Korean We don’t use all Chinese characters, rather there is a frequently-used-Chinese-character-list recommended by our Korean government and its size is 2000 We need to utilize this list and their pronunciations in the Korean TTS system, since the pronunciations of them are different from Chinese and Japanese

9 SSML Extension for Chinese Characters in Korean  SSML Extension for Chinese Characters in Korean Same characters but different pronunciation in Chinese Characters according to the country

10 SSML Extension for Homograph Words in Korean  Homograph Words in Korean Same word, different pronunciation, different meaning The difference is “duration”

11 SSML Extension for Homograph Words in Korean  SSML Extension for Homograph Words in Korean Only the difference for these words is the duration in pronunciation necessary to give the duration information to a TTS system for these kinds of words SSML recommendation supports “say-as” element and “sub” element, these elements cannot handle the above problem successfully

12 SSML Extension for Homograph Words in Korean  SSML Extension for Homograph Words in Korean We suggest “tone” tag for this problem Attribute values for tone element are ‘long’, ‘short’ and ‘default’ would be enough for Korean.

13 Conclusion  SSML Extension for Chinese Characters in Korean lexicon element doesn’t support “xml:lang” tag We suggest xml:lang=“ko”, xml:lang=“ko-CN”, xml:lang=“ja- KR”, xml:lang=“cn-KR” tags  SSML Extension for Homograph Words in Korean “say-as” and “sub” elements cannot handle homograph problem successfully We suggest “tone” element Attribute values, type=“long”, type=“short”, and type=“default” would be enough for Korean