영어교육에 있어서의 영어억양의 역할 (The role of prosody in English education) Korea Nazarene University 2006. 5.19 Kyuchul Yoon English Division Kyungnam University.

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
High Level Prosody features: through the construction of a model for emotional speech Loic Kessous Tel Aviv University Speech, Language and Hearing
Suprasegmentals The term suprasegmental refers to those properties of an utterance which aren't properties of any single segment. The following are usually.
Communicating with Robots using Speech: The Robot Talks (Speech Synthesis) Stephen Cox Chris Watkins Ibrahim Almajai.
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Comprehensive Stuttering Program By Einer Boberg & Deborah Kully (1985)
Effectiveness of spatial cues, prosody, and talker characteristics in selective attention C.J. Darwin & R.W. Hukin.
Dr. O. Dakkak & Dr. N. Ghneim: HIAST M. Abu-Zleikha & S. Al-Moubyed: IT fac., Damascus U. Prosodic Feature Introduction and Emotion Incorporation in an.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
Bootstrapping a Language- Independent Synthesizer Craig Olinsky Media Lab Europe / University College Dublin 15 January 2002.
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Looking at Spectrogram in Praat cs4706, Jan 30 Fadi Biadsy.
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
Research on teaching and learning pronunciation
Text-To-Speech Synthesis An Overview. What is a TTS System  Goal A system that can read any text Automatic production of new sentences Not just audio.
Praat Fadi Biadsy.
Producing Emotional Speech Thanks to Gabriel Schubiner.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
A Text-to-Speech Synthesis System
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
Toshiba Update 04/09/2006 Data-Driven Prosody and Voice Quality Generation for Emotional Speech Zeynep Inanoglu & Steve Young Machine Intelligence Lab.
Senior Project – Electrical Engineering Tool for Improving Non-Native French Speech Pronunciation Joseph Ciaburri Advisor – Professor Catravas,
[1] Processing the Prosody of Oral Presentations Rebecca Hincks KTH, The Royal Institute of Technology Department of Speech, Music and Hearing The Unit.
Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Efficacy of Computer-based Phonetic Training on Students’ Boundary Tone Zhang Yan, Nanjing University.
Tone sensitivity & the Identification of Consonant Laryngeal Features by KFL learners 15 th AATK Annual Conference Hye-Sook Lee -Presented by Hi-Sun Kim-
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
The role of prosody in dialect synthesis and authentication Kyuchul Yoon Division of English Kyungnam University Spring 2008 Joint Conference of KSPS.
SPEECH CONTENT Spanish Expressive Voices: Corpus for Emotion Research in Spanish R. Barra-Chicote 1, J. M. Montero 1, J. Macias-Guarasa 2, S. Lufti 1,
Building a sentential model for automatic prosody evaluation Kyuchul Yoon School of English Language & Literature Yeungnam University Korea.
Acoustic Properties of Taiwanese High School Students ’ Stress in English Intonation Advisor: Dr. Raung-Fu Chung Student: Hong-Yao Chen.
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
Segmental encoding of prosodic categories: A perception study through speech synthesis Kyuchul Yoon, Mary Beckman & Chris Brew.
Bernd Möbius CoE MMCI Saarland University Lecture 7 8 Dec 2010 Unit Selection Synthesis B Möbius Unit selection synthesis Text-to-Speech Synthesis.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
1 Current Interests 2007~2008 (Unfinished papers & Premature ideas) 1.Identifying frication & aspiration noise in the frequency domain: The case of Korean.
Frequency, Pitch, Tone and Length October 16, 2013 Thanks to Chilin Shih for making some of these lecture materials available.
Using Technology to Teach Pronunciation A review of the research from Melike Yücel Eleonora Frigo Laurie Wayne Ling 578, Winter 2010, Dr. Arnold.
Frequency, Pitch, Tone and Length February 12, 2014 Thanks to Chilin Shih for making some of these lecture materials available.
Imposing native speakers’ prosody on non-native speakers’ utterances: Preliminary studies Kyuchul Yoon Spring 2006 NAELL The Division of English Kyungnam.
Roman Kálecký UČO: Segmental features  Sounds  Speach Trainer 3D Suprasegmental features and accents  Speak English  SpeakAP  Accentuate!
Phonetics, part III: Suprasegmentals October 19, 2012.
What vocal cues indicate sarcasm? By: Jack Dolan Rockwell, P. (2000). Lower, slower, louder: Vocal cues of sarcasm. Journal of Psycholinguistic Research,
The role of prosody in dialect authentication Simulating Masan dialect with Seoul speech segments Kyuchul Yoon Division of English, Kyungnam University.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Dialect Simulation through Prosody Transfer: A preliminary study on simulating Masan dialect with Seoul dialect Kyuchul Yoon Division of English, Kyungnam.
Phonetics, part III: Suprasegmentals October 18, 2010.
Outline  I. Introduction  II. Reading fluency components  III. Experimental study  1) Method and participants  2) Testing materials  IV. Interpretation.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
INTONATION FRANCESCA FRASER. INTONATION “The Music of Speech” (Thornbury (2006) Alters Meaning of Speech Intonation. Intonation! Intonation? Thornbury,
Praat: doing phonetics by computer Introductory tutorial Kyuchul Yoon Division of English Kyungnam University.
Teaching Listening Why teach listening?
Teaching pronunciation
Suprasegmental features and Prosody
Investigating Pitch Accent Recognition in Non-native Speech
an Introduction to English
Text-To-Speech System for English
PHONETICS.
Speech and Language Processing
EXPERIMENTS WITH UNIT SELECTION SPEECH DATABASES FOR INDIAN LANGUAGES
A System for Hybridizing Vocal Performance
Tools for Speech Analysis
Looking at Spectrogram in Praat cs4706, Jan 30
Automatic Prosodic Event Detection
Presentation transcript:

영어교육에 있어서의 영어억양의 역할 (The role of prosody in English education) Korea Nazarene University Kyuchul Yoon English Division Kyungnam University

2 Contents A pop quiz Acquiring prosody in language learning…..7 Previous approaches……………………….8 A new tool…………………………………9 Technical details………………………….10 Implications………………...…………….23

3 Pop quiz Speech sound is composed of consonants and vowels.

4 Answer WRONG!!

5 Speech sound = segments + suprasegmentals (consonants & vowels) (intonation, pause, length, loudness)

6 Suprasegmentals (or Prosody) (1) pause = phrase breaks (2) intonation = fundamental frequency (F0 contour) (3) length = segmental durations (4) loudness = intensity (or loudness) contour

7 Acquiring prosody in language learning Prosody as non-segmental features of speech 1. phrase breaks 2. intonation (F0) contour 3. segmental durations 4. intensity contour

8 Previous approaches Explicit teaching of prosodic features such as the intonation contours, segmental durations, etc. Audio aid Listen and repeat! Visual aid Visual display of suprasegmentals (Chun,89; Spaai & Hermes, 92). Dr.Speaking ® : F0 contour comparison between native speaker and non-native speaker

9 A new tool A new kind of audio aid in the form of a non-native speaker’s utterance with the prosodic features of a native speaker’s utterance How this works 1. Software presents a native speaker’s utterance 2. A non-native speaker repeats the utterance 3. Software records the non-native speaker’s utterance 4. Software imposes the native speaker’s prosody onto the non-native speaker’s utterance 5. Software presents the processed non-native utterance

10 Technical details Manipulation of 1. segmental durations, including phrase breaks 2. F0 contours 3. intensity contours For 1 and 2 PSOLA (Pitch Synchronous OverLap and Add), developed by Moulines & Charpentier, 1990 implemented in Praat For 3 Intensity swap in Praat

11 Technical details Moulines & Charpentier, 1990 original waveform windowed waveform shortened waveform waveform with lower F0

12 Technical details 1 Segmental durations Segment alignment & PSOLA processing of durations : Alignment can be manual or automatic (with the help of speech recognition) keIeI min “…came in…”native keIeI in non-native m stretch shrink

13 Technical details 1+2 Segmental durations + F0 contour PSOLA processing of F0 on duration-treated utterance keIeI min native non-native keIeI min native F0 non-native F0

14 Technical details Segmental durations + F0 contour + intensity contour Mathematically “neutralize” non-native speaker’s intensity contour and transfer native speaker’s intensity contour in Praat – Holger Miterer (personal communication) keIeI min native non-native keIeI min native intensity non-native intensity

15 Technical details 1+3 Segmental durations + intensity contour Segment alignment & PSOLA processing of duations followed by intensity contour transfer keIeI min native keIeI in non-native m native intensity non-native intensity stretch shrink

16 Technical details 2+3 F0 contour + intensity contour “Reverse” segment alignment & PSOLA processing of F0 followed by intensity contour transfer keIeI min native keIeI in non-native m stretch shrink native F0 non-native F0 native intensity non-native intensity

17 Technical details Weakness 1. Voiceless segments can be made “voiced” in the windowing process (pitch-synchronous technique) 2. Excessive handling results in unnatural synthesis Segment alignment should be fine-tuned according to the voiced/voicless status of the (sub-)segments for better results

18 Technical details Examples Praat script native utterance non-native utterance synthetic non-native (durations+F0+intensity) synthetic non-native (durations+intensity) synthetic non-native (F0+intensity)

19 Technical details Comparison before synthesis – duration, F0 & intensity native utterance non-native utterance (blue & yellow)

20 Technical details Comparison after synthesis – duration, F0 & intensity native utterance synthetic non-native (blue & yellow)

21 Technical details Comparison after synthesis – duration & intensity native utterance synthetic non-native (blue & yellow)

22 Technical details Comparison after synthesis –F0 & intensity native utterance synthetic non-native (blue & yellow)

23 Implications The technique could be used (1) In second language education to facilitate/motivate acquisition of the target language prosody to emphasize the importance of prosody in achieving native speaker fluency (2) For patients with vocal disorders to help achieve the prosody of a normal voice ASR (Automatic Speech Recognition) can be employed to automate the segment aligning stage

24 References E. Moulines and F. Charpentier (1990) “Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones” Speech Communication 9, D. Chun (1989) “Teaching tone and intonation with microcomputers” CALICO Journal 6(3),21-47 W. Spaai and D. Hermes (1992) “A visual display for the teaching of intonation” CALICO Journal 10(3),