V OICE QUALITY AND F0 CUES FOR AFFECT EXPRESSION By I. Yanushevskaya, C. Gobl and N. Chasaide.

Slides:



Advertisements
Similar presentations
PF-STAR: emotional speech synthesis Istituto di Scienze e Tecnologie della Cognizione, Sezione di Padova – “Fonetica e Dialettologia”, CNR.
Advertisements

1 Analysis of Parameter Importance in Speaker Identity Ricardo de Córdoba, Juana M. Gutiérrez-Arriola Speech Technology Group Departamento de Ingeniería.
Speech Perception Dynamics of Speech
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
Linguistic Voice Quality Patricia Keating University of California, Los Angeles Christina Esposito Macalester College, St. Paul.
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
Let me tell you what I really think
Pitch range use in speech of Welsh/English bilinguals: Production Study Mikhail Ordin 1,2 Ineke Mennen 1 (1 Bangor University, Centre for Research on Bilingualism.
Two Types of Listeners? Marie Nilsenov á (Tilburg University) 1. Background When you and I listen to the same utterance, we may not perceive the linguistic.
Voice source characteristics in speaker segregation Patti Adank.
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
5-Text To Speech (TTS) Speech Synthesis
Facial expression as an input annotation modality for affective speech-to-speech translation Éva Székely, Zeeshan Ahmed, Ingmar Steiner, Julie Carson-Berndsen.
Emotions in IVR Systems Julia Hirschberg COMS 4995/6998 Thanks to Sue Yuen and Yves Scherer.
Emotions and Voice Quality: Experiments with Sinusoidal Modeling Authors: Carlo Drioli, Graziano Tisato, Piero Cosi, Fabio Tesser Institute of Cognitive.
Introductions & Conclusions
Whispered Speech A Presentation by Susanne Filges, Agata Mroczkowska and Annette Radon.
Spoken Language Generation Project II Synthesizing Emotional Speech in Fairy Tales.
Outline Why study emotional speech?
Modality and Recall Sarah LeStourgeon Michaelia Gilbert Christina Banks Hanover College.
Cues to Emotion: Anger and Frustration Julia Hirschberg COMS 4995/6998 Thanks to Sue Yuen and Yves Scherer.
Emotional Speech Guest Lecturer: Jackson Liscombe CS 4706 Julia Hirschberg 4/20/05.
Reasons for Teaching & Assessing Reading Fluency Reading Fluency.
What does your body say?.  all messages that are not expressed as words.
 Question of Fact  Question of Belief  Question of Policy  (PP )
Lesson D2-2 Understanding Effective Communication Techniques.
The partner effect in non- native speech Speech Accommodation Group Jiwon Hwang May 9, 2007.
Phonological Constraints on the Acquisition of Mid Vowels in English for Students in Taiwan author: 黃俐雯 presented by Lisa Liu 報告人: 劉莉莎.
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Communicating In Groups. Introduction I need four volunteers. (Five minute discussion) Did you notice anything unusual about each students behavior? Happiness.
Prepared by: Waleed Mohamed Azmy Under Supervision:
Voice Quality + Stop Acoustics
Regression Approaches to Voice Quality Control Based on One-to-Many Eigenvoice Conversion Kumi Ohta, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, and.
Critical Review on a Working Paper : Effects of background music, voice cues, earcons and gender on psychological ratings and heart rates during product.
Voice Quality + Spectral Analysis Feburary 15, 2011.
Speech Perception 4/4/00.
Q : Is this principle widely used in America, Japan, Korea?
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,
Lombard Speech Synthesis  Humans modify their voice according to the social situation/context  Shouting or loud speech is an important mode of speaking.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
English advance mechina 2011
HMM-Based Synthesis of Creaky Voice
TIME-SHIFTED PRINCIPAL COMPONENT ANALYSIS BASED CUE EXTRACTION FOR STEREO AUDIO SIGNALS Jianjun HE, Ee-Leng Tan, Woon-Seng Gan Digital Signal Processing.
What vocal cues indicate sarcasm? By: Jack Dolan Rockwell, P. (2000). Lower, slower, louder: Vocal cues of sarcasm. Journal of Psycholinguistic Research,
Finishing your speech. How to finish your speech and give it well ⋆ Practice (read softly) ⋆ Fix grammar/style ⋆ Practice (read aloud and time yourself)
D EFINITION OF AUDITORY PROCESSING DISORDER  APD is defect in the neural processing of auditory stimuli that caused by higher level of language, cognitive.
Language and Speech, 2000, 43 (2), THE BEHAVIOUR OF H* AND L* UNDER VARIATIONS IN PITCH RANGE IN DUTCH RISING CONTOURS Carlos Gussenhoven and Toni.
S CALABLE S KYLINE C OMPUTATION U SING O BJECT - BASED S PACE P ARTITIONING Shiming Zhang Nikos Mamoulis David W. Cheung sigmod
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
Subjective evaluation of an emotional speech database for Basque Aholab Signal Processing Laboratory – University of the Basque Country Authors: I. Sainz,
Hello. ok Hello ja.
Effective Communication Techniques. Interest Approach Give each student a copy of a relevant news article. Explain the importance of skimming and scanning.
Yow-Bang Wang, Lin-Shan Lee INTERSPEECH 2010 Speaker: Hsiao-Tsung Hung.
Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech.
Vision Sciences Society Annual Meeting 2012 Daniel Mann, Charles Chubb
August 15, 2008, presented by Rio Akasaka
Total Physical Response (TPR)
Laryngeal correlates of the English tense/lax vowel contrast
Speech Conductor Team Six (see below)
Get the Attention of the Audience
Guide on Style in Schools’ Debating
Presentation Skills: Voice
How (Not) to Add Laughter to Synthetic Speech
Attentive Tracking of Sound Sources
Copyright © Allyn & Bacon 2006
Listening: Attitudes, Principles, & Skills
Start in the Name of Allah
Three components of speech
WRITING / SPEAKING IDEAS ORGANIZATION: INTRODUCTION
Presentation transcript:

V OICE QUALITY AND F0 CUES FOR AFFECT EXPRESSION By I. Yanushevskaya, C. Gobl and N. Chasaide

O UTLINE Introduction Synthetic stimuli Experiment setup Result Conclusion

I NTRODUCTION F0 cues are crucial for emotional speech What about Voice Quality? Base on previous works: Adding voice quality cues enhance speech synthesis Several voice quality stimuli have similar result: Tense ~= Harsh Breathy ~= whisper Varying voice quality can influence listener’s judgment Want to know the effect of varying voice quality only.

S YNTHETIC STIMULI 15 synthetic stimuli: Ja adjö (Hello Goodbye) KLSYN88 as formant synthesizer 3 groups stimuli: “VQ”, “F0”, “VQ+F0”

KLSYN88

VQ ONLY STIMULI Modal, breathy, whispery, lax-creaky, tense stimuli Omit harsh, creaky included in previous work Modal: Copy the natural utterance to KLSYN88 Breathy: lower AV, higher OQ, lower SQ, higher TL, wider B1 Whispery: Aspiration noise Lax-creaky: Creaky+Breathy-Whispery Tense: lower OQ, higher SQ, lower TL, narrower B1 higher F0 NOT normalized with F0

F0 ONLY STIMULI

VQ+F0 STIMULI Are these good pairs? We’ll see….

E XPERIMENT SETUP 20 native speakers 10 of 15 stimuli presented Response a pair of opposite affective attribute sad-happy Intimate-formal Relaxed-stressed Bored-interested Apologetic-indignant Fearless-scared ANOVA

R ESULT

C ONCLUSION Showed that some voice quality is more related than other in some emotions. X Intimacy, sadness -> breathy O -> lax-creaky Voice quality is averagely better than F0 cues on speech synthesis Maybe because the voice quality already includes the information of F0

T HANKS FOR YOUR ATTENTION