Emotional Speech Julia Hirschberg CS 6998 12/8/2018.

Slides:



Advertisements
Similar presentations
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Advertisements

PF-STAR: emotional speech synthesis Istituto di Scienze e Tecnologie della Cognizione, Sezione di Padova – “Fonetica e Dialettologia”, CNR.
Descriptive schemes for facial expression introduction.
High Level Prosody features: through the construction of a model for emotional speech Loic Kessous Tel Aviv University Speech, Language and Hearing
Dr. O. Dakkak & Dr. N. Ghneim: HIAST M. Abu-Zleikha & S. Al-Moubyed: IT fac., Damascus U. Prosodic Feature Introduction and Emotion Incorporation in an.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Final Review CS4705 Natural Language Processing. Semantics Meaning Representations –Predicate/argument structure and FOPC Thematic roles and selectional.
Modeling Emotion Frameworks Useful In Computation and Speech Frank Enos.
Techniques for Emotion Classification Julia Hirschberg COMS 4995/6998 Thanks to Kaushal Lahankar.
Techniques for Emotion Classification Kaushal N Lahankar Oct 12,2009 COMS 6998.
Producing Emotional Speech Thanks to Gabriel Schubiner.
Traits Eysenck’s Hierarchical Model Cattell’s Taxonomy Wiggins Circumplex Five Factor Model.
1. Introduction to Pattern Recognition and Machine Learning. Prof. A.L. Yuille. Dept. Statistics. UCLA. Stat 231. Fall 2004.
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
Toshiba Update 14/09/2005 Zeynep Inanoglu Machine Intelligence Laboratory CU Engineering Department Supervisor: Prof. Steve Young A Statistical Approach.
Schizophrenia and Depression – Evidence in Speech Prosody Student: Yonatan Vaizman Advisor: Prof. Daphna Weinshall Joint work with Roie Kliper and Dr.
A study on Prediction on Listener Emotion in Speech for Medical Doctor Interface M.Kurematsu Faculty of Software and Information Science Iwate Prefectural.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706.
Predicting Voice Elicited Emotions
HMM-Based Speech Synthesis Erica Cooper CS4706 Spring 2011.
1/17/20161 Emotion in Meetings: Business and Personal Julia Hirschberg CS 4995/6998.
Research Methodology II Term review. Theoretical framework  What is meant by a theory? It is a set of interrelated constructs, definitions and propositions.
Subjective evaluation of an emotional speech database for Basque Aholab Signal Processing Laboratory – University of the Basque Country Authors: I. Sainz,
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
Chapter 11 Linguistics and Foreign Language Teaching Lecturer: Rui Liu.
Diction Writers employ diction, or word choice, to communicate ideas and impressions, to evoke emotions, and to convey their views of truth to the reader.
Thesaurus Everything you need to know about using it.
G. Anushiya Rachel Project Officer
Issues in Evaluating Educational Research
Do Now: Evaluate each expression.
Introduction to Machine Learning
August 15, 2008, presented by Rio Akasaka
Emotion Theories and Mixed Emotions
Text-To-Speech System for English
Artificial Intelligence (CS 370D)
The Systems Engineering Context
Objective: To solve two-step variable equations
A classification of learning objectives within education
National 4 English – Listening
Words are the most inexhaustible source of magic.
Logical Agents.
UNCLASSIFIED MASA Sword UNCLASSIFIED.
Why Study Spoken Language?
Spoken Language Processing
Meanings of Intonational Contours
Studying Intonation Julia Hirschberg CS /21/2018.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
RE-CAP.
Retrieval of audio testimonials via voice search
Emotion.
Accenting and Information Status
Why Study Spoken Language?
Meanings of Intonational Contours
Anastassia Loukina, Klaus Zechner, James Bruno, Beata Beigman Klebanov
TYPES OF CONFLICT.
Section 1-6: Multiplying Integers
Elements of Art The elements of art are the parts of an artwork that an artist plans. The elements are color, value, line, shape, form, texture and space.
Towards Automatic Fluency Assessment
Zero and Negative Exponents
Emotional Speech Julia Hirschberg CS /16/2019.
Presented by: Mónica Domínguez
Induction lesson: English Language
Making Tens.
CS4705 Natural Language Processing
Probability Models 7. SP.7 Develop a probability model and use it to find probabilities of events. Compare probabilities from a model to observed frequencies;
Making Tens.
Chapter 8 Emotions.
Distributive Property
Spoken Language Processing
Presentation transcript:

Emotional Speech Julia Hirschberg CS 6998 12/8/2018

Today Defining emotional speech Emotional categories Eliciting judgments Producing emotional speech Detecting emotional speech 12/8/2018

Cowie ‘00 Is there a good theoretical or practical definition of emotional speech? “Full-blown” emotion vs. emotional state Cause and effect descriptions Primary and secondary (second order) Everyday descriptions Representations Biological 12/8/2018

Dimensions in continuous space, e.g. Valence: positive or negative Activation level: how disposed to take action Structural models: different ways of appraising situation that evokes emotion e.g. positive or negative? Does situation help agent to achieve his/her goals? Timing as a key variable sadness vs. grief vs. depression vs. gloominess 12/8/2018

How are emotions expressed? Display rules? In speech? Mixing Simulation 12/8/2018

Schroeder ‘01: Emotion in Synthesis How is a given emotion expressed in speech? What are the properties of the emotion to be expressed? How are they related to those of other emotions? What kind of synthesizer works best? Formant Diphone Unit selection 12/8/2018

Prosody rules: what to modify? How do we evaluate the results? Forced choice Free response Recognition rate Perceived naturalness 12/8/2018

Ten Bosch ‘00: Emotion Recognition How hard is the problem? Is ‘standard’ ASR technology well-suited to it? Acoustic and language models target short local events Feature extraction normlizes/excludes e.g. pitch, rate, amplitude -- why? Interaction: emotional speech and ASR performance Synthesis needs one good example but... 12/8/2018

12/8/2018