Cross modal and linguistic interactions in speech perception

Slides:



Advertisements
Similar presentations
Investigating motor-related sounds in the brain Zarinah Agnew, Carolyn McGettigan, Sophie Scott UCL Institute of Cognitive Neuroscience.
Advertisements

Nonword repetition: effects of item length and complexity Carolyn McGettigan Frank Eisner Chloe Marshall Sophie Scott.
BUCNI Meeting BUCNI Meeting Aug Effects of spectral detail and tonal variation on speech intelligibility Kyong, Scott, Eisner and Rosen.
Acoustic versus perceptual accounts of speaker differentiation within anterior auditory cortex Nicolas J. Abreu Carolyn McGettigan Sophie K. Scott Institute.
Delayed auditory feedback: a study into vocal motor patterns UCL Institute of Cognitive Neuroscience Dr Zarinah Agnew Dr Carolyn McGettigan Briony Banks.
1 Speech Sounds Introduction to Linguistics for Computational Linguists.
THE ROLE OF EMOTION-SPECIFIC RESOURCES IN CROSS-MODAL PROCESSING OF EMOTIONAL STIMULI Marie Bayot – Asp FNRS 2012 In the following slides, you will find.
Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
Visual speech speeds up the neural processing of auditory speech van Wassenhove, V., Grant, K. W., & Poeppel, D. (2005) Proceedings of the National Academy.
Chapter 12 Speech Perception. Animals use sound to communicate in many ways Bird calls Bird calls Whale calls Whale calls Baboons shrieks Baboons shrieks.
The Neuroscience of Language. What is language? What is it for? Rapid efficient communication – (as such, other kinds of communication might be called.
Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.
Vineel Pratap Girish Govind Abhilash Veeragouni. Human listeners are capable of extracting information from the acoustic signal beyond just the linguistic.
Facial expression as an input annotation modality for affective speech-to-speech translation Éva Székely, Zeeshan Ahmed, Ingmar Steiner, Julie Carson-Berndsen.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
 INTRODUCTION  STEPS OF GESTURE RECOGNITION  TRACKING TECHNOLOGIES  SPEECH WITH GESTURE  APPLICATIONS.
Large-Scale, Real-World Face Recognition in Movie Trailers Week 2-3 Alan Wright (Facial Recog. pictures taken from Enrique Gortez)
Lip Feature Extraction Using Red Exclusion Trent W. Lewis and David M.W. Powers Flinders University of SA VIP2000.
FMRI: Biological Basis and Experiment Design Lecture 16: Final projects Experiment design Data simulation Data analysis 1 light year = 5,913,000,000,000.
A Corpus Search Methodology for Focus Realization Jonathan Howell and Mats Rooth Linguistics and CIS Cornell University.
Neural correlates of facial expression perception: Using adaptation to shift the emotion perceived in unambiguous expressions Phil Pell Anne Richards Marty.
Efficiency – practical Get better fMRI results Dummy-in-chief Joel Winston Design matrix and.
Clinical Applications of Speech Technology Phil Green Speech and Hearing Research Group Dept of Computer Science University of Sheffield
DEFINING COMMUNICATION CHAPTER 8- MARKETING EDUCATION.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
The Neural Basis of Speech Perception – a view from functional imaging Sophie Scott Institute of Cognitive Neuroscience, University College London.
Compressed Sensing Based UWB System Peng Zhang Wireless Networking System Lab WiNSys.
Damaris Escobar What is neurolinguistics?  It is the study of the neural mechanisms in the human brain that control the comprehension, production,
Clinical Applications of Speech Technology Phil Green Speech and Hearing Research Group Dept of Computer Science University of Sheffield
SIGNAL DETECTION IN FIXED PATTERN CHROMATIC NOISE 1 A. J. Ahumada, Jr., 2 W. K. Krebs 1 NASA Ames Research Center; 2 Naval Postgraduate School, Monterey,
Учитель МОУ гимназии № 13 Комарова Инна Викторовна.
Language. Phonetics is the study of how elements of language are physically produced.
COPYRIGHT © All rights reserved by Sound acoustics Germany The averaged quality measures over all test cases indicate the real influence of a test object.
Инвестиционный паспорт Муниципального образования «Целинский район»
Variation of aspect ratio Voice section Correct voice section Voice Activity Detection by Lip Shape Tracking Using EBGM Purpose What is EBGM ? Experimental.
(x – 8) (x + 8) = 0 x – 8 = 0 x + 8 = x = 8 x = (x + 5) (x + 2) = 0 x + 5 = 0 x + 2 = x = - 5 x = - 2.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Language Perception.
The Relation Between Speech Intelligibility and The Complex Modulation Spectrum Steven Greenberg International Computer Science Institute 1947 Center Street,
Fundamentals of Sensation and Perception EXAM REVIEW ERIK CHEVRIER OCTOBER 20 TH, 2015.
Understanding early visual coding from information theory By Li Zhaoping Lecture at EU advanced course in computational neuroscience, Arcachon, France,
Non Verbal Communication. What Is Paralanguage? DEFINITION Paralanguage is the voice intonation that accompanies speech, including voice pitch, voice.
By : Y N Jagadeesh Trainer – Soft skills Blue HR Solutions.
Motion Perception Deficits and Reading Impairment It’s the noise, not the motion A. Sperling, Z-L. Lu, F. Manis & M. Seidenberg.
Objectives Identify the image quality characteristics that apply to all medical imaging modalities Understand the concept of image optimization Review.
Unit 11 Teaching Reading Anything in common? Something in common! listening & speaking-receptive skills speaking & writing -productive skills.
What can we expect of cochlear implants for listening to speech in noisy environments? Andrew Faulkner: UCL Speech Hearing and Phonetic Sciences.
Presented By Meet Shah. Goal  Automatically predicting the respondent’s reactions (accept or reject) to offers during face to face negotiation by analyzing.
照片档案整理 一、照片档案的含义 二、照片档案的归档范围 三、 卷内照片的分类、组卷、排序与编号 四、填写照片档案说明 五、照片档案编目及封面、备考填写 六、数码照片整理方法 七、照片档案的保管与保护.
공무원연금관리공단 광주지부 공무원대부등 공적연금 연계제도 공무원연금관리공단 광주지부. 공적연금 연계제도 국민연금과 직역연금 ( 공무원 / 사학 / 군인 / 별정우체국 ) 간의 연계가 이루어지지 않고 있 어 공적연금의 사각지대가 발생해 노후생활안정 달성 미흡 연계제도 시행전.
Жюль Верн ( ). Я мальчиком мечтал, читая Жюля Верна, Что тени вымысла плоть обретут для нас; Что поплывет судно громадней «Грейт Истерна»; Что.
Detection Of Anger In Telephone Speech Using Support Vector Machine and Gaussian Mixture Model Prepared By : Siti Marahaini Binti Mahamood.
PLUS.
Functional Neuroimaging of Perceptual Decision Making
GSM Speech Coding To send a voice across a radio network, we have to turn our voice into a digital signal. GSM uses a method called RPE-LPC (Regular Pulse.
Mr. Darko Pekar, Speech Morphing Inc.
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments Good morning, My name is Guan-Lin Chao, from Carnegie Mellon.
Vocoders.
Copyright © American Speech-Language-Hearing Association
The General Linear Model (GLM): the marriage between linear systems and stats FFA.
مبررات إدخال الحاسوب في رياض الأطفال
A Gentle Introduction to Bilateral Filtering and its Applications
STRENGTHS & WEAKNESSES
Building Positive Teacher-Child Relationships
21twelveinteractive.com/ twitter.com/21twelveI/ facebook.com/21twelveinteractive/ linkedin.com/company/21twelve-interactive/ pinterest.com/21twelveinteractive/
Non-local Means Filtering
Dual Adaptive Control for Trajectory Tracking of Mobile Robots
Benedikt Zoefel, Alan Archer-Boyd, Matthew H. Davis  Current Biology 
For More Details:
End-to-End Speech-Driven Facial Animation with Temporal GANs
Presentation transcript:

Cross modal and linguistic interactions in speech perception Sophie Scott, Carolyn McGettigan, Irene Altarelli, Jonas Obleser, Andy Faulkner, Stuart Rosen

Speech comprehension is influenced…. By concurrent visual information By higher order linguistic information by the acoustics of the speech signal

The neural basis of this varies 1 2 3 4 8 16 3R 16R Effect size (%) Acoustic detail 0 1 2 4 8 0 1 2 4 8 0 1 2 4 8 30 pixels 18 pixels 6 pixels channels C Face and voice interaction Predictability effects

Proposed study Vary these three factors within one study Acoustic detail, facial blurring, linguistic predictability Sparse scanning, 3X2X2 design (2,4,6 channels, 15 and 45 levels of Gaussian blur, and low and high predictability), plus a severe blur/continuous noise condition. TR 9 seconds 260 trials in total, 20 sentences per condition, and 20 ‘baseline’ trials - 2 blocks of 20 mins?