Audio Workgroup Neuro-inspired Speech Recognition.

Slides:

Advertisements

Similar presentations

Neuro-inspired Speech Recognition

Advertisements

Audio Workgroup Neuro-inspired Speech Recognition.

Audio Workgroup Neuro-inspired Speech Recognition Group Members Ismail UysalYoojin Chung Ramin Pichevar Rich Hammett Tarek Massoud Ross Gaylor David Anderson.

Purpose The aim of this project was to investigate receptive fields on a neural network to compare a computational model to the actual cortical-level auditory.

The Auditory Nervous System Classical Ascending Pathway.

cells in cochlear nucleus

Confidence Measures for Speech Recognition Reza Sadraei.

Natural Language Processing - Speech Processing -

Special Senses. Retina A B C D E F G H I A: Inner limiting membrane. B: Optic nerve fiber layer. C: Ganglionic cell layer. D: Inner plexiform. E: Inner.

Embedded vs. PC Application Programming. Overview  The software design cycle  Designing differences  Code differences  Test differences.

Neural mechanisms of sound localization How the brain calculates interaural time and intensity differences.

To Understand, Survey and Implement Neurodynamic Models By Farhan Tauheed Asif Tasleem.

Voice-enabled Image Identification System Design Aashish P. Shrestha Ming Ming Zheng Multimedia Signal Processing, University of Bridgeport, Connecticut.

To Understand, Survey and Implement Neurodynamic Models By Farhan Tauheed Asif Tasleem.

AUDITORY PERCEPTION Pitch Perception Localization Auditory Scene Analysis.

Welcome To The Odditory System! Harry I. Haircell: Official Cochlea Mascot K+K+ AIR FLUID amplification.

Hearing Part 2. Tuning Curve Sensitivity of a single sensory neuron to a particular frequency of sound Two mechanisms for fine tuning of sensory neurons,

AIM: How do we hear?. Opponent Process Theory Hering proposed that we process four primary colors combined in pairs of red-green, blue- yellow, and black-white.

Process: Create Account Record Create Account Record Process Input Calc. Process Process Output Account Record First Name Last Name Company Address.

Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.

Hearing and Deafness Anatomy & physiology. Protection Impedance match Capture; Amplify mid-freqs Vertical direction coding Frequency analysis Transduction.

Speech Recognition Application

Artificial Neural Networks An Overview and Analysis.

Senior Design Fall 06 and Spring 07 Speech Strategy for the Cochlear Implant.

Figure 13.1 The periodic condensation and rarefaction of air molecules produced by a tuning fork neuro4e-fig jpg.

Methods Neural network Neural networks mimic biological processing by joining layers of artificial neurons in a meaningful way. The neural network employed.

Virtual Worlds: Audio and Other Senses. VR Worlds: Output Overview Visual Displays: –Visual depth cues –Properties –Kinds: monitor, projection, head-based,

LML Speech Recognition Speech Recognition Introduction I E.M. Bakker.

Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.

From last time …. ASR System Architecture Pronunciation Lexicon Signal Processing Probability Estimator Decoder Recognized Words “zero” “three” “two”

‘Missing Data’ speech recognition in reverberant conditions using binaural interaction Sue Harding, Jon Barker and Guy J. Brown Speech and Hearing Research.

Harmonicity Winner-Take-All The cochlea is a sensitive membrane structure in the inner ear that performs a particular type of frequency analysis. While.

A NEW FEATURE EXTRACTION MOTIVATED BY HUMAN EAR Amin Fazel Sharif University of Technology Hossein Sameti, S. K. Ghiathi February 2005.

Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.

By Sarita Jondhale 1 The process of removing the formants is called inverse filtering The remaining signal after the subtraction of the filtered modeled.

A Model of Binaural Processing Based on Tree-Structure Filter-Bank

Combining Speech Attributes for Speech Recognition Jeremy Morris November 9, 2006.

Introduction to Neural Networks Introduction to Neural Networks Applied to OCR and Speech Recognition An actual neuron A crude model of a neuron Computational.

7/6/99 MITE1 Fully Parallel Learning Neural Network Chip for Real-time Control Students: (Dr. Jin Liu), Borte Terlemez Advisor: Dr. Martin Brooke.

CS 351/ IT 351 Modeling and Simulation Technologies HPC Architectures Dr. Jim Holten.

Neural Networks. Molecules Levels of Information Processing in the Nervous System 0.01  m Synapses 1m1m Neurons 100  m Local Networks 1mm Areas /

SPHSC 462 HEARING DEVELOPMENT Overview Review of Hearing Science Introduction.

Energy, Stereoscopic Depth, and Correlations. Molecules Levels of Information Processing in the Nervous System 0.01  m Synapses 1m1m Neurons 100 

Hallucinations in Auditory Perception!!! Malcolm Slaney Yahoo! Research Stanford CCRMA.

Dynamically Reconfigurable Neurons. This presentation summaries the progression achieved up to date. Artificial Neural Networks Implementing the ANNs.

Ghent University Compact hardware for real-time speech recognition using a Liquid State Machine Benjamin Schrauwen – Michiel D’Haene David Verstraeten.

Final Year Project Eoin Culhane. MIDI Guitar Guitar with 6 outputs 1 output for each string Each individual string output will be converted to MIDI.

Audio Books for Phonetics Research CatCod2008 Jiahong Yuan and Mark Liberman University of Pennsylvania Dec. 4, 2008.

How do you get here?

1 Neural Networks MUMT 611 Philippe Zaborowski April 2005.

Adaptive Median Filter

Audiograms Degree, Type and Configuration

ANN-based program for Tablet PC character recognition

Artificial Intelligence for Speech Recognition

3) determine motion and sound perceptions.

Speech Recognition Christian Schulze

Audio Books for Phonetics Research

Tuning in the basilar membrane

Histology Slides for Special Senses

Research on the Modeling of Chinese Continuous Speech Recognition

3 primary cues for auditory localization: Interaural time difference (ITD) Interaural intensity difference Directional transfer function.

Volume 74, Issue 1, Pages (April 2012)

Copyright © 2014 Elsevier Inc. All rights reserved.

Increased network ensembles and altered Ca2+ dynamics in Panx1 KO cortical neurons. Increased network ensembles and altered Ca2+ dynamics in Panx1 KO cortical.

Week 13: Neurobiology of Hearing Part 2

VoiceXML An investigation Author: Mya Anderson

Presentation transcript:

Audio Workgroup Neuro-inspired Speech Recognition

Audio Workgroup Localization Effort Interaural Time Difference (ITD) Estimated from time difference between spikes of two matching channels. Interaural Intensity Difference (IID) Difference of spike counts between two cochleae. Azimuth: Combination of ITD and ILD

Audio Workgroup Localization Effort

Audio Workgroup Relational Network (Simple) X Y Z M M X M Y M Z m Patches of neurons Each measure one quantity Bidirectional relations for feedback/feedforward

Audio Workgroup Relational Network (example) Input here Relation specification Relational feedback Relation Feedback

Audio Workgroup ASR Relational Network Cochlea Delay Phone Recogniz er Word Recogniz er A patch of neurons (one of N output) We dont know how to represent time

Audio Workgroup ASR Advantages Not an HMM Top-Down, Bottom-Up Hypothesis Hallucinate

Audio Workgroup Silicon Cochlea Ganglion cells Basilar membrane high frequency low frequency Inner hair cells (van Schaik, Liu, 2004) BASILAR MEMBRANE INNER HAIR CELLS GANGLION CELLS

Audio Workgroup Silicon Cochlea Tone raster plots Vowel Rate Profiles

Audio Workgroup Learning Chip Architecture Tone Rasters? Vowel Rasters Learning Algorithm Alternative Learning Statistics LeastSquares

Audio Workgroup LSM Recognizer

Audio Workgroup Infrastruture Difficulties Remapper Replace with Matlab Power ? Sharing chips? PC replacement

Audio Workgroup FPAA/Mote

Word Recognizer Four example raster plot (silence, A_, A_ with relational, AI)

Audio Workgroup Software Simulation

Audio Workgroup Behind the Curtain

Audio Workgroup Hardware Overview Cochlea Remapper (in Matlab) Learning Giacomo Phoneme Word skype PCI- AER (for remappi ng)

Audio Workgroup