Audio Workgroup Neuro-inspired Speech Recognition Group Members Ismail UysalYoojin Chung Ramin Pichevar Rich Hammett Tarek Massoud Ross Gaylor David Anderson.

Slides:



Advertisements
Similar presentations
Neuro-inspired Speech Recognition
Advertisements

Audio Workgroup Neuro-inspired Speech Recognition.
Audio Workgroup Neuro-inspired Speech Recognition.
An AER Analog Silicon Cochlea Model using Pseudo Floating Gate Transconductors Master Thesis in Electronics and Computer Science, Microelectronics Programme,
Purpose The aim of this project was to investigate receptive fields on a neural network to compare a computational model to the actual cortical-level auditory.
MIMICKING THE HUMAN EAR Philipos Loizou (author) Oliver Johnson (me)
WIMS Capstone Proposal DSP Demo Abigail Fuentes Rivera Esteban Valentin Lugo Michael Ortiz Sanchez ICOM 5047 Prof Nayda Santiago.
Cognitive Computing…. Computational Neuroscience Jerome Swartz The Swartz Foundation May 10, 2006.
Neural NetworksNN 11 Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22
To Understand, Survey and Implement Neurodynamic Models By Farhan Tauheed Asif Tasleem.
To Understand, Survey and Implement Neurodynamic Models By Farhan Tauheed Asif Tasleem.
Structure and function
Neural Networks Primer Dr Bernie Domanski The City University of New York / CSI 2800 Victory Blvd 1N-215 Staten Island, New York 10314
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
SOMTIME: AN ARTIFICIAL NEURAL NETWORK FOR TOPOLOGICAL AND TEMPORAL CORRELATION FOR SPATIOTEMPORAL PATTERN LEARNING.
Hearing.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Authored By :- Rachit Kr. Rastogi Computer Sc. & Engineering Deptt., College Of Technology, G.B.P.U.A.T. Pantnagar, India
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Introduction to Neural Networks. Neural Networks in the Brain Human brain “computes” in an entirely different way from conventional digital computers.
IE 585 Introduction to Neural Networks. 2 Modeling Continuum Unarticulated Wisdom Articulated Qualitative Models Theoretic (First Principles) Models Empirical.
Wireless Integrated Microsystems Cochlear Team Team Members Feliut Guzman Wilmarie Rios Luis Calderon Guillermo Martinez Michael Ortiz Esteban Valentin.
Methods Neural network Neural networks mimic biological processing by joining layers of artificial neurons in a meaningful way. The neural network employed.
ECE 488 Computer Engineering Design I Fall 2005 Hau Ngo Ming Zhang.
LML Speech Recognition Speech Recognition Introduction I E.M. Bakker.
ICASSP Speech Discrimination Based on Multiscale Spectro–Temporal Modulations Nima Mesgarani, Shihab Shamma, University of Maryland Malcolm Slaney.
Pencil-and-Paper Neural Networks Prof. Kevin Crisp St. Olaf College.
Gammachirp Auditory Filter
Harmonicity Winner-Take-All The cochlea is a sensitive membrane structure in the inner ear that performs a particular type of frequency analysis. While.
A NEW FEATURE EXTRACTION MOTIVATED BY HUMAN EAR Amin Fazel Sharif University of Technology Hossein Sameti, S. K. Ghiathi February 2005.
GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO: , M.E(MBCBS),COM SYSTEM,VI.
Quiz 1 Review. Analog Synthesis Overview Sound is created by controlling electrical current within synthesizer, and amplifying result. Basic components:
A Model of Binaural Processing Based on Tree-Structure Filter-Bank
Kirchhoff Institute for Physics Johannes Schemmel Ruprecht-Karls-Universität Heidelberg 1 Accelerated Neuromorphic Hardware : Hybrid Plasticity - The Next.
Computational Modeling of the Auditory Periphery:
Center for Wireless Integrated MicroSystems (WIMS) Code for Cochlear Implant with Low-Power Constraints Luis Calderón, Axel Claudio, Jomayra Marrero, Guillermo.
7/6/99 MITE1 Fully Parallel Learning Neural Network Chip for Real-time Control Students: (Dr. Jin Liu), Borte Terlemez Advisor: Dr. Martin Brooke.
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.
CSC321 Lecture 5 Applying backpropagation to shape recognition Geoffrey Hinton.
Communication Process Trading Symbols.
Hallucinations in Auditory Perception!!! Malcolm Slaney Yahoo! Research Stanford CCRMA.
Artificial Intelligence CIS 342 The College of Saint Rose David Goldschmidt, Ph.D.
July 23, BSA, a Fast and Accurate Spike Train Encoding Scheme Benjamin Schrauwen.
FPGA Cochlea Andre Van Schaik & Chetan Singh Thakur.
Ghent University Compact hardware for real-time speech recognition using a Liquid State Machine Benjamin Schrauwen – Michiel D’Haene David Verstraeten.
Final Year Project Eoin Culhane. MIDI Guitar Guitar with 6 outputs 1 output for each string Each individual string output will be converted to MIDI.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
語音訊號處理之初步實驗 NTU Speech Lab 指導教授: 李琳山 助教: 熊信寬
How do you get here?
March 31, 2016Introduction to Artificial Intelligence Lecture 16: Neural Network Paradigms I 1 … let us move on to… Artificial Neural Networks.
An Introduction To The Backpropagation Algorithm.
Chapter 13 Artificial Intelligence. Artificial Intelligence – Figure 13.1 The Turing Test.
Memory Network Maintenance Using Spike-Timing Dependent Plasticity David Jangraw, ELE ’07 Advisor: John Hopfield, Department of Molecular Biology 12 T.
1 Neural Networks MUMT 611 Philippe Zaborowski April 2005.
Artificial Intelligence for Speech Recognition
Cognitive Computing…. Computational Neuroscience
Presenter: Prof.Dr.-Eng. Gheorghe-Daniel Andreescu
Artificial Intelligence (CS 370D)
Presenter: Artur M. KUCZAPSKI
CSSE463: Image Recognition Day 17
OVERVIEW OF BIOLOGICAL NEURONS
The Brain as an Efficient and Robust Adaptive Learner
Design for Simple Spiking Neuron Model
Demonstration of STDP based Neural Networks on an FPGA
Emre O. Neftci  iScience  Volume 5, Pages (July 2018) DOI: /j.isci
CSSE463: Image Recognition Day 17
CSSE463: Image Recognition Day 17
The Brain as an Efficient and Robust Adaptive Learner
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Audio Workgroup Neuro-inspired Speech Recognition Group Members Ismail UysalYoojin Chung Ramin Pichevar Rich Hammett Tarek Massoud Ross Gaylor David Anderson Shihab Shamma Hynek HermanskiShih-Chii Liu Giacomo Indiveri Malcolm Slaney

Audio Workgroup Audio Projects Localization Speech Recognition More ASR

Audio Workgroup Localization Effort

Audio Workgroup FPAA/Mote – Word Recognition

Audio Workgroup FPAA/Mote – Word Recognition Field Programmable Analog Array (FPAA)based analog cochlea (non-spiking) with envelope detection. MOTEbased pattern matching using matched filtering with receptive fields Robosapien listens to the spoken commands….

Audio Workgroup FPAA/Mote – Word Recognition Status: FPAA – (we are using a new FPAA) 2 nd -order sections synthesized but a full auditory filter bank is not yet up. MOTE – real-time communication with Matlab and sampling operational.

Audio Workgroup Relational Network (Simple) X Y Z M M X M Y M Z m Patches of neurons Each measure one quantity Bidirectional relations for feedback/feedforward Thanks to Rodney Douglas

Audio Workgroup Relational Network (example) Input here Relational Feedback Relational specification Relational feedback

Audio Workgroup ASR Relational Network Cochlea Delay Phone Recognizer Word Recognizer A patch of neurons (one of N output) Note: We dont know how to represent delays Phone Recognizer Bidirectional links enforce phoneme/word constraints

Audio Workgroup Relational Advantages Not an HMM HMMs are great, but… Incorporate other knowledge Bottom-up perception Top-down word hypothesis Hallucinate Based on experience Hear ba.. and know that Bad, bat, bar, bass, band follow >

Audio Workgroup Inner hair cells Silicon Cochlea Ganglion cells Basilar membrane high frequency low frequency (van Schaik, Liu, 2004) BASILAR MEMBRANE INNER HAIR CELLS GANGLION CELLS

Audio Workgroup Silicon Frequency Response Tone ramps into two cochleas

Audio Workgroup Cochlear Rate Profiles Left CochleaRight Cochlea Spikes per utterance

Audio Workgroup Learning Algorithms Statistical SAS (Pick best channels for decision) Least squares (for software demo) Liquid State Machine Take input to high dimensions with spiking net Spike Timing Dependent Plasticity (STDP) Giocomo/Srinjoy Chip Brader/Fusi Vowel 1 Vowel 2 LSM Spiking Output

Audio Workgroup Phoneme 1Phoneme 2 Learning Chip Architecture Immediate Cochlea Plastic synapses Delayed Cochlea Phoneme 1 Cochlea Chip Learning Chip Neurons Relational Network Nonplastic synapses Excit. Inhib. Binary synaptic weights:,,

Audio Workgroup Tone Results Tone recognition Spike input from silicon cochlea Training Two tones Duplicated input Positive and negative examples Testing

Audio Workgroup Phoneme recognition Spike input from silicon cochlea Training Two phonemes Duplicated inputs Positive and negative examples Testing Phoneme Results

Audio Workgroup Behind the Curtain

Audio Workgroup Hardware Overview Cochlea Learning Phoneme Word PCI-AER (for remapping) Cochlea Shih-Chii Liu Giacomo Indiveri Implemented in M ATLAB

Audio Workgroup Infrastructure Difficulties Remapper Ensuing the problems surrounding AER mapper boards, remapping the AER data from silicon cochlea to the learning chip had to be done in Matlab. (very slow) Power The unpredictable problem caused by the variation in supply voltage as much as 1V. Sharing chips The learning chip had to be shared with two other workgroups. PC replacement

Audio Workgroup Impedance Difficulties Cochlear firing rates Cochlea: 6M spikes/second 30k channels, 200 spikes/second Silicon Cochlea: 30k spikes/second 30 channels, 1k spike/second Learning Chip: 3k spikes/second 30 channels, 200 spikes/second Dynamic range

Audio Workgroup Simulation

Simulation 2

Audio Workgroup Simulation 3

Audio Workgroup Great Job! Student Members Ismail UysalYoojin Chung Ramin Pichevar Rich Hammett Tarek Massoud Ross Gaylor

Audio Workgroup

Silicon Cochlea Raster plot for two different tone inputs Mean firing rates for two different vowel inputs Channel Number Time in microseconds

Audio Workgroup Word Recognizer Four example raster plot (silence, A_, A_ with relational, AI)

Audio Workgroup Software Simulation

Audio Workgroup Software Simulation

Audio Workgroup Behind the Curtain