The Time Dimension for Scene Analysis DeLiang Wang Perception & Neurodynamics Lab The Ohio State University, USA.

Slides:



Advertisements
Similar presentations
Read this article for Friday next week [1]Chelazzi L, Miller EK, Duncan J, Desimone R. A neural basis for visual search in inferior temporal cortex. Nature.
Advertisements

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Auditory scene analysis 2
Multipitch Tracking for Noisy Speech
An Auditory Scene Analysis Approach to Speech Segregation DeLiang Wang Perception and Neurodynamics Lab The Ohio State University.
What is vision Aristotle - vision is knowing what is where by looking.
Read this article for Friday next week [1]Chelazzi L, Miller EK, Duncan J, Desimone R. A neural basis for visual search in inferior temporal cortex. Nature.
5/16/2015Intelligent Systems and Soft Computing1 Introduction Introduction Hebbian learning Hebbian learning Generalised Hebbian learning algorithm Generalised.
Artificial neural networks:
Auditory Scene Analysis (ASA). Auditory Demonstrations Albert S. Bregman / Pierre A. Ahad “Demonstration of Auditory Scene Analysis, The perceptual Organisation.
Impact of Correlated inputs on Spiking Neural Models Baktash Babadi Baktash Babadi School of Cognitive Sciences PM, Tehran, Iran PM, Tehran, Iran.
Cocktail Party Processing DeLiang Wang (Jointly with Guoning Hu) Perception & Neurodynamics Lab Ohio State University.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Un Supervised Learning & Self Organizing Maps Learning From Examples
COGNITIVE NEUROSCIENCE
Michigan State University 1 “Saliency-Based Visual Attention” “Computational Modeling of Visual Attention”, Itti, Koch, (Nature Reviews – Neuroscience.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Speech Segregation Based on Sound Localization DeLiang Wang & Nicoleta Roman The Ohio State University, U.S.A. Guy J. Brown University of Sheffield, U.K.
Speech Perception in Noise and Ideal Time- Frequency Masking DeLiang Wang Oticon A/S, Denmark On leave from Ohio State University, USA.
Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston
Functional Brain Signal Processing: Current Trends and Future Directions Kaushik Majumdar Indian Statistical Institute Bangalore Center
Dynamic Link Matching Hamid Reza Vaezi Mohammad Hossein Rohban Neural Networks Spring 2007.
Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.
The search for organizing principles of brain function Needed at multiple levels: synapse => cell => brain area (cortical maps) => hierarchy of areas.
From Auditory Masking to Supervised Separation: A Tale of Improving Intelligibility of Noisy Speech for Hearing- impaired Listeners DeLiang Wang Perception.
Cognition, Brain and Consciousness: An Introduction to Cognitive Neuroscience Edited by Bernard J. Baars and Nicole M. Gage 2007 Academic Press Chapter.
Monaural Speech Segregation: Representation, Pitch, and Amplitude Modulation DeLiang Wang The Ohio State University.
Changju Lee Visual System Neural Network Lab. Department of Bio and Brain Engineering.
Computational Auditory Scene Analysis and Its Potential Application to Hearing Aids DeLiang Wang Perception & Neurodynamics Lab Ohio State University.
VIEWING THE WORLD IN COLOR. COLOR A psychological interpretation Based on wavelength, amplitude, and purity Humans can discriminate among c. 10 million.
National Taiwan A Road Sign Recognition System Based on a Dynamic Visual Model C. Y. Fang Department of Information and.
Cycle 6: Oscillations and Synchrony
Auditory Segmentation and Unvoiced Speech Segregation DeLiang Wang & Guoning Hu Perception & Neurodynamics Lab The Ohio State University.
The Function of Synchrony Marieke Rohde Reading Group DyStURB (Dynamical Structures to Understand Real Brains)
Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.
Binding problems and feature integration theory. Feature detectors Neurons that fire to specific features of a stimulus Pathway away from retina shows.
Chapter 2. From Complex Networks to Intelligent Systems in Creating Brain-like Systems, Sendhoff et al. Course: Robots Learning from Humans Baek, Da Som.
Rhythms and Cognition: Creation and Coordination of Cell Assemblies Nancy Kopell Center for BioDynamics Boston University.
Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22
An Oscillatory Correlation Approach to Scene Segmentation DeLiang Wang The Ohio State University.
Ch 9. Rhythms and Synchrony 9.7 Adaptive Cooperative Systems, Martin Beckerman, Summarized by M.-O. Heo Biointelligence Laboratory, Seoul National.
Sensation & Perception. Motion Vision I: Basic Motion Vision.
Neural Networks 2nd Edition Simon Haykin
Speech Segregation Based on Oscillatory Correlation DeLiang Wang The Ohio State University.
How we actively interpret our environment..  Perception: The process in which we understand sensory information.  Illusions are powerful examples of.
Dimensions of Neural Networks Ali Akbar Darabi Ghassem Mirroshandel Hootan Nokhost.
Robot Intelligence Technology Lab. 10. Complex Hardware Morphologies: Walking Machines Presented by In-Won Park
Ch. 13 A face in the crowd: which groups of neurons process face stimuli, and how do they interact? KARI L. HOFFMANN 2009/1/13 BI, Population Coding Seminar.
The Neural Code Baktash Babadi SCS, IPM Fall 2004.
National Taiwan Normal A System to Detect Complex Motion of Nearby Vehicles on Freeways C. Y. Fang Department of Information.
A Neurodynamical Cortical Model of Visual Attention and Invariant Object Recognition Gustavo Deco Edmund T. Rolls Vision Research, 2004.
BIOLOGICALLY MOTIVATED OSCILLATORY NETWORK MODEL FOR DYNAMICAL IMAGE SEGMENTATION Margarita Kuzmina, Eduard Manykin Keldysh Institute of Applied Mathematics.
Speech and Singing Voice Enhancement via DNN
Cognitive Processes PSY 334
Cocktail Party Problem as Binary Classification
Brain States: Top-Down Influences in Sensory Processing
Perceiving and Recognizing Objects
Local Binary Patterns (LBP)
Andy Dykstra HST.722 November 1, 2007
Mind, Brain & Behavior Wednesday February 12, 2003.
OCNC Statistical Approach to Neural Learning and Population Coding ---- Introduction to Mathematical.
Cognitive Processes PSY 334
Multisensory integration: perceptual grouping by eye and ear
Thomas Akam, Dimitri M. Kullmann  Neuron 
Brain States: Top-Down Influences in Sensory Processing
The Temporal Correlation Hypothesis of Visual Feature Integration
Jennifer K. Bizley, Ross K. Maddox, Adrian K.C. Lee 
Margarita Kuzmina, Eduard Manykin
Presentation transcript:

The Time Dimension for Scene Analysis DeLiang Wang Perception & Neurodynamics Lab The Ohio State University, USA

Presentation outline l Introduction l Scene analysis and temporal correlation theory l Oscillatory Correlation l LEGION network l Oscillatory Correlation Approach to Scene Analysis l Image segmentation l Object selection l Cocktail party problem l Concluding remarks

Scene analysis problem

Binding problem Feature binding (integration) is a fundamental problem in neuroscience and perception (and perceptrons) Binding problem in Rosenblatt’s perceptrons

Temporal correlation theory l Temporal correlation theory proposes a solution to the nervous integration problem (von der Malsburg’81; also Milnor’74) l Application to cocktail party processing (von der Malsburg & Schneider’86)

Physiological evidence (Gray et al.’89)

Oscillatory correlation theory Oscillators represent feature detectors Binding is encoded by synchrony within an oscillator assembly and desynchrony between different assemblies

Computational requirements Need to synchronize locally coupled oscillator population Need to desynchronize different populations, when facing multiple objects l Synchrony and desynchrony must be achieved rapidly

LEGION architecture l LEGION - Locally Excitatory Globally Inhibitory Oscillator Network (Terman & Wang’95)

Relaxation oscillator as building block Typical x trace (membrane potential) With stimulus Without stimulus

Analytical results l Theorem 1. (Synchronization). The oscillators in a connected block synchronize at an exponential rate l Theorem 2. (Multiple patterns) If at the beginning all the oscillators of the same block synchronize and different blocks desynchronize, then synchrony within each block and the ordering of activations among different blocks are maintained l Theorem 3. (Desynchronization) If at the beginning all the oscillators of the system lie not too far away from each other, then the condition of Theorem 2 will be satisfied after some time. Moreover, the time it takes to satisfy the condition is no greater than N cycles, where N is the number of blocks

Connectedness problem l Minsky-Papert connectedness problem is a long- standing problem in perceptron learning l The problem exposes fundamental limitations of supervised learning, and illustrates the importance of proper representations

Connectedness problem: LEGION solution l Basic idea: Synchronization within a connected pattern and desynchronization between different ones

Presentation outline l Introduction l Scene analysis and temporal correlation theory l Oscillatory Correlation l LEGION network l Oscillatory Correlation Approach to Scene Analysis l Image segmentation l Object selection l Cocktail party problem l Concluding remarks

Oscillatory correlation approach to scene segmentation l Feature extraction first takes place l An visual feature can be pixel intensity, depth, local image patch, texture element, optic flow, etc. l An auditory feature can be a pure tone, amplitude and frequency modulation, onset, harmonicity, etc. l Connection weights between neighboring oscillators are set to be proportional to feature similarity l Global inhibitor controls granularity of segmentation l Larger inhibition results in more and smaller regions l Segments pop out from LEGION in time

Image segmentation example: Demo Input image

Image segmentation example Input imageSegmentation result

Object selection l The slow inhibitor keeps trace of each pattern, which can be overcome by only more salient (larger) patterns l Unlike traditional winner- take-all dynamics, selection (competition) takes place at the object level l Consistent with object- based attention theory l Binding precedes attention, rather than attention precedes binding (Treisman & Gelade’80)

Results of object selection Input image LEGION output Selection output Input LEGION segmentation Selection

Cocktail party problem In a natural environment, target speech is usually corrupted by acoustic interference, creating a speech segregation problem l Popularly known as cocktail-party problem (Cherry’53); also ball- room problem (Helmholtz, 1863) Human listeners organize sound in a perceptual process called auditory scene analysis (Bregman’90) l Auditory scene analysis (ASA) takes place in two conceptual stages: l Segmentation. Decompose the acoustic signal into ‘sensory elements’ (segments) l Grouping. Combine segments into groups, so that segments in the same group likely originate from the same sound source

Oscillatory correlation for ASA (Wang & Brown’99) Frequency

Auditory periphery: Cochleagram l Cochleagram representation of the utterance: “Why were you all weary?” mixed with phone ringing

Grouping layer: Example l Two streams emerge from the group layer l Foreground: left (original mixture ) l Background: right l More recent results (Hu & Wang’04):

Back to physiology l Chattering cells recorded by Gray & McCormick’96 l Burst oscillations are best modeled by relaxation oscillators

Versatility and time dimension l The principle of universality: “Give me a concrete problem and I will devise a network that solves it.” (von der Malsburg’99) l It characterizes artificial intelligence l The principle of versatility: “Given the network, learn to cope with situations and problems as they arise.” (von der Malsburg’99) l It characterizes natural intelligence l Time dimension is necessary for versatility l Flexible and infinitely extensible l Irreplaceable by spatial organization

Conclusion l Advances in dynamical analysis overcome computational obstacles of oscillatory correlation theory l Major progress is made towards solving the scene analysis problem l From Hebb’s cell assemblies to von der Malsburg’s correlation theory, time is an indispensable dimension for scene analysis