Spatio-Temporal Sequence Learning of Visual Place Cells for Robotic Navigation presented by Nguyen Vu Anh date: 20 th July, 2010 Nguyen Vu Anh, Alex Leng-Phuan.

Slides:



Advertisements
Similar presentations
Flexible Shaping: How learning in small steps helps Hierarchical Organization of Behavior, NIPS 2007 Kai Krueger and Peter Dayan Gatsby Computational Neuroscience.
Advertisements

Object recognition and scene “understanding”
Sparse Coding in Sparse Winner networks Janusz A. Starzyk 1, Yinyin Liu 1, David Vogel 2 1 School of Electrical Engineering & Computer Science Ohio University,
Intelligent Agents Russell and Norvig: 2
Template design only ©copyright 2008 Ohio UniversityMedia Production Spring Quarter  A hierarchical neural network structure for text learning.
An Overview of Machine Learning
HMAX Models Architecture Jim Mutch March 31, 2010.
Patch to the Future: Unsupervised Visual Prediction
Kohonen Self Organising Maps Michael J. Watts
My Group’s Current Research on Image Understanding.
CMPT-884 Jan 18, 2010 Video Copy Detection using Hadoop Presented by: Cameron Harvey Naghmeh Khodabakhshi CMPT 820 December 2, 2010.
Optimizing number of hidden neurons in neural networks
[1].Edward G. Jones, Microcolumns in the Cerebral Cortex, Proc. of National Academy of Science of United States of America, vol. 97(10), 2000, pp
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Self-Organizing Hierarchical Neural Network
1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.
Fourth International Symposium on Neural Networks (ISNN) June 3-7, 2007, Nanjing, China A Hierarchical Self-organizing Associative Memory for Machine Learning.
Robust Object Recognition with Cortex-Like Mechanisms Thomas Serre, Lior Wolf, Stanley Bileshi, Maximilian Riesenhuber, and Tomaso Poggio, Member, IEEE.
Unsupervised discovery of visual object class hierarchies Josef Sivic (INRIA / ENS), Bryan Russell (MIT), Andrew Zisserman (Oxford), Alyosha Efros (CMU)
Christian Siagian Laurent Itti Univ. Southern California, CA, USA
Associative Learning in Hierarchical Self Organizing Learning Arrays Janusz A. Starzyk, Zhen Zhu, and Yue Li School of Electrical Engineering and Computer.
Study on Mobile Robot Navigation Techniques Presenter: 林易增 2008/8/26.
Michael Arbib & Laurent Itti: CS664 – USC, spring Lecture 6: Object Recognition 1 CS664, USC, Spring 2002 Lecture 6. Object Recognition Reading Assignments:
Overview of Long-Term Memory laura leventhal. Reference Chapter 14 Chapter 14.
AN ANALYSIS OF SINGLE- LAYER NETWORKS IN UNSUPERVISED FEATURE LEARNING [1] Yani Chen 10/14/
Spatial Pyramid Pooling in Deep Convolutional
Lecture 09 Clustering-based Learning
SOMTIME: AN ARTIFICIAL NEURAL NETWORK FOR TOPOLOGICAL AND TEMPORAL CORRELATION FOR SPATIOTEMPORAL PATTERN LEARNING.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Multiclass object recognition
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.
Learning Decompositional Shape Models from Examples Alex Levinshtein Cristian Sminchisescu Sven Dickinson Sven Dickinson University of Toronto.
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Images Similarity by Relative Dynamic Programming M. Sc. thesis by Ady Ecker Supervisor: prof. Shimon Ullman.
Dynamic Captioning: Video Accessibility Enhancement for Hearing Impairment Richang Hong, Meng Wang, Mengdi Xuy Shuicheng Yany and Tat-Seng Chua School.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
Effective Reranking for Extracting Protein-protein Interactions from Biomedical Literature Deyu Zhou, Yulan He and Chee Keong Kwoh School of Computer Engineering.
381 Self Organization Map Learning without Examples.
Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
© 2009 Robert Hecht-Nielsen. All rights reserved. 1 Andrew Smith University of California, San Diego Building a Visual Hierarchy.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
3D Face Recognition Using Range Images Literature Survey Joonsoo Lee 3/10/05.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Activity Recognition Journal Club “Neural Mechanisms for the Recognition of Biological Movements” Martin Giese, Tomaso Poggio (Nature Neuroscience Review,
A Hierarchical Deep Temporal Model for Group Activity Recognition
National Taiwan Normal A System to Detect Complex Motion of Nearby Vehicles on Freeways C. Y. Fang Department of Information.
Action-Grounded Push Affordance Bootstrapping of Unknown Objects
School of Computer Science & Engineering
Recognizing Deformable Shapes
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Paper Presentation Aryeh Zapinsky
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
John H.L. Hansen & Taufiq Al Babba Hasan
Heterogeneous convolutional neural networks for visual recognition
Topological Signatures For Fast Mobility Analysis
Recognizing Deformable Shapes
Presentation transcript:

Spatio-Temporal Sequence Learning of Visual Place Cells for Robotic Navigation presented by Nguyen Vu Anh date: 20 th July, 2010 Nguyen Vu Anh, Alex Leng-Phuan Tay, Wooi-Boon Goh School of Computer Engineering Nanyang Technological University Singapore Janusz A. Starzyk School of Electrical Engineering Ohio University Athens, USA IJCNN, WCCI, Barcelona, Spain, 2010

Outline Introduction HMAX Feature Building and Extraction Spatio-Temporal Learning and Recognition Empirical Results Conclusion and future directions

Introduction Robotic navigation: Localization and Mapping. –Topological map & Place cells –Scope: Topological Visual Localization Challenges: –High dimension and uncertainty of visual features –Perceptual aliasing –Complex probabilistic frameworks e.g. HMM Approach: –Structural organization of human memory architecture. –Short-Term Memory (STM) and Long-Term Memory(LTM) Interaction

Introduction System Architecture Classifier Sequence Storage Symbol Quantization Feature Building and Extraction

Introduction Existing Works: –Autonomous navigation (SLAM): Mapping, Localization and Path Planning Topological vs metric representation Human employs mainly topological representation of environment [O’Keefe (1976), Redish(1999), Eichenbaum (1999), etc] –Visual Place-cell model: [Torralba (2001) ; Renninger&Malik (2004) ; Siagian&Itti (2007)] Hierarchical feature building and extraction (HMAX Model) [Serre et al (2007)] –Spatio-Temporal sequence learning: [Wang&Arbib (1990) (1993), Wang&Yowono (1995)] Our previous works: [Starzyk&He, (2007);Starzyk&He (2009);Tay et al (2007);Nguyen&Tay (2009)]

HMAX Feature Building and Extraction Interleaving simple (S) and complex (C) layers with increasing spatial invariance (Retina - LGN – V1 – V2,V4) 2 Stages: –Feature Construction –Feature Extraction Feature Significance:

HMAX Feature Building and Extraction Prototypes Ref: Riesenhuber & Poggio (1999), Serre et al (2007) Spatial Invariance Processing Dot-Product Matching

Spatio-Temporal Learning Architecture STM Structure: –Quantization of input using KFLANN with vigilance ρ See: Tay, Zurada,Wong and Xu, TNN, 2007

Spatio-Temporal Learning Architecture STM Structure: See: Tay, Zurada,Wong and Xu, TNN, 2007

Spatio-Temporal Learning Architecture LTM Cell Structure: –Each LTM is learnt by one-shot mechanism. –Each long training sequence is segmented into N overlapping subsequences of the same length M. –Each subsequence is dedicated permanently to an LTM cell.

Spatio-Temporal Learning Architecture LTM Cell Structure: Dual Neurons – STM Primary Neurons – Primary Excitation

Spatio-Temporal Learning Architecture Storage –One-shot learning Recognition Input feature vector Primary Excitation Computation Dual Neurons Update – Evidence Accumulation Output Matching Score from the last DN

Empirical Results ICLEF Competition 2010 Dataset –9 classes of places –2 sets of images with the same trajectory (Set S and SetC) (~4000 images each set) C K L O

Empirical Results Task –1 sequence (Set S) as training set and 1 sequence as testing set (Set R). Features: –10% of the training sequence Training –ρ=0.7. –Segmentation into consecutive subsequences of equal length (100) with overlapping portion (>50%). –Each subsequence is stored as a LTM cell. –The label of each LTM cell is the majority label of individual components. Testing –The label is assigned as the label of the maximally activated LTM cell. –If the activation of the maximal activated LTM cell is below ө, the system refuses to assign the label.

Empirical Results Table: LTM listing with training set S

Empirical Results Accuracy without threshold Accuracy with threshold ө=0.4 Robust testing: missing elements

Empirical Results Figure: LTM cells’ activation during recall stage

Empirical Results Intersection case:

Conclusion A hierarchical spatio-temporal learning architecture –HMAX hierarchical feature construction and extraction –STM clustering by KFLANN –Sequence storage and retrieval by LTM cells. Application in appearance-based topological localization

Future Directions Automatic tolerance estimation –E.g. Signal-to-noise ratio figure of features [Liu&Starzyk 2008] Hierarchical episodic memory which characterizes the interaction between STM and LTM –Other embodied intelligence components –Goal creation system [Starzyk 2008] Application in other domains: –Human Action Recognition

Thank you!