Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.

Slides:

Advertisements

Similar presentations

Applications of one-class classification

Advertisements

3D Model Matching with Viewpoint-Invariant Patches(VIP) Reporter ：鄒嘉恆 Date ： 10/06/2009.

Object recognition and scene “understanding”

Classifying Objects as New or Learned with Convolutional Networks and SGD By Kevin Xiong and Evan Phibbs Mentored by Yufei Wang.

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.

Multi video camera calibration and synchronization.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Quadtrees, Octrees and their Applications in Digital Image Processing

MULTIPLE MOVING OBJECTS TRACKING FOR VIDEO SURVEILLANCE SYSTEMS.

Project IST_1999_ ARTISTE – An Integrated Art Analysis and Navigation Environment Review Meeting N.1: Paris, C2RMF, November 28, 2000 Workpackage.

IT 691 Final Presentation Pace University Created by: Robert M Gust Mark Lee Samir Hessami Mark Lee Samir Hessami.

K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.

Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.

Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.

Oral Defense by Sunny Tang 15 Aug 2003

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

Computer vision.

Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.

Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning

Multimedia Databases (MMDB)

3D Motion Capture Assisted Video human motion recognition based on the Layered HMM Myunghoon Suk & Ashok Ramadass Advisor : Dr. B. Prabhakaran Multimedia.

COMPUTER VISION: SOME CLASSICAL PROBLEMS ADWAY MITRA MACHINE LEARNING LABORATORY COMPUTER SCIENCE AND AUTOMATION INDIAN INSTITUTE OF SCIENCE June 24, 2013.

Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation

Kumar Srijan ( ) Syed Ahsan( ). Problem Statement To create a Neural Networks based multiclass object classifier which can do rotation,

Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab

INTRODUCTION Generally, after stroke, patient usually has cerebral cortex functional barrier, for example, the impairment in the following capabilities,

Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.

Hierarchical Temporal Memory as a Means for Image Recognition by Wesley Bruning CHEM/CSE 597D Final Project Presentation December 10, 2008.

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Quadtrees, Octrees and their Applications in Digital Image Processing.

Object Recognition in Images Slides originally created by Bernd Heisele.

A Face processing system Based on Committee Machine: The Approach and Experimental Results Presented by: Harvest Jang 29 Jan 2003.

Human pose recognition from depth image MS Research Cambridge.

模式识别国家重点实验室中国科学院自动化研究所 National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences Matching Tracking Sequences Across.

A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER

Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.

Skeleton Based Action Recognition with Convolutional Neural Network

Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.

CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 6: Applying backpropagation to shape recognition Geoffrey Hinton.

Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham.

Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.

Optical Character Recognition

HIERARCHICAL TEMPORAL MEMORY WHY CANT COMPUTERS BE MORE LIKE THE BRAIN?

IMAGE PROCESSING is the use of computer algorithms to perform image process on digital images   It is used for filtering the image and editing the digital.

Data Mining, Neural Network and Genetic Programming

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Bag-of-Visual-Words Based Feature Extraction

Efficient Image Classification on Vertically Decomposed Data

A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology

Recognizing Deformable Shapes

Real-Time Human Pose Recognition in Parts from Single Depth Image

Self-Organizing Maps for Content-Based Image Database Retrieval

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Unsupervised Learning and Autoencoders

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Pearson Lanka (Pvt) Ltd.

Eric Grimson, Chris Stauffer,

RGB-D Image for Scene Recognition by Jiaqi Guo

An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.

Automatic Handwriting Generation

Week 3 Volodymyr Bobyr.

Week 7 Presentation Ngoc Ta Aidean Sharghi

Presented By: Firas Gerges (fg92)

Presentation transcript:

Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK

Image Recognition Applications: Digital image databases, surveillance, industry, medicine Tasks: Object recognition, automatic annotation, content based image search Input: Digital Image – Single object – Scene (multiple objects – clutter, occlusion, merging) Output: Description of the input image – Keywords, scene semantics, similar images Subtasks: image segmentation, feature extraction, classification

Motivation Image recognition – Very easy for us humans (and [other] animals) – Computers can‘t do it neither quickly, nor accurately enough, yet Good motivation for the researchers in the field of AI – bio-inspired models

Hierarchical Temporal Memory (HTM) Developed by Jeff Hawkins and Dileep George (Numenta) Hierarchical tree-shaped network Bio-inspired – based on large scale model of the neocortex Consists of basic operational units – nodes – Each node uses the same two-stage learning algorithm: 1) Spatial Learning (Pooling) 2) Temporal Learning (Pooling) – Learning is performed layer-by-layer – Nodes have receptive fields – each (except for the top node) can look only at a portion of the input image

Spatial Learning Observe common patterns in the input space (training images) Group them into clusters of spatially simillar patterns Use only one representative of each cluster – Generate „codebook“ Input space and spatial noise reduction

Temporal Learning Uses time sequences to learn correlations of spatial patterns

Temporal Learning

In each training step, TAM is increased at the locations corresponding with the co- occurring codebook patterns according to the update function defined as follows:

Inference & Classification Uses simlar dataflow as learning Two stages of inference in each node: – Spatial inference – find the closest pattern in the codebook – Temporal inference – calculate membership into temporal groups Classification – HTM itself does not classify images, it only transforms input space into another (hopefully more inviariant) space – External classifier must be used

ATM Security ATM (automatic teller machine) semiatomatic fraud detection system – Detection of masked individuals interacting with the ATM through the ATM‘s camera – possibility of illegal activity Pilot system implemented and tested in an experimental environment Using Kinect as an input device

Kinect RGB camera developed for the XBOX game console – Capable of providing depth image for the scene and a „skeleton“ if a person is detected on the scene

Experiment Setup

Face Image Segmentation using Kinect

Two image classes: normal and anomalous faces

ATM Security – Results Image set inflated with translated, rotated and mirrored copies of the original images k-NN classifier in the input space was compared with the combination of the HTM and k-NN and HTM and SVM classifier Scenario 1: The whole data set was used and Scenario 2: Translated images were excluded from the training set

New features and algorithms for the HTM New temporal pooler Images transformed to different image spaces – different image features Various settings for the temporal pooler SOM as spatial pooler

Testing of new image features Dataset: selected images from Caltech 256 – 10 classes, 30 testing and 30 training images per class Single layer network – With 1-NN classifier as top node – Image features extracted from image patches corresponding to the receptive fields of nodes

Results % TE window step size in pixels s1s2s4s8 RGB CA42,8741,6140,8638,00 med42,5041,3341,0038,17 Grey CA40,1339,6338,4134,68 med39,6739,3337,8335,67 Canny CA40,3542,3343,6643,55 med40,5041,8343,00 Lab CA44,9244,1744,2343,17 med44,8344,5043,67 GLD CA45,9546,0146,4346,10 med46,0046,1246,1746,00

problems - background

Thank you for your attention