Evaluating Pre-Processing Pipelines for Thermal-Visual Smart Camera

Slides:

Advertisements

Similar presentations

Object Recognition with Features Inspired by Visual Cortex T. Serre, L. Wolf, T. Poggio Presented by Andrew C. Gallagher Jan. 25, 2007.

Advertisements

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.

LBVC: Towards Low-bandwidth Video Chats on Smartphones Xin Qi, Qing Yang, David T. Nguyen, Gang Zhou, Ge Peng College of William and Mary 1.

Learning Convolutional Feature Hierarchies for Visual Recognition

Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik

Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

3. Introduction to Digital Image Analysis

A Wrapper-Based Approach to Image Segmentation and Classification Michael E. Farmer, Member, IEEE, and Anil K. Jain, Fellow, IEEE.

A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.

A Fast and Efficient VOP Extraction Method Based on Watershed Segmentation Alireza Tavakkoli Dr. Shohreh Kasaei Gholamreza Amayeh Sharif University of.

Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.

Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.

EEL 6935 Embedded Systems Long Presentation 2 Group Member: Qin Chen, Xiang Mao 4/2/20101.

© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,

Dynamic Hardware/Software Partitioning: A First Approach Authors -Greg Stitt, Roman Lysecky, Frank Vahid Presented By : Aditya Kanawade Guru Sharan 1.

SensEye: A Multi-Tier Camera Sensor Network by Purushottam Kulkarni, Deepak Ganesan, Prashant Shenoy, and Qifeng Lu Presenters: Yen-Chia Chen and Ivan.

Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.

Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.

Seok-Won Seong and Prabhat Mishra University of Florida IEEE Transaction on Computer Aided Design of Intigrated Systems April 2008, Vol 27, No. 4 Rahul.

Operated by Los Alamos National Security, LLC for NNSA U N C L A S S I F I E D Slide 1 Sensor network based vehicle classification and license plate identification.

Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.

Real-Time Human Posture Reconstruction in Wireless Smart Camera Networks Chen Wu, Hamid Aghajan Wireless Sensor Network Lab, Stanford University, USA IPSN.

ICPR/WDIA-2012 High Quality Novel View Synthesis Based on Low Resolution Depth Image and High Resolution Color Image Jui-Chiu Chiang, Zheng-Feng Liu, and.

Sluzek 142/MAPLD Development of a Reconfigurable Sensor Network for Intrusion Detection Andrzej Sluzek & Palaniappan Annamalai Intelligent Systems.

“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.

LoCaF: Detecting Real-World States with Lousy Wireless Cameras Benjamin Meyer, Richard Mietz, Kay Römer 1.

Presented by Leland Holmquest Privacy Sensitive Surveillance for Assisted Living – A Smart Camera Approach Sven Fleck & Wolfgang Straβer CS895: Software.

“Fingerprint Recognition based on Silicon Chips”.

Group No 5 1.Muhammad Talha Islam 2.Karim Akhter 3.Muhammad Arif 4.Muhammad Umer Khalid.

Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on a low power, embedded system School of Information Technology & Mathematical.

AUTOMATIC TARGET RECOGNITION OF CIVILIAN TARGETS September 28 th, 2004 Bala Lakshminarayanan.

Video Segmentation Prepared By M. Alburbar Supervised By: Mr. Nael Abu Ras University of Palestine Interactive Multimedia Application Development.

Design & Implementation VHDL ET062G & ET063G Lecture 7 Najeem Lawal 2012.

Map image compression for real-time applications UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE Image Compression Research group:

ECE472/572 - Lecture 14 Morphological Image Processing 11/17/11.

Robust Segmentation of Freight Containers in Train Monitoring Videos Qing-Jie Kong*, Avinash Kumar**, Narendra Ahuja**,Yuncai Liu* **Department of Electrical.

Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi

SMART CAMERAS AS EMBEDDED SYSTEM SMVEC. SMART CAMERA  See, think and act  Intelligent cameras  Embedding of image processing algorithms  Can be networked.

Authors: Christos Stergiou Andreas P. Plageras Kostas E. Psannis

Naifan Zhuang, Jun Ye, Kien A. Hua

Reza Yazdani Albert Segura José-María Arnau Antonio González

Hiba Tariq School of Engineering

University of Granada UGR

Experimental Demonstration of High-Order

Instruction Packing for a 32-bit Stack-Based Processor Witcharat Lertteerawattana and Prabhas Chongstitvatana Department of Computer Engineering Chulalongkorn.

Implementing Localization

Monitoring Robot Prepared by: Hanin Mizyed ,Abdalla Melhem

Gait Recognition Gökhan ŞENGÜL.

Realizing the potential of mobile devices as experimental devices: Human computer interface and performance considerations Chiung Ching Ho & C. Eswaran.

Vesa Klumpp, Knowtion Applications of Intelligent Control in Industry and Adaption to Space Missions Vesa Klumpp, Knowtion

Ajita Rattani and Reza Derakhshani,

Real-Time Human Pose Recognition in Parts from Single Depth Image

Supporting Fault-Tolerance in Streaming Grid Applications

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Image Processing for Physical Data

Digital Processing Platform

Learning to See in the Dark

RGB-D Image for Scene Recognition by Jiaqi Guo

A Distributed System for Real-time Volume Reconstruction

Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform

Visual Tracking on an Autonomous Self-contained Humanoid Robot

Pose Estimation for non-cooperative Spacecraft Rendevous using CNN

Declarative Transfer Learning from Deep CNNs at Scale

LANMC: LSTM-Assisted Non-Rigid Motion Correction

Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS

High Capacity Data Hiding for Grayscale Images

The Image The pixels in the image The mask The resulting image 255 X

PROJECT IDENTIFICATION

Presentation transcript:

Evaluating Pre-Processing Pipelines for Thermal-Visual Smart Camera Authors: Irida Shallari, Muhammad Imran, Najeem Lawal, Mattias O’Nils

Embedded processing in smart camera Benefits Reduced data for communication and analysis Real-time monitoring Challenges Energy consumption Performance Limited resources Computational Memory Energy Communication bandwidth

Embedded processing in smart camera

Surveillance of vulnerable areas

Alternative approach High frequency Low frequency

Problem statement Low level Design exploration in pixel-based image pre-processing pipeline for multi-sensor smart camera with respect to data communication vs classification accuracy. Intermediate High level -Spatial filtering -Temporal filtering -Segmentation -Morphology -Labelling -Classfication -Recognition Xmend trade-off communication vs performance

Classification algorithms 1280×1024 320×240 Capturing Capturing ROI Segmented ROI Segmented ROI ROI Classification algorithms SURF, SIFT Pre-processing architecture for smart camera

Image dataset Human and cyclist

Experimental setup Nexys 4 board which includes Artix-7 FPGA NVIDIA Tegra TK1 An IDS CMOS µEye visual camera focal length 12 mm, a resolution of 1280×1024 and a pixel pitch of 5.3 μm A Tamarisk 320 thermal camera focal length 19 mm, a sensor resolution of 320 x 240 and a pixel pitch of 17 μm

Design exploration Thermal Thermal binary Visual Visual binary

Architecture Camera node Client device 320×240 Human Animal Cyclist Capture Background subtract Camera node Segmentation Morphology Binary ROI coding Decoding Classification Client device Human Animal Cyclist

Classification accuracy vs Communication cost

Alternative approach High frequency Low frequency

Proposed architecture

Classification accuracy

Data compression (1280×1024) 3840 (200×300) 1406 (250×500) 2930 3 4 59 Image type Res./Size Kbytes (Raw data) Kbytes (RAW_ROI Kbytes (JPEG_ROI) KBytes (Bin_ROI) KBytes (G4_ROI) Human Cycl. Visual (1280×1024) 3840 (200×300) 1406 (250×500) 2930 3 4 59 122 2 Thermal (320×240) 75 (100×150) 117 (100×250) 20 1 15

Conclusions We propose an architectural approach in which: Thermal images are used as a mask to extract ROI Frequently transmitted ROI visual data Compressed visual data transmitted rarely for situational awareness Visual compressed ROI offers: 13%-64% better classification accuracy than binary visual ROI or thermographic images (raw_ROI, bin_ROI) New applications requiring situational awareness.

Questions?

Thank you!