Evaluating Pre-Processing Pipelines for Thermal-Visual Smart Camera

Slides:



Advertisements
Similar presentations
Object Recognition with Features Inspired by Visual Cortex T. Serre, L. Wolf, T. Poggio Presented by Andrew C. Gallagher Jan. 25, 2007.
Advertisements

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
LBVC: Towards Low-bandwidth Video Chats on Smartphones Xin Qi, Qing Yang, David T. Nguyen, Gang Zhou, Ge Peng College of William and Mary 1.
Learning Convolutional Feature Hierarchies for Visual Recognition
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
3. Introduction to Digital Image Analysis
A Wrapper-Based Approach to Image Segmentation and Classification Michael E. Farmer, Member, IEEE, and Anil K. Jain, Fellow, IEEE.
A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.
A Fast and Efficient VOP Extraction Method Based on Watershed Segmentation Alireza Tavakkoli Dr. Shohreh Kasaei Gholamreza Amayeh Sharif University of.
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.
EEL 6935 Embedded Systems Long Presentation 2 Group Member: Qin Chen, Xiang Mao 4/2/20101.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Dynamic Hardware/Software Partitioning: A First Approach Authors -Greg Stitt, Roman Lysecky, Frank Vahid Presented By : Aditya Kanawade Guru Sharan 1.
SensEye: A Multi-Tier Camera Sensor Network by Purushottam Kulkarni, Deepak Ganesan, Prashant Shenoy, and Qifeng Lu Presenters: Yen-Chia Chen and Ivan.
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Seok-Won Seong and Prabhat Mishra University of Florida IEEE Transaction on Computer Aided Design of Intigrated Systems April 2008, Vol 27, No. 4 Rahul.
Operated by Los Alamos National Security, LLC for NNSA U N C L A S S I F I E D Slide 1 Sensor network based vehicle classification and license plate identification.
Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.
Real-Time Human Posture Reconstruction in Wireless Smart Camera Networks Chen Wu, Hamid Aghajan Wireless Sensor Network Lab, Stanford University, USA IPSN.
ICPR/WDIA-2012 High Quality Novel View Synthesis Based on Low Resolution Depth Image and High Resolution Color Image Jui-Chiu Chiang, Zheng-Feng Liu, and.
Sluzek 142/MAPLD Development of a Reconfigurable Sensor Network for Intrusion Detection Andrzej Sluzek & Palaniappan Annamalai Intelligent Systems.
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
LoCaF: Detecting Real-World States with Lousy Wireless Cameras Benjamin Meyer, Richard Mietz, Kay Römer 1.
Presented by Leland Holmquest Privacy Sensitive Surveillance for Assisted Living – A Smart Camera Approach Sven Fleck & Wolfgang Straβer CS895: Software.
“Fingerprint Recognition based on Silicon Chips”.
Group No 5 1.Muhammad Talha Islam 2.Karim Akhter 3.Muhammad Arif 4.Muhammad Umer Khalid.
Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on a low power, embedded system School of Information Technology & Mathematical.
AUTOMATIC TARGET RECOGNITION OF CIVILIAN TARGETS September 28 th, 2004 Bala Lakshminarayanan.
Video Segmentation Prepared By M. Alburbar Supervised By: Mr. Nael Abu Ras University of Palestine Interactive Multimedia Application Development.
Design & Implementation VHDL ET062G & ET063G Lecture 7 Najeem Lawal 2012.
Map image compression for real-time applications UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE Image Compression Research group:
ECE472/572 - Lecture 14 Morphological Image Processing 11/17/11.
Robust Segmentation of Freight Containers in Train Monitoring Videos Qing-Jie Kong*, Avinash Kumar**, Narendra Ahuja**,Yuncai Liu* **Department of Electrical.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
SMART CAMERAS AS EMBEDDED SYSTEM SMVEC. SMART CAMERA  See, think and act  Intelligent cameras  Embedding of image processing algorithms  Can be networked.
Authors: Christos Stergiou Andreas P. Plageras Kostas E. Psannis
Naifan Zhuang, Jun Ye, Kien A. Hua
Reza Yazdani Albert Segura José-María Arnau Antonio González
Hiba Tariq School of Engineering
University of Granada UGR
Experimental Demonstration of High-Order
Instruction Packing for a 32-bit Stack-Based Processor Witcharat Lertteerawattana and Prabhas Chongstitvatana Department of Computer Engineering Chulalongkorn.
Implementing Localization
Monitoring Robot Prepared by: Hanin Mizyed ,Abdalla Melhem
Gait Recognition Gökhan ŞENGÜL.
Realizing the potential of mobile devices as experimental devices: Human computer interface and performance considerations Chiung Ching Ho & C. Eswaran.
Vesa Klumpp, Knowtion Applications of Intelligent Control in Industry and Adaption to Space Missions Vesa Klumpp, Knowtion
Ajita Rattani and Reza Derakhshani,
Real-Time Human Pose Recognition in Parts from Single Depth Image
Supporting Fault-Tolerance in Streaming Grid Applications
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Image Processing for Physical Data
Digital Processing Platform
Learning to See in the Dark
RGB-D Image for Scene Recognition by Jiaqi Guo
A Distributed System for Real-time Volume Reconstruction
Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform
Visual Tracking on an Autonomous Self-contained Humanoid Robot
Pose Estimation for non-cooperative Spacecraft Rendevous using CNN
Declarative Transfer Learning from Deep CNNs at Scale
LANMC: LSTM-Assisted Non-Rigid Motion Correction
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
High Capacity Data Hiding for Grayscale Images
The Image The pixels in the image The mask The resulting image 255 X
PROJECT IDENTIFICATION
Presentation transcript:

Evaluating Pre-Processing Pipelines for Thermal-Visual Smart Camera Authors: Irida Shallari, Muhammad Imran, Najeem Lawal, Mattias O’Nils

Embedded processing in smart camera Benefits Reduced data for communication and analysis Real-time monitoring Challenges Energy consumption Performance Limited resources Computational Memory Energy Communication bandwidth

Embedded processing in smart camera

Surveillance of vulnerable areas

Alternative approach High frequency Low frequency

Problem statement Low level Design exploration in pixel-based image pre-processing pipeline for multi-sensor smart camera with respect to data communication vs classification accuracy. Intermediate High level -Spatial filtering -Temporal filtering -Segmentation -Morphology -Labelling -Classfication -Recognition Xmend trade-off communication vs performance

Classification algorithms 1280×1024 320×240 Capturing Capturing ROI Segmented ROI Segmented ROI ROI Classification algorithms SURF, SIFT Pre-processing architecture for smart camera

Image dataset Human and cyclist

Experimental setup Nexys 4 board which includes Artix-7 FPGA NVIDIA Tegra TK1 An IDS CMOS µEye visual camera focal length 12 mm, a resolution of 1280×1024 and a pixel pitch of 5.3 μm A Tamarisk 320 thermal camera focal length 19 mm, a sensor resolution of 320 x 240 and a pixel pitch of 17 μm

Design exploration Thermal Thermal binary Visual Visual binary

Architecture Camera node Client device 320×240 Human Animal Cyclist Capture Background subtract Camera node Segmentation Morphology Binary ROI coding Decoding Classification Client device Human Animal Cyclist

Classification accuracy vs Communication cost

Alternative approach High frequency Low frequency

Proposed architecture

Classification accuracy

Data compression (1280×1024) 3840 (200×300) 1406 (250×500) 2930 3 4 59 Image type Res./Size Kbytes (Raw data) Kbytes (RAW_ROI Kbytes (JPEG_ROI) KBytes (Bin_ROI) KBytes (G4_ROI) Human Cycl. Visual (1280×1024) 3840 (200×300) 1406 (250×500) 2930 3 4 59 122 2 Thermal (320×240) 75 (100×150) 117 (100×250) 20 1 15

Conclusions We propose an architectural approach in which: Thermal images are used as a mask to extract ROI Frequently transmitted ROI visual data Compressed visual data transmitted rarely for situational awareness Visual compressed ROI offers: 13%-64% better classification accuracy than binary visual ROI or thermographic images (raw_ROI, bin_ROI) New applications requiring situational awareness.

Questions?

Thank you!