Real-Time VLSI Architecture for Detection of Moving Object Using Wronskian Determinant R. Aguilar-Ponce, J. Tessier, C. Emmela, A. Baker, J. Das, J.L.

Slides:

Advertisements

Similar presentations

Chapter 8-Video.

Advertisements

1 A HIGH THROUGHPUT PIPELINED ARCHITECTURE FOR H.264/AVC DEBLOCKING FILTER Kefalas Nikolaos, Theodoridis George VLSI Design Lab. Electrical & Computer.

Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.

Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.

1 KU College of Engineering Elec 204: Digital Systems Design Lecture 9 Programmable Configurations Read Only Memory (ROM) – –a fixed array of AND gates.

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.

Design of a Power-Efficient Interleaved CIC Architecture for Software Defined Radio Receivers By J.Luis Tecpanecatl-Xihuitl, Ruth Aguilar-Ponce, Ashok.

Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.

Vision Based Control Motion Matt Baker Kevin VanDyke.

Adviser ： Ming-Yuan Shieh Student ID ： M Student ： Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.

Image-Based Target Detection and Tracking Aggelos K. Katsaggelos Thrasyvoulos N. Pappas Peshala V. Pahalawatta C. Andrew Segall SensIT, Santa Fe January.

VIPER DSPS 1998 Slide 1 A DSP Solution to Error Concealment in Digital Video Eduardo Asbun and Edward J. Delp Video and Image Processing Laboratory (VIPER)

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Video enhances, dramatizes, and gives impact to your multimedia application. Your audience will better understand the message of your application.

UNIVERSITY OF MASSACHUSETTS Dept

International Conference on Image Analysis and Recognition (ICIAR’09). Halifax, Canada, 6-8 July Video Compression and Retrieval of Moving Object.

Chapter 2 Computer Imaging Systems. Content Computer Imaging Systems.

High Dynamic Range Emeka Ezekwe M11 Christopher Thayer M12 Shabnam Aggarwal M13 Charles Fan M14 Manager: Matthew Russo 6/26/

Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.

Sample rate conversion At times, it will be necessary to convert the sampling rate in a source signal to some other sampling rate Consider converting from.

Illumination Normalization with Time-Dependent Intrinsic Images for Video Surveillance Yasuyuki Matsushita, Member, IEEE, Ko Nishino, Member, IEEE, Katsushi.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,

Pipelining By Toan Nguyen.

CS 105 Digital Logic Design

EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.

1 REAL-TIME IMAGE PROCESSING APPROACH TO MEASURE TRAFFIC QUEUE PARAMETERS. M. Fathy and M.Y. Siyal Conference 1995: Image Processing And Its Applications.

SensEye: A Multi-Tier Camera Sensor Network by Purushottam Kulkarni, Deepak Ganesan, Prashant Shenoy, and Qifeng Lu Presenters: Yen-Chia Chen and Ivan.

Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.

Tricolor Attenuation Model for Shadow Detection. INTRODUCTION Shadows may cause some undesirable problems in many computer vision and image analysis tasks,

Lecture 03 Fasih ur Rehman

H.264 Deblocking Filter Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin, South Korea Copyright © solarlits.com.

Power Reduction for FPGA using Multiple Vdd/Vth

: Chapter 12: Image Compression 1 Montri Karnjanadecha ac.th/~montri Image Processing.

© 2011 The McGraw-Hill Companies, Inc. All rights reserved Chapter 6: Video.

Chapter 8 Problems Prof. Sin-Min Lee Department of Mathematics and Computer Science.

DIGITAL Video. Video Creation Video captures the real world therefore video cannot be created in the same sense that images can be created video must.

Implementing Codesign in Xilinx Virtex II Pro Betim Çiço, Hergys Rexha Department of Informatics Engineering Faculty of Information Technologies Polytechnic.

Sensing for Robotics & Control – Remote Sensors R. R. Lindeke, Ph.D.

A Camera-Projector System for Real-Time 3D Video Marcelo Bernardes, Luiz Velho, Asla Sá, Paulo Carvalho IMPA - VISGRAF Laboratory Procams 2005.

Towards real-time camera based logos detection Mathieu Delalandre Laboratory of Computer Science, RFAI group, Tours city, France Osaka Prefecture Partnership.

MACHINE VISION Machine Vision System Components ENT 273 Ms. HEMA C.R. Lecture 1.

Ch5: TELEVISION.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

1 by: Ilya Melamed Supervised by: Eyal Sarfati High Speed Digital Systems Lab.

Expectation-Maximization (EM) Case Studies

Computer Architecture Lecture 32 Fasih ur Rehman.

Topics covered: Arithmetic CSE243: Introduction to Computer Architecture and Hardware/Software Interface.

LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,

Student Name: Honghao Chen Supervisor: Dr Jimmy Li Co-Supervisor: Dr Sherry Randhawa.

Digital Video Digital video is basically a sequence of digital images  Processing of digital video has much in common with digital image processing First.

Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.

IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.

1 Fundamentals of Computer Science Combinational Circuits.

Recursive Architectures for 2DLNS Multiplication RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS - UNIVERSITY OF WINDSOR 11 Recursive Architectures for 2DLNS.

ECE DIGITAL LOGIC LECTURE 15: COMBINATIONAL CIRCUITS Assistant Prof. Fareena Saqib Florida Institute of Technology Fall 2015, 10/20/2015.

Senior Project Poster Day 2006, CIS Dept. University of Pennsylvania One if by land… Yosef Weiner, David Charles Pollack Faculty Advisor: C.J. Taylor,

EEL 5722 FPGA Design Fall 2003 Digit-Serial DSP Functions Part I.

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.

Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.

CONTENTS:  Introduction.  Face recognition task.  Image preprocessing.  Template Extraction and Normalization.  Template Correlation with image database.

IMAGE PROCESSING APPLIED TO TRAFFIC QUEUE DETECTION ALGORITHM.

Understanding Analogue and Digital Video Lesson 1

Parallelizing an Image Compression Toolbox

UNIVERSITY OF MASSACHUSETTS Dept

Chapter 6: Video.

DESIGN AND IMPLEMENTATION OF DIGITAL FILTER

A Parallel Algorithm for Hardware Implementation of Inverse Halftoning

DSP Architectures for Future Wireless Base-Stations

Presentation transcript:

Real-Time VLSI Architecture for Detection of Moving Object Using Wronskian Determinant R. Aguilar-Ponce, J. Tessier, C. Emmela, A. Baker, J. Das, J.L. Tecpanecatl-Xihuitl, A. Kumar and Magdy Bayoumi Center for Advance Computer Studies University of Louisiana at Lafayette

MWSCAS 2005Center for Advanced Computer Studies 2 Agenda 1. Introduction 2. Proposed Architecture 3. Results 4. Conclusion

MWSCAS 2005Center for Advanced Computer Studies 3 Introduction  Change detection takes one or several references frames and models the background and foreground of the image. Background Foreground Detect: Moving objects Appearing objects Disappearing objects Discard: Changes due to global illumination variations Shadow cast by moving objects Background Subtraction Technique

MWSCAS 2005Center for Advanced Computer Studies 4 Introduction  Applications that extract high level information from raw data, i.e. video stream require accurate and robust Change Detection Systems.  Such applications include:  Video surveillance  Remote sensing  Object-based video coding  Smart cameras

MWSCAS 2005Center for Advanced Computer Studies 5 Introduction  Video Surveillance Systems must determined when an intruder has appear on the scene  Tracking of moving automobiles and persons are issues of interests on these systems.  In order to achieve these task, change detection must be performed

MWSCAS 2005Center for Advanced Computer Studies 6 Introduction  Handheld devices such as cellular phones or PDAs include acquisition, storage and/or transmission of images. In order to achieve these operations, images must be compressed.  In an Object-based Video Coding approach a scene is represented as a composition of objects, which can be independently processed and coded.  In the object-based approach, the moving objects in the video scene are extracted, and each object is represented by its shape, motion, and texture.

MWSCAS 2005Center for Advanced Computer Studies 7 Introduction  While today’s digital cameras capture images, smart cameras capture high-level descriptions of the scene and analyze what they see  A smart camera combines video sensing, high-level video processing and communication within a single embedded device.

MWSCAS 2005Center for Advanced Computer Studies 8 Goal  Change detection has been performed purely in software.  The problem of object detection, however, becomes critical in the upcoming wireless visual sensors because of size and power constraints.  The need for low-power, small size, hardware implementations is greatly felt.  This paper introduces a VLSI architecture for Wronskian Change Detector (WCD).

MWSCAS 2005Center for Advanced Computer Studies 9 Background Subtraction Techniques  The most instinctive technique is Frame Differencing followed by thresholding.  Change is detected if the difference of the corresponding pixels exceeds a preset threshold.  The advantage of this technique is its low computational complexity, however it is very susceptible to noise and illumination changes.

MWSCAS 2005Center for Advanced Computer Studies 10 Background Subtraction Techniques  Median filter is one of the most popular background subtraction techniques.  Median of each pixel of all the frames in the buffer constitutes the background estimation.  Background pixels are considered to be those that stay on more than half of the frames on the buffer.  However, this technique requires a buffer large enough to store L frames.

MWSCAS 2005Center for Advanced Computer Studies 11 Background Subtraction Techniques  Mixture of Gaussian is a recursive background technique, that recursively updates the background model based on each input frame.  This method models each background pixel by a mixture of K Gaussian distributions (K is a number between 3 and 5).  Different Gaussians are assumed to represent different colors. The probable background colors are the ones that stay longer and more static.  This technique is computationally intensive; its parameters require careful tuning and it is very sensitive to sudden changes in global illumination. Any error in the background estimation can remain for a long period due to its recursive nature

MWSCAS 2005Center for Advanced Computer Studies 12 Background Subtraction Techniques  Wronskian Change Detector employs the Wronskian of intensity ratios as a measure of change.  A large mean or large variance of the intensity ratios increases the Wronskian value.  This method can detect object interiors and structural changes. Also, WCD is robust against illumination changes.  WCD is a suitable algorithm to be implemented in real-time due to its low complexity. Also, this technique requires only one previous frame; therefore it is appropriate for applications where resources are limited

MWSCAS 2005Center for Advanced Computer Studies 13 Background Subtraction Techniques Method AdaptabilityPrecisionComplexityTuning Global Illumination Changes Storage Requirement Frame Differencing HighLow SimpleSensitive 1 Previous Frame Median FilterHighMedium Simple Less Sensitive L Previous Frames Mixture of Gaussian LowHigh Complex SensitiveNone Wronskian Change Detector HighMediumLowSimpleRobust 1 Previous Frame

MWSCAS 2005Center for Advanced Computer Studies 14 Background Subtraction Techniques

MWSCAS 2005Center for Advanced Computer Studies 15 Background Subtraction Techniques Frame DifferencingMedian Filter Wronskian Change Detector

MWSCAS 2005Center for Advanced Computer Studies 16 Wronskian Change Detector  In order to determine if a change has occurred, a region of support is assigned to each pixel.  The size of the region of support can vary from 3 × 3, 5 × 5 and 9 × 9 pixels x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 x7x7 x8x8 x9x9

MWSCAS 2005Center for Advanced Computer Studies 17 Wronskian Change Detector Window size 3 × 3 Window size 5 × 5 Window size 9 × 9

MWSCAS 2005Center for Advanced Computer Studies 18 Wronskian Change Detector  Wronskian Change Detector employs the following equation  W(x/y) detects changes corresponding to dark zones, while its inverse ration W(y/x) finds if a change has occurred in bright zones. Therefore, computing both values allows robust detection against global illumination changes.  In our simulations, sizes of region of support larger than 3 do not provide better results but increases the computational complexity. Therefore a fixed value of 3 is employed in our approach.

MWSCAS 2005Center for Advanced Computer Studies 19 NTSC and PAL Standards  American Video standard, National Television System Committee (NTSC).  The NTSC standard displays 60 fields per second. Each field is composed by even and odd lines.  The NTSC signal transmits the odd fields first and then the even fields  The even and odd fields are displayed sequentially, thus interlacing the full frame.  PAL (Phase Alternation by Line) standard is the dominant television standard in Europe.  The distinction between these standards is that color is handled differently.

MWSCAS 2005Center for Advanced Computer Studies 20 NTSC/PAL Odd Field Even Field x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 x7x7 x8x8 x9x9 x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 x7x7 x8x8 x9x9 Odd Field Even Field

MWSCAS 2005Center for Advanced Computer Studies 21 y1y1 y2y2 y3y3 y4y4 y5y5 y6y6 y7y7 y8y8 y9y9 Wronskian Change Detector x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 x7x7 x8x8 x9x9 where Previous FrameCurrent Frame x2x2 x3x3 x 10 x5x5 x6x6 x 11 x8x8 x9x9 x12x12

MWSCAS 2005Center for Advanced Computer Studies 22 Proposed Architecture  Proposed architecture is composed by three units:  Processing unit  Main Controller  Memory Unit  Decoder and encoder are used to process both standards NTSC and PAL Frame Buffer 300 Kb Output Buffer 300 Kb Memory Unit Processing Unit Pipeline Processing Element Adder Tree Queue 1 Queue 2 Main Controller Decoder NTSC/PAL Encoder VGA Output

MWSCAS 2005Center for Advanced Computer Studies 23 Pipeline Processing Element  To achieve a low-power implementation a 8-bit unsigned integer arithmetic was used.  There are two main concerns:  The first one is how to capture the range of the function with only 8-bit unsigned arithmetic.  The second concern is guaranteeing precision, considering that threshold values are in the range of 0.6 to 0.7 to detect a change

MWSCAS 2005Center for Advanced Computer Studies 24 Pipeline Processing Element  The PE must be designed to capture the range of D(xi,yi) that could indicate a change.  Therefore, the equation must be scaled so that an unsigned 8-bit integer threshold can be used and all overflows are saturated.  Only the partial range of D(xi,yi) where THmin ≤ D(xi,yi) ≤ nTHmax is significant, where THmin and THmax are the minimum and maximum threshold to be used

MWSCAS 2005Center for Advanced Computer Studies 25 Pipeline Processing Element  This solved the problem of precision, but creates results that are too large to add n times.  For that reason, the five least significant bit of the product are discarded after multiplication, and the rest of the bits are employed as the result 8-bit Division 1 st stage Division (2 nd stage) and subtraction 8-Bit Multiplication First stage Multiplication (2 nd stage) Latch x y D(x,y)

MWSCAS 2005Center for Advanced Computer Studies 26 Pipeline Processing Element  The implementation of the system is done with a fixed region of support size of 3 × 3.  The main components of the PE are divider, adder and multiplier.  Multiplication is done by Booth algorithm because it represents a good trade-off between speed and power for 8 bit fixed point arithmetic  Integer division using 8 conditional subtractors is simple and fast enough for our application  The architecture is capable of analyze frame size of 640 × 480 pixels

MWSCAS 2005Center for Advanced Computer Studies 27 Processing Unit  The design uses control signals to pad the image by grounding the bus whenever it is required.  The adder trees sum PE outputs to produce the final results.  These results are compared to the threshold, and the change/no change bits are then stored into the output frame Processing Unit Pipeline Processing Element Adder Tree Queue 1 Queue 2

MWSCAS 2005Center for Advanced Computer Studies 28 Main Controller  The system is managed by the control unit. The controller has three states:  Process, the system input and calculates Wronskian value  Display shows the output through the encoder  Idle, the process unit does not performed any action  The maximum frame rate is 15 frames per second. If the application require less than 15 fps, the system will remain idle for the rest of the frames of a second

MWSCAS 2005Center for Advanced Computer Studies 29 Memory Unit  For storing the preceding frame data, we used a 300Kb memory.  Another memory of same size is required to store the output values. T  The memory is addressed by 19 bits that includes field index for a frame, vertical addressing and horizontal addressing.  The values for 2-pixels, i.e., 16 bits of data is read and stored at a time.

MWSCAS 2005Center for Advanced Computer Studies 30 Implementation  Implementation of the proposed architecture was done in VHDL using Mentor Graphic Modelsim Simulator.  Synthesis was done using Synopsis Synthesis Tools targeting Xilinx Virtex II XCV800 FPGA.  The XSV-800 board can accept PAL, SECAM, or NTSC video with up to 9-bits of resolution on the red, green, and blue channels and can output video images through a 110 MHz, 24-bit digital to analog converter.  Two independent banks of 512K x 16 SRAM are provided for local buffering of signals and data

MWSCAS 2005Center for Advanced Computer Studies 31 Simulation Results

MWSCAS 2005Center for Advanced Computer Studies 32 Simulation Results  Most of the area and power consumption is occupied by the processing unit.  The adder tree is fully asynchronous and its maximum delay of critical path is 12 ns.  The PE is synchronous with four stages of asynchronous logic.  One stage's maximum delay of critical path is 27 ns.  The total power consumption of the system is 121 mW. The total area of the system in LUT is 2312 (7247 slices).

MWSCAS 2005Center for Advanced Computer Studies 33 Conclusion  A background subtraction architecture using the Wronskian Change Detector algorithm has been presented for VLSI realization in wireless visual applications.  The proposed architecture consists of three units: processing, memory and controller.  The processing unit is composed by pipeline processing element that performs the basic operation.

MWSCAS 2005Center for Advanced Computer Studies 34 Conclusion  Partial results are stored and used on the adder tree to obtain final results.  Memory unit consists of two buffers, one stored the previous frame (Frame Buffer) and the other stored partial results and output.  The architecture is capable of computing Wronskian, Conjugate Wronskian and both.  The maximum frame rate is 15 fps. The power dissipated by the whole system is 121 mW. The total area of the system in LUT is 2312.

MWSCAS 2005 Center for Advanced Computer Studies 35 Thank you