James Pittman February 9, 2011 EEL 6788 Automatic Collection of Fuel Prices from a Network of Mobile Cameras A. Dong, S. S. Kanhere, C. T. Chou and N.

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

ARTIFICIAL PASSENGER.
Applications of one-class classification
1 ECE 495 – Integrated System Design I Introduction to Image Processing ECE 495, Spring 2013.
By: Mani Baghaei Fard.  During recent years number of moving vehicles in roads and highways has been considerably increased.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Word Spotting DTW.
1 Autonomous Registration of LiDAR Data to Single Aerial Image Takis Kasparis Nicholas S. Shorter
IntroductionIntroduction AbstractAbstract AUTOMATIC LICENSE PLATE LOCATION AND RECOGNITION ALGORITHM FOR COLOR IMAGES Kerem Ozkan, Mustafa C. Demir, Buket.
Simple Face Detection system Ali Arab Sharif university of tech. Fall 2012.
Managing Redundant Content in Bandwidth Constrained Wireless Networks Tuan Dao, Amit K. Roy- Chowdhury, Srikanth V. Krishnamurthy U.C. Riverside Harsha.
Object Inter-Camera Tracking with non- overlapping views: A new dynamic approach Trevor Montcalm Bubaker Boufama.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Brent Dingle Marco A. Morales Texas A&M University, Spring 2002
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
TelosCAM: Identifying Burglar Through Networked Sensor-Camera Mates with Privacy Protection Presented by Qixin Wang Shaojie Tang, Xiang-Yang Li, Haitao.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Distributed and Efficient Classifiers for Wireless Audio-Sensor Networks Baljeet Malhotra Ioanis Nikolaidis Mario A. Nascimento University of Alberta Canada.
Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.
Oral Defense by Sunny Tang 15 Aug 2003
Digital Images The nature and acquisition of a digital image.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Internet GIS. A vast network connecting computers throughout the world Computers on the Internet are physically connected Computers on the Internet use.
Automatic Collection of Fuel Prices from a Network of Mobile Cameras
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
Satellites in Our Pockets: An Object Positioning System using Smartphones Justin Manweiler, Puneet Jain, Romit Roy Choudhury TsungYun
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Vehicle License Plate Detection Algorithm Based on Statistical Characteristics in HSI Color Model Instructor : Dr. K. R. Rao Presented by: Prasanna Venkatesh.
Participatory Sensing in Commerce: Using Mobile Phones to Track Market Price Dispersion Nirupama Bulusu (Portland State University) Chun Tung Chou, Salil.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Expectation-Maximization (EM) Case Studies
NTIT IMD 1 Speaker: Ching-Hao Lai( 賴璟皓 ) Author: Hongliang Bai, Junmin Zhu and Changping Liu Source: Proceedings of IEEE on Intelligent Transportation.
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR
1.Research Motivation 2.Existing Techniques 3.Proposed Technique 4.Limitations 5.Conclusion.
Autonomous Robots Vision © Manfred Huber 2014.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Building a contactless university examination system using NFC Speaker : Chih-Ching Chen Advisor : Dr. Ho-Ting Wu 2013/12/2 1.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Eric Minner & James Pittman. Outline Project Statement / Motivation Concept overview Quick computer vision overview Demo Lessons Learned Future Work.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
POSTER TEMPLATE BY: Background Objectives Psychophysical Experiment Photo OCR Design Project Pipeline and outlines ❑ Deep Learning.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
ParkNet: Drive-by Sensing of Road-Side Parking Statistics Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin,
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
Histograms CSE 6363 – Machine Learning Vassilis Athitsos
Contents Team introduction Project Introduction Applicability
Efficient Image Classification on Vertically Decomposed Data
DIGITAL SIGNAL PROCESSING
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Factors that Influence the Geometric Detection Pattern of Vehicle-based Licence Plate Recognition Systems Martin Rademeyer Thinus Booysen, Arno Barnard.
Efficient Image Classification on Vertically Decomposed Data
What Is Spectral Imaging? An Introduction
RGB-D Image for Scene Recognition by Jiaqi Guo
Speaker: YI-JIA HUANG Date: 2011/12/08 Authors: C. N
Presentation transcript:

James Pittman February 9, 2011 EEL 6788 Automatic Collection of Fuel Prices from a Network of Mobile Cameras A. Dong, S. S. Kanhere, C. T. Chou and N. Bulusu, Automatic Collection of Fuel Prices from a Network of Mobile Cameras, in Proceedings of the 4th IEEE International Conference on Distributed Computing in Sensor Systems (DCOSS), June 2008

Outline Introduction Background System Design Computer Vision Algorithm Evaluation Related Work Conclusions 2

Introduction Wireless sensor network (WSN) technology has been applied to many different domains ◦ This paper presents a concept where WSNs are used for collecting consumer pricing information ◦ The specific target for this paper’s effort is in gathering pricing from fuel (gasoline) stations 3

Introduction Two stations on the same corner of an intersection can have greatly different prices for fuel Currently websites such as Gaswatch, GasBuddy, and others either: ◦ Send workers out every day multiple times to collect and track fuel pricing data. ◦ Rely on input from volunteer site users This is highly labor intensive and inaccurate since stations often update prices at different times of the day 4

Background - SenseMart The authors are re-using the Sensing Data Market (SenseMart) framework they proposed in an earlier paper The SenseMart concept is similar to participatory sensing. ◦ It leverages existing infrastructure (WSNs) for data collection and encourages the users to share their data to accomplish some high level task. 5

Background - SenseMart The SenseMart framework facilitates the data exchange using a “BitTorrent” style concept ◦ They incentivize the system by giving a return to the users proportionate to their contributions to encourages data sharing. ◦ They did not detail what the ‘return’ was other than access to accurate data on gas prices. 6

System Design The proposed system has 2 methods of operation 1.Fuel price collection 2.User query The first is the focus of this paper. ◦ Automatic triggering of users phones ◦ Use of computer vision algorithms + GPS/GIS contextual information to extract the pricing info 7

System Design 8

System Design – Camera Sensor Primary function – automatic capture of images of fuel price boards ◦ Assumed that participating users have cameras mounted in car on dashboard on passenger side (in Australia) ◦ System could also interface with built in car camera vision systems and transfer data via Wi-Fi or Bluetooth to mobile phones 9

System Design – Camera Sensor A control unit in the mobile phone oversees the capturing operations. ◦ It periodically polls the GPS receiver to obtain the current location ◦ A GIS (geographic information system) app such as Google maps or TomTom is required on the phone ◦ The GIS on the phone is then queried (using GPS location) to gather local contextual information 10

System Design – Camera Sensor ◦ If a gas station is known to be close, the control unit estimates viability for image capture (camera facing, distance to target) ◦ If the situation is deemed viable the camera is activated, images are captured and the camera is deactivated ◦ The resulting images along with the associated meta-data (location, time of capture, and any GIS data such as station brand) are passed to the “data-upload unit” for upload to the central server 11

System Design – Camera Sensor 12

System Design – Data Transport Any data captured by the camera along with the meta-data is transferred to the data upload unit ◦ This “unit” is generally the ability of the mobile phone to access the internet via 3G or Wi-Fi. ◦ The device establishes a TCP connection with the server and uploads the data. ◦ The backup/alternative method is to use multimedia SMS for the data transfer 13

System Design – Central Server The central server stores all of the data and runs the computer vision algorithms. ◦ Processes the images ◦ Extracts the fuel prices The server also handles the reception of the images, and processing / storing of the associated meta data 14

System Design – Central Server The server processes all of the data in steps 1.Detect a fuel board 2.Detect the section with the numbers 3.Crop the image to the numbers and normalize to a standard size & resolution 4.Extract the numbers 5.Classify the values 6.Report fuel prices 15

Computer Vision Algorithm 16 Pictorial overview of the algorithm

Computer Vision Algorithm 17 Challenges to overcome 1.Objects obscuring the fuel price boards 2.Background color similar / identical to the price board

Computer Vision Algorithm 18 3.Blurred or unfocused image captures (often due to capturing while sensor in motion) 4.Sections of the board that share characteristics with the prices (adds, borders)

Fuel Price Board Detection Detecting an fuel board and identifying its location in any given image is challenging ◦ Authors use GPS and GIS information to reduce the difficulty of the problem ◦ Each fuel brand has a generally unique color scheme ◦ Meta – data from GPS/GIS can be used to tag incoming images with fuel brands to guide the system in identifying color information 19

Fuel Price Board Detection There are 2 prominent color schemes for representing images: RGB and HIS (Red- Green-Blue and Hue-Intensity-Saturation) ◦ HIS is illumination independent but computationally complex ◦ RGB is illumination sensitive, but computationally efficient ◦ Authors work with RGB due to targeting mobile applications 20

Fuel Price Board Detection RGB is an additive color space, making it easy to extract a single color component ◦ Each pixel is represented by red, green and blue color components ◦ They extract a single color by boosting that channel and subtracting the other components To extract the blue color: 21

Fuel Price Board Detection The first step used by the authors is color thresholding ◦ The objective is to classify all pixels as either  Object pixels (in areas potentially containing an object)  Background pixels (everything else) ◦ This produces a binary image ◦ The difficulty in this is selecting the correct threshold to separate Object and Background pixels. 22

Fuel Price Board Detection They were unable to design a single color threshold to work with all images ◦ To resolve this they took all of the images and classified them into groups based on lighting conditions ◦ Each group was then analyzed and a threshold was derived based on the average intensity 23

Fuel Price Board Detection The next step merges adjacent regions together to form a complete price board Finally connected component labeling is employed to connect pixels into components (all pixels in a component share a physical connection and an intensity range) 24

Fuel Price Board Detection 25

Dimension Comparison Using a priori knowledge about general fuel board dimensions, overly large and overly small regions are excluded Ratios were devised to take in account differences in angles, and distance to sensor Variables correspond with: W – width, H – height Of the region in the image 26

Color Histogram Comparison The second part of the post processing is a histogram comparison algorithm ◦ Compares histogram distribution of candidate region to a template of the price board. ◦ Compute a χ squared distance between the candidate histogram (h i ) and the reference histogram (h j ). K = number of histogram bins. 27

Color Histogram Comparison Since χ turns out to be a large number they normalize it against the number of bins (K) and the width of the image (W i ) Based on test results they have come up with a threshold of D norm ≤ 2.5 ◦ If the value of D norm is under this threshold, the region is very likely a price board 28

Color Histogram Comparison 29 Template (a) Histogram(d) Example (b) Histogram(e) Other Region in Image (c) Histogram(f)

Fuel Price Classification Character Extraction ◦ Once the fuel board has been detected and located in the image the price must be extracted ◦ Due to the nature of the board (color choice, low noise) it can be converted to a binary image. ◦ This significantly reduces the complexity of character extraction 30

Character Extraction A bounding box algorithm is applied to the image to crop each character The cropped characters are normalized to a 50x70 pixel standard size image Each character is broken up into 35 10x10 pixel images that are used to create a 35x1 feature vector of the average intensity 31

Feed-forward Back-propagation Neural Networks (FFBPNN) are used in the character recognition Trained on characters from 20 sample fuel boards A priori knowledge of price placement is used as a base to know what price corresponds to what fuel type. 32 Character Recognition

Evaluation Data Set ◦ 52 images from 5 Mobil and 3 BP stations ◦ Captured with a 5-megapixel Nokia N95 phone or 4-megapixel Canon IXUS 400 camera ◦ Cameras held by passenger in front seat ◦ Images captured in a range of distances, weather and lighting conditions ◦ Each image has 1 fuel price board with 3 prices, and 11 numerals are expected (Australian stations) 33

Evaluation - Detection Range Definitions ◦ Board is “close” if it occupies > 1/8 of the image ◦ Otherwise it is “far away” Results metric ◦ “hit” if board correctly detected ◦ “miss” otherwise 34

Evaluation - Detection Detection Results 35

Evaluation - Classification Data Set ◦ 48 successfully classified images from detection phase ◦ 15 contain board to blurry even for humans to ID ◦ 33 images (15 Mobil, 18 BP) with 330 total characters and 99 fuel prices ◦ Issue with Mobil is always lower than BP as classification algo counted a white border as a “1” 36

Evaluation - Classification Results Nearly 90% combined correct classification! 37

Related Work The authors mention related work in both WSNs and detection/recognition of objects in images Point out that many WSN researchers are starting to look to a variety of tools (beyond traditional WSNs) such as mobile phones, vehicles, GPS, cameras etc. as everyday data collection devices 38

Related Work They also mention the difficulty (or near impossibility) of using commercial OCR (optical character recognition) software to do the task they have presented ◦ Issues include  lack of standard layout  lack of standard fonts  Other unstructured variables (lighting, weather, distance) 39

Conclusions 3 Key factors to the authors work ◦ Offering a “BitTorrent” style platform for sensing data, encouraging users to contribute ◦ Proposed system leverages existing sensing and communication infrastructure (lowering the barrier for a volunteer to participate) ◦ Use of computer vision algorithms for the extraction of data 40

41 References Chou, C.T., Bulusu, N., Kanhere, S.: Sensing data market. In: Proceedings of Poster Papers of 3 rd IEEE International Conference on Distributed Computing in Sensor Systems (DCOSS 2007). (June 2007) Yuan, B., Kwoh, L.K., Tan, C.L.: Finding the best-fit bounding-boxes. Document Analysis Systems VII 3872/2006 (2006)

Example GasBuddy.com App 42