Business Identification: Local Neighborhood Alexander Darino.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Arnd Christian König Venkatesh Ganti Rares Vernica Microsoft Research Entity Categorization Over Large Document Collections.
By: Mani Baghaei Fard.  During recent years number of moving vehicles in roads and highways has been considerably increased.
Large-Scale Entity-Based Online Social Network Profile Linkage.
Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Business Identification: Spatial Detection Alexander Darino Week 8.
Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.
DSPIN: Detecting Automatically Spun Content on the Web Qing Zhang, David Y. Wang, Geoffrey M. Voelker University of California, San Diego 1.
Detecting Nearly Duplicated Records in Location Datasets Microsoft Research Asia Search Technology Center Yu Zheng Xing Xie, Shuang Peng, James Fu.
PHP-based Image Recognition and Retrieval of Late 18th Century Artwork Ben Goodwin Handouts are available for students writing summaries for class assignments.
Lecture 4 Linear Filters and Convolution
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Image Enhancement To process an image so that the result is more suitable than the original image for a specific application. Spatial domain methods and.
Automatic Face Recognition Using Color Based Segmentation and Intelligent Energy Detection Michael Padilla and Zihong Fan Group 16 EE368, Spring
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
CSSE463: Image Recognition Day 30 Due Friday – Project plan Due Friday – Project plan Evidence that you’ve tried something and what specifically you hope.
Fuzzy Medical Image Segmentation
Cliff Rhyne and Jerry Fu June 5, 2007 Parallel Image Segmenter CSE 262 Spring 2007 Project Final Presentation.
Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.
Input devices, processing and output devices Hardware Senior I.
IIIT HyderabadUMASS AMHERST Robust Recognition of Documents by Fusing Results of Word Clusters Venkat Rasagna 1, Anand Kumar 1, C. V. Jawahar 1, R. Manmatha.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Computer Vision Systems for the Blind and Visually Disabled. STATS 19 SEM Talk 3. Alan Yuille. UCLA. Dept. Statistics and Psychology.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Filtering Approaches for Real-Time Anti-Aliasing /
Satellites in Our Pockets: An Object Positioning System using Smartphones Justin Manweiler, Puneet Jain, Romit Roy Choudhury TsungYun
AD-HOC GEOREFERENCING OF WEB-PAGES USING STREET-NAME PREFIX TREES Andrei Tabarcea, Ville Hautamäki, Pasi FräntiAndrei Tabarcea, Ville Hautamäki, Pasi Fränti.
Location-Based API 1. 2 Location-Based Services or LBS allow software to obtain the phone's current location. This includes location obtained from the.
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
Spatial Business Detection and Recognition from Images Alexander Darino.
Spatial Business Detection and Recognition from Images Alexander Darino Weeks 10 & 11.
CSSE463: Image Recognition Day 30 This week This week Today: motion vectors and tracking Today: motion vectors and tracking Friday: Project workday. First.
Computer Graphics Texture Mapping
FlowString: Partial Streamline Matching using Shape Invariant Similarity Measure for Exploratory Flow Visualization Jun Tao, Chaoli Wang, Ching-Kuang Shene.
Automated Form processing for DTIC Documents March 20, 2006 Presented By, K. Maly, M. Zubair, S. Zeil.
3D SLAM for Omni-directional Camera
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Michael Cafarella Alon HalevyNodira Khoussainova University of Washington Google, incUniversity of Washington Data Integration for Relational Web.
Classifier Evaluation Vasileios Hatzivassiloglou University of Texas at Dallas.
Face detection Slides adapted Grauman & Liebe’s tutorial
Business Identification: Spatial Detection Alexander Darino Week 5.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Progress Report Development of a Driver Alert System for Road Safety.
Tracking CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Computer Vision : CISC 4/689 Going Back a little Cameras.ppt.
CS654: Digital Image Analysis
1 Motion estimation from image and inertial measurements Dennis Strelow and Sanjiv Singh.
Census Data Capture: ABS Experience 1991 to 2006 Noumea February 2008.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Lecture 04 Edge Detection Lecture 04 Edge Detection Mata kuliah: T Computer Vision Tahun: 2010.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Computer Vision Image Features Instructor: Dr. Sherif Sami Lecture 4.
1 Long-term image-based motion estimation Dennis Strelow and Sanjiv Singh.
Automated Geo-referencing of Images Dr. Ronald Briggs Yan Li GeoSpatial Information Sciences The University.
The Big Picture Things to think about What different ways are there to collect information automatically? What are the advantages and disadvantages of.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.
Active Flattening of Curved Document Images via Two Structured Beams
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Factors that Influence the Geometric Detection Pattern of Vehicle-based Licence Plate Recognition Systems Martin Rademeyer Thinus Booysen, Arno Barnard.
Data Integration for Relational Web
CSSE463: Image Recognition Day 30
Grey Level Enhancement
CSSE463: Image Recognition Day 30
Fourier Transform of Boundaries
CSSE463: Image Recognition Day 30
Calibration and homographies
1 Motivation & System Overview
Presentation transcript:

Business Identification: Local Neighborhood Alexander Darino

Outline Where Am I? project obtains geolocation of camera from image Objective: Obtain the geolocation and address of Businesses in image – Assume Business is nearby, eg. < 100m from camera – Compare methods of obtaining this information

Outline Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses ImageOCR Detected Text Business Name Matching Business Identification

Outline This Week: – Finding Local Businesses via Geocode Search – Finding Local Addresses via Reverse Geocoding – Extracting Identifying Text (ie. store names) via Optical Character Recognition (OCR) – Matching OCR text to Business Names Next Steps/Weekend Objectives Acknowledgements

Obtaining Business Names ✓

Local Businesses: Geocode Search Used Three Place-Search APIs: – Yelp API - detailed yellow page-type results – Google Places API - "Skinny" + Reference to more information – CityGrid API - minimal yellow page-type results Used by Yellow Pages, Super Pages At present, only interested in business names Aggregated names from all three APIs Example (next slide)

Local Businesses: Geocode Search Denham & Company Salon Ullrich's Shoe Repairing Nicholas Coffee Co Bella Sera On the Square A & J Ribs Starbucks Coffee Jenny Lee Bakery Galardi's 30 Minute Cleaners Jimmy John's Gourmet Sandwiches Charley's Grilled Subs Fresh Corner Lagondola Pizzeria & Restaurant Camera Repair Service Inc Pittsburgh Cigar Bar Original Oyster House MixStirs 1902 Tavern Costanzo's Pittsburgh Silver Llc Graeme St Galardi's 30 Minute Cleaners Denham & Co Salon Bruegger's Bagel Bakery Nicholas Coffee Co Market Square Fat Tommy's Pizzeria Mixstirs Cafe Giggles Rycon Construction Inc Garbera, Dennis C, Dds - Emmert Dental Assoc Bella Sera on the Square Mancini's Bread Co Las Velas Ciao Baby Washington Reprographics Inc Highmark Life Insurance Co Fischer, Donald R, Md - Highmark Life Insurance Co Jimmy John's Lynx Energy Partners Inc Emmert Dental Assoc

Local Businesses: Geocode Search Results: 12 Success, 3 Partial Q9: First Presbyterian Church [turns out it wasn't a cathedral] (SUCCESS) Q28: Moe's (SUCCESS) Q34: Bruegger's Bagels (SUCCESS) Q35: Breuggers, Tavern, Nicholas (SUCCESS) Q42: Tavern, Nicholas, Constanzo's [in distance] (SUCCESS) Q57: Tambellini (SUCCESS) Q63: Benedum Center (SUCCESS) Q141: Roberts/7-Eleven (PARTIAL - misses Roberts) Q200: Goodyear (SUCCESS) Q238: Far from Bruegger's, Tavern, Nicholas (PARTIAL - misses Tavern) Q246: Some theater (can't read it) (SUCCESS) Q249: George Aikens (SUCCESS) Q260: Dogs Dun Wright, Cherrie's diner (SUCCESS) Q300: Giggles, Bruegger's, Tavern (in distance) (SUCCESS) Q318: Fifth Avenue Place, Wines & Spirits (PARTIAL - misses Wine & Spirits)

Local Businesses: Geocode Search Strengths – Aggregated results almost always found Business of interest Weaknesses – Each API limits query result set size - this is why we aggregate – Contacted Yelp, Google, CityGrid for extended API Access. Heard back from CityGrid; conference call next week. – Only businesses listed – Not all businesses listed All but one "Partial" result were for unlisted businesses Limitations – Have only tested for 15 Pittsburgh images - unknown result quality for rural areas.

Local Businesses: Geocode Search ✓ ✓*✓* ✓ * Implicitly verified: APIs can search by latitude/longitude OR address

Local Addresses: Reverse Geocoding Used Two Reverse-Geocoding APIs – Google: provides a range of addresses on the same road Usually the road is correct, but sometime's it's slightly off Sometimes the road is correct, but the actual address number is not in the range Bing: provides one or two proximate addresses – Rates it's own confidence. Even 'Medium' confidences are very accurate – Address is never exact, but is almost always adjacent to correct address – Results returned never consistent: always returns one or the other or both of the two addresses regardless of confidence level

Local Addresses: Reverse Geocoding Intent: Get up to ~500 nearby addresses No Address Search API Available ✓ ✓*✓* ✓ ✗

Extracting Identifying Text: OCR Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses ImageOCR Detected Text Business Name Matching Business Identification

Extracting Identifying Text: OCR Given: – List of nearby businesses (names, addresses, etc) – Image containing businesses with visible names Objective: – Extract name of businesses from image – Identify businesses located in image Match names extracted from image to names in business list

Extracting Identifying Text: OCR Used Two OCR APIs: – GNU OCR (Ocrad) – GOCR OCR APIs highly sensitive to: – Font (only works well with roman font) – Perspective – Scale – Binarization Threshold – Dark on Light vs. Light on Dark (inversion)

Extracting Identifying Text: OCR OCR API evaluations – Ocrad - could not yield any meaningful data across over 200 scale/threshold/inversion combinations – GOCR - produced good results across 10 scales with and without inversion using threshold automatically determined by Otsu's method Examples of GOCR output (next slides)

Extracting Identifying Text: OCR

n. c o.a...u oU..D.oa..e......_RuEGGE..KERy..J...w L M. II.....c.....i l..J.t...llt...l SHA.P. It..tllt _. l...J y. _.c _.... _tt.._....t.._.r t.t_t.._.._.l.. J.r.r.I.

Extracting Identifying Text: OCR

u _nq......eoR.E.l.e...í....e...n....n....n.e.R.E...e....o. _....E.R.E.IKE I.ltlO rE..o......E.....I.K.E.o..... J.n....c...E.R.E.I.E M..E.R.E...E...a J...Gu. ge..ge E.F.._.....E..gE.D... fUlI..lll.lll.IIi.l..Xl..

Extracting Identifying Text: OCR

..e_..w.. _......D u J.....J n......n n _..r.l_d..J.ec.m._..n J.n.._...tn.. ct..._ D.u.v... e. n.... u.. Y.._ w. n. n....Jn G..o..r..._ J...m l.t..l.tt.l.._w _....l....t j..i lI.i..

Extracting Identifying Text: OCR

__. ncu_.l..._..._J...ne _n._..v.....ra......d_..._ i..n..U ll REsT.unAN...r. c.....r...T t.rJll......m...c.....n J n. I..c...r.r ESTAU.ANT.r.O....c.cc. Note: Even though "Tambellini" is a roman font, it is too stretched to be picked up by GOCR

Extracting Identifying Text: OCR Strengths – Applicable to expected input of orthogonal images – Output can be run through word similarity matching algorithms Weaknesses – Only works well(-ish) for strictly roman font Limitations – Will perform poorly for artistic fonts and business signs Conclusion – By itself, OCR is not the best approach towards Business identification (poor recognition, franchises, perspective, etc) – OCR could be used as part of Business identification voting scheme

Matching OCR Text to Business Names Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses ImageOCR Detected Text Business Name Matching Business Identification

Matching OCR Text to Business Names Fuzzy String Matching: TRE Package – Approximate Regular Expression Matching – Returns edit-distance of matched text Filter OCR text – Trimming – Chunking – Uselessness (ie. Less than two letters) Developing algorithm to rate confidence of business name appearing in image

Matching OCR Text to Business Names

Next Steps/Weekend Objectives Implement ‘chunking’ to OCR output Evaluate and refine algorithm against multiple inputs Detect location of text in image

Acknowledgements Subh – Directed us to the Ocrad and GOCR OCR packages – Provided feedback on how to calibrate OCR packages to extract meaningful text (eg. scaling, inversion, etc)

Thank You