Business Identification: Spatial Detection Alexander Darino Week 8
Weaknesses to Current Approach Latitude Longitude Geocoding Reverse Geocoding Nearby Businesses Image OCR Detected Text Business Name Matching Business Identification Business Spatial Detection 2
STR Implementation STR Implementation: “Automatic Detection and Recognition of Signs From Natural Scenes” Multiresolution- based potential characters detection Character/layout geometry and color properties analysis Local affine rectification Refined Detection
Multiresolution-based potential characters detection
STR Implementation Original Next Step: Replace with readily available text detector Text detectors are not readily available (Will revisit later)
TEMPLATE-IMAGE SIFT MATCHING After many technical difficulties…
Template Name George Font Trebuchet MS # Levels 3 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 3 Peak Threshold 0 Edge Threshold 10 Statistics Good 1 Bad 0 Total (% G) 1 (100%)
Template Name George Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 3 Peak Threshold 0 Edge Threshold 10 Statistics Good 1 Bad 0 Total (% G) 1 (100%)
Template Name George Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 1 Peak Threshold 0 Edge Threshold 10 Statistics Good 2 Bad 0 Total (% G) 2 (100%)
Template Name George Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 10 Statistics Good 2 Bad 0 Total (% G) 2 (100%)
Template Name George Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 2 Bad 0 Total (% G) 2 (100%)
Template Name Aiken Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff (95%) Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 3 Bad 0 Total (% G) 3 (100%)
Template Name Delicious Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
Template Name Prepared Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
Template Name Foods Font Trebuchet MS # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name George … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
Template Name Bruegger’s Font Arial Rounded MT Bold # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name Bruegger’s … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
Template Name Bakery Font Arial Rounded MT Bold # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name Bruegger’s … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
Template Name Bakery Font Arial Rounded MT Bold # Levels 5 Peak Threshold 0 Edge Threshold 10 Scale Cutoff Image Name Bruegger’s … # Levels 1 Peak Threshold 20 Edge Threshold 6 Statistics Good 0 Bad 0 Total (% G) 0 (0%)
SCENE TEXT RECOGNITION Moving away from SIFT and revisiting
Scene Text Recognition Did not hear back from individuals contacted for STR implementation Returning to STR Implementation – Further reading indicates that patches are necessary for subsequent algorithms – Text detection is not enough: need to implement specified text detector Multiresolution- based potential characters detection Character/layout geometry and color properties analysis Local affine rectification Refined Detection
Color Properties Analysis Implemented Gaussian Mixture Model (GMM) to obtain μ and σ of foreground/background for: R/G/B/H/I Calculated Confidences that component (RGBHI) can be used to recognize characters Multiresolution- based potential characters detection Character/layout geometry and color properties analysis Local affine rectification Refined Detection
Original
Red
Green
Blue
Hue
Intensity
Evaluation The highest confidence was found in Intensity even though most letters vanish, vs Hue where letters are easily distinguisible This suggests text recognition should occur individually per character The paper further suggests it needs the patches around the individual characters
Next Step
Thank You