© Devi Parikh 2008 Localization and Segmentation of 2D High Capacity Color Barcodes Gavin Jancke Microsoft Research, Redmond Devi Parikh Carnegie Mellon.

Slides:



Advertisements
Similar presentations
Zhengyou Zhang Vision Technology Group Microsoft Research
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Miroslav Hlaváč Martin Kozák Fish position determination in 3D space by stereo vision.
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
QR Code Recognition Based On Image Processing
Presented by Xinyu Chang
--- some recent progress Bo Fu University of Kentucky.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
Beyond Mindless Labeling: Really Leveraging Humans to Build Intelligent Machines Devi Parikh Virginia Tech.
Proactive Learning: Cost- Sensitive Active Learning with Multiple Imperfect Oracles Pinar Donmez and Jaime Carbonell Pinar Donmez and Jaime Carbonell Language.
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
Computer Vision Lecture 16: Region Representation
The Free Safety Problem Using Gaze Estimation as a Meaningful Input to a Homing Task Albert Goldfain CSE 668: Animate Vision Principles Final Project Presentation.
Facial feature localization Presented by: Harvest Jang Spring 2002.
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
1Ellen L. Walker Edges Humans easily understand “line drawings” as pictures.
Detecting Pedestrians by Learning Shapelet Features
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Computer Vision REU Week 2 Adam Kavanaugh. Video Canny Put canny into a loop in order to process multiple frames of a video sequence Put canny into a.
A New Block Based Motion Estimation with True Region Motion Field Jozef Huska & Peter Kulla EUROCON 2007 The International Conference on “Computer as a.
Lecture 5 Template matching
MRI Image Segmentation for Brain Injury Quantification Lindsay Kulkin 1 and Bir Bhanu 2 1 Department of Biomedical Engineering, Syracuse University, Syracuse,
Using Error-Correcting Codes For Text Classification Rayid Ghani Center for Automated Learning & Discovery, Carnegie Mellon University.
CSSE463: Image Recognition Day 30 Due Friday – Project plan Due Friday – Project plan Evidence that you’ve tried something and what specifically you hope.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
1/20 Document Segmentation for Image Compression 27/10/2005 Emma Jonasson Supervisor: Dr. Peter Tischer.
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
Perceived video quality measurement Muhammad Saqib Ilyas CS 584 Spring 2005.
Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe – IJCV 2004 Brien Flewelling CPSC 643 Presentation 1.
Visual Screen: Transforming an Ordinary Screen into a Touch Screen Zhengyou Zhang & Ying Shan Vision Technology Group Microsoft Research
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science Technische Universität München Adaptive.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Using Error-Correcting Codes For Text Classification Rayid Ghani Center for Automated Learning & Discovery, Carnegie Mellon University.
Deep Green System for real-time tracking and playing the board game Reversi Nadav Erell Intro to Computational and Biological Vision, CS department, Ben-Gurion.
Unsupervised Learning of Hierarchical Spatial Structures Devi Parikh, Larry Zitnick and Tsuhan Chen.
GM-Carnegie Mellon Autonomous Driving CRL TitleAutomated Image Analysis for Robust Detection of Curbs Thrust AreaPerception Project LeadDavid Wettergreen,
CSSE463: Image Recognition Day 30 This week This week Today: motion vectors and tracking Today: motion vectors and tracking Friday: Project workday. First.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
X-ray Image Segmentation using Active Shape Models
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
DISCERN: Cooperative Whitespace Scanning in Practical Environments Tarun Bansal, Bo Chen and Prasun Sinha Ohio State Univeristy.
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
An Efficient Search Strategy for Block Motion Estimation Using Image Features Digital Video Processing 1 Term Project Feng Li Michael Su Xiaofeng Fan.
Bryan Willimon, Stan Birchfield, Ian Walker Department of Electrical and Computer Engineering Clemson University IROS 2010 Rigid and Non-Rigid Classification.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Vehicle Segmentation and Tracking From a Low-Angle Off-Axis Camera Neeraj K. Kanhere Committee members Dr. Stanley Birchfield Dr. Robert Schalkoff Dr.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
Corner Detection & Color Segmentation CSE350/ Sep 03.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Fine Granularity Scalability in MPEG-4 Video by Weiping Li Presentation by Warren Cheung.
© Devi Parikh 2008 Devi Parikh and Tsuhan Chen Carnegie Mellon University April 3, ICASSP 2008 Bringing Diverse Classifiers to Common Grounds: dtransform.
IEEE International Conference on Multimedia and Expo.
Yannick FranckenChris HermansPhilippe Bekaert Hasselt University – tUL – IBBT Expertise Centre for Digital Media, Belgium
Predicting Consensus Ranking in Crowdsourced Setting Xi Chen Mentors: Paul Bennett and Eric Horvitz Collaborator: Kevyn Collins-Thompson Machine Learning.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Extracting Adaptive Contextual Cues From Unlabeled Regions Congcong Li +, Devi Parikh *, Tsuhan Chen + + Cornell University * Toyota Technological Institute.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
EE368 Final Project Spring 2003
Can Computer Algorithms Guess Your Age and Gender?
Image Segmentation Classify pixels into groups having similar characteristics.
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
A Tutorial on HOG Human Detection
Vehicle Segmentation and Tracking in the Presence of Occlusions
Multiple Instance Learning: applications to computer vision
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

© Devi Parikh 2008 Localization and Segmentation of 2D High Capacity Color Barcodes Gavin Jancke Microsoft Research, Redmond Devi Parikh Carnegie Mellon University

© Devi Parikh 2008 Motivation UPC Barcode QR CodeDatamatrix

© Devi Parikh 2008 HCCB Microsoft’s High Capacity Color Barcode

© Devi Parikh 2008 Application Uniquely identifying commercial audiovisual works such as motion pictures, video games, broadcasts, digital video recordings and other media

© Devi Parikh 2008 Goal Locate and Segment the barcode from consumer images

© Devi Parikh 2008 Overview  Design specifications of Microsoft’s HCCB  Approach  Localization  Segmentation  Progressive Strategy  Results  Conclusions

© Devi Parikh 2008 Microsoft’s HCCB 4 or 8 colors Triangles String of colors palette

© Devi Parikh 2008 Microsoft’s HCCB

© Devi Parikh 2008 Microsoft’s HCCB

© Devi Parikh 2008 Microsoft’s HCCB

© Devi Parikh 2008 Microsoft’s HCCB R rows S symbols per row S = (r+1)*R Aspect ratio: r

© Devi Parikh 2008 Approach Thresholding Orientation prediction Corner localization Row localization Symbol localization Color assignments Barcode localization Barcode segmentation point inside the barcode is known

© Devi Parikh 2008 Localization: Thresholding  Identify thick white band and row separators  Normalization  Adaptive

© Devi Parikh 2008 Localization: Orientation orientation distance summation

© Devi Parikh 2008 Localization: Corners  Rough estimates whiteness masknon-texture maskcombined mask

© Devi Parikh 2008 Localization: Corners  Gradient based refinement

© Devi Parikh 2008 Localization: Corners  Line based refinement

© Devi Parikh 2008 Segmentation: Rows Summation Flip?

© Devi Parikh 2008 Segmentation: Symbols S E Local search Number of symbols per row q(S,E) =  q(samples|S,E)

© Devi Parikh 2008 Segmentation: Colors Palette

© Devi Parikh 2008  Segmentation results given accurate localization  Satisfactory  Corner localization  Unsatisfactory  No one strategy works well on all images  However (1) Errors of different strategies are complementary (2) Results are verifiable with decoder in the loop! Observations

© Devi Parikh 2008 Progressive strategy  Hence – progressive strategy!  Similar to ensemble of weak classifiers  Or hypothesize-and-test  Multiple strategies:  Rough + gradient + line, or rough + line, or rough + gradient, or rough alone  Different values of threshold during rough corner detection  Total 12  Order of strategies

© Devi Parikh 2008 Results  Dataset of 500 images  Performance metric: % barcodes successfully decoded  Decoder model: Barcode successfully decoded if 80% of symbols are correctly identified

© Devi Parikh 2008 Results Allows for explicit trade-off between accuracy and computational time

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Results

© Devi Parikh 2008 Conclusions  2D High Capacity Color Barcode (HCCB)  Successful localization and segmentation of HCCB from consumer images  Varying densities, aspect ratios, lighting, color balance, image quality, etc.  Simple computer vision and image processing techniques  Progressive strategy

© Devi Parikh 2008 Acknowledgements Microsoft Research  Larry Zitnick  Andy Wilson  Zhengyou Zhang Carnegie Mellon University  Advisor: Tsuhan Chen

© Devi Parikh 2008 Thank you!