How to Evaluate Foreground Maps ?

Slides:

Advertisements

Similar presentations

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

Advertisements

Viral Marketing – Learning Influence Probabilities.

Chapter 4 Pattern Recognition Concepts: Introduction & ROC Analysis.

Learning Algorithm Evaluation

Capital Budgeting Processes And Techniques

Evaluation of segmentation. Example Reference standard & segmentation.

© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.

The Ethics of Image Analysis Martin Peterson,TU/e.

S I E M E N S C O R P O R A T E R E S E A R C H 1 1 A Seeded Image Segmentation Framework Unifying Graph Cuts and Random Walker Which Yields A New Algorithm.

Modeling Relationship Strength in Online Social Networks Rongjian Xiang 1, Jennifer Neville 1, Monica Rogati 2 1 Purdue University, 2 LinkedIn WWW 2010.

What Makes a Patch Distinct?

Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.

Extraction of Vessels from X-Ray Angiograms Titus Rosu Prof. Dr. Rupert Lasser Andreas Keil.

Assessing and Comparing Classification Algorithms Introduction Resampling and Cross Validation Measuring Error Interval Estimation and Hypothesis Testing.

Model Evaluation Metrics for Performance Evaluation

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

A Study of Approaches for Object Recognition

Abstract We present a model of curvilinear grouping using piecewise linear representations of contours and a conditional random field to capture continuity.

1 I256: Applied Natural Language Processing Marti Hearst Sept 27, 2006.

Efficient fault-tolerant scheme based on the RSA system Author: N.-Y. Lee and W.-L. Tsai IEE Proceedings Presented by 詹益誌 2004/03/02.

Decision Theory Naïve Bayes ROC Curves

Interactive Matting Christoph Rhemann Supervised by: Margrit Gelautz and Carsten Rother.

Optimized Numerical Mapping Scheme for Filter-Based Exon Location in DNA Using a Quasi-Newton Algorithm P. Ramachandran, W.-S. Lu, and A. Antoniou Department.

An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.

Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.

Computer-Aided Diagnosis of Solid Breast Nodules: Use of an Artificial Neural Network Based on Multiple Sonographic Features Segyeong Joo, Yoon Seok Yang,

Introduction to Machine Learning Approach Lecture 5.

Virginia Image and Video Analysis Pavement conditions, fighter jets, and psychophysics! a.k.a. “Did you hear that???” Andrea Vaccari.

Evaluation of Image Retrieval Results Relevant: images which meet user’s information need Irrelevant: images which don’t meet user’s information need Query:

Tomer Sagi and Avigdor Gal Technion - Israel Institute of Technology Non-binary Evaluation for Schema Matching ER 2012 October 2012, Florence.

Face Alignment Using Cascaded Boosted Regression Active Shape Models

Identifying Computer Graphics Using HSV Model And Statistical Moments Of Characteristic Functions Xiao Cai, Yuewen Wang.

Link Reconstruction from Partial Information Gong Xiaofeng, Li Kun & C. H. Lai

Abstract This poster presents results of three studies dealing with application of ARTMAP neural networks for classification of remotely sensed multispectral.

Data Analysis 1 Mark Stamp. Topics  Experimental design o Training set, test set, n-fold cross validation, thresholding, imbalance, etc.  Accuracy o.

When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.

Video Segmentation Prepared By M. Alburbar Supervised By: Mr. Nael Abu Ras University of Palestine Interactive Multimedia Application Development.

Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.

Stable Multi-Target Tracking in Real-Time Surveillance Video

Computer-based identification and tracking of Antarctic icebergs in SAR images Department of Geography, University of Sheffield, 2004 Computer-based identification.

Model Evaluation l Metrics for Performance Evaluation –How to evaluate the performance of a model? l Methods for Performance Evaluation –How to obtain.

VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR

Corpus-based evaluation of Referring Expression Generation Albert Gatt Ielka van der Sluis Kees van Deemter Department of Computing Science University.

A Critique and Improvement of an Evaluation Metric for Text Segmentation A Paper by Lev Pevzner (Harvard University) Marti A. Hearst (UC, Berkeley) Presented.

Non-Ideal Iris Segmentation Using Graph Cuts

Threshold Setting and Performance Monitoring for Novel Text Mining Wenyin Tang and Flora S. Tsai School of Electrical and Electronic Engineering Nanyang.

Paper Title Authors names Conference and Year Presented by Your Name Date.

Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin, South Korea Copyright © solarlits.com.

Classifying Covert Photographs CVPR 2012 POSTER. Outline  Introduction  Combine Image Features and Attributes  Experiment  Conclusion.

On Using SIFT Descriptors for Image Parameter Evaluation Authors: Patrick M. McInerney 1, Juan M. Banda 1, and Rafal A. Angryk 2 1 Montana State University,

Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.

ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.

Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.

Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.

7. Performance Measurement

Evaluating Classifiers

Real-Time Soft Shadows with Adaptive Light Source Sampling

A Malware Similarity Testing Framework

Structure-measure: A New Way to Evaluate Foreground Maps

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Data Mining Classification: Alternative Techniques

Structure-measure: A New Way to Evaluate Foreground Maps

Text Detection in Images and Video

Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†

By: Mohammad Qudeisat Supervisor: Dr. Francis Lilley

Enhanced-alignment Measure for Binary Foreground Map Evaluation

More on Maxent Env. Variable importance:

Presentation transcript:

How to Evaluate Foreground Maps ? CVPR2014 Poster

Outline Introduction Limitation of Current Measures Solution Experiment Conclusions

Introduction The comparison of a foreground map against a binary ground-truth is common in various computer-vision problems salient object detection object segmentation foreground-extraction Several measures have been suggested to evaluate the accuracy of these foreground maps. AUC measure AP measure F-measure PASCAL First, multiple thresholds are applied to it, to obtain multiple binary maps. Then, these binary maps are compared to the ground-truth.

Introduction But the most commonly-used measures for evaluating both non-binary maps and binary maps do not always provide a reliable evaluation. [9] K. Chang, T. Liu, H. Chen, and S. Lai. Fusing generic objectness and visual saliency for salient object detection. In ICCV, pages 914–921, 2011. [12] S. Goferman, L. Zelnik-Manor, and A. Tal. Context-aware saliency detection. In CVPR, 2010. [13] H. Jiang, J. Wang, Z. Yuan, T. Liu, N. Zheng, and S. Li. Automatic salient object segmentation based on context and shape prior. In BMVC, volume 3, page 7, 2012.

Introduction Our contributions: Identifying three assumptions in commonly-used measures. We proceed to amend each of these flaws and to suggest a novel measure that evaluates foreground maps at an increased accuracy . Proposing four meta-measures to analyze the performance of evaluation measures. 三個主要貢獻

Introduction Two appealing properties of our measure are: being a generalization of the FB –measure providing a unified evaluation to both binary and non-binary maps.

Limitation of Current Measures Three flawed assumptions : Interpolation flaw Dependency flaw Equal-important flaw

Limitation of Current Measures Current Evaluation Measures Evaluation of binary maps: 4 basic quantities : TP (true-positive) TN (true-negative) FP (false-positive) FN (false-negative)

Limitation of Current Measures Current Evaluation Measures Evaluation of binary maps: Common score : TPR= FPR= 區分binary map 與 non-binary map Binary map 0 or 1 Non-binary map [ 0, 1] ->屬於前景的機率

Limitation of Current Measures Current Evaluation Measures Evaluation of non-binary maps: AUC (Area-Under-the-Curve) AP (Average-Precision) Image Source: http://zh.wikipedia.org/wiki/File:Curvas.png

Interpolation flaw The source of the interpolation flaw is the thresholding of the non-binary maps. Both AUC and AP assume that the interpolated curve (between binary maps) is a valid tool for evaluating non-binary maps. Should be better than (b) , but their scores are the same . Since both AUC and AP rely solely on the interpolated curve, ignoring the distribution of points along the curves, they deem (b) as perfect as (a)

Dependency flaw dependency between false-negatives Current measures assume that the pixels are independent of each other. Fig4. (a)FP集中 ,(b)散在true-positive中 are not of the same quality and should not receive the same score.

Equal-important flaw the location of the false-positives all erroneous detections have equal importance.

Solution Resolving the Interpolation Flaw Resolving the Dependency Flaw & the Equal-Importance Flaw The New Measure – -measure

Resolving the Interpolation Flaw The key idea is to extend the four basic quantities: TP, TN, FP and FN , to deal with non-binary values. G1xN : the column-stack representation of the binary ground-truth, where N is the number of pixels in the image. D1xN : the non-binary map to be evaluated against the ground-truth.

Resolving the Interpolation Flaw For binary map, pixel i correct G[ i ] = D[ i ] incorrect G[ i ] ≠ D[ i ] For non-binary Note that when D is binary, these definitions are identical to the conventional ones.

Resolving the Dependency Flaw & the Equal-Importance Flaw Assumptions deal with detection errors. Our key idea is to attribute different importance to different errors. Reformulate the basic quantities:

Resolving the Dependency Flaw & the Equal-Importance Flaw We suggest applying a weighting function to the errors. ANxN : captures the dependency between pixels BNx1 : represents the varying importance of the pixels

Resolving the Dependency Flaw & the Equal-Importance Flaw

Resolving the Dependency Flaw & the Equal-Importance Flaw Reformulate the basic quantities with weight: Note that when independency and equal-importance are assumed (i.e. A = I and B = 1), these definitions are identical to the conventional ones.

The New Measure – -measure Having dealt with all three flaws, we proceed to construct our evaluation measure.

Experiments Meta-measure : The ranking of an evaluation measure should agree with the preferences of an application that uses the map as input. A measure should prefer a good result by an algorithm that considers the content of the image, over an arbitrary map.

Experiments meta-measure : The score of a map should decrease when using a wrong ground-truth map. The ranking of an evaluation measure should not be sensitive to inaccuracies in the manually marked boundaries in the ground-truth maps.

Experiments :Meta-measure(1) Application Ranking

Experiments :Meta-measure(2) State-of-art vs. Generic

Experiments :Meta-measure(3) Ground-truth Switch

Experiments :Meta-measure(4) Annotation errors

Conclusions We analyzed the currently-used evaluation measures that suffer from three flawed assumptions: interpolation, dependency and equal-importance. We suggested an evaluation measure that amends these assumptions, and it offers a unified solution to the evaluation of non-binary and binary maps. The advantages of our measure were shown via four different meta-measures, both qualitatively and quantitatively.