Scalable Multi-Label Annotation Jia Deng Olga Russakovsky Jonathan Krause, Michael Bernstein Alexander Berg Li Fei-Fei.

Slides:



Advertisements
Similar presentations
Números.
Advertisements

Panos Ipeirotis Stern School of Business
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
Elastic Hierarchies: Combining Treemaps and Node-Link Diagrams
Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers New York University Stern School Victor Sheng Foster Provost Panos.
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Balanced Scorecard Report includes 7 day(s) from to Created by Expert_KPI at This report was generated with BSC Designer.
ALAK ROY. Assistant Professor Dept. of CSE NIT Agartala
Welfare Rights Training 2009 Employment & Support Allowance & Incapacity Benefit (For Youth) Taxable: Yes (Short Term Rate No) Means Tested: No (But.
Fill in missing numbers or operations
EuroCondens SGB E.
Worksheets.
STATISTICS Linear Statistical Models
Presented by The Coal Rush Revisited: R. W. Beck, Inc. IPED COAL POWER CONFERENCE January 18-19, 2007 St. Petersburg, FL Nicholas P. Guarriello An Economic.
Power of Evidence Review
Addition and Subtraction Equations
Disability status in Ethiopia in 1984, 1994 & 2007 population and housing sensus Ehete Bekele Seyoum ESA/STAT/AC.219/25.
By John E. Hopcroft, Rajeev Motwani and Jeffrey D. Ullman
AYP Changes for 2007 K-20 Videoconference June 11, 2007 Presented by: JoLynn Berge OSPI Federal Policy Coordinator.
CALENDAR.
2.11.
Andrew Smith Describing childhood diet with cluster analysis Young Statisticians meeting. 12th April 2011.
Identifying money correctly 1

The 5S numbers game..
突破信息检索壁垒 -SciFinder Scholar 介绍
Ming Mao, Jie Li, Marty Humphrey eScience Group CS Department, University of Virginia Grid 2010 – Oct 27, 2010.
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
$100 $200 $300 $400 $100 $200 $300 $400 $100 $200 $300 $400 $100 $200 $300 $400 $100 $200 $300 $400.
The basics for simulations
Analysis of Large Scale Visual Recognition
Connecticut Mastery Test (CMT) and the Connecticut Academic Achievement Test (CAPT) Spring 2013 Presented to the Guilford Board of Education September.
Rewarding Crowdsourced Workers Panos Ipeirotis New York University and Google Joint work with: Jing Wang, Foster Provost, Josh Attenberg, and Victor Sheng;
MM4A6c: Apply the law of sines and the law of cosines.
National Association of Farm Broadcasters 2005 Rural Lifestyles Farms Research Commissioned by NAFB and conducted by AMR in August 2005, this report is.
Figure 3–1 Standard logic symbols for the inverter (ANSI/IEEE Std
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
Dynamic Access Control the file server, reimagined Presented by Mark on twitter 1 contents copyright 2013 Mark Minasi.
Statistics Review – Part I
Progressive Aerobic Cardiovascular Endurance Run
Evelyn CP School Foundation Stage Results (Specific Learning Goals – Reading, Writing and Number) 2013 Reading (Expected) 77% Writing (Expected) 43% Number.
Charging at 120 and 240 Volts 120-Volt Portable Vehicle Charge Cord 240-Volt Home Charge Unit.
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
Before Between After.
Benjamin Banneker Charter Academy of Technology Making AYP Benjamin Banneker Charter Academy of Technology Making AYP.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
2.10% more children born Die 0.2 years sooner Spend 95.53% less money on health care No class divide 60.84% less electricity 84.40% less oil.
Foundation Stage Results CLL (6 or above) 79% 73.5%79.4%86.5% M (6 or above) 91%99%97%99% PSE (6 or above) 96%84%100%91.2%97.3% CLL.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Static Equilibrium; Elasticity and Fracture
Resistência dos Materiais, 5ª ed.
Solanco School District Preliminary Budget April 28, 2014.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.
Tom had a set of colored pencils. He saw that 15 of the pencils in the box were a shade of blue. Which decimal is equivalent to 15 ? A B C.
Chapter 8: Dialysis Providers 2014 ANNUAL DATA REPORT VOLUME 2: E ND -S TAGE R ENAL D ISEASE.
Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.
Introduction Embedded Universal Tools and Online Features 2.
Incentivize Crowd Labeling under Budget Constraint
Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.
3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.
Recognition: A machine learning approach
Kuan-Chuan Peng Tsuhan Chen
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR ImageNet1.
Recognition Using Visual Phrases
Modeling Annotator Accuracies for Supervised Learning
From Vision to Grasping: Adapting Visual Networks
Image Classification.
Presentation transcript:

Scalable Multi-Label Annotation Jia Deng Olga Russakovsky Jonathan Krause, Michael Bernstein Alexander Berg Li Fei-Fei

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird Task: Crowdsource object labels for images. Generalization: musical attributes of songs actions in movies sentiments in documents Application: Benchmarking, training, modeling Multi-label annotation Data Item Labels

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Current focus: 200 Category Detection (~100,000 fully labeled images) Large-Scale Visual Recognition Challenge ILSVRC

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird ?????? Answer Question MachineCrowd Is there a table? Yes

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird +????? Answer Question MachineCrowd Is there a table? Yes

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird ++???? Answer Question MachineCrowd Is there a chair? Yes

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird ++-??? Answer Question MachineCrowd Is there a horse? No

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird ++--?? Answer Question MachineCrowd Is there a dog? No

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird ++---? Answer Question MachineCrowd Is there a cat? No

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Naïve approach: ask for each object cost: estimation: use the crowd-machine diagram show UI. TableChairHorseDogCatBird Answer Question MachineCrowd Is there a bird? No

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird Naïve approach: ask for each object Cost: O(NK) for N images and K objects

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird Furniture Mammal Animal Hierarchy

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird Sparsity Correlation Furniture Mammal Animal Hierarchy

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ?????? Answer Question Machine Crowd Furniture Mammal Animal Better approach: exploit label structure

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ?????? Answer Question Machine Crowd Is there an animal? No Furniture Mammal Animal Better approach: exploit label structure

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ??---- Answer Question Machine Crowd Is there an animal? No Furniture Mammal Animal Better approach: exploit label structure

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ??---- Answer Question Machine Crowd Is there furniture? Yes Mammal Better approach: exploit label structure Animal Furniture

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Answer Question Machine Crowd Is there a table? Yes Mammal Better approach: exploit label structure Animal TableChairHorseDogCatBird ??---- Furniture

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird +?---- Answer Question Machine Crowd Is there a chair? Yes Mammal Better approach: exploit label structure Animal Furniture

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Answer Question Machine Crowd Is there a chair? Yes Mammal Better approach: exploit label structure Animal TableChairHorseDogCatBird Furniture

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Selecting the Right Question Goal: Get as much utility (new labels) as possible, for as little cost (worker time) as possible, given a desired level of accuracy

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Accuracy constraint User-specified accuracy threshold, e.g., 95% Majority voting assuming uniform worker quality [GAL: Sheng, Provost, Ipeirotis KDD ‘08] Might require only one worker, might require several based on the task

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Cost: worker time (time = money) Question (is there …)Cost (second) a thing used to open cans/bottles14.4 an item that runs on electricity (plugged in or using batteries) 12.6 a stringed instrument3.4 a canine2.0 expected human time to get an answer with 95% accuracy

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Utility: expected # of new labels TableChairHorseDogCatBird ?????? Is there a table? Yes No TableChairHorseDogCatBird +????? TableChairHorseDogCatBird -????? utility = 1

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Utility: expected # of new labels TableChairHorseDogCatBird ?????? Is there a table? Yes No TableChairHorseDogCatBird +????? TableChairHorseDogCatBird -????? TableChairHorseDogCatBird ?????? Is there an animal? TableChairHorseDogCatBird ?????? TableChairHorseDogCatBird ??---- utility = 1 utility = 0.5 * * 4 = 2 Pr(Y) = 0.5 Pr(N) = 0.5

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Pick the question with the most labels per second Query: Is there a... Utility (num labels) Cost (worker time in secs) Utility-Cost Ratio (labels per sec) mammal with claws or fingers living organism mammal creature without legs land or avian creature Selecting the Right Question

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Dataset: 20K images from ImageNet Challenge Labels: 200 basic categories (dog, cat, table…), 64 internal nodes in hierarchy Results

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Dataset: 20K images from ImageNet Challenge Labels: 200 basic categories (dog, cat, table…), 64 internal nodes in hierarchy Setup: – training test split – Estimate parameters on training, simulate on test – Future work: online estimation Results

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Results: accuracy Accuracy Threshold per question (parameter) Accuracy (F1 score) Naïve approach Accuracy (F1 score) Our approach (75.67)99.75 (76.97) (60.17)99.62 (60.69) Annotating 10K images with 200 objects

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Results: cost Accuracy Threshold per question (parameter) Cost saving (our approach compared to naïve approach) x x Annotating 10K images with 200 objects

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Results: cost Accuracy Threshold per question (parameter) Cost saving (our approach compared to naïve approach) x x Annotating 10K images with 200 objects 6 times more labels per second

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Speeds up crowdsourced multi-label annotation by exploiting the structure and distribution of labels. Could be a bargain for you! TableChairHorseDogCatBird Furniture Mamma l Animal Hierarchy Correlation Sparsity Conclusions

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei

TableChairHorseDogCatBird ?????? Answer Question MachineCrowd Is there an animal? No Furniture Mammal Animal Better approach: exploiting hierarchy, sparsity and correlation

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ??---- Answer Question MachineCrowd Is there an animal? No Furniture Mammal Animal Better Approach

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird ??---- Answer Question MachineCrowd Is there furniture? Yes Furniture Mammal Animal Better Approach

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird +?---- Answer Question MachineCrowd Is there a table? Yes Furniture Mammal Animal Better Approach

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei TableChairHorseDogCatBird Answer Question MachineCrowd Is there a chair? Yes Furniture Mammal Animal Better Approach

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Algorithm Initialize all labels to ‘missing’ While there are missing labels do – Select a question Q from all possible questions – Obtain an answer A to question Q from the crowd – Set values of some labels to +1 or -1

Deng, Russakovsky, Krause, Bernstein, Berg, Fei-Fei Algorithm Initialize all labels to ‘missing’ While there are missing labels do – Select a question Q from all possible questions – Obtain an answer A to question Q from the crowd – Set values of some labels to +1 or -1