Visual Recognition With Humans in the Loop Steve Branson Catherine Wah Florian Schroff Boris Babenko Serge Belongie Peter Welinder Pietro Perona ECCV 2010,

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

2 Casa 15m Perspectiva Lateral Izquierda.
1 A B C
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Simplifications of Context-Free Grammars
Variations of the Turing Machine
AP STUDY SESSION 2.
1
Select from the most commonly used minutes below.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2013 Elsevier Inc. All rights reserved.
Objectives: Generate and describe sequences. Vocabulary:
David Burdett May 11, 2004 Package Binding for WS CDL.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Create an Application Title 1Y - Youth Chapter 5.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
The 5S numbers game..
Photo Slideshow Instructions (delete before presenting or this page will show when slideshow loops) 1.Set PowerPoint to work in Outline. View/Normal click.
Media-Monitoring Final Report April - May 2010 News.
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
This module: Telling the time
The basics for simulations
EE, NCKU Tien-Hao Chang (Darby Chang)
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
Bellwork Do the following problem on a ½ sheet of paper and turn in.
1 The Royal Doulton Company The Royal Doulton Company is an English company producing tableware and collectables, dating to Operating originally.
Operating Systems Operating Systems - Winter 2010 Chapter 3 – Input/Output Vrije Universiteit Amsterdam.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
TESOL International Convention Presentation- ESL Instruction: Developing Your Skills to Become a Master Conductor by Beth Clifton Crumpler by.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Adding Up In Chunks.
FAFSA on the Web Preview Presentation December 2013.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
1 Termination and shape-shifting heaps Byron Cook Microsoft Research, Cambridge Joint work with Josh Berdine, Dino Distefano, and.
Artificial Intelligence
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
Before Between After.
Subtraction: Adding UP
Bell Busters! Unit 1 #1-61. Purposes of Government 1. Purposes of government 2. Preamble to the Constitution 3. Domestic tranquility 4. Common defense.
: 3 00.
5 minutes.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
Essential Cell Biology
Converting a Fraction to %
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Clock will move after 1 minute
famous photographer Ara Guler famous photographer ARA GULER.
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Copyright Tim Morris/St Stephen's School
1.step PMIT start + initial project data input Concept Concept.
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
CS 1699: Intro to Computer Vision Active Learning Prof. Adriana Kovashka University of Pittsburgh November 24, 2015.
Presentation transcript:

Visual Recognition With Humans in the Loop Steve Branson Catherine Wah Florian Schroff Boris Babenko Serge Belongie Peter Welinder Pietro Perona ECCV 2010, Crete, Greece 1

What type of bird is this? 2

3 …? Field Guide

What type of bird is this? 4 Computer Vision ?

What type of bird is this? 5 Bird? Computer Vision

What type of bird is this? 6 Chair? Bottle? Computer Vision

Parakeet Auklet 7 Field guides difficult for average users Computer vision doesnt work perfectly (yet) Research mostly on basic- level categories

Visual Recognition With Humans in the Loop Parakeet Auklet What kind of bird is this? 8

Levels of Categorization Airplane? Chair? Bottle? … Basic-Level Categories 9 [Griffin et al. 07, Lazebnik et al. 06, Grauman et al. 06, Everingham et al. 06, Felzenzwalb et al. 08, Viola et al. 01, … ]

Levels of Categorization American Goldfinch? Indigo Bunting? … Subordinate Categories 10 [Belhumeur et al. 08, Nilsback et al. 08, …]

Levels of Categorization Yellow Belly? Blue Belly?… Parts and Attributes 11 [Farhadi et al. 09, Lampert et al. 09, Kumar et al. 09]

Visual 20 Questions Game Blue Belly? no Cone-shaped Beak? yes Striped Wing? yes American Goldfinch? yes Hard classification problems can be turned into a sequence of easy ones 12

Recognition With Humans in the Loop Computer Vision Cone-shaped Beak? yes American Goldfinch? yes Computer Vision Computers: reduce number of required questions Humans: drive up accuracy of vision algorithms 13

Research Agenda Heavy Reliance on Human Assistance More Automated Computer Vision Improves Blue belly? no Cone-shaped beak? yes Striped Wing? yes American Goldfinch? yes Striped Wing? yes American Goldfinch? yes Fully Automatic American Goldfinch? yes

Field Guides 15

Field Guides 16

Example Questions 17

Example Questions 18

Example Questions 19

Example Questions 20

Example Questions 21

Example Questions 22

Basic Algorithm Input Image ( ) Question 1: Is the belly black? Question 2: Is the bill hooked? Computer Vision A: NO A: YES Max Expected Information Gain … 23

Without Computer Vision Input Image ( ) Question 1: Is the belly black? Question 2: Is the bill hooked? Class Prior A: NO A: YES Max Expected Information Gain … 24

Basic Algorithm Select the next question that maximizes expected information gain: Easy to compute if we can to estimate probabilities of the form: Object Class Image Sequence of user responses 25

Basic Algorithm Model of user responses Computer vision estimate Normalization factor 26

Basic Algorithm Model of user responses Computer vision estimate Normalization factor 27

Modeling User Responses Assume: Estimate using Mechanical Turk grey red black white brown blue grey red black white brown blue grey red black white brown blue Definitely Probably Guessing What is the color of the belly? Pine Grosbeak 28

Incorporating Computer Vision Use any recognition algorithm that can estimate: p(c|x) We experimented with two simple methods: 1-vs-all SVMAttribute-based classification [Lampert et al. 09, Farhadi et al. 09] 29

Incorporating Computer Vision [Vedaldi et al. 08, Vedaldi et al. 09 ] Self Similarity Color Histograms Color Layout Bag of Words Spatial Pyramid Geometric Blur Color SIFT, SIFT Multiple Kernels Used VLFeat and MKL code + color features 30

Birds 200 Dataset 200 classes, images, 288 binary attributes Why birds? Black-footed Albatross Groove-Billed Ani Parakeet AukletField SparrowVesper Sparrow Arctic TernForsters TernCommon Tern Bairds Sparrow Henslows Sparrow 31

Birds 200 Dataset 200 classes, images, 288 binary attributes Why birds? Black-footed Albatross Groove-Billed Ani Parakeet AukletField SparrowVesper Sparrow Arctic TernForsters TernCommon Tern Bairds Sparrow Henslows Sparrow 32

Birds 200 Dataset 200 classes, images, 288 binary attributes Why birds? Black-footed Albatross Groove-Billed Ani Parakeet AukletField SparrowVesper Sparrow Arctic TernForsters TernCommon Tern Bairds Sparrow Henslows Sparrow 33

Results: Without Computer Vision Comparing Different User Models 34

Results: Without Computer Vision if users answers agree with field guides… Perfect Users: 100% accuracy in 8log 2 (200) questions 35

Results: Without Computer Vision Real users answer questions MTurkers dont always agree with field guides… 36

Results: Without Computer Vision Real users answer questions MTurkers dont always agree with field guides… 37

Results: Without Computer Vision Probabilistic User Model: tolerate imperfect user responses 38

Results: With Computer Vision 39

Results: With Computer Vision Users drive performance: 19% 68% Just Computer Vision 19% 40

Results: With Computer Vision Computer Vision Reduces Manual Labor: questions 41

Examples Without computer vision: Q #1: Is the shape perching-like? no (Def.) With computer vision: Q #1: Is the throat white? yes (Def.) Western Grebe Different Questions Asked w/ and w/out Computer Vision perching-like 42

Examples computer vision Magnolia Warbler User Input Helps Correct Computer Vision Is the breast pattern solid? no (definitely) Common Yellowthroat Magnolia Warbler Common Yellowthroat 43

Recognition is Not Always Successful Acadian Flycatcher Least Flycatcher Parakeet Auklet Least Auklet Is the belly multi- colored? yes (Def.) Unlimited questions 44

Summary questions Computer vision reduces manual labor Users drive up performance 19% 45 Recognition of fine-grained categories More reliable than field guides

Summary questions Computer vision reduces manual labor Users drive up performance 19% 46 Recognition of fine-grained categories More reliable than field guides

Summary questions Computer vision reduces manual labor Users drive up performance 19% 47 Recognition of fine-grained categories More reliable than field guides

Summary questions Computer vision reduces manual labor Users drive up performance 19% 48 Recognition of fine-grained categories More reliable than field guides

Future Work Extend to domains other than birds Methodologies for generating questions Improve computer vision 49

Questions? Project page and datasets available at:

Categories of Recognition Easy for Humans Airplane? Chair? Bottle? … Hard for Humans American Goldfinch? Indigo Bunting?… Easy for Humans Yellow Belly? Blue Belly?… Basic-LevelSubordinateParts & Attributes Hard for computers 51

Related Work 20 Questions Game [20q.net] oMoby [IQEngines.com] Many Others: Crowdsourcing, Information Theory, Relevance Feedback, Active Learning, Expert Systems, … Field Guides [whabird.com] Botanists Electronic Field Guide [Belhumeur et al. 08] Oxford Flowers [Nilsback et al. 08] [Lampert et al. 09] [Farhadi et al. 09] [Kumar et al. 09] Attributes 52

Traditional Approach to Object Recognition Research Easy ProblemMore Difficult Problem Computer Vision Improves Stuck on basic-level categories Performance still too low for practical application 53

Missing Attributes Indigo Bunting Blue Grosbeak 54

Overview Solve difficult vision problems Leverage existing computer vision Make field guides more automated and reliable 55

Research Agenda 56 Reliance on Computers Reliance on Humans (# questions)

Research Agenda 57 Reliance on Computers Reliance on Humans (# questions) Field guides No use of computer vision

Research Agenda 58 Reliance on Computers Reliance on Humans (# questions) This paper

Research Agenda 59 Reliance on Computers Reliance on Humans (# questions) Computer vision improves Fewer questions required

Research Agenda 60 Reliance on Computers Reliance on Humans (# questions) Computer vision solved Fully automatic system