Rethinking the ESP Game Stephen Robertson, Milan Vojnovic, Ingmar Weber* Microsoft Research & Yahoo! Research *This work was done while I was a visiting.

Slides:

Advertisements

Similar presentations

Ulams Game and Universal Communications Using Feedback Ofer Shayevitz June 2006.

Advertisements

LABELING IMAGES LUIS VON AHN CARNEGIE MELLON UNIVERSITY.

THE ESP GAME, & PEEKABOOM LUIS VON AHN CARNEGIE MELLON UNIVERSITY.

Gillat Kol (IAS) joint work with Ran Raz (Weizmann + IAS) Interactive Channel Capacity.

Game Theoretic Aspect in Human Computation Presenter: Chien-Ju Ho

Chapter 4 Probability: Probabilities of Compound Events

Presentation 5. Probability.

CMPUT 466/551 Principal Source: CMU

Maximum Likelihood-Maximum Entropy Duality : Session 1 Pushpak Bhattacharyya Scribed by Aditya Joshi Presented in NLP-AI talk on 14 th January, 2014.

Dependent and Independent Events. If you have events that occur together or in a row, they are considered to be compound events (involve two or more separate.

A one player game where players are asked to tag funny video clips in a given time frame. They will score points throughout the game and be entered into.

Algorithms: The basic methods. Inferring rudimentary rules Simplicity first Simple algorithms often work surprisingly well Many different kinds of simple.

BA 555 Practical Business Analysis

Extracting Valuable Information Lazily Shiry Ginosar.

Human Computation Steven Emory CS 575. Overview What is Human Computation? History of Human Computation Examples of Human Computation Bad Example Good.

Naïve Bayes Model. Outline Independence and Conditional Independence Naïve Bayes Model Application: Spam Detection.

Pattern Classification, Chapter 1 1 Basic Probability.

Peekaboom: A Game for Locating Objects in Images

P robability Sample Space 郭俊利 2009/02/27. Probability 2 Outline Sample space Probability axioms Conditional probability Independence 1.1 ~ 1.5.

Lecture 2: Basic Information Theory Thinh Nguyen Oregon State University.

Human Computation Steven Emory CS 575 Human Issues in Computing.

Crash Course on Machine Learning

How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.

NAÏVE BAYES CLASSIFIER 1 ACM Student Chapter, Heritage Institute of Technology 10 th February, 2012 SIGKDD Presentation by Anirban Ghose Parami Roy Sourav.

Experimental Probability of Simple Events

Steganography detection Roland Cmorik, Martin Šumák.

Page 79 Exercise 5A Homework - using GCSE notes for review prior to starting this unit.

by B. Zadrozny and C. Elkan

Bayes for Beginners Presenters: Shuman ji & Nick Todd.

Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers Victor Sheng, Foster Provost, Panos Ipeirotis KDD 2008 New York.

IBS-09-SL RM 501 – Ranjit Goswami 1 Basic Probability.

Human Computation & ESP Game 2008/12/19 Presenter: Lin, Sin-Yan 1.

Lecture 7. Outline 1. Overview of Classification and Decision Tree 2. Algorithm to build Decision Tree 3. Formula to measure information 4. Weka, data.

“PREDICTIVE MODELING” CoSBBI, July Jennifer Hu.

(Important to algorithm analysis )

Uncertainty Uncertain Knowledge Probability Review Bayes’ Theorem Summary.

Playing GWAP with strategies - using ESP as an example Wen-Yuan Zhu CSIE, NTNU.

9/22/1999 JHU CS /Jan Hajic 1 Introduction to Natural Language Processing ( ) LM Smoothing (The EM Algorithm) Dr. Jan Hajič CS Dept., Johns.

1 Introduction to Natural Language Processing ( ) LM Smoothing (The EM Algorithm) AI-lab

1 Chapter 4, Part 1 Repeated Observations Independent Events The Multiplication Rule Conditional Probability.

1 Chapter 12 Probabilistic Reasoning and Bayesian Belief Networks.

CHAPTER 6 Naive Bayes Models for Classification. QUESTION????

th grade math Representing Probability. Objective To write probabilities as fractions, decimals, and percents. Why? To become more familiar with.

12/7/20151 Math b Conditional Probability, Independency, Bayes Theorem.

Uncertainty ECE457 Applied Artificial Intelligence Spring 2007 Lecture #8.

Information Retrieval Lecture 4 Introduction to Information Retrieval (Manning et al. 2007) Chapter 13 For the MSc Computer Science Programme Dell Zhang.

Expected values of discrete Random Variables. The function that maps S into S X in R and which is denoted by X(.) is called a random variable. The name.

Naïve Bayes Classification Material borrowed from Jonathan Huang and I. H. Witten’s and E. Frank’s “Data Mining” and Jeremy Wyatt and others.

Combinatorics (Important to algorithm analysis ) Problem I: How many N-bit strings contain at least 1 zero? Problem II: How many N-bit strings contain.

RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.

NTU & MSRA Ming-Feng Tsai

Probability. Probability Probability is fundamental to scientific inference Probability is fundamental to scientific inference Deterministic vs. Probabilistic.

STATISTICS 6.0 Conditional Probabilities “Conditional Probabilities”

Naïve Bayes Classifier April 25 th, Classification Methods (1) Manual classification Used by Yahoo!, Looksmart, about.com, ODP Very accurate when.

CS621: Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 9– Uncertainty.

PROBABILITY 1. Basic Terminology 2 Probability 3  Probability is the numerical measure of the likelihood that an event will occur  The probability.

Definition Slides Unit 2: Scientific Research Methods.

Definition Slides Unit 1.2 Research Methods Terms.

Naïve Bayes Classification Recitation, 1/25/07 Jonathan Huang.

1 Neural Codes. 2 Neuronal Codes – Action potentials as the elementary units voltage clamp from a brain cell of a fly.

ECE457 Applied Artificial Intelligence Fall 2007 Lecture #8

Quick Review Probability Theory

Quick Review Probability Theory

Classification of unlabeled data:

Lecture 15: Text Classification & Naive Bayes

Session II: Reserve Ranges Who Does What

Read R&N Ch Next lecture: Read R&N

Pull 2 samples of 5 pennies and record both averages (2 dots).

NAÏVE BAYES CLASSIFICATION

ECE457 Applied Artificial Intelligence Spring 2008 Lecture #8

Presentation transcript:

Rethinking the ESP Game Stephen Robertson, Milan Vojnovic, Ingmar Weber* Microsoft Research & Yahoo! Research *This work was done while I was a visiting researcher at MSRC.

- 2 - The ESP Game – Live Demo Show it live. (2min)live AlternativeAlternative version.

- 3 - The ESP Game - Summary Two players try to agree on a label to be added to an image No way to communicate Entered labels only revealed at end Known labels are “off-limits” ESP refers to “ Extrasensory perception” Read the other person’s mind

- 4 - The ESP Game - History Developed by Luis von Ahn and Laura Dabbish at CMU in 2004 Goal: Improve image search Licensed by Google in 2006 A prime example of harvesting human intelligence for difficult tasks Many variants (music, shapes, …)

- 5 - The ESP Game – Strengths and Weaknesses Strengths –Creative approach to a hard problem –Fun to play –Vast majority of labels are appropriate –Difficult to spam –Powerful idea: Reaching consensus with little or no communication

- 6 - The ESP Game – Strengths and Weaknesses Weaknesses –The ultimate object is ill-defined –Finds mostly general labels –Already millions of images for these –“Lowest common denominator” problem –Human time is used sub-optimally

- 7 - A “Robot” Playing the ESP Game VideoVideo of recorded play.

- 8 - The ESP Game – Labels are Predictable Synonyms are redundant –“guy” => “man” for 81% of images Co-occurrence reduces “new” information –“clouds” => “sky” for 68% of images Colors are easy to agree on –“black” is 3.3% of all occurrences

- 9 - How to Predict the Next Label T = {“beach”, “water”}, next label t = ??

How to Predict the Next Label Want to know: P(“blue” next label | {“beach”, “water”}) P(“car” next label | {“beach”, “water”}) P(“sky” next label | {“beach”, “water”}) P(“bcn” next label | {“beach”, “water”}) Problem of data sparsity!

How to Predict the Next Label Want to know: P(“t” next label | T) = P(T | “t” next label) ¢ P(“t”) / P(T) Use conditional independence … Give a random topic to two people. Ask them to each think of 3 related terms. P(A,B) = P(A|B) ¢ P(B) = P(B|A) ¢ P(A) Bayes’ Theorem

Conditional Independence Madrid sun paella beach soccer flamenco “Spain” sky water eyes azul blau bleu “blue” P(A,B|C) = P(A|C) ¢ P(B|C) P(“p1: sky”, “p2: azul” | “blue”) = P(“p1: sky” | “blue”) ¢ P(“p2: azul” | “blue”) p1 p2

How to Predict the Next Label P({s 1, s 2 } | “t”) ¢ P(“t”) / P(T) = P(s 1 | “t”) ¢ P(s 2 | “t”) ¢ P(“t”) / P(T) P(s | “t”) will still be zero very often ! smoothing P(s | “t”) = (1- ¸ ) P(s | “t”) + ¸ P(s) C.I. Assumption violated in practice, but “close enough”. Non-zero background probability

How to Predict the Next Label P(“t” next label | T already present) =  s 2 T P(s | “t”) P(“t”) / C where C is a normalizing constant ¸ chosen using a “validation set”. ¸ = 0.85 in the experiments. Model trained on ~13,000 tag sets. Also see: Naïve Bayes classifier Cond. indep. assumptionBayes’ Theorem

Experimental Results: Part 1 Number of -games played 205 -images encountered 1,335 -images w/ OLT 1,105 Percentage w/ match -all images 69% -only images with OLTs 81% -all entered tags 17% Av. number of labels entered -per image4.1 -per game26.7 Agreement index -mean 2.6 -median 2.0 The “robot” plays reasonably well. The “robot” plays human-like.

Quantifying “Predictability” and “Information” So, labels are fairly predictable. But how can we quantify “predictability”?

Quantifying “Predictability” and “Information” “sunny” vs. “cloudy” tomorrow in BCN The role of a cubic dice The next single letter in “barcelo*” The next single letter in “re*” Clicked search result for “yahoo research”

Entropy and Information An event occurring with probability p corresponds to an information of -log 2 (p) bits... … number of bits required to encode in optimally compressed encoding Example: Compressed weather forecast: P(“sunny”) = 0.5 0(1 bit) P(“cloudy”) = (2 bits) P(“rain”) = (3 bits) P(“thunderstorm”) = (3 bits)

Entropy and Information p=1 ! 0 bits of information –Cubic dice showed a number in [1,6] p ¼ 0 ! many, many bits of information –The numbers for the lottery “information” = “amount of surprise”

Entropy and Information Expected information for p 1, p 2, …, p n :  i -p i ¢ log(p i ) = (Shannon) entropy Might not know true p 1, p 2, …, p n, but think they are p 1, p 2, …, p n. Then, w.r.t. p you observe  i -p i ¢ log(p i ) minimized for p = p p given by earlier model. p is then observed.

Experimental Results: Part 2 Av. information per position of label in tag set Later labels are more predictable. Equidistribution = 12.3 bits. “Static” distribution = 9.3 bits. Av. information per position of human suggestions Human thinks harder and harder.

Improving the ESP Game Could score points according to –log 2 (p) - Number of bits of information added to the system Have an activation time limit for “obvious” labels - Remove the immediate satisfaction for simple matches Hide off-limits terms - Have to be more careful to avoid “obvious” labels Try to match “experts” - Use previous tags or meta information Educate players - Use previously labeled images to unlearn behavior Automatically expand the off-limits list - Easy, but 10+ terms not practical

Questions Thank you!