Internet Enabled Human Computation CSE 454 Daniel Weld.

Slides:



Advertisements
Similar presentations
LABELING IMAGES LUIS VON AHN CARNEGIE MELLON UNIVERSITY.
Advertisements

Options for communicating. A social network is a social structure made of individuals (or organizations) called "nodes," which are tied (connected) by.
What is an application?. An application is... An application (or app) is a type of software that allows you to perform specific tasks! Applications for.
Using the cool tools: Communicating in the Age of the Social Web Karen Neves Kellogg Library 13 Nov 2007.
PEER SUPPORT GUIDANCE COMMUNICATION RECRUITMENT STUDENT PROFESSIONAL DEVELOPMENT RESEARCH.
The Importance of Social Media. Some facts and statistics: Nearly 1 out of every 5 minutes online is spent on social media Facebook reached 1.11 billion.
THE ESP GAME, & PEEKABOOM LUIS VON AHN CARNEGIE MELLON UNIVERSITY.
HUMAN COMPUTATION LUIS VON AHN CARNEGIE MELLON UNIVERSITY.
Extracting Valuable Information Lazily Shiry Ginosar Advisor: Professor Manuel Blum Graduate Mentor: Luis von Ahn.
Game Theoretic Aspect in Human Computation Presenter: Chien-Ju Ho
Crowdsourcing using Mechanical Turk Quality Management and Scalability Panos Ipeirotis – New York University Title Page.
Collaborative Human Computing Zack Zhu March 31, 2010 Seminar for Distributed Computing 1.
Is Social Media right for you? by Social Media Travelers
3 Main Reasons Why I target Social Media To Generate an Income: 1.Virtually No Marketing Costs 2.No Website Needed to Build or Host 3.Not Even a Need.
Crowdsourcing 04/11/2013 Neelima Chavali ECE 6504.
Matchin: Eliciting User Preferences with an Online Game Severin Hacker, and Luis von Ahn Carnegie Mellon University SIGCHI 2009.
Supporting the 3Cs through Social Networking Tools April Hayman Instructional Designer International Society for Technology in Education.
Lecture 26: Vision for the Internet CS6670: Computer Vision Noah Snavely.
Crowdsourcing Accessibility: Can Accessibility be fixed for free with Community Help? Terrill
Crowdsourcing research data UMBC ebiquity,
Extracting Valuable Information Lazily Shiry Ginosar.
Enhancement Request Process Improvements Update
Crowdsourcing = Crowd + Outsourcing “soliciting solutions via open calls to large-scale communities”
CAPTCHA & THE ESP GAME SHAH JAYESH CS575SPRING 2008.
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
Task and Workflow Design I KSE 801 Uichin Lee. TurKit: Human Computation Algorithms on Mechanical Turk Greg Little, Lydia B. Chilton, Rob Miller, and.
Crowdsourcing Quality Management and other stories Panos Ipeirotis New York University & Tagasauris.
“Consistency is Key!” A Quick Guide to Online Marketing By Virtual Marketing Empire, LLC
By: Aaron Gustafson Owner Computers N’ Stuff.  Facebook is FREE!!!  Youtube is FREE!!!  Twitter is FREE!!!  Google Plus is FREE!!!  Website hosting.
How to Expand Your School’s Online Reach using Facebook, Blogs and Twitter.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
 Facebook  Youtube  Twitter  Google +  Pinterest.
Building Public Facing Websites with SharePoint 2010 Prepared for ILTA’s SharePoint for Legal Symposium June 16 th, 2010 George Durzi Principal Consultant.
Version 1.0 Requirements.  PROstructor ◦ PROstructor is a community and service to finding, scheduling and paying professional for private, group lessons.
SOCIAL MEDIA: TIPS AND TRICKS. WHAT IS SOCIAL MEDIA? social media is online media – text, photos, videos, et cetera – that is ‘social’ i.e. it encourages.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 5)
Mrs. Beth Cueni Carnegie Mellon
Social Media & The Chaplain. Social Media Social Media Is Consumer generated media it is media that is designed to be shared, sharing means that it is.
DATA-CENTERED CROWDSOURCING WORKSHOP PROF. TOVA MILO SLAVA NOVGORODOV TEL AVIV UNIVERSITY 2014/2015.
BLOG. WHAT IS A BLOG ? We have a lot of definition of blog.. A blog is a personal diary. A daily pulpit. A collaborative space. A political soapbox. A.
In addition to Word, Excel, PowerPoint, and Access, Microsoft Office® 2013 includes additional applications, including Outlook, OneNote, and Office Web.
+ TIPS & TRICKS TO HELP YOU MAKE MONEY ONLINE! Tricia Belmonte LikeUs ns.
Getting In Control Of Today’s Information Overload 50 Ways to Use Evernote in Your Real Estate Business.
Exploration Seminar 3 Human Computation Roy McElmurry.
Crowdsourcing & Social Networks Shrenik Sadalgi Spring 2010 COMS E6125 Web-enHanced Information Management Columbia University.
Yinan Zhu Karim Atiyeh Darpa Network Challenge. The Challenge “To locate ten moored red weather balloons located at ten fixed locations” “visible from.
Blogs, Wikis and Podcasting  By Zach, Andrew and Sam.
Using Social Media for Fundraising and Communication with Supporters Lindsay Boyle – Communications & Research Coordinator Claire Chapman – Information.
Website that support online communities 1. Wikis 2. Blogs 3. Forums 4. Social networking sites.
Welcome! Users, Groups, Project Templates, and Custom user-defined search fields.
Public Relations & Social Media
Human Computation and The Wisdom of Crowds Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington July 2015.
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Marcus Snyder 16 years old Junior, Kelly High School.
Human Computation (aka Crowdsourcing) LUIS VON AHN Slides taken from a talk by.
Advanced Internet Systems CSE 454 Daniel Weld. To do Add picture of original MT Add greg little or casting words flowchart Discussion included qualifications,
Strategies for Social Media Marketing. SOCIAL MEDIA & YOUR AUDIENCE Find and engage with current and potential customers online! Social is now the top.
Tom Lump Lake County Fair Kara Moon Porter County Fair.
 Smartphones – iPhone, Android, Blackberries, etc  Tablets – iPad, Android, Windows, Google, etc.  Computers Basically anything that can connect to.
CS 2750: Machine Learning Active Learning and Crowdsourcing
Introduction Before the internet became an integral part of our lives, advertising a business was done mainly on outdoor billboards, posters, tv ads and.
Crowdsourcing: How to Benefit from (Too) Many Great Ideas (Blohm et al., 2013) Olga Jemeljanova Joona Kanerva Niko Kuki Mikko Nummela Group
Web 2.0 technology can be used in second or foreign language learning and teaching with mobile devices, and illustrate sample activities of MALL as they.
A project by Advertise on Facebook Over 1 billion people. Facebook connects with their right ones.
THE ESP GAME, AND OTHER STUFF
Data-Centered Crowdsourcing Workshop
Overview The promotion of products or brands via Digital media Digital Media  Search Engine Marketing Search Engine Marketing  Social Media Marketing.
Mrs. Beth Cueni Carnegie Mellon
Human computation, and the wisdom of crowds
Presentation transcript:

Internet Enabled Human Computation CSE 454 Daniel Weld

To do Challenge - Mechanisms for deterring vandals Reputation Gold standard answers Randomized redundancy Balloon challenge More on foldit Game design, plateaus & levels Aardvark & quora

Crowdsourcing “a neologistic compound of Crowd and Outsourcing for the act of taking tasks traditionally performed by an employee or contractor, and outsourcing them to a group of people or community, through an "open call" to a large group of people (a crowd) asking for contributions” ---[Wikipedia]

Built in 1770 by Wolfgang von Kempelen

56/17/2015

Powerset

Your sentence is: The term silver dollar is often used for any large white metal coin issued by the United States with a face value of one dollar ; although purists insist that a dollar is not silver unless it contains some of that metal. Enter one term per box. $0.05

Fast & Cheap, but is it Good? [Snow et al. EMNLP-08]

How Cheap + Fast? In our experiment we ask for 10 annotations each of the full 30 word pairs, at an offered price of $0.02 for each set of 30 annotations (or, equivalently, at the rate of 1500 annotations per USD). The most surprising aspect of this study was the speed with which it was completed; the task of 300 annotations was completed by 10 annotators in less than 11 minutes … 1724 annotations / hour. [Snow et al. EMNLP-08]

Turker Demographics March, 2008 (Panos Ipeirotis)

Turker Demographics February, 2010 (Panos Ipeirotis)

Turker Demographics May, 2010 (Crowdflower)

Complex Jobs TurkIt [Little 09] Casting Words

TurKit [Little et al. 09] Determine a fixed allowance Money spent in a problem Each improvement iteration Ask two workers to vote A third is asked if the first two disagree Keep the artifact by majority vote 146/17/2015

Iterative Improvement ?

Iterative Improvement Version 7 A close-up photograph of the following items: A CASIO multi-function, solar-powered scientific calculator. A blue ball point pen with a blue rubber grip and the tip extended. British coins, two of 1 value, three of 20p value and one of 1p value. Seems to be a theme illustration for a brochure or document cover treating finance – probably personal finance.”

Limitation: Workflow is Fixed Number of iterations is determined By the allowance Not by the quality of the answers or the workers Number of votes / iter is almost fixed Not based on the difficulty of the job 176/17/2015

TurKontrol [Dai AAAI10] Learner Problem Solution HITs Answers 186/17/2015 Model Planner Input a picture an initial description Output a high quality description

TurKontrol Workflow Improvement needed? Generate improvemen t HIT Generat e ballot HIT More voting needed? bkbk Y N Y N 196/17/2015

Evaluation Measures Quality measure Quality improvement probability (QIP) An artifact has QIP q 1-Pr(an average worker improves the artifact) Never exactly known Can be estimated by a random variable Q Utility function U(q) 206/17/2015

Control Problem is a POMDP 216/17/2015

Comparison with Fixed Workflows Cost = (30,10) Allowance of TurKit = /17/

Money How Motivate People to Help?

DARPA Network Challenge $40k 10 Moored Weather Balloons 10am ET Saturday 12/5/09

Winner MIT Red Balloon Challenge Team All 10 Balloons – 8:52 Also notable: Groundspeak Geocachers 7 Balloons – 6:02

Selected competitors The MIT Media Lab team ( was the winning team, correctly identifying the locations of all 10 balloons in 8 hrs and 52 min. The MIT Media Lab team was organized within Professor Alex “Sandy” Pentland’s Human Dynamics Laboratory. The team designed and launched a recursive incentive recruiting method that reached almost 5,400 individuals in approximately 36 hours. The ingenuity of the recruiting method was that the incentive to join the effort was transferred undiminished with each subsequent layer of network nodes. MIT also enjoyed name recognition and mass media coverage (CNN Headline News) on execution day that helped them become one of the preferred sources to receive balloon reports. MIT collected extensive network structure data during the Challenge and plans several scientific studies of human dynamics and social networks using data from the DNC. George Hotz George Hotz learned about the Challenge the day before the balloon launch. He announced his personal effort and website ( in a Tweet an hour before the start of the DNC. Hotz has an existing Twitter network of almost 50,000 followers, due in no small part to his fame as a hacker (including the first untethering of the iPhone when he was 17 years old). With only an hour of preparation before the Challenge, Hotz was able to locate 8 balloons (4 from direct reports of his existing Twitter network, 4 through trades with other teams). The Groundspeak team ( mobilized their extensive, pre ‐ existing network of active geocachers using alerts one and two days prior to balloon launch. Groundspeak is the largest geocache coordinator with an estimated active network of premium users in the hundreds of thousands (plus several hundred thousand additional free content members). Groundspeak was able to use their member database to do very effective geographic targeting of reported balloon locations for verification.

Successful Tools Marketing + media broadcast strategies to get team members Recursive, incentivized recruiting of networks to build team Extraction of reported locs from open iNet sources (eg Twitter) Automated means of extracting data, e.g. Twitter crawler Deployment of automatic reporting capability, e.g. iPhone apps Dispatching team members as spotters to confirm Website design that motivates, encourages recruitment, or allows easy, secure reporting Search engine rank optimization of website

Recursive Incentivizing method that reached almost 5,400 individuals in approximately 36 hours. The ingenuity of the recruiting method was that the incentive to join the effort was transferred undiminished with each subsequent layer of network nodes. MIT also enjoyed name recognition and mass media coverage (CNN Headline News) on execution day

Money Altruism Esteem Self-Interest Fun How Motivate People to Help?

Altruism Self-Esteem

Collaborative Geomapping State Troopers Reaction to Trapster Motivation & Vandalism Control Other Applications North Korea Uncovered (Google Earth) DARPA Network Challenge

Self-Interest

Hybrid Models

StackOverflow

StackOverflow Optional Reputation Answer voted up+10 Question voted up+ 5 Answer accepted+15 (+2 to acceptor) Post voted down- 2 (-1 to voter) Max 30 votes / user / day

Reputation  Privileges 15vote up 15flag offensive 50leave comments 100edit community wiki posts 125vote down (costs 1 rep) 500retag questions 1000create new tags 2000edit other people’s posts Etc…

Motivating People Money Fun

IMAGE SEARCH ON THE WEB USES FILENAMES AND HTML TEXT Slides by Luis von Ahn

ACCESSIBILITY LESS THAN 10% OF THE WEB IS ACCESSIBLE TO THE VISUALLY IMPAIRED REASON:MOST IMAGES DON’T HAVE A CAPTION Slides by Luis von Ahn

LABELING IMAGES WITH WORDS STILL A COMPLETELY OPEN PROBLEM FACE MAN SUPER SEXY Slides by Luis von Ahn

DESIDERATA A METHOD THAT CAN LABEL ALL IMAGES ON THE WEB FAST AND CHEAP Slides by Luis von Ahn

TWO-PLAYER ONLINE GAME PARTNERS DON’T KNOW EACH OTHER AND CAN’T COMMUNICATE OBJECT OF THE GAME: TYPE THE SAME WORD THE ONLY THING IN COMMON IS AN IMAGE THE ESP GAME Slides by Luis von Ahn

PLAYER 1PLAYER 2 GUESSING: CARGUESSING: BOY GUESSING: CAR SUCCESS! YOU AGREE ON CAR SUCCESS! YOU AGREE ON CAR GUESSING: KID GUESSING: HAT THE ESP GAME Slides by Luis von Ahn

© 2004 Carnegie Mellon University, all rights reserved. Patent Pending. Slides by Luis von Ahn

MANY PEOPLE PLAY OVER 20 HOURS A WEEK 3.2 MILLION LABELS WITH 22,000 PLAYERS THE ESP GAME IS FUN Slides by Luis von Ahn

LABELING THE ENTIRE WEB INDIVIDUAL GAMES IN YAHOO! AND MSN AVERAGE OVER 10,000 PLAYERS AT A TIME 5000 PEOPLE PLAYING SIMULTANEOUSLY CAN LABEL ALL IMAGES ON GOOGLE IN 30 DAYS! Slides by Luis von Ahn

9 BILLION MAN-HOURS OF SOLITAIRE WERE PLAYED IN 2003 EMPIRE STATE BUILDING PANAMA CANAL 7 MILLION MAN-HOURS (6.8 HOURS OF SOLITAIRE) 20 MILLION MAN-HOURS (LESS THAN A DAY OF SOLITAIRE) Slides by Luis von Ahn

GWAP Problem?

PhotoCity Reconstructing the World in 3D Bringing Games with a Purpose Indoors

PhotoCity Gameplay

30 Photo Seed with Holes

Mobile App

Hybrid Models Revisited Effect of Pay on Job Completion

Hybrid Models Revisited

Hybrids What else could you add to a MT Task? Leaderboards Raffles ????

Money Altruism Esteem Self-Interest Fun Motivation