Human Computation and Crowdsourcing: Survey and Taxonomy Uichin Lee Sept. 7, 2011.

Slides:



Advertisements
Similar presentations
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Xchange-Park: A Crowd-sourcing based parking reservation system Aakash Therani Ankit Jasuja Manish Shah
Computer Systems Lab TJHSST Current Projects In-House, pt 6.
Human- Computer Interfaces HUMAN COMPUTATION.  Humans helping solve large problems  Using humans WITH computers to solve problems not solvable be either.
Collective Intelligence Dr. Frank McCown Intro to Web Science Harding University This work is licensed under a Creative Commons Attribution-NonCommercial-
Bringing the crowdsourcing revolution to research in communication disorders Tara McAllister Byun, PhD, CCC-SLP Suzanne M. Adlof, PhD Michelle W. Moore,
Collaborative Human Computing Zack Zhu March 31, 2010 Seminar for Distributed Computing 1.
Crowdsourcing 04/11/2013 Neelima Chavali ECE 6504.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
SESSION 10 MANAGING KNOWLEDGE FOR THE DIGITAL FIRM.
Social Tagging and Search Marti Hearst UC Berkeley.
Oozing out knowledge in human brains to the Internet Lada Adamic School of Information University of Michigan
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
Peekaboom: A Game for Locating Objects in Images
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
Knowledge is Power Marketing Information System (MIS) determines what information managers need and then gathers, sorts, analyzes, stores, and distributes.
Human Computation Steven Emory CS 575 Human Issues in Computing.
Knowledge Management Solutions
James A. Senn’s Information Technology, 3rd Edition
Chapter 3 Computer Science and the Foundation of Knowledge Model
Presenters: Title:. CONTENTS What is Crowdsourcing? How Crowdsourcing works? Types of Crowdsourcing Applications of Crowdsourcing Benefits & Problems.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CSC 4510 – Machine Learning Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University Course website:
Human Computation and Crowdsourcing Uichin Lee May 8, 2011.
Taylor Trayner. Definition  Set of business processes developed in an organization to create, store, transfer, and apply knowledge  Knowledge is a firm.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 5)
Artificial Intelligence (AI) Can Machines Think?.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
Principle of Management
Uichin Lee, Jihyoung Kim *, Eunhee Yi **, Juyup Sung, Mario Gerla * KAIST Knowledge Service Engineering * UCLA Computer Science ** LG UX R&D Lab
DATA-CENTERED CROWDSOURCING WORKSHOP PROF. TOVA MILO SLAVA NOVGORODOV TEL AVIV UNIVERSITY 2014/2015.
Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers Victor Sheng, Foster Provost, Panos Ipeirotis KDD 2008 New York.
Exploration Seminar 3 Human Computation Roy McElmurry.
Systematization of Crowdsoucing for Data Annotation Aobo, Feb
Overview Crowd-Powered Systems Alexander J. Quinn January 15, 2015.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Human Computation and Crowdsourcing Uichin Lee KSE652 Social Computing Systems Design and Analysis.
 Situational Based Interviews  Behavioral Based Interviews  Competency Based Interviews SP - O MALIK AZHAR MAHMOOD.
Testing & modeling users. The aims Describe how to do user testing. Discuss the differences between user testing, usability testing and research experiments.
Artificial Intelligence By Michelle Witcofsky And Evan Flanagan.
How Solvable Is Intelligence? A brief introduction to AI Dr. Richard Fox Department of Computer Science Northern Kentucky University.
1 CS 2710, ISSP 2610 Foundations of Artificial Intelligence introduction.
Playing GWAP with strategies - using ESP as an example Wen-Yuan Zhu CSIE, NTNU.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
© 2009 All Rights Reserved Jody Underwood Chief Scientist
I Robot.
CROWDSOURCING Massimo Poesio Part 2: Games with a Purpose.
AN INTELLIGENT AGENT is a software entity that senses its environment and then carries out some operations on behalf of a user, with a certain degree of.
Human Interaction with Data “Meaningful Interpretations” “The Power of Crowdsourcing” &
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
MIS2502: Data Analytics Advanced Analytics - Introduction.
What is Artificial Intelligence?
Crowd and Open Sourcing By Lindsay Romo. Definition: obtaining services, ideas, or content by soliciting contributions from a large group of people Characteristics.
KNOWLEDGE MANAGEMENT UNIT II KNOWLEDGE MANAGEMENT AND TECHNOLOGY 1.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
1 Teacher Resource Network A K-20 Initiative Project Concept Greg Marks
Chapter 8 The Social Enterprise: From Recruiting to Problem Solving and Collaboration.
Crowd-based mining of reusable process model patterns Carlos Rodríguez, Florian Daniel, Fabio Casati BPM 2014, September 9th 2014, Eindhoven, The Netherlands.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Crowdsourcing: How to Benefit from (Too) Many Great Ideas (Blohm et al., 2013) Olga Jemeljanova Joona Kanerva Niko Kuki Mikko Nummela Group
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
Data-Centered Crowdsourcing Workshop
CHAPTER 1 Introduction BIC 3337 EXPERT SYSTEM.
Web Mining Department of Computer Science and Engg.
Testing & modeling users
Software Agent.
Presentation transcript:

Human Computation and Crowdsourcing: Survey and Taxonomy Uichin Lee Sept. 7, 2011

Human Computation: A Survey and Taxonomy of a Growing Field Alexander J. Quinn, Benjamin B. Bederson CHI 2011

Human Computation Computer scientists (in the artificial intelligence field) have been trying to emulate human like abilities, e.g., language, visual processing, reasoning Alan Turing wrote in 1950: “The idea behind digital computers may be explained by saying that these machines are intended to carry out any operations which could be done by a human computer.” L. Von Ahn 2005 a doctorial thesis about human computation The field is now thriving: business, art, R&D, HCI, databases, artificial intelligence, etc.

Definition of Human Computation Dates back 1938 in philosophy and psychology literature ; 1960 in Computer Science literature (by Turing) Modern usage inspired by von Ahn’s 2005 dissertation titled by “Human Computation” and the work leading to it – “…a paradigm for utilizing human processing power to solve problems that computers cannot yet solve.”

Definition of Human Computation “…the idea of using human effort to perform tasks that computers cannot yet perform, usually in an enjoyable manner.” (Law, von Ahn 2009) “…a new research area that studies the process of channeling the vast internet population to perform tasks or provide data towards solving difficult problems that no known efficient computer algorithms can yet solve” (Chandrasekar, et al., 2010) “…a technique that makes use of human abilities for computation to solve problems.” (Yuen, Chen, King, 2009) “…a technique to let humans solve tasks, which cannot be solved by computers.” (Schall, Truong, Dustdar, 2008) “A computational process that involves humans in certain steps…” (Yang, et al., 2008) “…systems of computers and large numbers of humans that work together in order to solve problems that could not be solved by either computers or humans alone” (Quinn, Bederson, 2009) “…a new area of research that studies how to build systems, such as simple casual games, to collect annotations from human users.” (Law, et al., 2009)

Related Ideas Crowdsourcing Social computing Data mining Collective intelligence

Crowdsourcing “Crowdsourcing is the act of taking a job traditionally performed by a designated agent (usually an employee) and outsourcing it to an undefined, generally large group of people in the form of an open call.” (Jeff Howe) Human computation replaces computers with humans, whereas crowdsourcing replaces traditional human workers with members of the public – HC: replacement of computers with humans – CS: replacement of insourced workers with crowdsourced workers Some crowdsourcing tasks can be considered as human computation tasks – Hiring crowdsourced workers for translation jobs : – Machine translation (fast, but low quality) vs. human translation (slow, high quality)

Social Computing Wikipedia: – “.. supporting any sort of social behavior in or through computational systems” (e.g., blogs, , IM, SNS, wikis, social bookmarking) – “.. Supporting computations that are carried out by groups of people” (e.g., collaborative filtering, online auctions, prediction markets, reputation systems) Some other definitions: – “… applications and services that facilitate collective action and social interaction online with rich exchange of multimedia information and evolution of aggregate knowledge…” (Parameswaran, Whinston, 2007) – “… the interplay between persons' social behaviors and their interactions with computing technologies” (Dryer, Eisbach, Ark, 1999)

Data Mining Data mining is defined broadly as the application of specific algorithms for extracting patterns from data.” (Fayyad, Piatetsky-Shapiro, Smyth, 1996) While data mining deals with human created data, it does not involve human computation – Google PageRank “only” uses human created data (links)

Collective Intelligence Overarching notion: large groups of loosely organized people can accomplish great things working together – Traditional study focused on “decision making capabilities by a large group of people” Taxonomical “genome” of collective intelligence – “… groups of individuals doing things collectively that seem intelligent” (Malone, 2009) Collective intelligence generally encompasses human computation and social computing

Relationship Diagram Collective Intelligence Data Mining Crowdsourcing Social Computing Human Computation

Classifying Human Computation Motivation – What does motivate people to perform HC? Human skill – What kinds of human skills do HC tasks require? Aggregation – How to combine results of HC tasks? Quality control – How to control quality of the results of HC tasks? Processing order of different roles – Roles (requester, worker, computer) Task-request cardinality – Requester vs. Worker cardinality

Motivation Examples Pay (financial rewards) Mechanical Turk (online labor marketplace), ChaCha (mobile Q&A), LiveOps (a distributed call center) Altruism (just helping other people for good)helpfindjim.com (Jim Gray), Naver KiN, Yahoo! Answer Enjoyment (fun) Game With A Purpose (GWAP): - ESP Game, Tag a Tune, Reputation (recognition) Volunteer translators at childrenslibrary.org, Naver KiN, Yahoo! Answer Implicit workreCAPTCHA

Quality Control Examples Output agreement ESP Game (a game for labeling images) – answer is accepted if the pair agree on the same answer Input agreement Tag-a-tune: two humans are listening to different inputs (music). They are asked to describe the music and try to decide whether they are looking at the same music or different music Economic models When money is a motivating factor; some economic models can be used to elicit quality answers (e.g., game-theoretic model of the worker’s rating to reduce the incentive to cheat) Defensive task design Design tasks so that it’s difficult to cheat (e.g., comprehension questions) Redundancy Each task is given to multiple people to separate the wheat from the chaff Statistical filtering Filter or aggregate the data in some way that removes the effects of irrelevant work Multilevel review One set of workers does the work; the second set reviews the results and rates the quality (e.g., Soylent : find-fix-verity) Automatic check fold.it (protein folding game); easy to check using computer, but hard to find answers Reputation system Motivated to provide quality answers by a reputation scoring systems; Mechanical Turk, Naver KiN, etc. Expert check Trusted expert skims or cross-checks results for relevance and apparent accuracy

Aggregation Examples Collection (to build a knowledge base) Artificial intelligence research; to build large DB of common sense facts (e.g., people can’t brush their hairs with a table) Examples: ESP game, reCAPTCHA, FACTory, Verbosity, etc. Wisdom of crowds (statistical processing of data) Average guess of normal people can be very close to the actual outcome; e.g., Ask500people, News Futures, Iowa Electronic Markets Search Large number of volunteers to sift through photos or videos, searching for some desired scientific phenomenon, person, or object, e.g., helpfindjim.com, project Iterative improvementGiving answers of previous worker to elicit better answers, e.g., MonoTrans Active learning Classifier training; selects the samples that could potentially give best training benefits and select them for manual annotations for training Genetic algorithm (search/optimization) Free Knowledge Exchange, PicBreeder None (if independent task is performed) VizWiz (a mobile app that les a blind user take a photo and ask question)

Human Skills, Processing Order, Task-Request Cardinality Human SkillsExamples Visual recognitionESP Game Language understandingSoylent Basic human communicationChaCha Processing OrderExamples Computer  Worker (>> Requester)reCAPTCHA Worker (player)  Requester  Computer (aggregation)ESP Game (image labeling) Computer  Worker  Requester  Computer Cyc inferred large # of common senses  FACTory, a GWAP where worker (players solve problem), Cyc performs aggregation Requester  WorkerMechanical Turk Task-Request CardinalityExamples One-to-one (one worker to one task)ChaCha Many-to-many (many workers to many tasks)ESP Game Many-to-one (many workers to one task)helpfindjim.com (Jim Gary) Few-to-one (few workers to one task)VizWiz

Crowdsourcing Systems on the World-Wide Web Anhai Doan, Raghu Ramakrishnan, Alon Y. Halevy Communications of the ACM Vol. 54 No. 4, Pages

Crowdsourcing Systems (CS) Defining crowdsourcing systems: tricky – Explicit vs. implicit collaboration to solve something? CS system “enlists a crowd to help solve a problem defined by the system owners” Addressing the following issues: – How to recruit and retain users? – What contributions can users make? – How to combine user contributions to solve the target problem? – How to evaluate users and their contributions?

Explicit Crowdsourcing Systems

Implicit Crowdsourcing Systems

Summary Definition of human computation and crowdsourcing Relationship with other related issues Classifying human computation and crowdsourcing systems – Motivation, human skill, aggregation, quality control, processing order, task-request cardinality – Nature of collaboration, architecture, recruitment, human skill