Human Computation and Crowdsourcing Uichin Lee May 8, 2011.

Slides:



Advertisements
Similar presentations
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Advertisements

Xchange-Park: A Crowd-sourcing based parking reservation system Aakash Therani Ankit Jasuja Manish Shah
DECISION THEORIES 1 Problem solving –Collaboration, GAME THEORY –Asymmetric information, AGENCY THEORY –Optimization, OPERATIONAL RESEARCH 2 Problem finding.
Databases vs the Internet Coconino Community College Revised August 2010.
Human Computation and Crowdsourcing: Survey and Taxonomy Uichin Lee Sept. 7, 2011.
Applying Crowd Sourcing and Workflow in Social Conflict Detection By: Reshmi De, Bhargabi Chakrabarti 28/03/13.
Estimating the Completion Time of Crowdsourced Tasks using Survival Analysis Jing Wang, New York University Siamak Faridani, University of California,
Introduction to Mechanized Labor Marketplaces: Mechanical Turk Uichin Lee KAIST KSE.
Presenter: Chien-Ju Ho  Introduction to Amazon Mechanical Turk  Applications  Demographics and statistics  The value of using MTurk Repeated.
Collective Intelligence Dr. Frank McCown Intro to Web Science Harding University This work is licensed under a Creative Commons Attribution-NonCommercial-
When to ask for help: Evaluating Projects for Crowdsourcing Peter Organisciak University of Illinois at Urbana-Champaign.
Web 2.0 and Collective Intelligence Mark Levene (Follow the links to learn more!)
Crowdsourcing research data UMBC ebiquity,
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
Oozing out knowledge in human brains to the Internet Lada Adamic School of Information University of Michigan
Intelligent Agent Systems. Artificial Intelligence Systems that think like humans Systems that think rationally Systems that act like humans Systems that.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
New Technologies Are Surfacing Everyday. l Some will have a dramatic affect on the business environment. l Others will totally change the way you live.
McGraw-Hill/Irwin ©2005 The McGraw-Hill Companies, All rights reserved ©2005 The McGraw-Hill Companies, All rights reserved McGraw-Hill/Irwin.
SEARCHING ON THE INTERNET
The Market Research Process
The Internet vs. The Online Database What’s the difference?
Presenters: Title:. CONTENTS What is Crowdsourcing? How Crowdsourcing works? Types of Crowdsourcing Applications of Crowdsourcing Benefits & Problems.
What kind of media institution might distribute your media product and why? By Jess Knight Question 3.
Social Media Marketing & Management Mrs. Piotrowski 1.
The Market Research Process
MANAGEMENT OF MARKETING
2 Read to Learn How the workplace is affected by forces such as changing technology and the global economy How to evaluate job outlooks when making career.
Crowdsourcing. What is it? Image credit: Alexander Kesselaar.
Alexander Consulting Enterprise 9/7/2015 Marketing Research in Global Markets Why Bother?
Introduction to Mechanized Labor Marketplaces: Mechanical Turk Uichin Lee KAIST KSE.
Uichin Lee, Jihyoung Kim *, Eunhee Yi **, Juyup Sung, Mario Gerla * KAIST Knowledge Service Engineering * UCLA Computer Science ** LG UX R&D Lab
DATA-CENTERED CROWDSOURCING WORKSHOP PROF. TOVA MILO SLAVA NOVGORODOV TEL AVIV UNIVERSITY 2014/2015.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Gathering Information and Scanning the Environment Marketing Management, 13 th ed 3.
BLOG. WHAT IS A BLOG ? We have a lot of definition of blog.. A blog is a personal diary. A daily pulpit. A collaborative space. A political soapbox. A.
11 C H A P T E R Artificial Intelligence and Expert Systems.
What is AI:-  Ai is the science of making machine do things that would requires intelligence.  Computers with the ability to mimic or duplicate the.
INTRODUCTION TO RESEARCH. Learning to become a researcher By the time you get to college, you will be expected to advance from: Information retrieval–
Systematization of Crowdsoucing for Data Annotation Aobo, Feb
Overview Crowd-Powered Systems Alexander J. Quinn January 15, 2015.
KSE631: Content Networking Uichin Lee KAIST KSE March 5, 2013.
Internet Skills The World Wide Web (Web) consists of billions of interconnected pages of information from a wide variety of sources. In this section: Web.
KSE631: Content Networking Uichin Lee KAIST KSE Feb. 07, 2012.
Crowdsourcing Color Perceptions using Mobile Devices Jaejeung Kim 1, Sergey Leksikov 1, Punyotai Thamjamrassri 2, Uichin Lee 1, Hyeon-Jeong Suk 2 1 Dept.
Crowdsourcing: Ethics, Collaboration, Creativity KSE 801 Uichin Lee.
Fundamentals of Information Systems, Third Edition2 Principles and Learning Objectives Artificial intelligence systems form a broad and diverse set of.
Chapter 1 Basic Economic Concepts. Vocabulary Wants Needs Goods Services Resources Business Profit Competition Market Research Consumer Determine Identify.
Website Evaluation Workshop Ms. Janek, MLIS International Academy 2012.
KSE631: Content Networking Uichin Lee Feb. 07, 2011.
Computing Fundamentals Module Lesson 6 — Using Technology to Solve Problems Computer Literacy BASICS.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
EXAM PREP WORKSHOP MANAGEMENT INFORMATION SYSTEMS.
Chapter 9: Tapping the Crowd for Fast Innovation ISTO SIPILÄ.
Databases vs the Internet. QUESTION: What is the main difference between using library databases and search engines? ANSWER: Databases are NOT the Internet.
Artificial Intelligence, simulation and modelling.
Christoph F. Eick: Thoughts on the Rook Project Challenges of Playing Bridge Well 
Chapter 8 The Social Enterprise: From Recruiting to Problem Solving and Collaboration.
Conducting Marketing Research Chapter 29. Sec – The Marketing Research Process The steps in conducting marketing research The difference between.
Chapter 29 Conducting Market Research. Objectives  Explain the steps in designing and conducting market research  Compare primary and secondary data.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
C ROWD SOURCING F ARHEEN M OSHARRAF. W HAT IS C ROWD SOURCING ? Crowd sourcing is the process of obtaining needed services, ideas, or content by soliciting.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Data-Centered Crowdsourcing Workshop
Crowdsourcing: A New Work Style
Intelligent Agent Solution
Future Technologies FTC 2016 Future Technologies Conference December 2016 San Francisco, United States.
Overview The promotion of products or brands via Digital media Digital Media  Search Engine Marketing Search Engine Marketing  Social Media Marketing.
Introduction to Artificial Intelligence Instructor: Dr. Eduardo Urbina
Presentation transcript:

Human Computation and Crowdsourcing Uichin Lee May 8, 2011

Content Networking Human intelligence: – Distributed human computation, crowdsourcing Mobile device intelligence: – Sensing (camera, GPS) Network intelligence: – Internet, mobile networks (w/ advanced services) Application intelligence: – Agent, processing, mining Internet Smart home/office On the move Applications Content provider Fixed access Content Networking Radio access Device Network Application Human Crowd

Contents Overview Genres of distributed human computation – Games with a purpose, mechanized labor, wisdom of crowds, crowdsourcing, dual-purpose work, grand search, human-based genetic algorithms, knowledge collection from volunteer contributors Dimensions – Motivation, quality, human skill, participation time, cognitive load Analyzing Amazon Mechanical Turk Marketplace

Overview Distributed human computation (DHC) aims at solving rich computation problems through collaboration between humans and computers – Particularly, in some problem domains where humans could be much better than machines – Examples: artificial intelligence, natural language processing, and computer visions Well artificial intelligence has been trying hard to solve these problems using machines – But its quality may be not satisfactory.. DHC offers the possibility of combining humans and computers: faster than individual human efforts, and quality is as good as human efforts (or even better) Taxonomy of Distributed Human Computation Alexander J. Quinn, Benjamin B. Bederson, 2009

Overview How? The system has global knowledge of the problem and forms small sub-problems that take advantage of humans’ special abilities – Delegating sub-problems to a large number of people connected via Internet (could be geographically dispersed) Examples: – Searching for a person in a large number of satellite photos covering thousands of square miles of ocean (e.g., Jim Gray) – Image labeling (e.g., ESP game) – Human-computer interaction, cryptograph, business, genetic algorithms, etc. (and many others!)

DHC Genres Games with a purpose Mechanized labor Wisdom of crowds Crowdsourcing Dual-purpose work Grand search Human-based genetic algorithms Knowledge collection from volunteer contributors People sensing

Games with a purpose Game that requires the player to perform some computation to gain points or to succeed Defining factor: people are motivated by the fun of a game

Mechanized labor Crowdsourcing with monetary rewards Amazon’s Mechanical Turk, ChaCha (paid per micro task) – Cf) Mturk was lunched in 2005 by the needs of Amazon; they wanted to eliminate all the duplicate pages as much as possible which couldn’t be done using automated algorithms

Wisdom of crowds Crowd intelligence: very difficult when done individually, but very easy when aggregated (asking opinions of crowds) Example services: online polling, prediction markets

Crowdsourcing Coined by Jeff Howe in a Wired magazine article – Displacement of usual internal labor by soliciting unpaid help from the general public – Motivated by curiosity or serendipity while browsing the web (e.g., online product reviews) Examples: – Question answering services: Naver KiN, Yahoo Answer, Askville, Aardvark – (finding elusive particles from space images)

Dual-purpose work Translating a computation into an activity that many people were already doing frequently

Grand search Finding a solution (instead of aggregation) Examples: finding an image that contains something (e.g., search for a missing person, or for elusive particles as in

Human-based genetic algorithms Humans contribute solutions to problems and subsequent participants by performing functions such as initialization, mutation, and recombinant crossover Defining factor is that solutions consists of a sequence of small parts and that they evolve in a way that is controlled by human evaluation

Knowledge collection from volunteer contributors Aims to advance artificial intelligence research by using humans to build large databases of common sense facts – E.g., “people cannot brush their hair with a table” Common methods have been using data mining, e.g., Cyc Human-based methods could help, e.g., FACTory, Verbosity, 1001 Paraphrases, etc.

People sensing Community awareness (participatory sensing) Emergency/rescue operations Safecast.org seeks to aggregate worldwide sensor information Geiger counter; 방사능측정기 Pictures from

Dimensions Motivation – Pay (e.g., Mturk), altruism (e.g., Naver KiN, Wikipedia), fun (e.g., games), implicit (e.g., embedded in regular activities) Quality – Mechanisms: forced agreement (e.g., games), economic models (when money is involved), defensive task design, redundancy – Checking: statistical, redundant work, multilevel review, expert review, forced agreement, automatic check, reputation systems Aggregation – Knowledge base, statistical, grand search, unit tasks (ChaCha, Mturk) Human skill – Language understanding, vision, communications, reasoning, common knowledge/sense Participation time: 10min Cognitive load (affecting contributor’s willingness to help)

Analyzing the Amazon Mechanical Turk Marketplace Panagiotis G. Ipeirotis (NYU)

AMT Screenshot

Screenshot

AMT questions Who are the workers that complete these tasks? What type of tasks can be completed in the marketplace? How much does it cost? How fast can I get results back? How big is the AMT market place?

Demographics Countries: 46.80% US, India: 34%, Misc: 19.2% (from 66 different countries) 1

Demographics Why do you complete tasks in Mechanical Turk? Please check any of the following that applies: – [1] Fruitful way to spend free time and get some cash (e.g., instead of watching TV) – [2] I find the tasks to be fun – [3] To kill time – [4] For "primary" income purposes (e.g., gas, bills, groceries, credit cards) – [5] For "secondary" income purposes, pocket change (for hobbies, gadgets, going out) – [6] I am currently unemployed, or have only a part time job 123

Demographics Why do you complete tasks in Mechanical Turk? Please check any of the following that applies: – [1] Fruitful way to spend free time and get some cash (e.g., instead of watching TV) – [2] I find the tasks to be fun – [3] To kill time – [4] For "primary" income purposes (e.g., gas, bills, groceries, credit cards) – [5] For "secondary" income purposes, pocket change (for hobbies, gadgets, going out) – [6] I am currently unemployed, or have only a part time job 456

Type of tasks

Requester distribution

Price distribution

Keywords vs. Ranks

Posting vs. completion rate

Completion time