Data-Centered Crowdsourcing Workshop

Slides:



Advertisements
Similar presentations
Working Smarter: Using Smart Phones & Mobile Work Dino Martinez Student Involvement & Leadership Development.
Advertisements

Whiteboard Content Sharing Audio Video PollsRecordingMeet Now Skype Integration MS Lync 2013 Tools & Tips for facilitators… Limitations Alternatives One.
Search for personal information using Yahoo BOSS by Evgeny Dosychev Dmitry Kichin Supervisor: Eddie Bortnikov.
4.5 Multimedia Production
© 2007 IBM Corporation IBM Emerging Technologies Enabling an Accessible Web 2.0 Becky Gibson Web Accessibility Architect.
Search Engines and Information Retrieval
Crowdsourcing research data UMBC ebiquity,
Academic Advisor: Prof. Ronen Brafman Team Members: Ran Isenberg Mirit Markovich Noa Aharon Alon Furman.
Development of mobile applications using PhoneGap and HTML 5
SOCIAL NETWORKING APP FACEBOOK. WHAT IS FACEBOOK Facebook was created in 2004 by Mark Zuckerburg and was first used on computers. It was one of the first.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
TO ENABLE THE CROSS-BORDER EXCHANGE OF CULTURE BY PROVIDING AN INNOVATIVE, MULTILINGUAL IT PLATFORM, BASED ON AVAILABLE OPEN SOURCE SOCIAL PLATFORM SOLUTIONS.
Approximated Provenance for Complex Applications
Utilizing Social Media & Multimedia Communications.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
DATA-CENTERED CROWDSOURCING WORKSHOP PROF. TOVA MILO SLAVA NOVGORODOV TEL AVIV UNIVERSITY 2014/2015.
Web 2.0 meet Semantic Web at Yahoo! Dave Beckett Yahoo! Media Group November 8, 2006.
Why I LIKE the Facebook Database… Sharon Viente May 2010.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Data, Docs, Maps and … Text: The functional side(s) of new media Andrew Chavez Associate Director / Digital Initiatives Texas Center for Community Journalism.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Big6 Overview Big6™ Trainers Program McDowell County Schools.
Live Demo Augmented reality – lets see some pictures flying…Augmented reality – lets see some pictures flying… Facebook -Facebook -
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
SEO Who knew 3 letters could mean so much?. What is SEO? Search Engine Optimization (SEO) is the practice of improving and promoting a web site in order.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Presentation e-Learning Basics Author: Mary Frentzou )
Web Fundamentals (HTML and CSS) Course Introduction Svetlin Nakov Technical Trainer Software University
Web Fundamentals (HTML and CSS)
Capstone Project Fall Course Information Instructor Ye Zhao –Office: MSB 220 – Fall 2015 (MSB162) –Time: Tue, Thu 10:45am.
Tshilidzi Tshiredo. Introduction Long time ago even before technologies, social networking platforms and mobile devices, Dewey, J.( ) stated that.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
E-Hoop Learning Platform Functionalities Prof. Michalis Xenos Dr. Lefteris Kozanidis Eleni Chatzidaki, MSc Hellenic Open University (HOU) “Unified approach.
Getting Started Telligent or SharePoint (or Hybrid)?
NEALLT 2016 Motivating Students with Media, Games, and Style Gettysburg College March Luba Iskold Muhlenberg College.
Presentation by Giorgos Theodoridis. WordPress is a free web software you can use to create a beautiful website, blog, or app, (CMS) based on PHP and.
Genie Pal A Versatile Intelligent Assistant To Help Both Work And Personal life.
How To Market Disaster Restoration Services in The Internet Era
Search Engine Optimization
Information Retrieval in Practice
Prediction Games Players compete by making predictions about upcoming event/observation in the real world Predictions are scored after event At TAMU, we.
SEARCH ENGINE OPTIMIZATION.
Research in Collective Intelligence through Horse Racing in Hong Kong
IS1500: Introduction to Web Development
CONNECT.
Computer Software Digital Literacy.
Information Retrieval (in Practice)
INTERNET IN EDUCATION UNIT- 5
globallab Where curiosity rules!
Proposal for Term Project
Digital Media plan EQ Solution.
Web Fundamentals (HTML and CSS)
GeneXus 9.0: Web applications at their higher power
Computer Software Digital Literacy.
Teaching a Workshop Kent Schroeder SIL AFA.
How to use By Zainab Muman
Web 2.0.
Technology Woes?.
Leveraging SNAP-Ed Connection for Policy and Programs
The Top 10 Reasons Why Federated Can’t Succeed
H5P: Using an Interactive Assessment Tool in Moodle
American Library Association Online Resource Center
Louisiana: Our History.
Derek Reinelt Team Manager Angela Lam Design Jake Sanders Development
Metrics Stats n’ Stuff.
Ben Jones - S Rebecca Hunter - S
Digital Marketing Starter Course
Producing Web Course Material with IBM Knowledge Factory Team
Slides prepared by Sarah Benis Scheier-Dolberg
Perspective on Web Game Tech from the Instant Games team
Presentation transcript:

Data-Centered Crowdsourcing Workshop Prof. Tova Milo Slava Novgorodov Tel Aviv University 2016/2017

Projects The team size is around 4 students and should be decided by the next week The projects should be done during the semester There is one “touch-base” meeting in a month and submission at the last week of the semester (demo day) The topic and the technology is up to you (we teach web programming using PHP, JavaScript and HTML and provide an SDK) The project should be interesting and solve real problem in the world of crowdsouring

Projects In general there are two possible options for the projects: Application (Web or Mobile) – an app that solves concrete problem using crowdsourcing (for example: Application for text translation) Infrastructure – provides an option for application to integrate with it, and solves more general crowdsourcing problem (for example: Platform for matching users to questions/tasks) In both cases, students should prepare an end-to-end demonstration of their projects

Crowdsourcing application Possible options for applications projects: Tagging images (or in general: Adding metadata to multimedia content) using crowd Text translation, adaptation, shortening, improvement (TL;DR) Answering questions/queries using crowd Planning and scheduling with the crowd Geo-spartial applications based on crowd (use crowd locations to improve your data) Building knowledge base (incl. cleaning) …

Crowdsourcing INFRASTRUCTURE Possible options for infrastructure projects: Crowdsourcing platform design and implementation (AMTurk, CrowdFlower, …) Matching crowd users to questions/tasks (based on their preferences/test questions/performance) Payment and incentives model for crowd (solving the problem of trade-off between budget and quality) Budget management and payment strategies Ordering/Sequences of questions to crowd (session model and context) Spamming detection Load balancing

Collecting meta-data from multimedia Existing projects: The ESP game – tagging images Tag a Tune – audio annotation Squigl – detect objects on images Verbosity – a common knowledge trainer (game) More projects – Games with a Purpose More ideas for projects: Video tagging, events detection using crowd

The esp game

Tag a tune

SQUIGL

verbosity

Text translation Existing projects: Paper from UPenn by Zaidan & Callison-Burch Facebook localization was partially done by crowd GetLocalization.com TL;DR community Google translate “improve translation” feature More ideas for projects: Projects like English Wikipedia  Simple English Wiki

Get localization

TL;DR Facebook community

Google translate

Answering questions and queries with crowd Existing projects: StackOverflow TripAdvisor Yahoo! Answers A lot of “academical” research to make such projects more structured and focused (OASSIS, AskIt, …) More ideas for projects: Free text to queries translation (using crowd) Meta-queries

Planning with crowd Existing projects: CrowdPlannr (academical research, TAU) CrowdCierge (also academical, MIT) More ideas for projects: Events planning Schedule Suggestions Work split and sharing Better (more general) CrowdPlannr…

crowdcierge

GEOSPATRIAL CROWD APPS Existing projects: Scores & Video city maps – soccer fans visualization Waze OpenStreetMaps – collaborative mapping More ideas for projects: Wikipedia missing photos If there is a missing photo of some place, you can ask user to upload it (if he is currently there) Geo-based tasks and job sequence planning Crowd-based layers over existing maps

Scores&video city maps

Waze

Openstreetmaps

Tzevaadom.com

Building knowledgebase Existing projects: YAGO (academical, not crowdsourcing) OASSIS (academical, crowd minning) QOCO (academical, query-oriented) More ideas for projects: Building database of popular facts (common knowledge) Building database of “rare”/not documented facts Enrich datasets with semantics Combine multipledatasets

OASSIS

ASKIT

TEASERS

GUESS & Trivia

Crowdsourcing INFRASTRUCTURE Possible options for infrastructure projects: Crowdsourcing platform design and implementation (AMTurk, CrowdFlower, …) Matching crowd users to questions/tasks (based on their preferences/test questions/performance) Payment and incentives model for crowd (solving the problem of trade-off between budget and quality) Budget management and payment strategies Ordering/Sequences of questions to crowd (session model and context) Spamming detection Load balancing

Crowdsourcing PLATFORM Existing projects: Amazon Mechanical Turk (microtasks) CrowdFlower oDesk (for big projects) More ideas for projects: Better management panel Better connection to users

Mechanical turk

crowdflower

Matching crowd users to questions/tasks The matching can be done based on: User preference Requester preference (“Give me 5 people from US, that are experts in baseball”) Qualification tests More?

Payment and incentives Create the real “free” market: Bidding Monopoles and workers unions Long-term contracts Quality / Payment tradeoff …

Budget management Easier way to manage your workers Algorithmic budget allocation Manual preferences Better UI (control panel) …

Management panel

Ordering of tasks Crowd users have different preferences Many tasks of the same type Many tasks in the same context Different tasks “Easy-Only” (less money per tasks – more tasks) “Difficult-Only” Mixed …

Spamming detection Detect spammers Don’t hurt real crowd (can cost you reputation) Provide high quality and low latency …

Load balancing The tasks should be done by all the crowd… There are many “strong” crowd users, and many requesters willing to get them Not always available …

Questions?