pick a crowd

Pick-A-Crowd: Tell Me What You Like,and I’ll Tell You What to Do

A Crowdsourcing Platform for PersonalizedHuman Intelligence Task Assignment Based on Social

Networks

Djellel E. Difallah, Gianluca Demartini, Philippe Cudré-MaurouxeXascale Infolab

University of Fribourg, Switzerland15th May 2013, WWW 2013 - Rio De Janeiro, Brazil

Crowdsourcing

• Exploit human intelligence to solve tasks that are simple for Humans and complex for machines

• Examples: – Wikipedia, reCaptcha, Duolingo

• Incentives– Financial, fun, visibility

Motivation

• The Pull Methodology is suboptimal

Effective workers

Actual workers

Max Overlap

Motivation

• The Push Methodology is a Task-to-Worker Recommender System.

Contribution and Claim

• Pick-A-Crowd: A system architecture that uses Task-to-Worker matching:– The worker’s social profile – The task context

• Workers can provide higher quality answers on tasks they relate to

Worker Social Profiling

“YouAreWhatYouLike”

Image Tagging

Problem Definition (1)-The Human Intelligence Task (HIT)

Data CollectionSurveyCategorization

Batch of Tasks:TitleBatch InstructionSpecific task instruction*Task data:

- Text.- Options.- Additional data (image, Url)

List of categories*

Problem Definition (2)-The Worker

Completed HITs: 256Approval Rate: 96%Qualification TypesGeneric Qualifications

Page:- Title- Category- Description- Feed, etc.

Problem Definition (3) –Task-to-Worker Matching

Batch of Tasks:TitleBatch InstructionSpecific task instruction*Task data:

- Text.- Options.- Additional data (image,

Url)List of categories*

Page:- Title- Category- Description- Feed, etc.

1- Task-to-Page Matching Function- Category- Expert finding- Semantic

2- Worker Ranking

Matching Models (1/3)–Category Based

• The requester provides a list of categories related to the batch• We create a subset of pages whose category is in the category

list of the batch• Rank the workers by the number of liked pages in the subset

Matching Models (2/3) –Expert Finding

• Build an inverted index on the pages’ titles and description• Use the title/description of the tasks as a key word query on

the inverted index and get a subset of pages• Rank the workers by the number of liked pages in the subset

Matching Models (3/3) –Semantic Based

• Link the context to an external knowledge base (e.g., DBPedia)• Exploit the underlying graph structure to determine the Hits and Pages similarity

– Assumption that a worker who likes a page is able to answer questions about related entities– Worker who likes a page is able to answer questions about entities of the same type

• Rank the workers by the number of liked pages in the subset

HIT FB Pages

Similarity

Relatedness

Type-Similarity

Pick-A-Crowd Architecture

Experimental Evaluation

• The Facebook app OpenTurk implements part of the Pick-A-Crowd architecture:– More than 170 registered workers participated– Over 12k pages crawled

• Covered both multiple answer questions as well as open-ended questions– 50 images with multiple choice question and 5 candidate answers

(Soccer, Actors, Music, Authors,Movies, Animes)– Answer 20 open-ended questions related to the topic (Cricket)

OpenTurk app

Evaluation -Correlation between the crowd accuracy and the number of relevant likes (Category Based)

NUMBER OF RELEVANT LIKES

Evaluation (Baseline) –Amazon Mechanical Turk (AMT)

AMT 3 = Majority vote of 3 workersAMT 5 = Majority vote of 5 workers

Evaluation – HIT Assignment Models

CATEGORY APPROACH

EXPERT FINDING BASED

TITLE/INSTRUCTION CONTENT

SEMANTIC BASED

TYPE RELATEDNESS

Evaluation -Comparison With Mechanical Turk

Conclusions and Future Work

• Pull vs. Push methodologies in Crowdsourcing • Pick-A-Crowd system architecture with Task-

to-Worker recommendation• Experimental comparison with AMT shows a

consistent quality improvement“Workers Know what they Like”

• Exploit more of the social activity, and handle content-less tasks

Next Step

• We are building a Crowdsourcing platform for the research community

• Pre-register on:

www.openturk.com

Thank You!

pick a crowd

category category category

task data

title title title

category list

task context workers

worker matching batch

matching models

workers amt

Business

crowd space: a predictive crowd analysis...

a crowd monitoring framework using emotion analysis of...

pick a series (pick a pig & pick a dog & pick a polar besr)...

energy level-based abnormal crowd behavior detection ·...

is three a crowd?

kickstarter - a crowd funding platform

capturing a crowd

evaluation fours a crowd

a face in the crowd

when crowd is a company

build a community, not a crowd

ukie crowd funding...

pick-a-crowd: tell me what you like, and i’ll tell...

flock follow a new crowd

belleflamme p., lambert, t., schweinbacher a., crowd funding...

cocktails for a crowd

understanding collective crowd behaviors: learning a ... ·...

datasheet crowd management - allgovision.com · management...

managing the crowd: towards a taxonomy of crowdsourcing...

crowd - the crowd lottery principles