wikimania 2009 - yahoo answers community moderation

36
Yahoo! Answers Community Moderation Micah Alpern, Director of User Experience Yahoo! Search

Upload: micah-alpern

Post on 02-Nov-2014

5.357 views

Category:

Technology


3 download

DESCRIPTION

Yahoo! Answers is the largest Q&A service with hundreds of thousands of questions asked and answered every day. Traditional moderation systems failed to scale with the products growth. To address these challenges the team deployed a Community Moderation system which empowering trusted Answers members to report and remove abusive content automatically. One of the challenges to making the system work was to understand who to trust. The Answers team built a rich reputation model based on an analysis of over a dozen different system actions. Creating a successful community moderation system required changes in technology, community guidelines & policy, and user experience design.

TRANSCRIPT

Page 1: Wikimania 2009 - Yahoo Answers Community Moderation

Yahoo! AnswersCommunity Moderation

Micah Alpern, Director of User ExperienceYahoo! Search

Page 2: Wikimania 2009 - Yahoo Answers Community Moderation

•Yahoo! AnswersAsk, Answer, Discover

• 1.2 million Q&A per day

• Experiential Knowledge

Page 3: Wikimania 2009 - Yahoo Answers Community Moderation

•Yahoo! AnswersAsk, Answer, Discover

• 1.2 million Q&A per day

• Experiential Knowledge

Page 4: Wikimania 2009 - Yahoo Answers Community Moderation

Activity

Page 5: Wikimania 2009 - Yahoo Answers Community Moderation

Activity

Page 6: Wikimania 2009 - Yahoo Answers Community Moderation
Page 7: Wikimania 2009 - Yahoo Answers Community Moderation

Answers is a marketplace

•People want lots of good answers to their questions quickly.

This means wide distribution of your freshest content.

Higher visibility for abuse in Questions than Answers.

“If you don't fix it right away, it's not worth fixing.”

by getxsickphotography

Page 8: Wikimania 2009 - Yahoo Answers Community Moderation

Traditional Moderation Failed

•As Answers grew traditional moderation methods failed to scale with the community.

•Manual customer care systems: Had slow response times

Treated all abuse reports the same

Had high false positive and false negative rates

Were high cost and scaled up with traffic

Page 9: Wikimania 2009 - Yahoo Answers Community Moderation

Machine learning?

A team of PHDs worked on a black box abuse classifier.

Pre-filter of bad content to be reviewed by Customer Care.

Could only handle questions, not the answers

After a while that broke as well

60% of reports were fake abuse by users trying to report others.

Page 10: Wikimania 2009 - Yahoo Answers Community Moderation

Community Moderation

• Deployed a newCommunity Moderation system  Empowered trusted Answers users to help moderate content by allowing their report abuse actions to automatically delete content.

 System didn’t reveal reputation scores to users,

Encouraged them to report accurately so they could gain more community influence.

Illustration by Bryce Glass

To address these challenges:

Page 11: Wikimania 2009 - Yahoo Answers Community Moderation

Reputation System

Page 12: Wikimania 2009 - Yahoo Answers Community Moderation

Answers Moderation Model

Hide Content

Item?

ContentReporter

Author

Page 13: Wikimania 2009 - Yahoo Answers Community Moderation

Answers Moderation ModelREPORTER

Abuse Suspicion Reputation

REPORTER Community InvestmentReputation

User’s Abusive

Reputation

Confirmed Abuse

Reporter Reputation

Abuse Reporter

Bootstrap Reputation

AbuseReporter

Reputation

CONTENT ITEM AbuseReputation

AUTHOROverall Asker

Reputation

AUTHOROverall Answerer

Reputation

AUTHOROverall Asker

Reputation

Question Quality

Reputation

AUTHOROverall

Answerer Reputation

Answer Quality

Reputation

Hide Content

Item?

Page 14: Wikimania 2009 - Yahoo Answers Community Moderation

the story of a report

an abuse report comes in

We need to decided: Who do we believe? Do we hide the content?

A question is asked..

Author Reporter

Page 15: Wikimania 2009 - Yahoo Answers Community Moderation
Page 16: Wikimania 2009 - Yahoo Answers Community Moderation
Page 17: Wikimania 2009 - Yahoo Answers Community Moderation

The content

A question is asked..

What do we know about the content? Have others reported it

Machined learned “Junk detector” score

Language specific

Requires tons of training data

!= a quality detector

CONTENT ITEM

AbuseReputation

Page 18: Wikimania 2009 - Yahoo Answers Community Moderation

the reporter

What do we know about the reporter?

Reporter

08.17.08 4:41pm

08.20.08 6:23 pm

10.04.08 1:08 pm

11.06.08 7:03 am

Past reporting behavior

hiddennot

hiddenhide overturned

Outcome

not hidden

Confirmed Abuse

Reporter Reputation CONTENT

ITEM AbuseReputation

AbuseReporter

Reputation Past reporting performance

and it’s outcome

For many reporters we don’t have enough feedback.

Page 19: Wikimania 2009 - Yahoo Answers Community Moderation

How do we solve the cold start problem?

Page 20: Wikimania 2009 - Yahoo Answers Community Moderation

Cold Start:Reporter Reputation

Reporter

Suspicion: Implied Karma

Hard evidence

Examine signals across synonymous identities

Confirmed Abuse

Reporter Reputation Abuse

Reporter Reputation

CONTENT ITEM AbuseReputation

Take this result and update the running reputation of the content reported.

Reporter’s Abuse

Suspicion Reputation

Reporter’s Community InvestmentReputation

User’s Abusive

Reputation

Abuse Reporter

Bootstrap Reputation

Page 21: Wikimania 2009 - Yahoo Answers Community Moderation

Compared to

Illustration by Bryce Glass

Page 22: Wikimania 2009 - Yahoo Answers Community Moderation

Do we trust the author?How good are they at Asking questions?How good are they at Answering?Do they have a history of abuse?

AUTHOROverall Asker

Reputation

AUTHOROverall Answerer

Reputation

AUTHOROverall Asker

Reputation

Question Quality

Reputation

AUTHOROverall

Answerer Reputation

Answer Quality

Reputation

Users Abusive

Reputation

Page 23: Wikimania 2009 - Yahoo Answers Community Moderation

we make a decision

an abuse report comes in

A question is asked..

Author Reporter

Page 24: Wikimania 2009 - Yahoo Answers Community Moderation
Page 25: Wikimania 2009 - Yahoo Answers Community Moderation
Page 26: Wikimania 2009 - Yahoo Answers Community Moderation

decisions can be revised

an abuse report comes in

A question is asked..

Author Reporter

Trust level is re-instated andtemporary protection assigned.

Trust level for Reporting is decreased.

Page 27: Wikimania 2009 - Yahoo Answers Community Moderation

CraigsListAfter 3 flags the post is deletedOther systems: Ignore new user flags

In Answers:• Its harder to remove content from good contributors.

• Easier for a even a new person to remove a spammer.

Naive moderation

Page 28: Wikimania 2009 - Yahoo Answers Community Moderation

So how did we do?

Page 29: Wikimania 2009 - Yahoo Answers Community Moderation

Speed

Page 30: Wikimania 2009 - Yahoo Answers Community Moderation

Accuracy

Page 31: Wikimania 2009 - Yahoo Answers Community Moderation

Concussion

• Reputation system helped Answers:

Scale & empower the community to mange abuse

Provide a better experience to Askers and Answerers

Page 32: Wikimania 2009 - Yahoo Answers Community Moderation

Questions?

Bryce Glass

Randy Farmer

Ori Zaltzman

http://buildingreputation.com/

Micah Alpern

[email protected]

twitter:malpern

Page 33: Wikimania 2009 - Yahoo Answers Community Moderation
Page 34: Wikimania 2009 - Yahoo Answers Community Moderation

SPAM

•  “We have other methods for identifying and dealing with spam, but as a very popular site we are a target for spammers so are continually upping our game to find new ways to identify them quicker and remove their content faster (though we shouldn’t share the methods we take). We recognize spam damages the experience for our users and so are committed to reducing spam on Answers as much as possible.”

Page 35: Wikimania 2009 - Yahoo Answers Community Moderation

Chat and Low quality content

• “You mention this a bit in the presentation, but generally we acknowledge there are different types of questions being asked and answered on the site and accept that currently we’re limited in how we are identifying and highlighting knowledge based Q&A vs Social Q&A. However, we are looking at ways to easier promote the type of content a user wants to see.”

Page 36: Wikimania 2009 - Yahoo Answers Community Moderation

Accuracy

• “focus is on experiential knowledge, practical help based on someone’s opinion or experience. Different type of knowledge than covered by wikipedia. We don’t want to publish content which is misleading, incorrect or harmful but use the “report abuse” system and our trusted users to balance any dubious advice.