open dataopensource conf may2015
TRANSCRIPT
4 CrowdFlower, Inc. – Proprietary and Confidential
The Effect of Better Algorithms
Naïve Bayes Maximum Entropy SVM0%
5%
10%
15%
20%
25%
Classifier Error Rate
6 CrowdFlower, Inc. – Proprietary and Confidential
The Effect of Better Features
Unigrams Bigrams Unigrams+Bigrams0%
5%
10%
15%
20%
25%
30%
Classifier Error Rate
8 CrowdFlower, Inc. – Proprietary and Confidential
The Effect of More Data
N 2N 4N0%
2%
4%
6%
8%
10%
12%
14%
Classifier Error Rate
10 CrowdFlower, Inc. – Proprietary and Confidential
The Effect of Cleaner Data
90% Accurate Data 95% Accurate Data 100% Accurate Data0%
2%
4%
6%
8%
10%
12%
14%
Classifier Error Rate