articleApr 3, 2017GOLD OA

Ex Machina

Wikimedia Foundation

Indexed incrossref

Abstract

The damage personal attacks cause to online discourse motivates many platforms to try to curb the phenomenon. However, understanding the prevalence and impact of personal attacks in online platforms at scale remains surprisingly difficult. The contribution of this paper is to develop and illustrate a method that combines crowdsourcing and machine learning to analyze personal attacks at scale. We show an evaluation method for a classifier in terms of the aggregated number of crowd-workers it can approximate. We apply our methodology to English Wikipedia, generating a corpus of over 100k high quality human-labeled comments and 63M machine-labeled ones from a classifier that is as good as the aggregate of 3…

Citation impact

638
total citations
FWCI
50.98
Percentile
100%
References
39
Citations per year

Authors

3

Topics & keywords

Keywords
  • Crowdsourcing
  • Computer science
  • Classifier (UML)
  • Personally identifiable information
  • Aggregate (composite)
  • Artificial intelligence
  • Machine learning
  • Data science
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.