Hate Speech Detection with Comment Embeddings

Djuric, Nemanja; Zhou, Jing; Morris, Robin K.; Grbovic, Mihajlo; Radosavljević, Vladan; Bhamidipati, Narayan

doi:10.1145/2740908.2742760

articleMay 18, 2015Closed access

Hate Speech Detection with Comment Embeddings

NDNemanja Djuric JZJing Zhou RKRobin K. Morris MGMihajlo Grbovic VRVladan Radosavljević

Yahoo (United States) · Yahoo (United Kingdom)

Indexed incrossref

Abstract

We address the problem of hate speech detection in online user comments. Hate speech, defined as an "abusive speech targeting specific group characteristics, such as ethnicity, religion, or gender", is an important problem plaguing websites that allow users to leave feedback, having a negative impact on their online business and overall user experience. We propose to learn distributed low-dimensional representations of comments using recently proposed neural language models, that can then be fed as inputs to a classification algorithm. Our approach addresses issues of high-dimensionality and sparsity that impact the current state-of-the-art, resulting in highly efficient and effective hate speech detectors.

Citation impact

721

total citations

FWCI: 64.10
Percentile: 100%
References: 9

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Voice activity detection
Curse of dimensionality
Speech recognition
Detector
Free speech
Artificial intelligence
Machine learning

UN Sustainable Development Goals

Gender equality

No related works found for this paper.