Using Convolutional Neural Networks to Classify Hate-Speech

Gambäck, Björn; Sikdar, Utpal Kumar

doi:10.18653/v1/w17-3013

articleJan 1, 2017GOLD OA

Using Convolutional Neural Networks to Classify Hate-Speech

BGBjörn Gambäck UKUtpal Kumar Sikdar

Norwegian University of Science and Technology

Indexed incrossref

Abstract

The paper introduces a deep learningbased Twitter hate-speech text classification system. The classifier assigns each tweet to one of four predefined categories: racism, sexism, both (racism and sexism) and non-hate-speech. Four Convolutional Neural Network models were trained on resp. character 4-grams, word vectors based on semantic information built using word2vec, randomly generated word vectors, and word vectors combined with character n-grams. The feature set was down-sized in the networks by maxpooling, and a softmax function used to classify tweets. Tested by 10-fold crossvalidation, the model based on word2vec embeddings performed best, with higher precision than recall, and a 78.3% F-score.

Citation impact

529

total citations

FWCI: 43.72
Percentile: 100%
References: 22

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Word2vec
Softmax function
Computer science
Artificial intelligence
Convolutional neural network
Pooling
Word (group theory)
Natural language processing

UN Sustainable Development Goals

Gender equality

No related works found for this paper.