Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks | Emerald Insight

To read this content please select one of the options below:

Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks

Femi Emmanuel Ayo (Computer Science, College of Natural and Applied Sciences, McPherson University, Seriki Sotayo, Nigeria)
Olusegun Folorunso (Computer Science, College of Natural Sciences, Federal University of Agriculture Abeokuta, Abeokuta, Nigeria)
Friday Thomas Ibharalu (Computer Science, Federal University of Agriculture Abeokuta, Abeokuta, Nigeria)
Idowu Ademola Osinuga (Mathematics, Federal University of Agriculture Abeokuta, Abeokuta, Nigeria)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 3 November 2020

Issue publication date: 13 November 2020

547

Abstract

Purpose

Hate speech is an expression of intense hatred. Twitter has become a popular analytical tool for the prediction and monitoring of abusive behaviors. Hate speech detection with social media data has witnessed special research attention in recent studies, hence, the need to design a generic metadata architecture and efficient feature extraction technique to enhance hate speech detection.

Design/methodology/approach

This study proposes a hybrid embeddings enhanced with a topic inference method and an improved cuckoo search neural network for hate speech detection in Twitter data. The proposed method uses a hybrid embeddings technique that includes Term Frequency-Inverse Document Frequency (TF-IDF) for word-level feature extraction and Long Short Term Memory (LSTM) which is a variant of recurrent neural networks architecture for sentence-level feature extraction. The extracted features from the hybrid embeddings then serve as input into the improved cuckoo search neural network for the prediction of a tweet as hate speech, offensive language or neither.

Findings

The proposed method showed better results when tested on the collected Twitter datasets compared to other related methods. In order to validate the performances of the proposed method, t-test and post hoc multiple comparisons were used to compare the significance and means of the proposed method with other related methods for hate speech detection. Furthermore, Paired Sample t-Test was also conducted to validate the performances of the proposed method with other related methods.

Research limitations/implications

Finally, the evaluation results showed that the proposed method outperforms other related methods with mean F1-score of 91.3.

Originality/value

The main novelty of this study is the use of an automatic topic spotting measure based on naïve Bayes model to improve features representation.

Keywords

Acknowledgements

Declaration of competing interest: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Citation

Ayo, F.E., Folorunso, O., Ibharalu, F.T. and Osinuga, I.A. (2020), "Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks", International Journal of Intelligent Computing and Cybernetics, Vol. 13 No. 4, pp. 485-525. https://doi.org/10.1108/IJICC-06-2020-0061

Publisher

:

Emerald Publishing Limited

Copyright © 2020, Emerald Publishing Limited

Related articles