Analysis of Hate Speech Detection based on Obstacles and Solutions using Deeplearning Methods

Shayuti Rohmah, Hopa Kapi

Abstract


Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.

Full Text:

PDF

References


Agrawal S, Awekar A. 2018. Deep learning for detecting cyberbullying across multiple social media platforms. In: European conference on information retrieval. Grenoble, France. Springer. 141-153

Al-Hassan A, Al-Dossari H. 2019. Detection of hate speech in social networks: a survey on multilingual corpus. In: Computer Science & Information Technology (CS & IT). Chennai, India. AIRCC Publishing Corporation. 83-100

Alatawi HS, Alhothali AM, Moria KM. 2020. Detecting white supremacist hate speech using domain specific word embedding with deep learning and BERT.

Alorainy W, Burnap P, Liu H, Williams ML. 2019. The enemy among us: detecting cyber hate speech with threats-based othering language embeddings. ACM Transactions on the Web 13(3):1-26

Arango A, Pérez J, Poblete B. 2020. Hate speech detection is not as easy as you may think: a closer look at model validation (extended version) Information Systems Epub ahead of print 2020 30 June

Badjatiya P, Gupta M, Varma V. 2019. Stereotypical bias removal for hate speech detection task using knowledge-based generalizations. In: Liu L, White RW, Mantrach A, Silvestri F, McAuley JJ, Baeza-Yates R, Zia L, eds. The World Wide Web conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. ACM. 49-59

Badjatiya P, Gupta S, Gupta M, Varma V. 2017. Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on World Wide Web companion. 759-760

Banko M, MacKeen B, Ray L. 2020. A unified taxonomy of harmful content. In: Proceedings of the fourth workshop on online abuse and harms. Association for Computational Linguistics. 125-137

Basile V. 2020. It’s the end of the gold standard as we know itOn the impact of pre-aggregation on the evaluation of highly subjective tasks. In: CEUR workshop proceedings. 10

Basile V, Bosco C, Fersini E, Nozza D, Patti V, Rangel Pardo FM, Rosso P, Sanguinetti M. 2019. SemEval-2019 Task 5: multilingual detection of hate speech against immigrants and women in twitter. In: Proceedings of the 13th international workshop on semantic evaluation. Minneapolis, Minnesota, USA. Association for Computational Linguistics. 54-63

Baziotis C, Pelekis N, Doulkeridis C. 2017. DataStories at SemEval-2017 Task 4: Deep LSTM with attention for message-level and topic-based sentiment analysis. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017). Vancouver, Canada. Association for Computational Linguistics. 747-754

Blei DM, Ng AY, Jordan MI. 2003. Latent dirichlet allocation. Journal of Machine Learning Research 3(Jan):993-1022

Blodgett SL, Green L, OConnor B. 2016. Demographic dialectal variation in social media: a case study of African-American English. In: Proceedings of the 2016 conference on empirical methods in natural language processing. 1119-1130

Blodgett SL, O’Connor B. 2017. Racial disparity in natural language processing: a case study of social media African-American English.

Bodapati S, Gella S, Bhattacharjee K, Al-Onaizan Y. 2019. Neural word decomposition models for abusive language detection. In: Proceedings of the third workshop on abusive language online. Florence, Italy. Association for Computational Linguistics. 135-145

Bojanowski P, Grave E, Joulin A, Mikolov T. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5:135-146

Bolukbasi T, Chang K, Zou JY, Saligrama V, Kalai AT. 2016. Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In: Lee DD, Sugiyama M, von Luxburg U, Guyon I, Garnett R, eds. Advances in neural information processing systems 29: Annual conference on neural information processing systems 2016, December 5-10, 2016, Barcelona, Spain. 4349-4357

Breitfeller L, Ahn E, Jurgens D, Tsvetkov Y. 2019. Finding microaggressions in the wild: a case for locating elusive phenomena in social media posts. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Hong Kong, China. Association for Computational Linguistics. 1664-1674

Buolamwini J, Gebru T. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. In: Conference on fairness, accountability and transparency. 77-91

Cao R, Lee RK.-W, Hoang T.-A. 2020. DeepHate: hate speech detection via multi-faceted text representations. In: 12th ACM conference on web science, WebSci ’20. New York, NY, USA. Association for Computing Machinery. 11-20

Caruana R. 1997. Multitask learning. Machine Learning 28(1):41-75

Caselli T, Basile V, Mitrović J, Kartoziya I, Granitzer M. 2020. I feel offended, don’t be abusive! implicit/explicit messages in offensive and abusive language. In: Proceedings of the 12th language resources and evaluation conference. Marseille, France. European Language Resources Association. 6193-6202

Caselli T, Basile V, Mitrovi J, Granitzer M. 2021. HateBERT: retraining BERT for abusive language detection in english.

Cer D, Yang Y, Kong S-y, Hua N, Limtiaco N, StJohn R, Constant N, Guajardo-Cespedes M, Yuan S, Tar C+2 more. 2018. Universal sentence encoder for english. In: Proceedings of the 2018 conference on empirical methods in natural language processing: system demonstrations. Brussels, Belgium. Association for Computational Linguistics. 169-174

Chen H, McKeever S, Delany SJ. 2018. A comparison of classical versus deep learning techniques for abusive content detection on social media sites. In: Staab S, Koltsova O, Ignatov DI, eds. Social informatics, Lecture notes in computer science. Cham: Springer International Publishing. 117-133

Chen H, McKeever S, Delany SJ. 2019. The use of deep learning distributed representations in the identification of abusive text. Proceedings of the International AAAI Conference on Web and Social Media 13:125-133

Chung Y-L, Kuzmenko E, Tekiroglu SS, Guerini M. 2019. CONAN - Counter narratives through nichesourcing: a multilingual dataset of responses to fight online hate speech. In: Proceedings of the 57th annual meeting of the association for computational linguistics. Florence, Italy. Association for Computational Linguistics. 2819-2829


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.