Research Article

TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA

Volume: 29 Number: 1 March 3, 2026
TR EN

TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA

Abstract

The rise in global digital communication has led to an increase in hate speech and offensive language, posing significant threats to societal well-being. While AI presents cybersecurity challenges, it also plays a crucial role in addressing these issues. Researchers must develop AI with a multidisciplinary approach, mitigating algorithmic misuse and ensuring cybersecurity. This study introduces Tri-Embeddings, an innovative method for detecting abusive language using AI-powered text analysis, applied to Twitter data. The method combines pre-trained models such as Word2Vec, FastText, and Universal Sentence Encoder (USE). Additionally, this study explores the impact of integrating a large language model, DistilBERT, into the proposed unified embedding framework. The findings demonstrate the method’s effectiveness, with high precision, recall, and F1 scores, showing its potential to reduce the spread of offensive and hateful language. This approach helps mitigate ethical breaches and creates a safer, more inclusive online space.

Keywords

References

  1. Al-Saqqa, S., & Awajan, A. (2019). The use of word2vec model in sentiment analysis: A survey. Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control, 39–43. https://doi.org/10.1145/3388218.3388229
  2. Barbieri, F., Camacho-Collados, J., Neves, L., & Espinosa-Anke, L. (2020). TweetEval: Unified benchmark and comparative evaluation for tweet classification. ArXiv Preprint ArXiv:2010.12421. https://doi.org/10.18653/v1/2020.findings-emnlp.148
  3. Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5, 135–146. https://doi.org/10.1162/tacl_a_00051
  4. Cécillon, N., Labatut, V., Dufour, R., & Linares, G. (2021). Graph embeddings for abusive language detection. SN Computer Science, 2, 1–15. https://doi.org/10.1007/s42979-020-00413-7
  5. Cer, D., Yang, Y., Kong, S., Hua, N., Limtiaco, N., John, R. S., Constant, N., Guajardo-Cespedes, M., Yuan, S., & Tar, C. (2018). Universal sentence encoder. ArXiv Preprint ArXiv:1803.11175. https://doi.org/10.18280/ria.350404
  6. Davidson, T., Bhattacharya, D., & Weber, I. (2019a). Racial bias in hate speech and abusive language detection datasets. ArXiv Preprint ArXiv:1905.12516. https://doi.org/10.18653/v1/w19-3504
  7. Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 512–515. https://doi.org/10.1609/icwsm.v11i1.14955
  8. Dorris, W., Hu, R., Vishwamitra, N., Luo, F., & Costello, M. (2020). Towards automatic detection and explanation of hate speech and offensive language. Proceedings of the Sixth International Workshop on Security and Privacy Analytics, 23–29. https://doi.org/10.1145/3375708.3380312

Details

Primary Language

English

Subjects

Natural Language Processing

Journal Section

Research Article

Publication Date

March 3, 2026

Submission Date

April 18, 2025

Acceptance Date

December 5, 2025

Published in Issue

Year 2026 Volume: 29 Number: 1

APA
Bakır, R. (2026). TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA. Kahramanmaraş Sütçü İmam Üniversitesi Mühendislik Bilimleri Dergisi, 29(1), 278-294. https://doi.org/10.17780/ksujes.1679361