Araştırma Makalesi

TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA

Cilt: 29 Sayı: 1 3 Mart 2026
PDF İndir
TR EN

TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA

Abstract

The rise in global digital communication has led to an increase in hate speech and offensive language, posing significant threats to societal well-being. While AI presents cybersecurity challenges, it also plays a crucial role in addressing these issues. Researchers must develop AI with a multidisciplinary approach, mitigating algorithmic misuse and ensuring cybersecurity. This study introduces Tri-Embeddings, an innovative method for detecting abusive language using AI-powered text analysis, applied to Twitter data. The method combines pre-trained models such as Word2Vec, FastText, and Universal Sentence Encoder (USE). Additionally, this study explores the impact of integrating a large language model, DistilBERT, into the proposed unified embedding framework. The findings demonstrate the method’s effectiveness, with high precision, recall, and F1 scores, showing its potential to reduce the spread of offensive and hateful language. This approach helps mitigate ethical breaches and creates a safer, more inclusive online space.

Keywords

Kaynakça

  1. Al-Saqqa, S., & Awajan, A. (2019). The use of word2vec model in sentiment analysis: A survey. Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control, 39–43. https://doi.org/10.1145/3388218.3388229
  2. Barbieri, F., Camacho-Collados, J., Neves, L., & Espinosa-Anke, L. (2020). TweetEval: Unified benchmark and comparative evaluation for tweet classification. ArXiv Preprint ArXiv:2010.12421. https://doi.org/10.18653/v1/2020.findings-emnlp.148
  3. Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5, 135–146. https://doi.org/10.1162/tacl_a_00051
  4. Cécillon, N., Labatut, V., Dufour, R., & Linares, G. (2021). Graph embeddings for abusive language detection. SN Computer Science, 2, 1–15. https://doi.org/10.1007/s42979-020-00413-7
  5. Cer, D., Yang, Y., Kong, S., Hua, N., Limtiaco, N., John, R. S., Constant, N., Guajardo-Cespedes, M., Yuan, S., & Tar, C. (2018). Universal sentence encoder. ArXiv Preprint ArXiv:1803.11175. https://doi.org/10.18280/ria.350404
  6. Davidson, T., Bhattacharya, D., & Weber, I. (2019a). Racial bias in hate speech and abusive language detection datasets. ArXiv Preprint ArXiv:1905.12516. https://doi.org/10.18653/v1/w19-3504
  7. Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 512–515. https://doi.org/10.1609/icwsm.v11i1.14955
  8. Dorris, W., Hu, R., Vishwamitra, N., Luo, F., & Costello, M. (2020). Towards automatic detection and explanation of hate speech and offensive language. Proceedings of the Sixth International Workshop on Security and Privacy Analytics, 23–29. https://doi.org/10.1145/3375708.3380312

Ayrıntılar

Birincil Dil

İngilizce

Konular

Doğal Dil İşleme

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

3 Mart 2026

Gönderilme Tarihi

18 Nisan 2025

Kabul Tarihi

5 Aralık 2025

Yayımlandığı Sayı

Yıl 2026 Cilt: 29 Sayı: 1

Kaynak Göster

APA
Bakır, R. (2026). TRI-EMBEDDINGS: A NOVEL APPROACH FOR DETECTING ABUSIVE LANGUAGE ON SOCIAL MEDIA. Kahramanmaraş Sütçü İmam Üniversitesi Mühendislik Bilimleri Dergisi, 29(1), 278-294. https://doi.org/10.17780/ksujes.1679361