Skip to main content

Research Repository

Advanced Search

Towards a Cyberbullying Detection Approach: Fine-Tuned Contrastive Self- Supervised Learning for Data Augmentation

Alharigy, Lulwah; Alnuaim, Hana; Moradpoor, Naghmeh; Tan, Thomas

Authors

Lulwah Alharigy

Hana Alnuaim



Abstract

Cyberbullying on social media platforms is pervasive and challenging to detect due to linguistic subtleties and the need for extensive data annotation. We introduce a Deep Contrastive Self-Supervised Learning (DCSSL) model that integrates a Natural Language Inference (NLI) dataset, a fine-tuned sentence encoder, and data augmentation to enhance the understanding of cyberbullying's nuanced semantics and offensiveness. The DCSSL model effectively captures contextual dependencies and the varied semantic implications inherent in cyberbullying instances, addressing the limitations of manual data annotation processes when compared against established models such as BERT and Bi-LSTM. Our proposed model registers a significant improvement, achieving a macro average F1 score of 0.9231 on cyberbullying datasets, highlighting its applicability in environments where manual annotation is impractical or unavailable.

Citation

Alharigy, L., Alnuaim, H., Moradpoor, N., & Tan, T. (online). Towards a Cyberbullying Detection Approach: Fine-Tuned Contrastive Self- Supervised Learning for Data Augmentation. International Journal of Data Science and Analytics, https://doi.org/10.1007/s41060-024-00607-9

Journal Article Type Article
Acceptance Date Jul 3, 2024
Online Publication Date Jul 17, 2024
Deposit Date Jul 5, 2024
Publicly Available Date Jul 18, 2024
Print ISSN 2364-415X
Electronic ISSN 2364-4168
Publisher Springer
Peer Reviewed Peer Reviewed
DOI https://doi.org/10.1007/s41060-024-00607-9
Keywords Cyberbullying Detection, Deep Contrastive Self-Supervised Learning, Data Augmentation, Natural Language Inference, Offensive Content Detection

Files





You might also like



Downloadable Citations