Semantics Squad at BLP-2023 Task 1: Violence Inciting Bangla Text Detection with Fine-Tuned Transformer-Based Models

Krishno Dey; Prerona Tarannum; Md. Arid Hasan; Francis Palma

doi:10.18653/v1/2023.banglalp-1.28

Semantics Squad at BLP-2023 Task 1: Violence Inciting Bangla Text Detection with Fine-Tuned Transformer-Based Models

Krishno Dey, Prerona Tarannum, Md. Arid Hasan, Francis Palma

Abstract

This study investigates the application of Transformer-based models for violence threat identification. We participated in the BLP-2023 Shared Task 1 and in our initial submission, BanglaBERT large achieved 5th position on the leader-board with a macro F1 score of 0.7441, approaching the highest baseline of 0.7879 established for this task. In contrast, the top-performing system on the leaderboard achieved an F1 score of 0.7604. Subsequent experiments involving m-BERT, XLM-RoBERTa base, XLM-RoBERTa large, BanglishBERT, BanglaBERT, and BanglaBERT large models revealed that BanglaBERT achieved an F1 score of 0.7441, which closely approximated the baseline. Remarkably, m-BERT and XLM-RoBERTa base also approximated the baseline with macro F1 scores of 0.6584 and 0.6968, respectively. A notable finding from our study is the under-performance by larger models for the shared task dataset, which requires further investigation. Our findings underscore the potential of transformer-based models in identifying violence threats, offering valuable insights to enhance safety measures on online platforms.

Anthology ID:: 2023.banglalp-1.28
Volume:: Proceedings of the First Workshop on Bangla Language Processing (BLP-2023)
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Firoj Alam, Sudipta Kar, Shammur Absar Chowdhury, Farig Sadeque, Ruhul Amin
Venue:: BanglaLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 225–229
Language:
URL:: https://aclanthology.org/2023.banglalp-1.28
DOI:: 10.18653/v1/2023.banglalp-1.28
Bibkey:
Cite (ACL):: Krishno Dey, Prerona Tarannum, Md. Arid Hasan, and Francis Palma. 2023. Semantics Squad at BLP-2023 Task 1: Violence Inciting Bangla Text Detection with Fine-Tuned Transformer-Based Models. In Proceedings of the First Workshop on Bangla Language Processing (BLP-2023), pages 225–229, Singapore. Association for Computational Linguistics.
Cite (Informal):: Semantics Squad at BLP-2023 Task 1: Violence Inciting Bangla Text Detection with Fine-Tuned Transformer-Based Models (Dey et al., BanglaLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.banglalp-1.28.pdf

PDF Cite Search