LowResourceNLU at BLP-2023 Task 1 & 2: Enhancing Sentiment Classification and Violence Incitement Detection in Bangla Through Aggregated Language Models
Veeramani, Hariram, Thapa, Surendrabikram, and Naseem, Usman (2023) LowResourceNLU at BLP-2023 Task 1 & 2: Enhancing Sentiment Classification and Violence Incitement Detection in Bangla Through Aggregated Language Models. In: Proceedings of the 1st Workshop on Bangla Language Processing. pp. 273-278. From: BLP 2023: First Workshop on Bangla Language Processing, 7 December 2023, Singapore.
|
PDF (Published Version)
- Published Version
Available under License Creative Commons Attribution. Download (189kB) | Preview |
Abstract
Violence incitement detection and sentiment analysis hold significant importance in the field of natural language processing. However, in the case of the Bangla language, there are unique challenges due to its low-resource nature. In this paper, we address these challenges by presenting an innovative approach that leverages aggregated BERT models for two tasks at the BLP workshop in EMNLP 2023, specifically tailored for Bangla. Task 1 focuses on violence-inciting text detection, while task 2 centers on sentiment analysis. Our approach combines fine-tuning with textual entailment (utilizing BanglaBERT), Masked Language Model (MLM) training (making use of BanglaBERT), and the use of standalone Multilingual BERT. This comprehensive framework significantly enhances the accuracy of sentiment classification and violence incitement detection in Bangla text. Our method achieved the 11th rank in task 1 with an F1-score of 73.47 and the 4th rank in task 2 with an F1-score of 71.73. This paper provides a detailed system description along with an analysis of the impact of each component of our framework.
Item ID: | 82404 |
---|---|
Item Type: | Conference Item (Research - E1) |
ISBN: | 9798891760585 |
Copyright Information: | Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License. |
Date Deposited: | 14 Mar 2024 02:18 |
FoR Codes: | 46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460208 Natural language processing @ 100% |
SEO Codes: | 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220403 Artificial intelligence @ 100% |
Downloads: |
Total: 40 Last 12 Months: 12 |
More Statistics |