NEHATE: Large-Scale Annotated Data Shedding Light on Hate Speech in Nepali Local Election Discourse
Thapa, Surendrabikram, Rauniyar, Kritesh, Shiwakoti, Shuvam, Poudel, Sweta, Naseem, Usman, and Nasim, Mehwish (2023) NEHATE: Large-Scale Annotated Data Shedding Light on Hate Speech in Nepali Local Election Discourse. In: Frontiers in Artificial Intelligence and Applications (372) pp. 2346-2353. From: ECAI 2020: 26th European Conference on Artificial Intelligence, 30 September - 4 October 2023, Kraków, Poland.
|
PDF (Published Version)
- Published Version
Available under License Creative Commons Attribution Non-commercial. Download (1MB) | Preview |
Abstract
The use of social media during election campaigns has become increasingly popular. However, the unbridled nature of online discourse can lead to the propagation of hate speech, which has far-reaching implications for the democratic process. Natural Language Processing (NLP) techniques are being used to counteract the spread of hate speech and promote healthy online discourse. Despite the increasing need for NLP techniques to combat hate speech, research on low-resource languages such as Nepali is limited, posing a challenge to the realization of the United Nations' Leave No One Behind principle, which calls for inclusive development that benefits all individuals and communities, regardless of their backgrounds or circumstances. To bridge this gap, we introduce NEHATE, a large-scale manually annotated dataset of hate speech and its targets in Nepali local election discourse. The dataset comprises 13,505 tweets, annotated for hate speech with further sub-categorization of hate speech into targets such as community, individual, and organization. Benchmarking of the dataset with various algorithms has shown potential for performance improvement. We have made the dataset publicly available at https://github.com/shucoll/NEHate to promote further research and development, while also contributing to the UN SDGs aimed at fostering peaceful, inclusive societies, and justice and strong institutions.
Item ID: | 81075 |
---|---|
Item Type: | Conference Item (Research - E1) |
ISBN: | 9781643684369 |
Copyright Information: | © 2023 The Authors. This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0). |
Date Deposited: | 21 Nov 2023 01:36 |
FoR Codes: | 46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460208 Natural language processing @ 100% |
SEO Codes: | 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220402 Applied computing @ 100% |
Downloads: |
Total: 140 Last 12 Months: 16 |
More Statistics |