Abstract
Hate speech detection is a challenging task, especially in the context of real-time monitoring on the internet. Manual detection is both exhausting and impractical due to the high volume and frequency of online data. This paper proposes a system called NoHateS. This system is made of multiple components, the main one is BETO-CNN, a Transformers-based model trained on a Spanish corpus, which is designed to actually detect whether a text contains hate speech or not. The second component is developed to ensure accessibility. This includes an API to allow seamless integration of the model into various applications, and a Discord Bot developed for easy manipulation of the aforementioned API in order to help users detect hate speech in text channels. This paper also includes tests with imbalanced data and applies data augmentation in order to deal with it and make more robust models. The results demonstrate the effectiveness of NoHateS in detecting hate speech and provide recommendations for future research in this domain as it achieves 72.63% and 72.94% F1-score on the non-augmented and augmented dataset respectively.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2023 IEEE 30th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798350315578 |
| DOIs | |
| State | Published - 2023 |
| Event | 30th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023 - Lima, Peru Duration: 2 Nov 2023 → 4 Nov 2023 |
Publication series
| Name | Proceedings of the 2023 IEEE 30th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023 |
|---|
Conference
| Conference | 30th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023 |
|---|---|
| Country/Territory | Peru |
| City | Lima |
| Period | 2/11/23 → 4/11/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- BERT
- BETO
- Hate speech
- Transformer
Fingerprint
Dive into the research topics of 'NoHateS: A Transformers-based Approach for Real-Time Hate Speech Detection in Spanish'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver