Skip to main navigation Skip to search Skip to main content

NoHateS: A Transformers-based Approach for Real-Time Hate Speech Detection in Spanish

  • Alessandro Carhuancho-Bazan
  • , Sergio Nunez-Lazo
  • , Willy Ugarte
  • Universidad Peruana de Ciencias Aplicadas

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Hate speech detection is a challenging task, especially in the context of real-time monitoring on the internet. Manual detection is both exhausting and impractical due to the high volume and frequency of online data. This paper proposes a system called NoHateS. This system is made of multiple components, the main one is BETO-CNN, a Transformers-based model trained on a Spanish corpus, which is designed to actually detect whether a text contains hate speech or not. The second component is developed to ensure accessibility. This includes an API to allow seamless integration of the model into various applications, and a Discord Bot developed for easy manipulation of the aforementioned API in order to help users detect hate speech in text channels. This paper also includes tests with imbalanced data and applies data augmentation in order to deal with it and make more robust models. The results demonstrate the effectiveness of NoHateS in detecting hate speech and provide recommendations for future research in this domain as it achieves 72.63% and 72.94% F1-score on the non-augmented and augmented dataset respectively.

Original languageEnglish
Title of host publicationProceedings of the 2023 IEEE 30th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350315578
DOIs
StatePublished - 2023
Event30th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023 - Lima, Peru
Duration: 2 Nov 20234 Nov 2023

Publication series

NameProceedings of the 2023 IEEE 30th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023

Conference

Conference30th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2023
Country/TerritoryPeru
CityLima
Period2/11/234/11/23

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • BERT
  • BETO
  • Hate speech
  • Transformer

Fingerprint

Dive into the research topics of 'NoHateS: A Transformers-based Approach for Real-Time Hate Speech Detection in Spanish'. Together they form a unique fingerprint.

Cite this