TY - GEN
T1 - A Whisper and BETO-Bert Based Web Application for Classification of Emergency Calls
AU - Rojas, Anthony
AU - Calcin, Kevin
AU - Castaneda, Pedro
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This study presents a comprehensive web-based solution for the classification of emergency calls using state-of-the-art deep learning models, aimed at enhancing the accuracy and speed of emergency response systems. The primary aim is to categorize emergencies more effectively, particularly those related to safety, enabling rapid identification and response in call centers. The methodology is structured into key phases: first, the transcription phase employs the Whisper model for precise speech-to-text conversion; then, data preprocessing ensures the removal of irrelevant characters, numerical data, and common phrases to refine the input. In the translation phase, careful attention is given to maintaining linguistic consistency between English and Spanish. During the segmentation phase, tokenization and attention masking are applied to enhance text structure. Finally, the classification phase utilizes the BETO model-a BERT variant fine-tuned for Spanish-to classify calls into specific emergency types, including 'Accident,' 'Crime,' and 'Violence.' The proposed solution achieved a classification accuracy of 95.7%, supported by a learning rate optimization process.
AB - This study presents a comprehensive web-based solution for the classification of emergency calls using state-of-the-art deep learning models, aimed at enhancing the accuracy and speed of emergency response systems. The primary aim is to categorize emergencies more effectively, particularly those related to safety, enabling rapid identification and response in call centers. The methodology is structured into key phases: first, the transcription phase employs the Whisper model for precise speech-to-text conversion; then, data preprocessing ensures the removal of irrelevant characters, numerical data, and common phrases to refine the input. In the translation phase, careful attention is given to maintaining linguistic consistency between English and Spanish. During the segmentation phase, tokenization and attention masking are applied to enhance text structure. Finally, the classification phase utilizes the BETO model-a BERT variant fine-tuned for Spanish-to classify calls into specific emergency types, including 'Accident,' 'Crime,' and 'Violence.' The proposed solution achieved a classification accuracy of 95.7%, supported by a learning rate optimization process.
KW - BETO
KW - Whisper
KW - classification
KW - deep learning
KW - diarization
KW - emergency calls
KW - safety
UR - https://www.scopus.com/pages/publications/105010757390
U2 - 10.1109/ICICT64582.2025.00043
DO - 10.1109/ICICT64582.2025.00043
M3 - Contribución a la conferencia
AN - SCOPUS:105010757390
T3 - Proceedings - 2025 8th International Conference on Information and Computer Technologies, ICICT 2025
SP - 236
EP - 241
BT - Proceedings - 2025 8th International Conference on Information and Computer Technologies, ICICT 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th International Conference on Information and Computer Technologies, ICICT 2025
Y2 - 14 March 2025 through 16 March 2025
ER -