Speech to Text Recognition for Videogame Controlling with Convolutional Neural Networks

Joaquin Aguirre-Peralta, Marek Rivas-Zavala, Willy Ugarte

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

3 Citas (Scopus)

Resumen

Disability in people is a reality that has always been present throughout humanity and all nations of the planet are immersed in this reality. Being communication and interaction through technology much more important than ever, people with disabilities are the most affected by having a physical gap. There are still few tools that these people can use to interact more easily with different types of hardware, therefore, we want to provide them a playful and medical tool that can adapt to their needs and allow them to interact a little more with the people around them. From this context, we have decided to focus on people with motor disabilities of the upper limbs and based on this, we propose the use of gamification in the NLP (Natural Language Processing) area, developing a videogame consisting of three voice-operated minigames. This work has 4 stages: analysis (benchmarking), design, development and validation. In the first stage, we elaborated a benchmarking of the models. In the second stage, we describe the implementation of CNNs, together with methods such as gamification and NLP for problem solving. In the third stage, the corresponding mini-games which compose the videogame and its characteristics are described. Finally, in the last stage, the application of the videogame was validated with experts in physiotherapy. Our results show that with the training performed, the prediction of words with noise was improved from 43.49% to 74.50% and of words without noise from 63.87% to 96.36%.

Idioma originalInglés
Título de la publicación alojadaICPRAM 2023 - Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods, Volume 1
EditoresMaria De Marsico, Gabriella Sanniti di Baja, Ana L.N. Fred
EditorialScience and Technology Publications, Lda
Páginas948-955
Número de páginas8
ISBN (versión impresa)9789897586262
DOI
EstadoPublicada - 2023
Evento12th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2023 - Lisbon, Portugal
Duración: 22 feb. 202324 feb. 2023

Serie de la publicación

NombreInternational Conference on Pattern Recognition Applications and Methods
Volumen1
ISSN (versión digital)2184-4313

Conferencia

Conferencia12th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2023
País/TerritorioPortugal
CiudadLisbon
Período22/02/2324/02/23

Huella

Profundice en los temas de investigación de 'Speech to Text Recognition for Videogame Controlling with Convolutional Neural Networks'. En conjunto forman una huella única.

Citar esto