FormalStyler: GPT based Model for Formal Style Transfer based on Formality and Meaning Preservation

Mariano De Rivero, Cristhiam Tirado, Willy Ugarte

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

4 Citas (Scopus)

Resumen

Style transfer is a natural language processing generation task, it consists of substituting one given writing style for another one. In this work, we seek to perform informal-to-formal style transfers in the English language. This process is shown in our web interface where the user input a informal message by text or voice. This project's target audience are students and professionals in the need to improve the quality of their work by formalizing their texts. A style transfer is considered successful when the original semantic meaning of the message is preserved after the independent style has been replaced. This task is hindered by the scarcity of training and evaluation datasets alongside the lack of metrics. To accomplish this task we opted to utilize OpenAI's GPT-2 Transformer-based pre-trained model. To adapt the GPT-2 to our research, we finetuned the model with a parallel corpus containing informal text entries paired with the equivalent formal ones. We evaluate the fine-tuned model results with two specific metrics, formality and meaning preservation. To further fine-tune the model we integrate a human-based feedback system where the user selects the best formal sentence out of the ones generated by the model. The resulting evaluations of our solution exhibit similar to improved scores in formality and meaning preservation to state-of-the-art approaches.

Idioma originalInglés
Título de la publicación alojada13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of IC3K 2021 - Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
EditoresRita Cucchiara, Ana Fred, Joaquim Filipe
EditorialScience and Technology Publications, Lda
Páginas48-56
Número de páginas9
ISBN (versión digital)9789897585333
DOI
EstadoPublicada - 2021
Evento13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021 - Virtual, Online
Duración: 25 oct. 202227 oct. 2022

Serie de la publicación

NombreInternational Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings
Volumen1
ISSN (versión digital)2184-3228

Conferencia

Conferencia13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021
CiudadVirtual, Online
Período25/10/2227/10/22

Huella

Profundice en los temas de investigación de 'FormalStyler: GPT based Model for Formal Style Transfer based on Formality and Meaning Preservation'. En conjunto forman una huella única.

Citar esto