Skip to main navigation Skip to search Skip to main content

FormalStyler: GPT based Model for Formal Style Transfer based on Formality and Meaning Preservation

  • Universidad Peruana de Ciencias Aplicadas

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Style transfer is a natural language processing generation task, it consists of substituting one given writing style for another one. In this work, we seek to perform informal-to-formal style transfers in the English language. This process is shown in our web interface where the user input a informal message by text or voice. This project's target audience are students and professionals in the need to improve the quality of their work by formalizing their texts. A style transfer is considered successful when the original semantic meaning of the message is preserved after the independent style has been replaced. This task is hindered by the scarcity of training and evaluation datasets alongside the lack of metrics. To accomplish this task we opted to utilize OpenAI's GPT-2 Transformer-based pre-trained model. To adapt the GPT-2 to our research, we finetuned the model with a parallel corpus containing informal text entries paired with the equivalent formal ones. We evaluate the fine-tuned model results with two specific metrics, formality and meaning preservation. To further fine-tune the model we integrate a human-based feedback system where the user selects the best formal sentence out of the ones generated by the model. The resulting evaluations of our solution exhibit similar to improved scores in formality and meaning preservation to state-of-the-art approaches.

Original languageEnglish
Title of host publication13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of IC3K 2021 - Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
EditorsRita Cucchiara, Ana Fred, Joaquim Filipe
PublisherScience and Technology Publications, Lda
Pages48-56
Number of pages9
ISBN (Electronic)9789897585333
DOIs
StatePublished - 2021
Event13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021 - Virtual, Online
Duration: 25 Oct 202227 Oct 2022

Publication series

NameInternational Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings
Volume1
ISSN (Electronic)2184-3228

Conference

Conference13th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2021 as part of 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021
CityVirtual, Online
Period25/10/2227/10/22

Keywords

  • Formalization
  • GPT-2
  • Meaning Preservation
  • Natural Language Processing
  • Style Transfer
  • Transformer

Fingerprint

Dive into the research topics of 'FormalStyler: GPT based Model for Formal Style Transfer based on Formality and Meaning Preservation'. Together they form a unique fingerprint.

Cite this