Skip to main navigation Skip to search Skip to main content

Story visualization using image-text matching architecture for digital storytelling

  • Arian Yturrizaga-Aguirre
  • , Camilo Silva-Olivares
  • , Willy Ugarte
  • Universidad Peruana de Ciencias Aplicadas

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Currently, the techniques for generating images from text used to visualize stories have serious limitations in terms of image quality, which prevents quantifying their impact in real life scenarios. An example of this occurs in the field of education, where digital storytelling is used as a tool to incite teaching. For this reason, we propose to design a web interface that allows primary school children to write a short story and obtain, as a result, a sequence of coherent and representative images of said content, emulating a conventional process of educational digital storytelling. We describe the use of an Image-text matching architecture based on NLP and Image Retrieval for the story visualization task focused on digital storytelling. To evaluate the performance of the architecture, the quantitative metrics: WuPalmer and cosine similarity were used, in addition to qualitative metrics.

Original languageEnglish
Title of host publicationProceedings of the 2022 IEEE Engineering International Research Conference, EIRCON 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665450829
DOIs
StatePublished - 2022
Event2022 IEEE Engineering International Research Conference, EIRCON 2022 - Lima, Peru
Duration: 26 Oct 202228 Oct 2022

Publication series

NameProceedings of the 2022 IEEE Engineering International Research Conference, EIRCON 2022

Conference

Conference2022 IEEE Engineering International Research Conference, EIRCON 2022
Country/TerritoryPeru
CityLima
Period26/10/2228/10/22

Keywords

  • Deep Learning
  • Story Generative model

Fingerprint

Dive into the research topics of 'Story visualization using image-text matching architecture for digital storytelling'. Together they form a unique fingerprint.

Cite this