Skip to main navigation Skip to search Skip to main content

Exploring Deep Neural Networks and Decision Tree for Spanish Text Classification

  • Pedro Shiguihara
  • , Lilian Berton
  • obtuvo un doctorado en la de Maryland y realizó un postdoctorado de la Universidad de Toronto. Es docente-investigador en la Universidad San Ignacio de Loyola
  • Universidade Federal de São Paulo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Nowadays, huge amounts of information are available on social networks, blogs, websites, and digital libraries. Most of this information is in unstructured text format, so text mining approaches have become increasingly studied to process all this data. Text classification aims to automatically classify documents into predetermined categories, applying machine learning (ML) algorithms. In this paper, we collected a dataset set related to reviews of a food store in Peru and compared different vectorization models, such as Term Frequency Inverse Document Frequency (TF-IDF), Bag of Words (BoW), and classification algorithms, such as traditional ML classifiers SVM, Decision Tree, MLP, KNN, Naive Bayes and a recent approach "deep jointly informed neural networks"(DJINN) that initialize deep feedforward neural networks based on decision trees. The results show DJINN gets a F1-score higher than traditional ML, being a promising technique for text classification.

Original languageEnglish
Title of host publicationProceedings of the 2022 IEEE 29th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665486361
DOIs
StatePublished - 2022
Externally publishedYes
Event29th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022 - Lima, Peru
Duration: 11 Aug 202213 Aug 2022

Publication series

NameProceedings of the 2022 IEEE 29th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022

Conference

Conference29th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022
Country/TerritoryPeru
CityLima
Period11/08/2213/08/22

Keywords

  • Classification
  • Machine Learning
  • Text mining

Fingerprint

Dive into the research topics of 'Exploring Deep Neural Networks and Decision Tree for Spanish Text Classification'. Together they form a unique fingerprint.

Cite this