Modelo predictivo para reducir el índice de deserción de estudiantes universitarios en el Perú: Redes Bayesianas vs. Árboles de Decisión

Translated title of the contribution: Predictive model to reduce the dropout rate of university students in Perú: Bayesian Networks vs. Decision Trees

Erik Cevallos Medina, Claudio Barahona Chunga, Jimmy Armas-Aguirre, Elizabeth E. Grandon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

This research proposes a prediction model that might help reducing the dropout rate of university students in Peru. For this, a three-phase predictive analysis model was designed which was combined with the stages proposed by the IBM SPSS Modeler methodology. Bayesian network techniques was compared with decision trees for their level of accuracy over other algorithms in an Educational Data Mining (EDM) scenario. Data were collected from 500 undergraduate students from a private university in Lima. The results indicate that Bayesian networks behave better than decision trees based on metrics of precision, accuracy, specificity, and error rate. Particularly, the accuracy of Bayesian networks reaches 67.10% while the accuracy for decision trees is 61.92% in the training sample for iteration with 8:2 rate. On the other hand, the variables athletic person (0.30%), own house (0.21%), and high school grades (0.13%) are the ones that contribute most to the prediction model for both Bayesian networks and decision trees.

Translated title of the contributionPredictive model to reduce the dropout rate of university students in Perú: Bayesian Networks vs. Decision Trees
Original languageSpanish
Title of host publicationProceedings of CISTI 2020 - 15th Iberian Conference on Information Systems and Technologies
EditorsAlvaro Rocha, Bernabe Escobar Perez, Francisco Garcia Penalvo, Maria del Mar Miras, Ramiro Goncalves
PublisherIEEE Computer Society
ISBN (Electronic)9789895465903
DOIs
StatePublished - Jun 2020
Event15th Iberian Conference on Information Systems and Technologies, CISTI 2020 - Seville, Spain
Duration: 24 Jun 202027 Jun 2020

Publication series

NameIberian Conference on Information Systems and Technologies, CISTI
Volume2020-June
ISSN (Print)2166-0727
ISSN (Electronic)2166-0735

Conference

Conference15th Iberian Conference on Information Systems and Technologies, CISTI 2020
Country/TerritorySpain
CitySeville
Period24/06/2027/06/20

Fingerprint

Dive into the research topics of 'Predictive model to reduce the dropout rate of university students in Perú: Bayesian Networks vs. Decision Trees'. Together they form a unique fingerprint.

Cite this