TY - GEN
T1 - Modelo predictivo para reducir el índice de deserción de estudiantes universitarios en el Perú
T2 - 15th Iberian Conference on Information Systems and Technologies, CISTI 2020
AU - Medina, Erik Cevallos
AU - Chunga, Claudio Barahona
AU - Armas-Aguirre, Jimmy
AU - Grandon, Elizabeth E.
N1 - Publisher Copyright:
© 2020 AISTI.
PY - 2020/6
Y1 - 2020/6
N2 - This research proposes a prediction model that might help reducing the dropout rate of university students in Peru. For this, a three-phase predictive analysis model was designed which was combined with the stages proposed by the IBM SPSS Modeler methodology. Bayesian network techniques was compared with decision trees for their level of accuracy over other algorithms in an Educational Data Mining (EDM) scenario. Data were collected from 500 undergraduate students from a private university in Lima. The results indicate that Bayesian networks behave better than decision trees based on metrics of precision, accuracy, specificity, and error rate. Particularly, the accuracy of Bayesian networks reaches 67.10% while the accuracy for decision trees is 61.92% in the training sample for iteration with 8:2 rate. On the other hand, the variables athletic person (0.30%), own house (0.21%), and high school grades (0.13%) are the ones that contribute most to the prediction model for both Bayesian networks and decision trees.
AB - This research proposes a prediction model that might help reducing the dropout rate of university students in Peru. For this, a three-phase predictive analysis model was designed which was combined with the stages proposed by the IBM SPSS Modeler methodology. Bayesian network techniques was compared with decision trees for their level of accuracy over other algorithms in an Educational Data Mining (EDM) scenario. Data were collected from 500 undergraduate students from a private university in Lima. The results indicate that Bayesian networks behave better than decision trees based on metrics of precision, accuracy, specificity, and error rate. Particularly, the accuracy of Bayesian networks reaches 67.10% while the accuracy for decision trees is 61.92% in the training sample for iteration with 8:2 rate. On the other hand, the variables athletic person (0.30%), own house (0.21%), and high school grades (0.13%) are the ones that contribute most to the prediction model for both Bayesian networks and decision trees.
KW - Bayesian Networks
KW - Decision Trees
KW - Educational Data Mining
KW - predictive analysis
KW - university dropout
UR - https://www.scopus.com/pages/publications/85089025364
U2 - 10.23919/CISTI49556.2020.9141095
DO - 10.23919/CISTI49556.2020.9141095
M3 - Contribución a la conferencia
AN - SCOPUS:85089025364
T3 - Iberian Conference on Information Systems and Technologies, CISTI
BT - Proceedings of CISTI 2020 - 15th Iberian Conference on Information Systems and Technologies
A2 - Rocha, Alvaro
A2 - Perez, Bernabe Escobar
A2 - Penalvo, Francisco Garcia
A2 - del Mar Miras, Maria
A2 - Goncalves, Ramiro
PB - IEEE Computer Society
Y2 - 24 June 2020 through 27 June 2020
ER -