Open Access
Data Science and Management, volume 8, issue 1, pages 72-85
A Model for Predicting Dropout of Higher Education Students
Anaíle Mendes Rabelo
1
,
Luis A. Zarate
1
1
Department of Computer Science, Applied Computational Intelligence Laboratory-LICAP, Pontifical Catholic University of Minas Gerais, Av. Dom José Gaspar, 500, Prédio 20, Coração Eucarístico, Belo Horizonte, Minas Gerais, Brazil
Publication type: Journal Article
Publication date: 2025-03-01
Journal:
Data Science and Management
scimago Q1
SJR: 1.432
CiteScore: 7.5
Impact factor: —
ISSN: 26667649
Abstract
Higher education institutions are becoming increasingly concerned with the retention of their students. This work is motivated by the interest in predicting and reducing student dropout, and consequently in reducing the financial loss of said institutions. From the characterization of the dropout problem, and application of a knowledge discovery process, a model (ensemble) to improve dropout prediction is proposed. The ensemble model combines the results of three models: Logistic Regression, Neural Networks, and Decision Tree. As a result, the model can correctly classify 89% of the students (as enrolled and dropped) and accurately identify 98.1% dropouts. When the proposed model is compared with the ensemble method, Random Forest, the proposed model presented desirable characteristics to assist the management in proposing actions to retain students.
Found
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.