Early detection of university students with potential difficulties

作者:

Highlights:

• A decision support system to identify at registration time students who are at-risk.

• Design of algorithms to improve classification accuracy.

• Comparison of several improved machine learning algorithms on a real case-study.

• What-if analysis to increase student success and retention.

摘要

Using data mining methods, this paper presents a new means of identifying freshmen's profiles likely to face major difficulties to complete their first academic year. Academic failure is a relevant issue at a time when post-secondary education is ever more critical to economic success. We aim at early detection of potential failure using student data available at registration, i.e. school records and environmental factors, with a view to timely and efficient remediation and/or study reorientation. We adapt three data mining methods, namely random forest, logistic regression and artificial neural network algorithms. We design algorithms to increase the accuracy of the prediction when some classes are of major interest. These algorithms are context independent and can be used in different fields. Real data pertaining to undergraduates at the University of Liège (Belgium), illustrates our methodology.

论文关键词:Student attrition,Machine learning,Prediction,Classification,Accuracy,Remediation

论文评审过程:Received 17 May 2016, Revised 4 May 2017, Accepted 4 May 2017, Available online 7 May 2017, Version of Record 19 August 2017.

论文官网地址:https://doi.org/10.1016/j.dss.2017.05.003