journal article Open Access Dec 13, 2024

Predictive Models for Educational Purposes: A Systematic Review

View at Publisher Save 10.3390/bdcc8120187
Abstract
This systematic literature review evaluates predictive models in education, focusing on their role in forecasting student performance, identifying at-risk students, and personalising learning experiences. The review compares the effectiveness of machine learning (ML) algorithms such as Support Vector Machines (SVMs), Artificial Neural Networks (ANNs), and Decision Trees with traditional statistical models, assessing their ability to manage complex educational data and improve decision-making. The search, conducted across databases including ScienceDirect, IEEE Xplore, ACM Digital Library, and Google Scholar, yielded 400 records. After screening and removing duplicates, 124 studies were included in the final review. The findings show that ML algorithms consistently outperform traditional models due to their capacity to handle large, non-linear datasets and continuously enhance predictive accuracy as new patterns emerge. These models effectively incorporate socio-economic, demographic, and academic data, making them valuable tools for improving student retention and performance. However, the review also identifies key challenges, including the risk of perpetuating biases present in historical data, issues of transparency, and the complexity of interpreting AI-driven decisions. In addition, reliance on varying data processing methods across studies reduces the generalisability of current models. Future research should focus on developing more transparent, interpretable, and equitable models while standardising data collection and incorporating non-traditional variables, such as cognitive and motivational factors. Ensuring transparency and ethical standards in handling student data is essential for fostering trust in AI-driven models.
Topics

No keywords indexed for this article. Browse by subject →

References
125
[1]
Mimar "ComPRePS: An automated cloud-based image analysis tool to democratize AI in digital pathology" Proc. SPIE Int. Soc. Opt. Eng. (2024)
[2]
Sokkhey, P., and Okazaki, T. (2020). Hybrid machine learning algorithms for predicting academic performance. Int. J. Adv. Comput. Sci. Appl., 11. 10.14569/ijacsa.2020.0110104
[3]
Sunar "The artificial bee colony algorithm in training artificial neural network for oil spill detection" Neural Netw. World (2011) 10.14311/nnw.2011.21.028
[4]
Liu, J., Li, L., and Ye, H. (2023, January 29–31). A prediction model combining convolutional neural network and LSTM neural network. Proceedings of the 2023 2nd International Conference on Artificial Intelligence and Autonomous Robot Systems (AIARS), Bristol, UK. 10.1109/aiars59518.2023.00071
[5]
Bala, P. (2021). Predictive modelling for future technology development. Handbook of Research on Future Opportunities for Technology Management Education, IGI Global. 10.4018/978-1-7998-8327-2.ch027
[6]
Pencina "Prediction models—Development, evaluation, and clinical application" N. Engl. J. Med. (2020) 10.1056/nejmp2000589
[7]
Ranstam "Clinical prediction models" Br. J. Surg. (2016) 10.1002/bjs.10242
[8]
Chen "A comparative study on the results of College English Grade 4 based on multi-model prediction" J. Electr. Syst. (2024) 10.52783/jes.2660
[9]
Alamri "Explainable student performance prediction models: A systematic review" IEEE Access (2021) 10.1109/access.2021.3061368
[10]
Zeineddine "Enhancing prediction of student success: Automated machine learning approach" Comput. Electr. Eng. (2021) 10.1016/j.compeleceng.2020.106903
[11]
Waheed "Predicting academic performance of students from VLE big data using deep learning models" Comput. Hum. Behav. (2020) 10.1016/j.chb.2019.106189
[12]
Yang "Machine learning-based student modeling methodology for intelligent tutoring systems" J. Educ. Comput. Res. (2021) 10.1177/0735633120986256
[13]
Yin, C., Tang, D., Zhang, F., Tang, Q., Feng, Y., and He, Z. (2023). Students learning performance prediction based on feature extraction algorithm and attention-based bidirectional gated recurrent unit network. PLoS ONE, 18. 10.1371/journal.pone.0286156
[14]
Fahim "Hybrid LSTM self-attention mechanism model for forecasting the reform of scientific research in Morocco" Comput. Intell. Neurosci. (2021) 10.1155/2021/6689204
[15]
Hooshyar, M., Pedaste, M., and Yang, Y. (2019). Mining educational data to predict students’ performance through procrastination behavior. Entropy, 22. 10.3390/e22010012
[16]
Systematic literature reviews in software engineering – A systematic literature review

Barbara Kitchenham, O. Pearl Brereton, David Budgen et al.

Information and Software Technology 2009 10.1016/j.infsof.2008.09.009
[17]
Ramaswami, M., and Bhaskaran, R. (2010). A CHAID based performance prediction model in educational data mining. arXiv.
[18]
Ojajuni, O., Ayeni, F., Akodu, O., Ekanoye, F., Adewole, S., Ayo, T., Misra, S., and Mbarika, V. (2021). Predicting student academic performance using machine learning. Computational Science and Its Applications—ICCSA 2021, Springer. 10.1007/978-3-030-87013-3_36
[19]
Maheshwari, A., Malhotra, B.S., Hada, M., Ranka, M., and Basha, M.S.A. (2024, January 3–4). Comparative analysis of machine learning models in predicting academic outcomes: Insights and implications for educational data analytics. Proceedings of the 2024 International Conference on Smart Systems for Applications in Electrical Sciences (ICSSES), Tumakuru, India. 10.1109/icsses62373.2024.10561260
[20]
Shabnam Ara, S.J., Tanuja, R., and Manjula, S.H. (2023, January 24–26). Regression-driven predictive model to estimate learners’ performance through multisource data. Proceedings of the 2023 2nd International Conference on Futuristic Technologies (INCOFT), Belagavi, India.
[21]
Hamoud "A prediction model based on machine learning algorithms with feature selection approaches over imbalanced dataset" Indones. J. Electr. Eng. Comput. Sci. (2022)
[22]
Harvey, J.L., and Kumar, S.A.P. (2019, January 6–9). A practical model for educators to predict student performance in K-12 education using machine learning. Proceedings of the 2019 IEEE Symposium Series on Computational Intelligence (SSCI), Xiamen, China. 10.1109/ssci44817.2019.9003147
[23]
Kaplan "Bayesian probabilistic forecasting with large-scale educational trend data: A case study using NAEP" Large-Scale Assess. Educ. (2021) 10.1186/s40536-021-00108-2
[24]
Lockwood "Improving accuracy and stability of aggregate student growth measures using empirical best linear prediction" J. Educ. Behav. Stat. (2022) 10.3102/10769986221101624
[25]
Sekeroglu, B., Abiyev, R., Ilhan, A., Arslan, M., and Idoko, J.B. (2021). Systematic literature review on machine learning and student performance prediction: Critical gaps and possible remedies. Appl. Sci., 11. 10.3390/app112210907
[26]
Artificial intelligence in education: Addressing ethical challenges in K-12 settings

Selin Akgun, Christine Greenhow

AI and Ethics 2022 10.1007/s43681-021-00096-7
[27]
Bai "Educational big data: Predictions, applications and challenges" Big Data Res. (2021) 10.1016/j.bdr.2021.100270
[28]
Acharya "Early prediction of students performance using machine learning techniques" Int. J. Comput. Appl. (2014)
[29]
Adejo "Predicting student academic performance using multi-model heterogeneous ensemble approach" J. Appl. Res. High. Educ. (2018) 10.1108/jarhe-09-2017-0113
[30]
Alkhasawneh "Developing a hybrid model to predict student first year retention in STEM disciplines using machine learning techniques" J. STEM Educ. Innov. Res. (2014)
[31]
Altabrawee "Predicting students’ performance using machine learning techniques" J. Univ. Babylon Pure Appl. Sci. (2019)
[32]
Miraz, M., Excell, P., Ware, A., Soomro, S., and Ali, M. (2019). Accuracy comparison of machine learning algorithms for predictive analytics in higher education. Emerging Technologies in Computing, Springer. 10.1007/978-3-030-23943-5
[33]
Brooks, C.A., Thompson, C., and Teasley, S.D. (2014, January 24–28). Towards a general method for building predictive models of learner success using educational time series data. Proceedings of the LAK Workshops, Indianapolis, IN, USA.
[34]
Chitti, M., Chitti, P., and Jayabalan, M. (2020, January 14–17). Need for Interpretable Student Performance Prediction. Proceedings of the 2020 13th International Conference on Developments in eSystems Engineering (DeSE), Liverpool, UK. 10.1109/dese51703.2020.9450735
[35]
Francis "Predicting Academic Performance of Students Using a Hybrid Data Mining Approach" J. Med. Syst. (2019) 10.1007/s10916-019-1295-4
[36]
Huang "Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models" Comput. Educ. (2013) 10.1016/j.compedu.2012.08.015
[37]
Khan "A Conceptual Framework to Aid Attribute Selection in Machine Learning Student Performance Prediction Models" Int. J. Interact. Mob. Technol. (IJIM) (2021) 10.3991/ijim.v15i15.20019
[38]
Khosravi "Leveraging educational data mining: XGBoost and random forest for predicting student achievement" Int. J. Data Sci. Adv. Anal. (2024)
[39]
Khuma "Factors that influence academic performance of students: An empirical study" Seybold Rep. (2023)
[40]
Liz-Domínguez, M., Caeiro-Rodríguez, M., Llamas-Nistal, M., and Mikic-Fonte, F.A. (2019). Systematic Literature Review of Predictive Analysis Tools in Higher Education. Appl. Sci., 9. 10.3390/app9245569
[41]
"Early prediction of student learning performance through data mining: A systematic review" Psicothema (2021)
[42]
Matzavela "Decision tree learning through a Predictive Model for Student Academic Performance in Intelligent M-Learning environments" Comput. Educ. Artif. Intell. (2021) 10.1016/j.caeai.2021.100035
[43]
Mduma, N. (2023). Data Balancing Techniques for Predicting Student Dropout Using Machine Learning. Data, 8. 10.3390/data8030049
[44]
Morais, M., Araújo, J., and Costa, J. (2014, January 22–25). Monitoring student performance using data clustering and predictive modelling. Proceedings of the 2014 IEEE Frontiers in Education Conference (FIE) Proceedings, Madrid, Spain.
[45]
Naito, T., Baba, A., Kashima, H., Takaki, T., and Funo, Y. (2018, January 3–4). Predictive Modeling of Learning Continuation in Preschool Education Using Temporal Patterns of Development Tests. Proceedings of the Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, New Orleans, LA, USA. 10.1609/aaai.v32i1.11393
[46]
Payzan-LeNestour, E., and Bossaerts, P. (2011). Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. PLoS Comput. Biol., 7. 10.1371/journal.pcbi.1001048
[47]
Pereira "Explaining Individual and Collective Programming Students’ Behavior by Interpreting a Black-Box Predictive Model" IEEE Access (2021) 10.1109/access.2021.3105956
[48]
Ramaswami, G., Susnjak, T., and Mathrani, A. (2022). On developing generic models for predicting student outcomes in educational data mining. Big Data Cogn. Comput., 6. 10.3390/bdcc6010006
[49]
Delen "Predicting and analyzing secondary education placement-test scores: A data mining approach" Expert Syst. Appl. (2012) 10.1016/j.eswa.2012.02.112
[50]
Siram "Towards a framework for performance management and machine learning in a higher education institution" J. Inform. Educ. Res. (2024)

Showing 50 of 125 references

Metrics
48
Citations
125
References
Details
Published
Dec 13, 2024
Vol/Issue
8(12)
Pages
187
License
View
Cite This Article
Ahlam Almalawi, Ben Soh, Alice Li, et al. (2024). Predictive Models for Educational Purposes: A Systematic Review. Big Data and Cognitive Computing, 8(12), 187. https://doi.org/10.3390/bdcc8120187
Related

You May Also Like

Fog Computing and the Internet of Things: A Review

Hany Atlam, Robert Walters · 2018

441 citations

Artificial Intelligence in Pharmaceutical and Healthcare Research

Subrat Kumar Bhattamisra, Priyanka Banerjee · 2023

202 citations