Predicting Students’ Performance in Mathematics Subjects at Kolej MARA Banting using Machine Learning Methods

Authors

  • Ahmad Akif Ibrahim Mathematics Department, Kolej MARA Banting, Jalan Labohan Dagang, Bukit Changgang, 42700 Banting, Selangor, Malaysia
  • Nor Azuana Ramli Centre for Mathematical Sciences, Universiti Malaysia Pahang Al-Sultan Abdullah, Lebuh Persiaran Tun Khalil Yaakob, 26300 Kuantan, Pahang, Malaysia
  • Sahimel Azwal Sulaiman Mathematics Department, Kolej MARA Banting, Jalan Labohan Dagang, Bukit Changgang, 42700 Banting, Selangor, Malaysia

DOI:

https://doi.org/10.37134/jpsmm.vol15.1.2.2025

Keywords:

Machine Learning, Students’ Performance, Mathematics Subjects, International Baccalaureate, Predictive Modelling

Abstract

Predicting students’ performance is crucial for personalised and educational success for individuals. However, no standard procedure or method considers external factors to predict students’ performance in mathematics at Kolej MARA Banting (KMB). This research aims to address this problem by exploring the potential of machine learning methods for predicting students’ performance in mathematics at KMB. The study follows a machine learning process: data collection, attribute selection, pre-processing, model training, and evaluation. A sample of 703 data points on students’ demographics, academic records, and mathematics performance were collected and pre-processed. Machine learning models such as support vector machine, decision tree, k-nearest neighbours, Naïve Bayes, Random Forest, AdaBoost, and stacking model were applied in this study. The accuracy and performance of these models were assessed to determine which model outperformed the others and its effectiveness in predicting students’ mathematics performance. The study findings demonstrate that the stacking model exhibited superior performance in accuracy (71.43%), precision (68.73%), recall (71.43%), and F1-score (69.80%) compared to the other models. Nevertheless, it is essential to note that the stacking model achieved moderate accuracy. This could be attributed to the inherent difficulties in constructing a precise predictive model for student performance, such as the models failing to sufficiently reflect the complexities within the dataset, resulting in underfitting. Additionally, the target attribute, International Baccalaureate (IB) grade, is imbalanced, with more high performers than low performers, causing the models to be biased towards the majority class and impacting overall accuracy. The performance of the models in this study could be improved by adding more features related to students’ performance, such as anxiety, depression, well-being, and others, to capture enough complexity in the data. It is also suggested that samples from other colleges with a balanced grade distribution be obtained compared to students at KMB.

References

Abdullah, N. A. H., Mohamad, M. S., Abdullah, S. S., & Ahmad, N. (2018). Predicting students' academic performance in mathematics using data mining techniques. Journal of Telecommunication, Electronic and Computer Engineering, 10(1-8), 123-127.

Alija, S., Beqiri, E., Gaafar, A. S., & Hamoud, A. K. (2023). Predicting students performance using supervised machine learning based on imbalanced dataset and wrapper feature selection. Informatica. https://doi.org/10.31449/inf.v47i1.4519

Alloghani, M., Aljaaf, A., Hussain, A. et al. (2020). Correction to: implementation of machine learning algorithms to create diabetic patient re-admission profiles. BMC Med Inform Decis Mak 20, 93. https://doi.org/10.1186/s12911-020-1102-7

Alsariera, Y. A., Baashar, Y., Alkawsi, G., Mustafa, A., Alkahtani, A. A., & Ali, N. (2022). Assessment and evaluation of different machine learning algorithms for predicting student performance. Computational Intelligence and Neuroscience, 2022, 1–11. https://doi.org/10.1155/2022/4151487

Altabrawee, H., Ali, O. A. J., & Ajmi, S. Q. (2019). Predicting students’ performance using machine learning techniques. Journal of University of Babylon for Pure and Applied Sciences, 27(1), 194–205. https://doi.org/10.29196/jubpas.v27i1.2108

Ashfaq, U., P. M., D. B., & Mafas, R. (2020). managing student performance: a predictive analytics using imbalanced data. International Journal of Recent Technology and Engineering (IJRTE). 8(6), 2277–2283. https://doi.org/10.35940/ijrte.e7008.038620

Basheer, M. Y. I., Mutalib, S., Hamid, N. H. A., Abdul-Rahman, S., & Malik, A. M. A. (2019). Predictive analytics of university student intake using supervised methods. IAES International Journal of Artificial Intelligence, 8(4), 367–374. https://doi.org/10.11591/ijai.v8.i4.pp367-374

Bujang,A., S. D., Selamat, A., & Krejcar, O. (2021). A predictive analytics model for students’ grade prediction by supervised machine learning. IOP Conference Series: Materials Science and Engineering, 1051(1), 012005. https://doi.org/10.1088/1757-899x/1051/1/012005

Draelos, V. A. P. B. R. (2020). Measuring performance: auc (auroc). Glass Box. https://glassboxmedicine.com/2019/02/23/measuring-performance-auc-auroc/

Dhilipan, J., Vijayalakshmi, N., Suriya, S., & Christopher, A. (2021). Prediction of students performance using machine learning. IOP Conference Series: Materials Science and Engineering, 1055(1), 012122. https://doi.org/10.1088/1757-899x/1055/1/012122

El-Hajj, M., Yassine, A., Mourad, A., & Hamadé, R. (2020). Predicting academic performance using artificial neural network: The case of Lebanese secondary schools. International Journal of Emerging Technologies in Learning, 15(18), 113-128.

Fatima, R., Saleem, M., & Hamid, M. (2019). Predicting academic performance of secondary school students using discriminant analysis. International Journal of Educational Research and Development, 8(1), 27-37.

Halim, N. D. A., Ismail, N. A. A., Kamarudin, S., & Mohamad, A. S. (2020). Predicting students' academic performance in english using demographic data, self-efficacy and attitude. International Journal of Advanced Science and Technology, 29(6), 3896-3903.

IB, Find countries and universities that recognise the IB. (n.d.). Retrieved (2022) from https://www.ibo.org/university-admission/find-countries-and-universities-that-recognize-the-ib/

Khan, S. A., & Ahmad, R. (2019). Predicting academic performance using decision tree and support vector machine algorithms. Journal of Education and Learning, 8(5), 382-391.

Lenin, T., & Chandrasekaran, N. (2021). Learning from imbalanced educational data using ensemble machine learning algorithms. Webology, 18(Special Issue 01), 183–195. https://doi.org/10.14704/web/v18si01/web18053

M., R., F., N., & A., A. (2018). Predicting and analysis of students’ academic performance using data mining techniques. International Journal of Computer Applications, 182(32), 1–6. https://doi.org/10.5120/ijca2018918250

Md Salleh, N., & Othman, I. (2019). Integrating Emotional Intelligence In Curriculum: Do Female Students Perform Better Than Male Students? (70 - 80). Jurnal Pendidikan Sains Dan Matematik Malaysia, 4(1), 70-80.

Mduma, N. (2023). Data balancing techniques for predicting student dropout using machine learning. Data, 8(3), 49. https://doi.org/10.3390/data8030049

Muhammat Pazil, N. S., Mahmud, N., & Azman, N. A. N. (2022). The Impact of COVID-19 on Academic Performance of Bachelor’s Degree Students. Jurnal Pendidikan Sains Dan Matematik Malaysia, 12(1), 93-100.

Norhuda, M. S., Zulkifli, N. N., Rosli, N. M., & Jali, M. F. (2021). Predicting academic performance in bahasa melayu subject among secondary school students. International Journal of Advanced Science and Technology, 30(5), 1545-1554.

Yaacob, W. F. W., Nasir, S. A. M., Yaacob, W. F. W., & Sobri, N. M. (2019). Supervised data mining approach for predicting student performance. Indonesian Journal of Electrical Engineering and Computer Science, 16(3), 1584. https://doi.org/10.11591/ijeecs.v16.i3.pp1584-1592

Yağcı, M. (2022). Educational data mining: prediction of students’ academic performance using machine learning algorithms. Smart Learning Environments, 9(1). https://doi.org/10.1186/s40561-022-00192-z

Zohair, L. M. A. (2019). Prediction of student’s performance by modelling small dataset size. International Journal of Educational Technology in Higher Education. https://doi.org/10.1186/s41239-019-0160-3

Downloads

Published

2025-04-28

How to Cite

Ibrahim, A. A. ., Ramli, N. A. ., & Sulaiman, S. A. . (2025). Predicting Students’ Performance in Mathematics Subjects at Kolej MARA Banting using Machine Learning Methods. Jurnal Pendidikan Sains Dan Matematik Malaysia, 15(1), 19-31. https://doi.org/10.37134/jpsmm.vol15.1.2.2025