{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T18:51:18Z","timestamp":1732042278967},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T00:00:00Z","timestamp":1671753600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T00:00:00Z","timestamp":1671753600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Smart Learn. Environ."],"abstract":"Abstract<\/jats:title>The tremendous growth in electronic educational data creates the need to have meaningful information extracted from it. Educational Data Mining (EDM) is an exciting research area that can reveal valuable knowledge from educational databases. This knowledge can be used for many purposes, including identifying dropouts or weak students who need special attention and discovering extraordinary students who can be offered lifetime opportunities. Although former studies in EDM used an extensive range of features for predicting students\u2019 academic achievement (in terms of (i) achieved grades or (ii) passing and failing), those features are sometimes not obtainable for practical usage, and therefore, the prediction models are not feasible for employment. This study uses data mining (DM) algorithms to predict the academic performance of master\u2019 s students by using a non-extensive data set and including only the features that are easy to collect at the beginning of a studying program. To perform this study, we have collected over 700 students' records from 2010 to 2018 from the Faculty of Business Informatics and Mathematics at the University of Mannheim in Germany. Those records include demographics and post-enrollment features such as semester grades. The empirical results show the following: (i) the most significant features for predicting students' academic achievements are the students\u2019 grades in each semester (importance rate between 14 and 36%), followed by the distance from students\u2019 accommodation to university (importance rate between 6 and 18%) and culture (importance rate between 7 and 17%). On the other hand, gender, age, the numbers of failed courses, and the number of registered and unregistered exams per semester are less significant for the predictions. (ii) As expected, predictions performed after the second semester is more accurate than those performed after the first semester. (iii) Unsurprisingly, models that predict two classes yield better results than those that predict three. (iv) Random Forest classifier performs the best in all prediction models (0.77\u20130.94 accuracy), and using oversampling methods to deal with imbalanced data can significantly improve the performance of DM methods. For future work, we recommend testing the predictive models on other master programs and a larger datasets. Furthermore, we recommend investigating other oversampling approaches.<\/jats:p>","DOI":"10.1186\/s40561-022-00220-y","type":"journal-article","created":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T13:04:59Z","timestamp":1671800699000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Predicting Master\u2019s students\u2019 academic performance: an empirical study in Germany"],"prefix":"10.1186","volume":"9","author":[{"ORCID":"http:\/\/orcid.org\/0000-0003-4264-1490","authenticated-orcid":false,"given":"Sarah","family":"Alturki","sequence":"first","affiliation":[]},{"given":"Lea","family":"Cohausz","sequence":"additional","affiliation":[]},{"given":"Heiner","family":"Stuckenschmidt","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,12,23]]},"reference":[{"issue":"1","key":"220_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/S41239-019-0160-3\/FIGURES\/13","volume":"16","author":"LMZ Abu","year":"2019","unstructured":"Abu, L. M. Z. (2019). Prediction of student\u2019s performance by modelling small dataset size. International Journal of Educational Technology in Higher Education, 16(1), 1\u201318. https:\/\/doi.org\/10.1186\/S41239-019-0160-3\/FIGURES\/13","journal-title":"International Journal of Educational Technology in Higher Education"},{"issue":"2","key":"220_CR2","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1080\/07294360.2019.1664999","volume":"39","author":"R Ajjawi","year":"2020","unstructured":"Ajjawi, R., Dracup, M., Zacharias, N., Bennett, S., & Boud, D. (2020). Persisting students\u2019 explanations of and emotional responses to academic failure. Higher Education Research and Development, 39(2), 185\u2013199. https:\/\/doi.org\/10.1080\/07294360.2019.1664999","journal-title":"Higher Education Research and Development"},{"issue":"04","key":"220_CR3","first-page":"666","volume":"4","author":"YM Alemu","year":"2015","unstructured":"Alemu, Y. M. (2015). Application of data mining techniques for student success and failure prediction (the case of Debre_Markos University). International Journal of Scientific and Technology Research, 4(04), 666.","journal-title":"International Journal of Scientific and Technology Research"},{"key":"220_CR4","first-page":"121","volume":"20","author":"S Alturki","year":"2021","unstructured":"Alturki, S., & Alturki, N. (2021). Using educational data mining to predict Students\u2019 academic performance for applying early interventions. Journal of Information Technology Education: Innovations in Practice, 20, 121\u2013137.","journal-title":"Journal of Information Technology Education: Innovations in Practice"},{"key":"220_CR5","doi-asserted-by":"publisher","DOI":"10.1007\/s10758-020-09476-0","author":"S Alturki","year":"2020","unstructured":"Alturki, S., Hulpus, I., & Stuckenschmidt, H. (2020). Predicting academic outcomes: A survey from 2007 till 2018. Technology, Knowledge and Learning. https:\/\/doi.org\/10.1007\/s10758-020-09476-0","journal-title":"Technology, Knowledge and Learning"},{"key":"220_CR6","doi-asserted-by":"publisher","DOI":"10.1108\/JARHE-01-2021-0034\/FULL\/XML","author":"S Alturki","year":"2021","unstructured":"Alturki, S., & Stuckenschmidt, H. (2021). Assessing students\u2019 self-assessment ability in an interdisciplinary domain. Journal of Applied Research in Higher Education, ahead-of-print. https:\/\/doi.org\/10.1108\/JARHE-01-2021-0034\/FULL\/XML","journal-title":"Journal of Applied Research in Higher Education, ahead-of-print"},{"key":"220_CR7","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1016\/j.compedu.2017.05.007","volume":"113","author":"R Asif","year":"2017","unstructured":"Asif, R., Merceron, A., Abbas, A. S., & Ghani, H. N. E. D. N. (2017). Analyzing undergraduate students\u2019 performance using educational data mining. Computers and Education, 113, 177\u2013194. https:\/\/doi.org\/10.1016\/j.compedu.2017.05.007","journal-title":"Computers and Education"},{"key":"220_CR8","unstructured":"Aulck, L., Velagapudi, N., Blumenstock, J. & West, J. (2017). Predicting Student Dropout in Higher Education. https:\/\/arxiv.org\/pdf\/1606.06364.pdf."},{"key":"220_CR9","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1016\/j.procs.2016.04.012","volume":"82","author":"G Badr","year":"2016","unstructured":"Badr, G., Algobail, A., Almutairi, H., & Almutery, M. (2016). Predicting students\u2019 performance in University courses: A case study and tool in KSU Mathematics Department. Procedia Computer Science, 82, 80\u201389. https:\/\/doi.org\/10.1016\/j.procs.2016.04.012","journal-title":"Procedia Computer Science"},{"issue":"1","key":"220_CR10","first-page":"3","volume":"1","author":"R Baker","year":"2009","unstructured":"Baker, R., & Yacef, K. (2009). The state of educational data mining in 2009: A review and future visions. Journal of Educational Data Mining, 1(1), 3\u201317.","journal-title":"Journal of Educational Data Mining"},{"key":"220_CR11","doi-asserted-by":"publisher","DOI":"10.5753\/rbie.2011.19.02.03","author":"R Baker","year":"2011","unstructured":"Baker, R., Isotani, S., & Carvalho, A. (2011). Minera\u00e7\u00e3o de Dados Educacionais: Oportunidades Para o Brasil. Revista Brasileira de Inform\u00e1tica Na Educa\u00e7\u00e3o. https:\/\/doi.org\/10.5753\/rbie.2011.19.02.03","journal-title":"Revista Brasileira de Inform\u00e1tica Na Educa\u00e7\u00e3o"},{"key":"220_CR12","doi-asserted-by":"crossref","unstructured":"Baradwaj, B. K. & Pal, S. (2011). Mining educational data to analyze students\u201d performance. In IJACSA) international journal of advanced computer science and applications (Vol. 2, Issue 6). www.ijacsa.thesai.org.","DOI":"10.14569\/IJACSA.2011.020609"},{"issue":"1","key":"220_CR13","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324.","journal-title":"Machine Learning"},{"issue":"4","key":"220_CR14","first-page":"501","volume":"50","author":"F Calisir","year":"2016","unstructured":"Calisir, F., Basak, E., & Comertoglu, S. (2016). Predicting academic performance of Master\u2019s Students in engineering management. College Student Journal, 50(4), 501\u2013512.","journal-title":"College Student Journal"},{"key":"220_CR15","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/JAIR.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321\u2013357. https:\/\/doi.org\/10.1613\/JAIR.953.","journal-title":"Journal of Artificial Intelligence Research"},{"key":"220_CR16","unstructured":"Clark, M. (2013). An introduction to machine learning with applications in R. Retrieved 17 October, 2022 from http:\/\/web.ipac.caltech.edu\/staff\/fmasci\/home\/astro_refs\/ML_inR.pdf."},{"key":"220_CR17","doi-asserted-by":"publisher","unstructured":"Daud, A., Aljohani, N. R., Ayaz, A. R., Lytras, M. D., Abbas, F. & Alowibdi, J. S. (2017). Predicting student performance using advanced learning analytics. In WWW 17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion. https:\/\/doi.org\/10.1145\/3041021.3054164.","DOI":"10.1145\/3041021.3054164"},{"key":"220_CR18","unstructured":"Geng, M. 2006. A comparison of logistic regression to random forests for exploring differences in risk factors associated with stage at diagnosis between black and white colon cancer patients. Master's Thesis, University of Pittsburgh. Retrieved 17 October, 2022 from http:\/\/d-scholarship.pitt.edu\/id\/eprint\/7034."},{"key":"220_CR19","unstructured":"Han, J., Kamber, M. & Pei, J. (2011). Data Mining. Concepts and Techniques, 3rd Edition (The Morgan Kaufmann Series in Data Management Systems)."},{"key":"220_CR20","doi-asserted-by":"publisher","DOI":"10.1016\/j.compedu.2012.08.015","author":"S Huang","year":"2013","unstructured":"Huang, S., & Fang, N. (2013). Predicting Student Academic Performance in an Engineering Dynamics Course: A Comparison of Four Types of Predictive Mathematical Models. https:\/\/doi.org\/10.1016\/j.compedu.2012.08.015","journal-title":"Predicting Student Academic Performance in an Engineering Dynamics Course: A Comparison of Four Types of Predictive Mathematical Models."},{"key":"220_CR21","doi-asserted-by":"publisher","DOI":"10.1080\/01443410.2018.1502412","author":"LM Jeno","year":"2018","unstructured":"Jeno, L. M., Danielsen, A. G., & Raaheim, A. (2018). Educational Psychology an International Journal of Experimental Educational Psychology A Prospective Investigation of Students\u2019 Academic Achievement and Dropout in Higher Education: A Self-Determination Theory Approach. https:\/\/doi.org\/10.1080\/01443410.2018.1502412","journal-title":"Educational Psychology an International Journal of Experimental Educational Psychology A Prospective Investigation of Students' Academic Achievement and Dropout in Higher Education: A Self-Determination Theory Approach."},{"issue":"1","key":"220_CR22","doi-asserted-by":"publisher","first-page":"61","DOI":"10.2478\/cait-2013-0006","volume":"13","author":"D Kabakchieva","year":"2013","unstructured":"Kabakchieva, D. (2013). Predicting student performance by using data mining methods for classification. Cybernetics and Information Technologies, 13(1), 61\u201372. https:\/\/doi.org\/10.2478\/cait-2013-0006","journal-title":"Cybernetics and Information Technologies"},{"issue":"2","key":"220_CR23","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1556\/063.9.2019.1.18","volume":"9","author":"BM Kehm","year":"2019","unstructured":"Kehm, B. M., Larsen, M. R., & Sommersel, H. B. (2019). Student dropout from universities in Europe: A review of empirical literature. Hungarian Educational Research Journal, 9(2), 147\u2013164. https:\/\/doi.org\/10.1556\/063.9.2019.1.18","journal-title":"Hungarian Educational Research Journal"},{"key":"220_CR24","unstructured":"Kercher, J. (2018). Academic success and dropout among international students in Germany and other major host countries."},{"key":"220_CR25","unstructured":"Kim, U. 1995. Individualism and collectivism a psychological, cultural and ecological analysis. http:\/\/eurasia.nias.ku.dk\/publications\/."},{"issue":"2","key":"220_CR26","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1504\/ijkesdp.2009.022718","volume":"1","author":"S Kotsiantis","year":"2009","unstructured":"Kotsiantis, S. (2009). Educational data mining: A case study for predicting dropout-prone students. International Journal of Knowledge Engineering and Soft Data Paradigms, 1(2), 101. https:\/\/doi.org\/10.1504\/ijkesdp.2009.022718","journal-title":"International Journal of Knowledge Engineering and Soft Data Paradigms"},{"key":"220_CR27","unstructured":"Kova\u010di\u0107, Z. J. (2010). Early prediction of student success: Mining students enrolment data. In Proceedings of informing science and IT education conference (pp. 647\u2013665)."},{"issue":"1","key":"220_CR28","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1525\/collabra.153","volume":"4","author":"DA Moore","year":"2018","unstructured":"Moore, D. A., Dev, A. S., & Goncharova, E. Y. (2018). Overconfidence across cultures. Collabra: Psychology, 4(1), 36. https:\/\/doi.org\/10.1525\/collabra.153.","journal-title":"Collabra: Psychology"},{"issue":"2","key":"220_CR29","first-page":"190","volume":"12","author":"D Mouton","year":"2020","unstructured":"Mouton, D., Zhang, H., & Ertl, B. (2020). German university student\u2019s reasons for dropout. Identifying latent classes. Journal for Educational Research, 12(2), 190\u2013224.","journal-title":"Journal for Educational Research"},{"issue":"3","key":"220_CR30","doi-asserted-by":"publisher","first-page":"1821","DOI":"10.30534\/ijatcse\/2021\/461032021","volume":"10","author":"M Nadeem","year":"2021","unstructured":"Nadeem, M., Palaniappan, S., & Haider, W. (2021). Impact of Postgraduate Students dropout and delay in University: Analysis using machine learning algorithms. International Journal of Advanced Trends in Computer Science and Engineering, 10(3), 1821\u20131826. https:\/\/doi.org\/10.30534\/ijatcse\/2021\/461032021","journal-title":"International Journal of Advanced Trends in Computer Science and Engineering"},{"key":"220_CR31","doi-asserted-by":"publisher","unstructured":"Nguyen, T. N., Janecek, P. & Haddawy, P. (2007). A comparative analysis of techniques for predicting academic performance. In 2007 37th annual frontiers in education conference-global engineering: Knowledge without borders, opportunities without passports, T2G-7-T2G-12. https:\/\/doi.org\/10.1109\/FIE.2007.4417993.","DOI":"10.1109\/FIE.2007.4417993"},{"issue":"1","key":"220_CR32","first-page":"958","volume":"10","author":"E Osmanbegovi\u0107","year":"2012","unstructured":"Osmanbegovi\u0107, E., Suljic, M., & Sulji\u0107, M. (2012). Data mining approach for predicting student performance. Journal of Economics and Business, 10(1), 958.","journal-title":"Journal of Economics and Business"},{"issue":"5","key":"220_CR33","first-page":"2278","volume":"4","author":"AK Pal","year":"2013","unstructured":"Pal, A. K., & Pal, S. (2013). Analysis and mining of educational data for predicting the performance of students. International Journal of Electronics Communication and Computer Engineering, 4(5), 2278\u20134209.","journal-title":"International Journal of Electronics Communication and Computer Engineering"},{"issue":"5","key":"220_CR34","doi-asserted-by":"publisher","first-page":"26","DOI":"10.5120\/ijca2015905328","volume":"123","author":"A Pradeep","year":"2015","unstructured":"Pradeep, A., & Thomas, J. (2015). Predicting college students dropout using EDM techniques. International Journal of Computer Applications, 123(5), 26\u201334.","journal-title":"International Journal of Computer Applications"},{"issue":"33","key":"220_CR35","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1016\/j.eswa.2006.04.005","volume":"33","author":"C Romero","year":"2007","unstructured":"Romero, C., & Ventura, S. (2007). Educational data mining: A survey from 1995 to 2005. ScienceDirect, 33(33), 134\u2013146. https:\/\/doi.org\/10.1016\/j.eswa.2006.04.005","journal-title":"ScienceDirect"},{"issue":"5","key":"220_CR36","doi-asserted-by":"publisher","first-page":"1070","DOI":"10.1080\/07294360.2020.1799951","volume":"40","author":"N Rotem","year":"2020","unstructured":"Rotem, N., Yair, G., & Shustak, E. (2020). Dropping out of Master\u2019s Degrees: Objective Predictors and Subjective Reasons., 40(5), 1070\u20131084. https:\/\/doi.org\/10.1080\/07294360.2020.1799951","journal-title":"Dropping out of Master's Degrees: Objective Predictors and Subjective Reasons."},{"key":"220_CR37","first-page":"110","volume":"6","author":"S Sembiring","year":"2011","unstructured":"Sembiring, S., Zarlis, M., Hartama, D., Wani, E., & Magister, P. (2011). Prediction of student academic performance by an application of data mining techniques. International Conference on Management and Artificial Intelligence, 6, 110\u2013114.","journal-title":"International Conference on Management and Artificial Intelligence"},{"key":"220_CR38","unstructured":"Shakeel, K. & Anwer, B. N. (2015). Educational data mining to reduce student dropout rate by using classification. In 253rd OMICS international conference on big data analysis & data mining."},{"key":"220_CR39","unstructured":"Simpeh, F. & Akinlolu, M. (2018). Importance level of on-campus student housing facility spaces: Perception of Postgraduate Students. In 10th Cidb postgraduate conference"},{"key":"220_CR40","unstructured":"Staiculescu, C. & Richiteanu, N. E. R. (2018). University dropout. Causes and solution. In Mental health global challenges XXI Century conference proceedings."},{"issue":"1","key":"220_CR41","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1080\/17421772.2017.1369146","volume":"13","author":"C Vieira","year":"2018","unstructured":"Vieira, C., Vieira, I., & Raposo, L. (2018). Distance and academic performance in higher education. Spatial Economic Analysis, 13(1), 60\u201379. https:\/\/doi.org\/10.1080\/17421772.2017.1369146","journal-title":"Spatial Economic Analysis"},{"issue":"12","key":"220_CR42","first-page":"13","volume":"1","author":"SK Yadav","year":"2011","unstructured":"Yadav, S. K., Bharadwaj, B., & Pal, S. (2011). Data mining applications: A comparative study for predicting student\u2019s performance. International Journal of Innovative Technology and Creative Engineering, 1(12), 13\u201319.","journal-title":"International Journal of Innovative Technology and Creative Engineering"},{"issue":"2","key":"220_CR43","first-page":"51","volume":"2","author":"SK Yadav","year":"2012","unstructured":"Yadav, S. K., & Pal, S. (2012). Data mining: A prediction for performance improvement of engineering students using classification. World of Computer Science and Information Technology Journal (WCSIT), 2(2), 51\u201356.","journal-title":"World of Computer Science and Information Technology Journal (WCSIT)"},{"key":"220_CR44","unstructured":"Zhao, Y., Qiangwen, X., Ming, C., & Gary, M. W. (2020). Predicting student performance in a Master of data science program using admissions data. In Proceedings of The 13th international conference on educational data mining (pp. 325\u201333)."},{"key":"220_CR45","unstructured":"Zimmermann, J., Brodersen, K. H., Pellet, J.-P., August, E. & Buhmann, J. M. (2011). Predicting graduate-level performance from undergraduate achievements. In Proceedings of the 4th international conference on educational data mining. https:\/\/www.researchgate.net\/publication\/221570422_Predicting_Graduate-level_Performance_from_Undergraduate_Achievements."},{"issue":"9\/10","key":"220_CR46","doi-asserted-by":"publisher","first-page":"1","DOI":"10.17159\/SAJS.2015\/20140298","volume":"111","author":"T Zewotir","year":"2015","unstructured":"Zewotir, T., North, D., & Murray, M. (2015). The time to degree or dropout amongst full-time Master\u2019s students at University of KwaZulu-Natal. South African Journal of Science, 111(9\/10), 1\u20136. https:\/\/doi.org\/10.17159\/SAJS.2015\/20140298","journal-title":"South African Journal of Science"}],"container-title":["Smart Learning Environments"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40561-022-00220-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40561-022-00220-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40561-022-00220-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T13:08:51Z","timestamp":1671800931000},"score":1,"resource":{"primary":{"URL":"https:\/\/slejournal.springeropen.com\/articles\/10.1186\/s40561-022-00220-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,23]]},"references-count":46,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["220"],"URL":"https:\/\/doi.org\/10.1186\/s40561-022-00220-y","relation":{},"ISSN":["2196-7091"],"issn-type":[{"value":"2196-7091","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,23]]},"assertion":[{"value":"2 September 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 December 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 December 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that there is no competing interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"38"}}