Journal article
Machine Learning Approaches for Predicting U.S. Students’ Scientific Literacy: An Analysis of Key Factors Across Performance Levels and Socioeconomic Statuses
International journal of science and mathematics education, Vol.23(7), pp.2755-2783
10/2025
DOI: 10.1007/s10763-025-10545-y
Abstract
Many scholars have traditionally examined features linked to students’ science performance by using classical statistical methods. However, there is a dearth of research on predicting student achievements by using machine learning (ML) models in science education. We aim to address this gap using the 2006 and 2015 Program for International Student Assessment datasets to construct prediction models for students’ scientific literacy. We applied three models (traditional regression, XGBoost, and random forest) to compare their accuracy. The two ML models showed more accuracy and interpretability than the regression model. For the overall group, the results of two ML models indicated that grade, enjoyment, and self-perceived scientific literacy were the most important predictors of scientific literacy. For both low and high performers, the results of the two ML models were similar to the overall group regardless of socioeconomic status. In both ML models, motivation was the least important variable overall, yet it became the most significant predictor in the regression model. Enjoyment, which exerted the highest influence in the two ML models, was the least significant predictor in the regression model. This study highlights the benefits of integrating ML models into predictive analyses in science education.
Details
- Title: Subtitle
- Machine Learning Approaches for Predicting U.S. Students’ Scientific Literacy: An Analysis of Key Factors Across Performance Levels and Socioeconomic Statuses
- Creators
- Hyesun YouMinju HongLi ZhuZhenhan Fang
- Resource Type
- Journal article
- Publication Details
- International journal of science and mathematics education, Vol.23(7), pp.2755-2783
- DOI
- 10.1007/s10763-025-10545-y
- ISSN
- 1571-0068
- eISSN
- 1573-1774
- Publisher
- SPRINGER
- Language
- English
- Electronic publication date
- 01/24/2025
- Date published
- 10/2025
- Academic Unit
- Teaching and Learning
- Record Identifier
- 9984781271202771
Metrics
8 Record Views