A machine learning approach to predict university enrolment choices through students' high school background in Italy (2403.13819v1)
Abstract: This paper explores the influence of Italian high school students' proficiency in mathematics and the Italian language on their university enrolment choices, specifically focusing on STEM (Science, Technology, Engineering, and Mathematics) courses. We distinguish between students from scientific and humanistic backgrounds in high school, providing valuable insights into their enrolment preferences. Furthermore, we investigate potential gender differences in response to similar previous educational choices and achievements. The study employs gradient boosting methodology, known for its high predicting performance and ability to capture non-linear relationships within data, and adjusts for variables related to the socio-demographic characteristics of the students and their previous educational achievements. Our analysis reveals significant differences in the enrolment choices based on previous high school achievements. The findings shed light on the complex interplay of academic proficiency, gender, and high school background in shaping students' choices regarding university education, with implications for educational policy and future research endeavours.
- Visualizing the effects of predictor variables in black box supervised learning models. Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(4):1059–1086.
- Science aspirations, capital, and family habitus: How families shape children’s engagement and identification with science. American educational research journal, 49(5):881–908.
- Gender segregation in higher education: an empirical test of seven explanations. Higher Education, 79(1):55–78.
- Nudging gender desegregation: A field experiment on the causal effect of information barriers on gender inequalities in higher education. European Societies, 21(3):356–377.
- Gender stereotypes about intellectual ability emerge early and influence children’s interests. Science, 355(6323):389–391.
- Schools and inequality: A multilevel analysis of coleman’s equality of educational opportunity data. Teachers College Record, 112(5):1201–1246.
- Boudon, R. (1974). Education, opportunity, and social inequality: Changing prospects in western society.
- Cheryan, S. (2012). Understanding the paradox in math-related fields: Why do some gender gaps remain while others do not? Sex roles, 66(3):184–190.
- Why are some stem fields more gender-balanced than others? Psychological bulletin, 143(1):1.
- Gender differences in high school choices: Do math and language skills play a role?
- Between formal openness and stratification in secondary education: Implications for social inequalities in italy. Models of secondary education and social inequality: An international comparison, pages 305–322.
- Correll, S. J. (2001). Gender and the career choice process: The role of biased self-assessments. American journal of Sociology, 106(6):1691–1730.
- Math–gender stereotypes in elementary school children. Child development, 82(3):766–779.
- The effect of high school rank in english and math on college major choice.
- Educational data mining: Predictive analysis of academic performance of public school students in the capital of brazil. Journal of business research, 94:335–343.
- Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). The annals of statistics, 28(2):337–407.
- Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of statistics, pages 1189–1232.
- Friedman, J. H. (2002). Stochastic gradient boosting. Computational statistics & data analysis, 38(4):367–378.
- Gender streaming and prior achievement in high school science and mathematics. Economics of Education Review, 53:230–253.
- Gender differences in fields of study: the role of significant others and rational choice motivations. European Sociological Review, 31(3):284–297.
- School size and students’ achievement. empirical evidences from pisa survey data. Socio-Economic Planning Sciences, 64:66–77.
- Girls in stem: Is it a female role-model thing? Frontiers in psychology, 11:2204.
- Grömping, U. (2020). Model-agnostic effects plots for interpreting machine learning models. Reports in Mathematics, Physics and Chemistry, Department II, Beuth University of Applied Sciences Berlin Report, 1:2020.
- Hadjar, A. (2019). Educational expansion and inequalities: how did inequalities by social origin and gender decrease in modern industrial societies. Research handbook on the sociology of education. Cheltenham, UK: Edward Elgar Publishing, pages 173–192.
- Educational expansion: Expected and unexpected consequences. Expected and Unexpected Consequences of the Educational Expansion in Europe and the US. Bern: Haupt, pages 9–23.
- Education systems and gender inequalities in educational attainment. In Education systems and inequalities, pages 159–184. Policy Press.
- Machine learning for the educational sciences. Review of Education, 9(3):e3310.
- ISTAT (2023). Rapporto BES 2022, “il benessere equo e sostenibile in italia”’.
- Kromydas, T. (2017). Rethinking higher education and its relationship with social inequalities: past knowledge, present state and future potential. Palgrave communications, 3(1):1–12.
- School context and the gender gap in educational achievement. American Sociological Review, 77(3):463–485.
- Gender differences in higher education from a life course perspective: transitions and social inequality between enrolment and first post-doc position. Higher Education, 77:381–402.
- Horizontal and vertical gender segregation in higher education: Eu 28 under scrutiny. Managerial Challenges of the Contemporary Society. Proceedings, 8(1):162.
- The gender gap in stem fields: The impact of the gender stereotype of math and science on secondary students’ career aspirations. In Frontiers in Education, page 60. Frontiers.
- McNally, S. (2020). Gender differences in tertiary education: what explains stem participation? Technical report, IZA Policy Paper.
- MOBYSU.IT (2017). Database MOBYSU.IT, Mobilità degli studi universitari italiani, Research Protocol MUR - Universities of Cagliari, Palermo, Siena, Torino, Sassari, Firenze, Cattolica and Napoli Federico II, Scientific Coordinator Massimo Attanasio (UNIPA), Data Source ANS-MUR/CINECA.
- Molnar, C. (2020). Interpretable machine learning. Lulu. com.
- OECD (2022). Education at a Glance 2022.
- Social inequalities in the choice of secondary school: Long-term trends during educational expansion and reforms in italy. European Societies, 16(5):666–693.
- Estimating the peers effect on students’ university choices. In Book of short papers. IES 2022 Innovation & society 5.0: statistical and economic methodologies for quality assessment, pages 134–139. PKE srl.
- Unveiling gender disparities in university pathways: insights from italy’s master’s level. European Journal of Higher Education, pages 1–24.
- Does taking additional maths classes in high school affect academic outcomes? Socio-Economic Planning Sciences, page 101674.
- Attitudes, interest and factors influencing stem enrolment behaviour: An overview of relevant literature. Understanding student participation and choice in science and technology education, pages 63–88.
- Ridgeway, G. (2007). Generalized boosted models: A guide to the gbm package. Update, 1(1):2007.
- Gender differences in higher education choices. italian girls in the corner? The Education of Gender The Gender of Education, page 61.
- Salmieri, L. (2022). Students, parents and school-choices. gendered trajectories in the italian education system. Italian Journal of Sociology of Education, 14(2):99–119.
- Gender differences in tertiary educational attainment and the intergenerational transmission of cultural capital in italy. The Education of Gender The Gender of Education, page 77.
- Sherman, J. (1980). Mathematics, spatial visualization, and related factors: Changes in girls and boys, grades 8–11. Journal of Educational psychology, 72(4):476.
- Is there a shortage of scientists? a re-analysis of supply for the uk. British Journal of Educational Studies, 59(2):159–177.
- The gender-equality paradox in science, technology, engineering, and mathematics education. Psychological science, 29(4):581–593.
- Prediction of differential performance between advanced placement exam scores and class grades using machine learning. In Frontiers in Education, volume 7, page 1007779. Frontiers.
- Gender gap in stem education and career choices: what matters? Journal of Applied Research in Higher Education.
- Teacher, parental and friend influences on stem interest and career choice intention. Issues in Educational Research, 30(4):1558–1575.
- Yağcı, M. (2022). Educational data mining: prediction of students’ academic performance using machine learning algorithms. Smart Learning Environments, 9(1):11.
- Andrea Priulla (1 paper)
- Alessandro Albano (2 papers)
- Nicoletta D'Angelo (15 papers)
- Massimo Attanasio (1 paper)