Predicting Learning Performance with Large Language Models: A Study in Adult Literacy (2403.14668v1)
Abstract: Intelligent Tutoring Systems (ITSs) have significantly enhanced adult literacy training, a key factor for societal participation, employment opportunities, and lifelong learning. Our study investigates the application of advanced AI models, including LLMs like GPT-4, for predicting learning performance in adult literacy programs in ITSs. This research is motivated by the potential of LLMs to predict learning performance based on its inherent reasoning and computational capabilities. By using reading comprehension datasets from the ITS, AutoTutor, we evaluate the predictive capabilities of GPT-4 versus traditional machine learning methods in predicting learning performance through five-fold cross-validation techniques. Our findings show that the GPT-4 presents the competitive predictive abilities with traditional machine learning methods such as Bayesian Knowledge Tracing, Performance Factor Analysis, Sparse Factor Analysis Lite (SPARFA-Lite), tensor factorization and eXtreme Gradient Boosting (XGBoost). While XGBoost (trained on local machine) outperforms GPT-4 in predictive accuracy, GPT-4-selected XGBoost and its subsequent tuning on the GPT-4 platform demonstrates superior performance compared to local machine execution. Moreover, our investigation into hyper-parameter tuning by GPT-4 versus grid-search suggests comparable performance, albeit with less stability in the automated approach, using XGBoost as the case study. Our study contributes to the field by highlighting the potential of integrating LLMs with traditional machine learning models to enhance predictive accuracy and personalize adult literacy education, setting a foundation for future research in applying LLMs within ITSs.
- National Research Council “Improving adult literacy instruction: Options for practice and research” National Academies Press, 2012
- “Literacy, lives and learning” Routledge, 2012
- “Reading comprehension” Springer, 2011
- “Diagnostic Assessment of Adults’ Reading Deficiencies in an Intelligent Tutoring System.” In ITS Workshops, 2018, pp. 105–112
- “A conversation-based intelligent tutoring system benefits adult readers with Low literacy skills” In Adaptive Instructional Systems: First International Conference, AIS 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, July 26–31, 2019, Proceedings 21, 2019, pp. 604–614 Springer
- “AutoTutor: A tutor with dialogue in natural language” In Behavior Research Methods, Instruments, & Computers 36 Springer, 2004, pp. 180–192
- Benjamin D Nye, Arthur C Graesser and Xiangen Hu “AutoTutor and family: A review of 17 years of natural language tutoring” In International Journal of Artificial Intelligence in Education 24 Springer, 2014, pp. 427–469
- “Using AutoTutor to track performance and engagement in a reading comprehension intervention for adult literacy students” In Revista Signos. Estudios de Lingüística 54.107, 2021
- Albert T Corbett, Kenneth R Koedinger and John R Anderson “Intelligent tutoring systems” In Handbook of human-computer interaction Elsevier, 1997, pp. 849–874
- Arthur C Graesser, Mark W Conley and Andrew Olney “Intelligent tutoring systems.” American Psychological Association, 2012
- Albert T Corbett and John R Anderson “Knowledge tracing: Modeling the acquisition of procedural knowledge” In User modeling and user-adapted interaction 4 Springer, 1994, pp. 253–278
- Philip I Pavlik, Luke G Eglington and Leigh M Harrell-Williams “Logistic knowledge tracing: A constrained framework for learner modeling” In IEEE Transactions on Learning Technologies 14.5 IEEE, 2021, pp. 624–639
- Shima Imani, Liang Du and Harsh Shrivastava “Mathprompter: Mathematical reasoning using large language models” In arXiv preprint arXiv:2303.05398, 2023
- “Large Language Models for Mathematical Reasoning: Progresses and Challenges” In arXiv preprint arXiv:2402.00157, 2024
- “Time-llm: Time series forecasting by reprogramming large language models” In arXiv preprint arXiv:2310.01728, 2023
- “Large language models are zero-shot time series forecasters” In arXiv preprint arXiv:2310.07820, 2023
- “Large Language Models for Time Series: A Survey” In arXiv preprint arXiv:2402.01801, 2024
- “Open-ended knowledge tracing for computer science education” In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022, pp. 3849–3862
- Teo Susnjak “Beyond Predictive Learning Analytics Modelling and onto Explainable Artificial Intelligence with Prescriptive Analytics and ChatGPT” In International Journal of Artificial Intelligence in Education Springer, 2023, pp. 1–31
- “Gpt-4 technical report” In arXiv preprint arXiv:2303.08774, 2023
- Michael V Yudelson, Kenneth R Koedinger and Geoffrey J Gordon “Individualized bayesian knowledge tracing models” In Artificial Intelligence in Education: 16th International Conference, AIED 2013, Memphis, TN, USA, July 9-13, 2013. Proceedings 16, 2013, pp. 171–180 Springer
- Philip I Pavlik Jr, Hao Cen and Kenneth R Koedinger “Performance Factors Analysis–A New Alternative to Knowledge Tracing.” In Online Submission ERIC, 2009
- Andrew S Lan, Christoph Studer and Richard G Baraniuk “Quantized matrix completion for personalized learning” In arXiv preprint arXiv:1412.5968, 2014
- “Rank-based tensor factorization for student performance prediction” In 12th International Conference on Educational Data Mining (EDM), 2019
- “Xgboost: extreme gradient boosting” In R package version 0.4-2 1.4, 2015, pp. 1–4
- Amal Asselman, Mohamed Khaldi and Souhaib Aammou “Enhancing the prediction of student performance based on the machine learning XGBoost algorithm” In Interactive Learning Environments 31.6 Taylor & Francis, 2023, pp. 3360–3379
- “Reading comprehension lessons in AutoTutor for the Center for the Study of Adult Literacy” In Adaptive educational technologies for literacy instruction Routledge, 2016, pp. 288–293
- “Educational technologies that support reading comprehension for adults who have low literacy skills” In The Wiley handbook of adult literacy Wiley Online Library, 2019, pp. 471–493
- Arthur C Graesser, Carol M Forsyth and Blair A Lehman “Two heads may be better than one: Learning from computer agents in conversational trialogues” In Teachers College Record 119.3 SAGE Publications Sage CA: Los Angeles, CA, 2017, pp. 1–20
- “Patterns of adults with low literacy skills interacting with an intelligent tutoring system” In International Journal of Artificial Intelligence in Education Springer, 2022, pp. 1–26
- “Clustering the Learning Patterns of Adults with Low Literacy Skills Interacting with an Intelligent Tutoring System.” In Grantee Submission ERIC, 2018
- “Using an adaptive intelligent tutoring system to promote learning affordances for adults with low literacy skills” In Adaptive Instructional Systems: First International Conference, AIS 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, July 26–31, 2019, Proceedings 21, 2019, pp. 327–339 Springer
- “Technology use and integration in adult education and literacy classrooms”, 2019
- “Assessment with computer agents that engage in conversational dialogues and trialogues with learners” In Computers in Human Behavior 76 Elsevier, 2017, pp. 607–616
- Arthur C Graesser, Xiangen Hu and Danielle S McNamara “Computerized Learning Environments That Incorporate Research in Discourse Psychology, Cognitive Science, and Computational Linguistics.” American Psychological Association, 2005
- “Exploring an intelligent tutoring system as a conversation-based assessment tool for reading comprehension” In Behaviormetrika 45 Springer, 2018, pp. 615–633
- “Exploring the Individual Differences in Multidimensional Evolution of Knowledge States of Learners” In International Conference on Human-Computer Interaction, 2023, pp. 265–284 Springer
- Joseph E Beck and Yue Gong “Wheel-spinning: Students who fail to master a skill” In Artificial Intelligence in Education: 16th International Conference, AIED 2013, Memphis, TN, USA, July 9-13, 2013. Proceedings 16, 2013, pp. 431–440 Springer
- Yue Gong “Student modeling in intelligent tutoring systems”, 2014
- Michel C Desmarais and Ryan SJ d Baker “A review of recent advances in learner and skill modeling in intelligent learning environments” In User Modeling and User-Adapted Interaction 22 Springer, 2012, pp. 9–38
- “A Review of Learner Models Used in Intelligent Tutoring Systems” In Design Recommendations for Intelligent Tutoring Systems: Volume 1-Learner Modeling 1 US Army Research Laboratory, 2013, pp. 39
- Philip Irvin Pavlik Jr and Luke G Eglington “Automated Search Improves Logistic Knowledge Tracing, Surpassing Deep Learning in Accuracy and Explainability” In Journal of Educational Data Mining 15.3, 2023, pp. 58–86
- “Instructional Factors Analysis: A Cognitive Model For Multiple Instructional Interventions.” In EDM 2011 Citeseer, 2011, pp. 61–70
- Radek Pelánek “Bayesian knowledge tracing, logistic models, and beyond: an overview of learner modeling techniques” In User Modeling and User-Adapted Interaction 27 Springer, 2017, pp. 313–350
- “Sparse factor analysis for learning and content analytics” In arXiv preprint arXiv:1303.5685, 2013
- Jie Huang and Kevin Chen-Chuan Chang “Towards reasoning in large language models: A survey” In arXiv preprint arXiv:2212.10403, 2022
- “Chain-of-thought prompting elicits reasoning in large language models” In Advances in Neural Information Processing Systems 35, 2022, pp. 24824–24837
- “Can large language models provide feedback to students? A case study on ChatGPT” In 2023 IEEE International Conference on Advanced Learning Technologies (ICALT), 2023, pp. 323–325 IEEE
- “Ruffle&Riley: Towards the Automated Induction of Conversational Tutoring Systems” In arXiv preprint arXiv:2310.01420, 2023
- “Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation” In arXiv preprint arXiv:2402.14594, 2024
- Chee Wei Tan “Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction” In arXiv preprint arXiv:2311.14708, 2023
- “Evaluating reading comprehension exercises generated by LLMs: A showcase of ChatGPT in education applications” In Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023), 2023, pp. 610–625
- “Evaluating the logical reasoning ability of chatgpt and gpt-4” In arXiv preprint arXiv:2304.03439, 2023
- Ghodai Abdelrahman, Qing Wang and Bernardo Nunes “Knowledge tracing: A survey” In ACM Computing Surveys 55.11 ACM New York, NY, 2023, pp. 1–37
- Zachary A Pardos and Neil T Heffernan “Modeling individualization in a bayesian networks implementation of knowledge tracing” In International conference on user modeling, adaptation, and personalization, 2010, pp. 255–266 Springer
- Zachary A Pardos and Neil T Heffernan “KT-IDEM: Introducing item difficulty to the knowledge tracing model” In User Modeling, Adaption and Personalization: 19th International Conference, UMAP 2011, Girona, Spain, July 11-15, 2011. Proceedings 19, 2011, pp. 243–254 Springer
- Michael Yudelson, Philip I Pavlik and Kenneth R Koedinger “User modeling–a notoriously black art” In User Modeling, Adaption and Personalization: 19th International Conference, UMAP 2011, Girona, Spain, July 11-15, 2011. Proceedings 19, 2011, pp. 317–328 Springer
- Luke G Eglington and Philip I Pavlik Jr “How to optimize student learning using student models that adapt rapidly to individual differences” In International Journal of Artificial Intelligence in Education Springer, 2022, pp. 1–22
- Yue Gong, Joseph E Beck and Neil T Heffernan “Comparing knowledge tracing and performance factor analysis by using multiple model fitting procedures” In Intelligent Tutoring Systems: 10th International Conference, ITS 2010, Pittsburgh, PA, USA, June 14-18, 2010, Proceedings, Part I 10, 2010, pp. 35–44 Springer
- “Knowledge tracing for complex problem solving: Granular rank-based tensor factorization” In Proceedings of the 29th ACM conference on user modeling, adaptation and personalization, 2021, pp. 179–188
- “3DG: A Framework for Using Generative AI for Handling Sparse Learner Performance Data From Intelligent Tutoring Systems” In arXiv preprint arXiv:2402.01746, 2024
- “An XGBoost-Based Knowledge Tracing Modelf” In International Journal of Computational Intelligence Systems 16.1 Springer, 2023, pp. 13
- “Xgboost: A scalable tree boosting system” In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794
- “Going deeper with deep knowledge tracing.” In International Educational Data Mining Society ERIC, 2016
- “When is deep learning the best approach to knowledge tracing?” In Journal of Educational Data Mining 12.3, 2020, pp. 31–54
- Rob J Hyndman and Anne B Koehler “Another look at measures of forecast accuracy” In International journal of forecasting 22.4 Elsevier, 2006, pp. 679–688
- “Deep knowledge tracing” In Advances in neural information processing systems 28, 2015
- “A self-attentive model for knowledge tracing” In arXiv preprint arXiv:1907.06837, 2019
- “Dynamic key-value memory networks for knowledge tracing” In Proceedings of the 26th international conference on World Wide Web, 2017, pp. 765–774
- Kenneth R Koedinger, Albert T Corbett and Charles Perfetti “The Knowledge-Learning-Instruction framework: Bridging the science-practice chasm to enhance robust student learning” In Cognitive science 36.5 Wiley Online Library, 2012, pp. 757–798
- Philip I Pavlik Jr, Luke G Eglington and Liang Zhang “Automatic Domain Model Creation and Improvement.” In Grantee Submission ERIC, 2021
- Faruk Ahmed, Keith Shubeck and Xiangen Hu “Chatgpt in the generalized intelligent framework for tutoring” In Proceedings of the 11th Annual Generalized Intelligent Framework for Tutoring (GIFT) Users Symposium (GIFTSym11), 2023, pp. 109 US Army Combat Capabilities Development Command–Soldier Center
- Liang Zhang (357 papers)
- Jionghao Lin (36 papers)
- Conrad Borchers (27 papers)
- John Sabatini (4 papers)
- John Hollander (4 papers)
- Meng Cao (107 papers)
- Xiangen Hu (11 papers)