Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Predictive, scalable and interpretable knowledge tracing on structured domains (2403.13179v1)

Published 19 Mar 2024 in cs.LG, cs.CY, and stat.ML

Abstract: Intelligent tutoring systems optimize the selection and timing of learning materials to enhance understanding and long-term retention. This requires estimates of both the learner's progress (''knowledge tracing''; KT), and the prerequisite structure of the learning domain (''knowledge mapping''). While recent deep learning models achieve high KT accuracy, they do so at the expense of the interpretability of psychologically-inspired models. In this work, we present a solution to this trade-off. PSI-KT is a hierarchical generative approach that explicitly models how both individual cognitive traits and the prerequisite structure of knowledge influence learning dynamics, thus achieving interpretability by design. Moreover, by using scalable Bayesian inference, PSI-KT targets the real-world need for efficient personalization even with a growing body of learners and learning histories. Evaluated on three datasets from online learning platforms, PSI-KT achieves superior multi-step predictive accuracy and scalable inference in continual-learning settings, all while providing interpretable representations of learner-specific traits and the prerequisite structure of knowledge that causally supports learning. In sum, predictive, scalable and interpretable knowledge tracing with solid knowledge mapping lays a key foundation for effective personalized learning to make education accessible to a broad, global audience.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (70)
  1. Terry A Ackerman. Multidimensional item response theory models. Wiley StatsRef: Statistics Reference Online, 2014.
  2. Hagai Attias. A variational bayesian framework for graphical models. Advances in neural information processing systems, 12, 1999.
  3. The form of the forgetting curve and the fate of memories. Journal of Mathematical Psychology, 55:25–35, 02 2011.
  4. More accurate student modeling through contextual estimation of slip and guess probabilities in bayesian knowledge tracing. In Intelligent Tutoring Systems: 9th International Conference, ITS 2008, Montreal, Canada, June 23-27, 2008 Proceedings 9, pp.  406–415. Springer, 2008.
  5. A network neuroscience of human learning: potential to inform quantitative theories of brain and behavior. Trends in cognitive sciences, 21(4):250–264, 2017.
  6. The variational bayesian em algorithm for incomplete data: with application to scoring graphical model structures. Bayesian statistics, 7:453–464, 2003.
  7. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence, 35(8):1798–1828, 2013.
  8. Variational inference: A review for statisticians. Journal of the American statistical Association, 112(518):859–877, 2017.
  9. Exploration beyond bandits. In The Drive for Knowledge: The Science of Human Information-Seeking, pp.  147–168. Cambridge University Press, 2022.
  10. Empirical analysis of predictive algorithms for collaborative filtering. arXiv preprint arXiv:1301.7363, 2013.
  11. Leslie Burkholder. Equipoise and ethics in educational research. Theory and Research in Education, 19(1):65–77, 2021. doi: 10.1177/14778785211009105.
  12. Modeling exercise relationships in e-learning: A unified approach. In EDM, pp.  532–535, 2015.
  13. Improving interpretability of deep sequential knowledge tracing models with question-centric cognitive representations. arXiv preprint arXiv:2302.06885, 2023.
  14. Towards an appropriate query, key, and value computation for knowledge tracing. In Proceedings of the seventh ACM conference on learning@ scale, pp.  341–344, 2020.
  15. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction, 4:253–278, 1994.
  16. Curriculum learning for human compositional generalization. Proceedings of the National Academy of Sciences, 119(41):e2205582119, 2022.
  17. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (methodological), pp.  1–38, 1977.
  18. John Dewey. The child and the curriculum. University of Chicago Press Chicago, 1910.
  19. Deep unsupervised clustering with gaussian mixture variational autoencoders. arXiv preprint arXiv:1611.02648, 2016.
  20. H. Ebbinghaus. Über das Gedächtnis: Untersuchungen zur experimentellen Psychologie. Duncker & Humblot, Leipzig, 1885.
  21. Scientific inference with interpretable machine learning: Analyzing models to learn about real-world phenomena. arXiv preprint arXiv:2206.05487, 2022.
  22. When is deep learning the best approach to knowledge tracing? Journal of Educational Data Mining, 12(3):31–54, 2020.
  23. Context-aware attentive knowledge tracing. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp.  2330–2339, 2020.
  24. Review of causal discovery methods based on graphical models. Frontiers in genetics, 10:524, 2019.
  25. Structure and strength in causal induction. Cognitive psychology, 51(4):334–384, 2005.
  26. Theory-based causal induction. Psychological review, 116(4):661, 2009.
  27. Lilian H Hill. Concept mapping to encourage meaningful student learning. Adult Learning, 16(3-4):7–13, 2005.
  28. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
  29. Categorical reparameterization with gumbel-softmax. arXiv preprint arXiv:1611.01144, 2016.
  30. Local patterns to global architectures: influences of network topology on human learning. Trends in cognitive sciences, 20(8):629–640, 2016.
  31. Dynamic bayesian networks for student modeling. IEEE Transactions on Learning Technologies, 10(4):450–462, 2017.
  32. Disentangling by factorising. In International Conference on Machine Learning, pp. 2649–2658. PMLR, 2018.
  33. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  34. Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings, 2014.
  35. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907, 2016.
  36. Concept networks of students’ knowledge of relationships between physics concepts: Finding key concepts and their epistemic support. Applied network science, 3(1):1–21, 2018.
  37. Improving students’ long-term knowledge retention through personalized review. Psychological science, 25(3):639–647, 2014.
  38. Efficient neural causal discovery without acyclicity constraints. arXiv preprint arXiv:2107.10483, 2021.
  39. simplekt: a simple but tough-to-beat baseline for knowledge tracing. arXiv preprint arXiv:2302.06881, 2023.
  40. Tracing knowledge state with individual cognition and acquisition estimation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.  173–182, 2021.
  41. Generalized variational continual learning. arXiv preprint arXiv:2011.12328, 2020.
  42. Frederic M Lord. Applications of item response theory to practical testing problems. Routledge, 2012.
  43. How humans learn and represent networks. Proceedings of the National Academy of Sciences, 117(47):29407–29415, 2020.
  44. Consolidation of long-term memory: evidence and alternatives. Psychological Bulletin, 130(6):843, 2004.
  45. Interpretable knowledge tracing: Simple and efficient student modeling with causal relations. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pp.  12810–12818, 2022.
  46. Augmenting knowledge tracing by considering forgetting behavior. In The world wide web conference, pp.  3101–3107, 2019.
  47. Graph-based knowledge tracing: modeling student proficiency using graph neural network. In IEEE/WIC/ACM International Conference on Web Intelligence, pp.  156–163, 2019.
  48. Variational continual learning. arXiv preprint arXiv:1710.10628, 2017.
  49. A self-attentive model for knowledge tracing. arXiv preprint arXiv:1907.06837, 2019.
  50. Logistic knowledge tracing: A constrained framework for learner modeling. IEEE Transactions on Learning Technologies, 14(5):624–639, 2021.
  51. Performance factors analysis–a new alternative to knowledge tracing. Online Submission, 2009.
  52. Jean Piaget. Science of education and the psychology of the child. trans. d. coltman. 1970.
  53. Deep knowledge tracing. Advances in neural information processing systems, 28, 2015.
  54. Sidney L Pressey. A simple apparatus which gives tests and scores-and teaches. Sch. & Soc., 23:373–376, 1926.
  55. David E Rumelhart. Schemata: The building blocks of cognition. In Theoretical issues in reading comprehension, pp.  33–58. Routledge, 2017.
  56. Incorporating scaffolding and tutor context into bayesian knowledge tracing to predict inquiry skill acquisition. In Educational Data Mining 2013. Citeseer, 2013.
  57. Applied stochastic differential equations, volume 10. Cambridge University Press, 2019.
  58. Assistments dataset from multiple randomized controlled experiments. In Proceedings of the Third (2016) ACM Conference on Learning@ Scale, pp.  181–184, 2016.
  59. A trainable spaced repetition model for language learning. In Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers), pp.  1848–1858, 2016.
  60. Learning process-consistent knowledge tracing. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pp.  1452–1460, 2021.
  61. Variational mixture-of-experts autoencoders for multi-modal deep generative models. Advances in Neural Information Processing Systems, 32, 2019.
  62. Saint+: Integrating temporal features for ednet correctness prediction. In LAK21: 11th International Learning Analytics and Knowledge Conference, pp.  490–496, 2021.
  63. Structure-based knowledge tracing: An influence propagation view. In 2020 IEEE international conference on data mining (ICDM), pp.  541–550. IEEE, 2020.
  64. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  65. Evaluating the theoretic adequacy and applied potential of computational models of the spacing effect. Cognitive science, 42:644–691, 2018.
  66. Temporal cross-effects in knowledge tracing. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp.  517–525, 2021.
  67. Diagnostic questions: The neurips 2020 education challenge. arXiv preprint arXiv:2007.12061, 2020.
  68. Genuine power curves in forgetting: A quantitative analysis of individual subject forgetting functions. Memory & cognition, 25:731–739, 1997.
  69. Embedding entities and relations for learning and inference in knowledge bases. arXiv preprint arXiv:1412.6575, 2014.
  70. Individualized bayesian knowledge tracing models. In Artificial Intelligence in Education: 16th International Conference, AIED 2013, Memphis, TN, USA, July 9-13, 2013. Proceedings 16, pp.  171–180. Springer, 2013.
Citations (5)

Summary

We haven't generated a summary for this paper yet.