A Question-centric Multi-experts Contrastive Learning Framework for Improving the Accuracy and Interpretability of Deep Sequential Knowledge Tracing Models (2403.07322v3)
Abstract: Knowledge tracing (KT) plays a crucial role in predicting students' future performance by analyzing their historical learning processes. Deep neural networks (DNNs) have shown great potential in solving the KT problem. However, there still exist some important challenges when applying deep learning techniques to model the KT process. The first challenge lies in taking the individual information of the question into modeling. This is crucial because, despite questions sharing the same knowledge component (KC), students' knowledge acquisition on homogeneous questions can vary significantly. The second challenge lies in interpreting the prediction results from existing deep learning-based KT models. In real-world applications, while it may not be necessary to have complete transparency and interpretability of the model parameters, it is crucial to present the model's prediction results in a manner that teachers find interpretable. This makes teachers accept the rationale behind the prediction results and utilize them to design teaching activities and tailored learning strategies for students. However, the inherent black-box nature of deep learning techniques often poses a hurdle for teachers to fully embrace the model's prediction results. To address these challenges, we propose a Question-centric Multi-experts Contrastive Learning framework for KT called Q-MCKT. We have provided all the datasets and code on our website at https://github.com/rattlesnakey/Q-MCKT.
- Ghodai Abdelrahman and Qing Wang. 2019. Knowledge tracing with sequential key-value memory networks. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 175–184.
- A mixture-of-experts model for learning multi-facet entity embeddings. In Proceedings of the 28th International Conference on Computational Linguistics. 5124–5135.
- Efficient large scale language modeling with mixtures of experts. arXiv preprint arXiv:2112.10684 (2021).
- More accurate student modeling through contextual estimation of slip and guess probabilities in bayesian knowledge tracing. In Intelligent Tutoring Systems: 9th International Conference, ITS 2008, Montreal, Canada, June 23-27, 2008 Proceedings 9. Springer, 406–415.
- Learning factors analysis–a general method for cognitive model evaluation and improvement. In International conference on intelligent tutoring systems. Springer, 164–175.
- Improving interpretability of deep sequential knowledge tracing models with question-centric cognitive representations. arXiv preprint arXiv:2302.06885 (2023).
- Prerequisite-driven deep knowledge tracing. In 2018 IEEE International Conference on Data Mining. IEEE, 39–48.
- Towards an appropriate query, key, and value computation for knowledge tracing. In Proceedings of the Seventh ACM Conference on Learning@ Scale. 341–344.
- Ednet: A large-scale hierarchical dataset in education. In Artificial Intelligence in Education: 21st International Conference, AIED 2020, Ifrane, Morocco, July 6–10, 2020, Proceedings, Part II 21. Springer, 69–73.
- Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).
- Incorporating item response theory into knowledge tracing. In International Conference on Artificial Intelligence in Education. Springer, 114–118.
- Albert T Corbett and John R Anderson. 1994a. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction 4 (1994), 253–278.
- Albert T Corbett and John R Anderson. 1994b. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction 4 (1994), 253–278.
- Glam: Efficient scaling of language models with mixture-of-experts. In International Conference on Machine Learning. PMLR, 5547–5569.
- Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity. The Journal of Machine Learning Research 23, 1 (2022), 5232–5270.
- Simcse: Simple contrastive learning of sentence embeddings. arXiv preprint arXiv:2104.08821 (2021).
- Context-aware attentive knowledge tracing. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2330–2339.
- Supervised contrastive learning for pre-trained language model fine-tuning. arXiv preprint arXiv:2011.01403 (2020).
- Enhancing Knowledge Tracing via Adversarial Training. In Proceedings of the 29th ACM International Conference on Multimedia. 367–375.
- Learning Bayesian knowledge tracing parameters with a knowledge heuristic and empirical probabilities. In Intelligent Tutoring Systems: 12th International Conference, ITS 2014, Honolulu, HI, USA, June 5-9, 2014. Proceedings 12. Springer, 150–155.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9729–9738.
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.
- Towards Robust Knowledge Tracing Models via k-Sparse Attention. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2441–2445.
- Adaptive mixtures of local experts. Neural computation 3, 1 (1991), 79–87.
- Dynamic Bayesian networks for student modeling. IEEE Transactions on Learning Technologies 10, 4 (2017), 450–462.
- Diederik P Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations.
- Andrew S Lan and Richard G Baraniuk. 2016. A Contextual Bandits Framework for Personalized Learning Action Selection.. In EDM. 424–429.
- Adaptive gamification for learning environments. IEEE Transactions on Learning Technologies 12, 1 (2018), 16–28.
- Jinseok Lee and Dit-Yan Yeung. 2019. Knowledge query network for knowledge tracing: How knowledge interacts with skills. In Proceedings of the 9th International Conference on Learning Analytics & Knowledge. 491–500.
- Contrastive learning for knowledge tracing. In Proceedings of the ACM Web Conference 2022. 2330–2338.
- Gshard: Scaling giant models with conditional computation and automatic sharding. arXiv preprint arXiv:2006.16668 (2020).
- Base layers: Simplifying training of large, sparse models. In International Conference on Machine Learning. PMLR, 6265–6274.
- Oscar: Object-semantics aligned pre-training for vision-language tasks. In European Conference on Computer Vision. Springer, 121–137.
- Credit risk and limits forecasting in e-commerce consumer lending service via multi-view-aware mixture-of-experts nets. In Proceedings of the 14th ACM international conference on web search and data mining. 229–237.
- Ekt: Exercise-aware knowledge tracing for student performance prediction. IEEE Transactions on Knowledge and Data Engineering 33, 1 (2019), 100–115.
- Enhancing deep knowledge tracing with auxiliary tasks. In Proceedings of the ACM Web Conference 2023. 4178–4187.
- simpleKT: a simple but tough-to-beat baseline for knowledge tracing. arXiv preprint arXiv:2302.06881 (2023).
- pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
- Tracing Knowledge State with Individual Cognition and Acquisition Estimation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 173–182.
- Tracing knowledge state with individual cognition and acquisition estimation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 173–182.
- Interpreting Deep Learning Models for Knowledge Tracing. International Journal of Artificial Intelligence in Education (2022), 1–24.
- Towards interpretable deep learning models for knowledge tracing. In International Conference on Artificial Intelligence in Education. Springer, 185–190.
- Deep knowledge tracing and dynamic student classification for knowledge tracing. In 2018 IEEE International Conference on Data Mining. IEEE, 1182–1187.
- Revealing the learning in learning curves. In International Conference on Artificial Intelligence in Education. Springer, 473–482.
- Augmenting knowledge tracing by considering forgetting behavior. In The World Wide Web Conference. 3101–3107.
- Augmenting knowledge tracing by considering forgetting behavior. In The world wide web conference. 3101–3107.
- Graph-based knowledge tracing: modeling student proficiency using graph neural network. In 2019 IEEE/WIC/ACM International Conference on Web Intelligence. IEEE, 156–163.
- Shalini Pandey and George Karypis. 2019. A self-attentive model for knowledge tracing. In 12th International Conference on Educational Data Mining. International Educational Data Mining Society, 384–389.
- Shalini Pandey and Jaideep Srivastava. 2020a. RKT: relation-aware self-attention for knowledge tracing. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1205–1214.
- Shalini Pandey and Jaideep Srivastava. 2020b. RKT: relation-aware self-attention for knowledge tracing. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1205–1214.
- Zachary A Pardos and Neil T Heffernan. 2011. KT-IDEM: Introducing item difficulty to the knowledge tracing model. In User Modeling, Adaption and Personalization: 19th International Conference, UMAP 2011, Girona, Spain, July 11-15, 2011. Proceedings 19. Springer, 243–254.
- Deep knowledge tracing. Advances in Neural Information Processing Systems 28 (2015).
- Learning program embeddings to propagate feedback on student code. In International conference on machine Learning. PMLR, 1093–1102.
- Deep knowledge tracing with transformers. In International Conference on Artificial Intelligence in Education. Springer, 252–256.
- EAKT: Embedding Cognitive Framework with Attention for Interpretable Knowledge Tracing. Scientific Reports (2022).
- Learning transferable visual models from natural language supervision. In International Conference on Machine Learning. PMLR, 8748–8763.
- Georg Rasch. 1993. Probabilistic models for some intelligence and attainment tests. ERIC.
- Scaling vision with sparse mixture of experts. Advances in Neural Information Processing Systems 34 (2021), 8583–8595.
- Max Ryabinin and Anton Gusev. 2020. Towards crowdsourced training of large neural networks using decentralized mixture-of-experts. Advances in Neural Information Processing Systems 33 (2020), 3659–3672.
- Learning Process-consistent Knowledge Tracing. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1452–1460.
- qDKT: Question-centric deep knowledge tracing. In Proceedings of The 13th International Conference on Educational Data Mining (EDM 2020). 677–681.
- Exercise-enhanced sequential modeling for student performance prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
- Factorization techniques for predicting student performance. In Educational recommender systems and technologies: Practices and challenges. IGI Global, 129–153.
- Structure-based Knowledge Tracing: An Influence Propagation View. In 2020 IEEE International Conference on Data Mining. IEEE, 541–550.
- Jill-Jênn Vie and Hisashi Kashima. 2019. Knowledge tracing machines: Factorization machines for knowledge tracing. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 750–757.
- Neural cognitive diagnosis for intelligent education systems. In Proceedings of the AAAI Conference on Artificial Intelligence. 6153–6161.
- Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge. ArXiv preprint abs/2007.12061 (2020). https://arxiv.org/abs/2007.12061
- Beverly Park Woolf. 2010. Building intelligent interactive tutors: Student-centered strategies for revolutionizing e-learning. Morgan Kaufmann.
- Exercise recommendation based on knowledge concept prediction. Knowledge-Based Systems 210 (2020), 106481.
- GIKT: a graph-based interaction model for knowledge tracing. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 299–315.
- Chun-Kit Yeung. 2019a. Deep-IRT: Make deep learning based knowledge tracing explainable using item response theory. In Proceedings of The 12th International Conference on Educational Data Mining (EDM 2019). 683–686.
- Chun-Kit Yeung. 2019b. Deep-IRT: Make deep learning based knowledge tracing explainable using item response theory. arXiv preprint arXiv:1904.11738 (2019).
- Chun-Kit Yeung and Dit-Yan Yeung. 2018a. Addressing two problems in deep knowledge tracing via prediction-consistent regularization. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale. 1–10.
- Chun-Kit Yeung and Dit-Yan Yeung. 2018b. Addressing two problems in deep knowledge tracing via prediction-consistent regularization. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale. 1–10.
- Individualized bayesian knowledge tracing models. In Artificial Intelligence in Education: 16th International Conference, AIED 2013, Memphis, TN, USA, July 9-13, 2013. Proceedings 16. Springer, 171–180.
- Assisting Language Learners: Automated Trans-Lingual Definition Generation via Contrastive Prompt Learning. arXiv preprint arXiv:2306.06058 (2023).
- Fine-grained Contrastive Learning for Definition Generation. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing. 1001–1012.
- Dynamic key-value memory networks for knowledge tracing. In Proceedings of the 26th International Conference on World Wide Web. 765–774.
- Multi-Factors Aware Dual-Attentional Knowledge Tracing. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2588–2597.
- Interpretable personalized knowledge tracing and next learning activity recommendation. In Proceedings of the Seventh ACM Conference on Learning@Scale. 325–328.
- Hengyuan Zhang (34 papers)
- Zitao Liu (76 papers)
- Chenming Shang (9 papers)
- Dawei Li (75 papers)
- Yong Jiang (195 papers)