Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Zero-1-to-3: Domain-level Zero-shot Cognitive Diagnosis via One Batch of Early-bird Students towards Three Diagnostic Objectives (2312.13434v3)

Published 20 Dec 2023 in cs.AI and cs.IR

Abstract: Cognitive diagnosis seeks to estimate the cognitive states of students by exploring their logged practice quiz data. It plays a pivotal role in personalized learning guidance within intelligent education systems. In this paper, we focus on an important, practical, yet often underexplored task: domain-level zero-shot cognitive diagnosis (DZCD), which arises due to the absence of student practice logs in newly launched domains. Recent cross-domain diagnostic models have been demonstrated to be a promising strategy for DZCD. These methods primarily focus on how to transfer student states across domains. However, they might inadvertently incorporate non-transferable information into student representations, thereby limiting the efficacy of knowledge transfer. To tackle this, we propose Zero-1-to-3, a domain-level zero-shot cognitive diagnosis framework via one batch of early-bird students towards three diagnostic objectives. Our approach initiates with pre-training a diagnosis model with dual regularizers, which decouples student states into domain-shared and domain-specific parts. The shared cognitive signals can be transferred to the target domain, enriching the cognitive priors for the new domain, which ensures the cognitive state propagation objective. Subsequently, we devise a strategy to generate simulated practice logs for cold-start students through analyzing the behavioral patterns from early-bird students, fulfilling the domain-adaption goal. Consequently, we refine the cognitive states of cold-start students as diagnostic outcomes via virtual data, aligning with the diagnosis-oriented goal. Finally, extensive experiments on six real-world datasets highlight the efficacy of our model for DZCD and its practical application in question recommendation. The code is publicly available at https://github.com/bigdata-ustc/Zero-1-to-3.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. BETA-CD: A Bayesian Meta-Learned Cognitive Diagnosis Framework for Personalized Learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, 5018–5026.
  2. Prerequisite-driven deep knowledge tracing. In 2018 IEEE International Conference on Data Mining (ICDM), 39–48. IEEE.
  3. Cerberus transformer: Joint semantic, affordance and attribute parsing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 19649–19658.
  4. Disentangling Cognitive Diagnosis with Limited Exercise Labels. In Thirty-seventh Conference on Neural Information Processing Systems.
  5. De La Torre, J. 2009. DINA model and parameter estimation: A didactic. Journal of educational and behavioral statistics, 34(1): 115–130.
  6. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
  7. Item response theory. Psychology Press.
  8. Rcd: Relation map driven cognitive diagnosis for intelligent education systems. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 501–510.
  9. Leveraging Transferable Knowledge Concept Graph Embedding for Cold-Start Cognitive Diagnosis. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 983–992.
  10. Conet: Collaborative cross networks for cross-domain recommendation. In Proceedings of the 27th ACM international conference on information and knowledge management, 667–676.
  11. Exploring multi-objective exercise recommendations in online education systems. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 1261–1270.
  12. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  13. Meta Multi-Agent Exercise Recommendation: A Game Application Perspective.
  14. Ekt: Exercise-aware knowledge tracing for student performance prediction. IEEE Transactions on Knowledge and Data Engineering, 33(1): 100–115.
  15. Homogeneous Cohort-Aware Group Cognitive Diagnosis: A Multi-grained Modeling Perspective. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 4094–4098.
  16. Improving knowledge tracing with collaborative information. In Proceedings of the fifteenth ACM international conference on web search and data mining, 599–607.
  17. Nguyen, T. 2015. The effectiveness of online learning: Beyond no significant difference and future horizons. MERLOT Journal of online learning and teaching, 11(2): 309–319.
  18. Reckase, M. D. 2009. Multidimensional item response theory models. In Multidimensional item response theory, 79–112. Springer.
  19. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, 452–461.
  20. Transferable Student Performance Modeling for Intelligent Tutoring Systems. arXiv preprint arXiv:2202.03980.
  21. Incremental Cognitive Diagnosis for Intelligent Education. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 1760–1770.
  22. Deep-IRT with Independent Student and Item Networks. International Educational Data Mining Society.
  23. Visualizing data using t-SNE. Journal of machine learning research, 9(11).
  24. A Preference Learning Decoupling Framework for User Cold-Start Recommendation. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1168–1177.
  25. Neural cognitive diagnosis for intelligent education systems. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 6153–6161.
  26. Hypersorec: Exploiting hyperbolic user and item representations with multiple aspects for social-aware recommendation. ACM Transactions on Information Systems (TOIS), 40(2): 1–28.
  27. Corporate Relative Valuation Using Heterogeneous Multi-Modal Graph Neural Network. IEEE Trans. Knowl. Data Eng., 35(1): 211–224.
  28. Deep Learning for Fixed Model Reuse. In Proceedings of the 31 Conference on Artificial Intelligence, 2831–2837. San Francisco, California.
  29. Exploiting non-interactive exercises in cognitive diagnosis. Interaction, 100(200): 300.
  30. APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM ’23, 3009–3019. New York, NY, USA: Association for Computing Machinery. ISBN 9798400701245.
  31. FedJudge: Federated Legal Large Language Model. arXiv preprint arXiv:2309.08173.
  32. FairLISA: Fair User Modeling with Limited Sensitive Attributes Information. In Thirty-seventh Conference on Neural Information Processing Systems.
  33. Physics inspired optimization on semantic transfer features: An alternative method for room layout estimation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 10–18.
  34. Domain Disentanglement with Interpolative Data Augmentation for Dual-Target Cross-Domain Recommendation. arXiv preprint arXiv:2307.13910.
  35. A Bounded Ability Estimation for Computerized Adaptive Testing. In Thirty-seventh Conference on Neural Information Processing Systems.
Citations (7)

Summary

We haven't generated a summary for this paper yet.