Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unified Uncertainty Estimation for Cognitive Diagnosis Models (2403.14676v1)

Published 9 Mar 2024 in cs.CY, cs.AI, and cs.LG

Abstract: Cognitive diagnosis models have been widely used in different areas, especially intelligent education, to measure users' proficiency levels on knowledge concepts, based on which users can get personalized instructions. As the measurement is not always reliable due to the weak links of the models and data, the uncertainty of measurement also offers important information for decisions. However, the research on the uncertainty estimation lags behind that on advanced model structures for cognitive diagnosis. Existing approaches have limited efficiency and leave an academic blank for sophisticated models which have interaction function parameters (e.g., deep learning-based models). To address these problems, we propose a unified uncertainty estimation approach for a wide range of cognitive diagnosis models. Specifically, based on the idea of estimating the posterior distributions of cognitive diagnosis model parameters, we first provide a unified objective function for mini-batch based optimization that can be more efficiently applied to a wide range of models and large datasets. Then, we modify the reparameterization approach in order to adapt to parameters defined on different domains. Furthermore, we decompose the uncertainty of diagnostic parameters into data aspect and model aspect, which better explains the source of uncertainty. Extensive experiments demonstrate that our method is effective and can provide useful insights into the uncertainty of cognitive diagnosis.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Information Fusion 76 (2021), 243–297.
  2. Quality meets diversity: A model-agnostic framework for computerized adaptive testing. In 2020 IEEE International Conference on Data Mining (ICDM). IEEE, 42–51.
  3. Weight uncertainty in neural network. In International conference on machine learning. PMLR, 1613–1622.
  4. Léon Bottou. 2010. Large-scale machine learning with stochastic gradient descent. In Proceedings of COMPSTAT’2010. Springer, 177–186.
  5. Real-time prediction intervals for intra-hour DNI forecasts. Renewable energy 83 (2015), 234–244.
  6. Jimmy De La Torre. 2009. DINA model and parameter estimation: A didactic. Journal of educational and behavioral statistics 34, 1 (2009), 115–130.
  7. Jimmy De La Torre and Jeffrey A Douglas. 2004. Higher-order latent trait models for cognitive diagnosis. Psychometrika 69, 3 (2004), 333–353.
  8. Gpirt: A Gaussian process model for item response theory. In Conference on uncertainty in artificial intelligence. PMLR, 520–529.
  9. Susan E Embretson and Steven P Reise. 2013. Item response theory. Psychology Press.
  10. Sentiment analysis: Bayesian ensemble learning. Decision support systems 68 (2014), 26–38.
  11. Ethan Goan and Clinton Fookes. 2020. Bayesian neural networks: An introduction and survey. Case Studies in Applied Bayesian Data Science: CIRM Jean-Morlet Chair, Fall 2018 (2020), 45–87.
  12. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE international conference on computer vision. 1026–1034.
  13. Convolutional Gaussian Embeddings for Personalized Recommendation with Uncertainty. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (Macao, China) (IJCAI’19). AAAI Press, 2642–2648.
  14. Neural network-based uncertainty quantification: A survey of methodologies and applications. IEEE access 6 (2018), 36218–36234.
  15. Diederik P Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations.
  16. Solomon Kullback and Richard A Leibler. 1951. On information and sufficiency. The annals of mathematical statistics 22, 1 (1951), 79–86.
  17. Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems 30 (2017).
  18. Won-Chan Lee and Guemin Lee. 2018. IRT linking and equating. The Wiley handbook of psychometric testing: A multidisciplinary reference on survey, scale and test development (2018), 639–673.
  19. Jacqueline Leighton and Mark Gierl. 2007. Cognitive diagnostic assessment for education: Theory and applications. Cambridge University Press.
  20. HierCDF: A Bayesian Network-based Hierarchical Cognitive Diagnosis Framework. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 904–913.
  21. Classification-oriented dawid skene model for transferring intelligence from crowds to machines. Frontiers of Computer Science 17, 5 (2023), 175332.
  22. I Lira and D Grientschnig. 2010. Bayesian assessment of uncertainty in metrology: a tutorial. Metrologia 47, 3 (2010), R1.
  23. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology (TIST) 9, 4 (2018), 1–26.
  24. Knowledge-Sensed Cognitive Diagnosis for Intelligent Education Platforms. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 1451–1460.
  25. Bootstrap standard errors for maximum likelihood ability estimates when item parameters are unknown. Educational and Psychological Measurement 74, 4 (2014), 697–712.
  26. Richard J Patz and Brian W Junker. 1999. A straightforward approach to Markov chain Monte Carlo methods for item response models. Journal of educational and behavioral Statistics 24, 2 (1999), 146–178.
  27. Mark D Reckase. 2009. Multidimensional item response theory models. In Multidimensional Item Response Theory. Springer, 79–112.
  28. Modern psychometric methodology: Applications of item response theory. Rehabilitation Counseling Bulletin 50, 3 (2007), 177–188.
  29. Addressing model uncertainty in item response theory person scores through model averaging. Behaviormetrika 45, 2 (2018), 495–503.
  30. The k-tied normal distribution: A compact parameterization of Gaussian mean field posteriors in Bayesian neural networks. In International Conference on Machine Learning. PMLR, 9289–9299.
  31. Michael L Thomas. 2011. The value of item response theory in clinical assessment: a review. Assessment 18, 3 (2011), 291–307.
  32. Neural cognitive diagnosis for intelligent education systems. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6153–6161.
  33. NeuralCD: A General Framework for Cognitive Diagnosis. IEEE Transactions on Knowledge and Data Engineering (2022).
  34. Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks. Neurocomputing 338 (2019), 34–45.
  35. Using Knowledge Concept Aggregation towards Accurate Cognitive Diagnosis. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2010–2019.
  36. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. Advances in neural information processing systems 22 (2009).
  37. Characterizing sources of uncertainty in item response theory scale scores. Educational and psychological measurement 72, 2 (2012), 264–290.
  38. Jerrold H Zar. 2005. Spearman rank correlation. Encyclopedia of biostatistics 7 (2005).
Citations (1)

Summary

We haven't generated a summary for this paper yet.