Unsupervised ID-based Algorithms for Comparing Models, Task Complexity, and Generated Text
Develop an unsupervised algorithm that leverages intrinsic dimension (ID) estimates of hidden representations in transformer-based large language models to perform (i) model comparison, (ii) task complexity comparison across datasets, and (iii) generated text comparison, operationalizing ID as the core metric for these comparative analyses.
Sponsor
References
- Though our work highlights the ID estimates showing a strong relation with model generalization, exploiting them to develop a concrete unsupervised algorithm for model comparison/task complexity comparison and generated text comparison remains open for future avenues.
— Geometry of Decision Making in Language Models
(2511.20315 - Joshi et al., 25 Nov 2025) in Appendix, Section "Additional Results, Discussion and Future Directions" — Future Directions, Item 6