Justify the use of exactly two latent features in HRM
Determine and rigorously justify the design choice of using exactly two latent feature vectors (z_L and z_H) in the Hierarchical Reasoning Model (HRM), rather than one, three, or more latent features, by establishing theoretical criteria or empirical evidence that specify when two features are optimal and how alternative numbers of features affect performance and learning dynamics.
Sponsor
References
Furthermore, it is not clear why they use two latent features rather than other combinations of features.
— Less is More: Recursive Reasoning with Tiny Networks
(2510.04871 - Jolicoeur-Martineau, 6 Oct 2025) in Section 3.3 (Hierarchical interpretation based on complex biological arguments)