Objective, algorithmic classification of natural-language rules
Develop an objective, algorithmic method to classify natural-language rules for ConceptARC tasks into the categories correct-intended, correct-unintended, and incorrect, reducing reliance on subjective human judgment.
References
Our classification of human- and machine-generated rules was done manually, and involved some subjectivity; we do not know of any objective or algorithmic means to usefully classify these natural-language rules into our various categories.
— Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
(2510.02125 - Beger et al., 2 Oct 2025) in Section: Limitations