Identify the causes of failure episodes labeled as “Unknown” in BEHAVIOR-1K evaluation
Identify the underlying causes of failure episodes categorized as “Unknown” in the authors’ labeled subset of BEHAVIOR-1K evaluation tasks, where failures could not be attributed to any specific problem, to improve diagnosis and targeted remediation strategies.
Sponsor
References
Unknown: Failures that we could not attribute to any specific problem.
— Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge
(2512.06951 - Larchenko et al., 7 Dec 2025) in Section 6.2, Failure Mode Analysis