Isolate the contribution of each component in the Pi0.5-based BEHAVIOR-1K solution
Determine the individual performance contribution of each component introduced in the Pi0.5-based vision-language-action policy for the 2025 BEHAVIOR Challenge—specifically correlated noise with beta shrinkage for flow matching, KV cache transformation for mixed-layer attention, System 2 stage prediction with voting, custom attention masks, delta action space with per-timestamp normalization, multi-sample flow matching, FAST auxiliary training, correlation-aware inpainting, action compression via cubic splines, and task-specific correction rules—through rigorous ablation experiments on BEHAVIOR-1K tasks measured by q-score and binary success.
Sponsor
References
Due to resource constraints, we could not isolate the contribution of each component. Rigorous ablation studies would be valuable to identify which innovations actually matter and which are redundant.