Evaluation of Robo-Dopamine on RoboRewardBench
Evaluate the performance of the Robo-Dopamine checkpoints using the RoboRewardBench benchmark once the dataset and checkpoints are released, to assess their reward prediction accuracy relative to other vision-language reward models.
Sponsor
References
At the time of writing, their dataset and checkpoints have not been released, so we leave evaluating the Robo-Dopamine checkpoints with RoboRewardBench to future work.
— RoboReward: General-Purpose Vision-Language Reward Models for Robotics
(2601.00675 - Lee et al., 2 Jan 2026) in Section 2, Related Work