When and how to incorporate additional Recursive Reward Modeling steps beyond critique
Ascertain the appropriate point at which to introduce further steps of Recursive Reward Modeling (RRM) beyond a single critique stage for supervising large language models and determine whether the critique method can be effectively used within RRM.
References
The critique approach is also only the first step of recursive reward modeling (RRM), and we do not know the point at which an additional RRM step is appropriate or whether critique can be used for RRM effectively.
— LLM Critics Help Catch LLM Bugs
(2407.00215 - McAleese et al., 28 Jun 2024) in Section “Discussion and Limitations”