Efficient Controlled Language Generation with Low-Rank Autoregressive Reward Models (2407.04615v2)

Published 5 Jul 2024 in cs.CL

Abstract: LLMs trained on large amounts of data are known to produce inappropriate content in some cases and require careful tuning to be used in the real world. We revisit the reward augmented decoding (RAD) approach to control the generation from a LLM using the scores from a task-specific reward model. We investigate the training objective of RAD, and reformulate it as a task of learning a reward matrix. We show that RAD is designed to support high flexibility when representing the reward matrices, which leads to a higher computational costs during decoding. However, we demonstrate that RAD does not use its full flexibility. Motivated by this, we propose a simpler but more efficient low-rank parametrization of the reward model enabling fast and effective guided decoding. For the detoxification and sentiment control tasks, we show that our low-rank reward model performs on par with the more flexible RAD parametrization, while requiring only a single reward model call per generated token.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/stateof_ai/status/1810438226236023058

Efficient Controlled Language Generation with Low-Rank Autoregressive Reward Models (2407.04615v2)

Summary

Related Papers

Tweets