Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning (2407.15762v2)

Published 22 Jul 2024 in cs.LG, cs.AI, and cs.CL

Abstract: Reward-based finetuning is crucial for aligning language policies with intended behaviors (e.g., creativity and safety). A key challenge is to develop steerable LLMs that trade-off multiple (conflicting) objectives in a flexible and efficient manner. This paper presents Conditional Language Policy (CLP), a general framework for finetuning LLMs on multiple objectives. Building on techniques from multi-task training and parameter-efficient finetuning, CLP learn steerable models that effectively trade-off conflicting objectives at inference time. Notably, this does not require training or maintaining multiple models to achieve different trade-offs between the objectives. Through extensive experiments and ablations on two summarization datasets, we show that CLP learns steerable LLMs that outperform and Pareto-dominate the existing approaches for multi-objective finetuning.

References (48)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/kaiwenw_ai/status/1867698849151954991

https://twitter.com/arxivsanitybot/status/1815740776904839182

YouTube

Show All Videos

Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning (2407.15762v2)

Summary

Related Papers

Tweets

YouTube