Source of Positive Transfer in RL Fine-Tuning
Determine the origins of the positive transfer observed when fine-tuning reinforcement learning agents with forgetting mitigation by disentangling and quantifying the respective contributions of reused pretrained representations and initialized policy parameters to improved learning efficiency and performance.
References
Although we see significant positive transfer once the forgetting problem is addressed, it remains an open question where this impact comes from.
— Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
(2402.02868 - Wołczyk et al., 2024) in Appendix, Section "Analysis of forgetting in robotic manipulation tasks", subsection "Impact of representation vs policy on transfer"