Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Prompt-based Visual Alignment for Zero-shot Policy Transfer (2406.03250v1)

Published 5 Jun 2024 in cs.CV and cs.AI

Abstract: Overfitting in RL has become one of the main obstacles to applications in reinforcement learning(RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain representation and resulting in performance degradation on unseen domains. Besides, abundant data from multiple domains are needed. To address these issues, in this work, we propose prompt-based visual alignment (PVA), a robust framework to mitigate the detrimental domain bias in the image for zero-shot policy transfer. Inspired that Visual-LLM (VLM) can serve as a bridge to connect both text space and image space, we leverage the semantic information contained in a text sequence as an explicit constraint to train a visual aligner. Thus, the visual aligner can map images from multiple domains to a unified domain and achieve good generalization performance. To better depict semantic information, prompt tuning is applied to learn a sequence of learnable tokens. With explicit constraints of semantic information, PVA can learn unified cross-domain representation under limited access to cross-domain data and achieves great zero-shot generalization ability in unseen domains. We verify PVA on a vision-based autonomous driving task with CARLA simulator. Experiments show that the agent generalizes well on unseen domains under limited access to multi-domain data.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Haihan Gao (1 paper)
  2. Rui Zhang (1138 papers)
  3. Qi Yi (18 papers)
  4. Hantao Yao (23 papers)
  5. Haochen Li (42 papers)
  6. Jiaming Guo (37 papers)
  7. Shaohui Peng (20 papers)
  8. Yunkai Gao (5 papers)
  9. Xing Hu (122 papers)
  10. Yuanbo Wen (19 papers)
  11. Zihao Zhang (75 papers)
  12. Zidong Du (41 papers)
  13. Ling Li (112 papers)
  14. Qi Guo (237 papers)
  15. Yunji Chen (51 papers)
  16. Qicheng Wang (4 papers)

Summary

We haven't generated a summary for this paper yet.