Stable and efficient learning of long‑horizon navigation with low‑level control in the loop
Develop reinforcement learning methods that can stably and efficiently learn long‑horizon navigation policies while incorporating low‑level locomotion control during training, addressing sparse rewards, contact dynamics, and training instability.
References
Yet, involving locomotion complicates the training of long-horizon navigation policies, which requires future developments to stabilize learning.
— Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes
(2408.03539 - Tang et al., 7 Aug 2024) in Trends and Challenges in Navigation (Subsection "Navigation")