Learning to navigate efficiently and precisely in real environments
Abstract: In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping. The more recent Embodied AI literature, on the other hand, focuses on modular or end-to-end agents trained in simulators like Habitat or AI-Thor, where the emphasis is put on photo-realistic rendering and scene diversity, but high-fidelity robot motion is assigned a less privileged role. The resulting sim2real gap significantly impacts transfer of the trained models to real robotic platforms. In this work we explore end-to-end training of agents in simulation in settings which minimize the sim2real gap both, in sensing and in actuation. Our agent directly predicts (discretized) velocity commands, which are maintained through closed-loop control in the real robot. The behavior of the real robot (including the underlying low-level controller) is identified and simulated in a modified Habitat simulator. Noise models for odometry and localization further contribute in lowering the sim2real gap. We evaluate on real navigation scenarios, explore different localization and point goal calculation methods and report significant gains in performance and robustness compared to prior work.
- On evaluation of embodied navigation agents. arXiv preprint, 2018.
- Combining optimal control and learning for visual navigation in novel environments. In CoRL, 2020b.
- Learning to reason on uncertain topological maps. In ECCV, 2020a.
- Egomap: Projective mapping and structured egocentric memory for deep RL. In ECML-PKDD, 2020b.
- Learning to plan with uncertain topological maps. In ECCV, 2020c.
- Graph augmented Deep Reinforcement Learning in the GameRLand3D environment. In AAAI Workshop on Reinforcement Learning in Games, 2022.
- End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon,. In ICLR, 2024a.
- Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction. In ICLR, 2024b.
- Simultaneous localization and mapping: A survey of current trends in autonomous driving. IEEE Transactions on Intelligent Vehicles, 2017.
- The interactive museum tour-guide robot. In Aaai/iaai, pages 11–18, 1998.
- Object goal navigation using goal-oriented semantic exploration. In NeurIPS, 2020a.
- Learning to explore using active neural slam. In ICLR, 2020b.
- Semantic curiosity for active visual learning. In ECCV, 2020c.
- Neural topological slam for visual navigation. In CVPR, 2020d.
- Robustnav: Towards benchmarking robustness in embodied navigation. CoRR, 2106.04531, 2021.
- Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation. arXiv:2202.11742, 2022.
- Deep reinforcement learning of navigation in a complex and crowded environment with a limited field of view. In ICRA, 2019.
- Hybrid imitative planning with geometric and predictive costs in off-road environments. In ICRA, 2022.
- Learning whom to trust in navigation: dynamically switching between classical and neural planning. In IROS, 2023.
- Goal-conditioned imitation learning. In NeurIPS, 2019.
- PALM-E: An Embodied Multimodal Language Model. In ICML, 2023.
- Vtnet: Visual transformer network for object goal navigation. arXiv preprint arXiv:2105.09447, 2021.
- Off-dynamics reinforcement learning: Training for transfer with domain classifiers. In ICLR, 2021.
- Scene memory transformer for embodied agents in long-horizon tasks. In CVPR, 2019.
- The dynamic window approach to collision avoidance. IEEE Robotics & Automation Magazine, 4(1):23–33, 1997.
- Navigating to objects in the real world. Science Robotics, 8(79), 2023.
- Cognitive mapping and planning for visual nav. In CVPR, 2017.
- Mapnet: An allocentric spatial memory for mapping environments. In CVPR, 2018.
- Perspectives on sim2real transfer for robotics: A summary of the R: SS 2020 workshop, 2020.
- Inner Monologue: Embodied Reasoning through Planning with Language Models. In CoRL, 2022.
- Reinforcement learning with unsupervised auxiliary tasks. In ICLR, 2017.
- Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance? IEEE Robotics and Automation Letters, 5(4):6670–6677, 2020.
- All-in-one: A DRL-based control switch combining state-of-the-art navigation planners. In ICRA, 2022.
- AI2-THOR: An Interactive 3D Environment for Visual AI. CoRR, 1712.05474, 2017.
- Kurt Konolige. A gradient method for realtime robot control. In IROS, 2000.
- RTAB-Map as an open-source lidar and visual simultaneous localization and mapping library for large-scale and long-term online operation. Journal of Field Robotics, 36(2):416–446, 2019.
- World model based sim2real transfer for visual navigation. In NeurIPS Robot Learning Workshop, 2023.
- Active Mapping and Robot Exploration: A Survey. Sensors, 21(7):2445, 2021.
- The marathon 2: A navigation system. In IROS, 2020.
- Where are we in the search for an artificial visual cortex for embodied intelligence? In arXiv:2303.18240, 2023.
- The office marathon: Robust navigation in an indoor office environment. In ICRA, 2010.
- Multi-Object Navigation with dynamically learned neural implicit representations. In ICCV, 2023.
- Learning to navigate in complex environments. In ICLR, 2017a.
- Learning to navigate in complex environments. In ICLR, 2017b.
- Katsuhiko Ogata. Modern Control Engineering. Prentice Hall, 2010.
- Neural map: Structured memory for deep reinforcement learning. In ICLR, 2018.
- Is mapping necessary for realistic pointgoal navigation? In CVPR, 2022.
- Sim-to-real transfer of robotic control with dynamics randomization. In ICRA, 2018.
- Habitat-matterport 3D dataset (HM3D): 1000 large-scale 3d environments for embodied AI. In NeurIPS Datasets and Benchmarks Track, 2021.
- A Generalist Agent. arXiv:2205.06175, 2022. arXiv: 2205.06175.
- Timed-elastic-bands for time-optimal point-to-point nonlinear model predictive control. In European Control Conference (ECC), 2015.
- An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments. In ICRA, 2022.
- Habitat: A platform for embodied ai research. In ICCV, 2019.
- Proximal policy optimization algorithms. arXiv preprint, 2017.
- James A Sethian. A fast marching level set method for monotonically advancing fronts. PNAS, 93(4):1591–1595, 1996.
- ViKiNG: Vision-based kilometer-scale navigation with geographic hints. In RSS, 2022.
- ViNT: A foundation model for visual navigation. In CoRL, 2023.
- Sim-to-real: Learning agile locomotion for quadruped robots. In RSS, 2018.
- Probabilistic robotics, 2005.
- Bi-directional Domain Adaptation for Sim2Real Transfer of Embodied Navigation Agents. IEEE Robotics and Automation Letters, 6(2), 2021.
- Rethinking sim2real: Lower fidelity simulation leads to higher sim2real transfer in navigation. In CoRL, pages 859–870, 2022.
- Sim-to-real strategy for spatially aware robot navigation in uneven outdoor environments. In ICRA Workshop on Releasing Robots into the Wild, 2022.
- Dd-ppo: Learning near-perfect pointgoal navigators from 2.5 billion frames. In ICLR, 2019.
- OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav. In arXiv:2303.07798, 2023.
- Success weighted by completion time: A dynamics-aware evaluation criteria for embodied navigation. In IROS, 2021.
- Learning robust agents for visual navigation in dynamic environments: The winning entry of igibson challenge 2021. In IROS, pages 77–83, 2022.
- Path planning using neural A* search. In ICML, 2021.
- Adversarial discriminative sim-to-real transfer of visuo-motor policies. Int. J. Robotics Res., 38(10-11), 2019.
- Adapting object detectors via selective cross-domain alignment. In CVPR, 2019.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.