Never-Ending Behavior-Cloning Agent for Robotic Manipulation (2403.00336v2)
Abstract: Relying on multi-modal observations, embodied robots could perform multiple robotic manipulation tasks in unstructured real-world environments. However, most language-conditioned behavior-cloning agents still face existing long-standing challenges, i.e., 3D scene representation and human-level task learning, when adapting into new sequential tasks in practical scenarios. We here investigate these above challenges with NBAgent in embodied robots, a pioneering language-conditioned Never-ending Behavior-cloning Agent. It can continually learn observation knowledge of novel 3D scene semantics and robot manipulation skills from skill-shared and skill-specific attributes, respectively. Specifically, we propose a skill-sharedsemantic rendering module and a skill-shared representation distillation module to effectively learn 3D scene semantics from skill-shared attribute, further tackling 3D scene representation overlooking. Meanwhile, we establish a skill-specific evolving planner to perform manipulation knowledge decoupling, which can continually embed novel skill-specific knowledge like human from latent and low-rank space. Finally, we design a never-ending embodied robot manipulation benchmark, and expensive experiments demonstrate the significant performance of our method. Visual results, code, and dataset are provided at: https://neragent.github.io.
- Few-shot continual active learning by a robot. In NeurIPS, volume 35, pp. 30612–30624, 2022.
- Cbcl-pr: A cognitively inspired model for class-incremental learning in robotics. IEEE Transactions on Cognitive and Developmental Systems, 2023.
- Continual learning through human-robot interaction–human perceptions of a continual learning robot in repeated interactions. arXiv preprint arXiv:2305.16332, 2023.
- Rainbow memory: Continual learning with a memory of diverse samples. In CVPR, pp. 8218–8227, 2021.
- Rt-1: Robotics transformer for real-world control at scale. arXiv preprint arXiv:2212.06817, 2022.
- Do as i can, not as i say: Grounding language in robotic affordances. In CoRL, pp. 287–318. PMLR, 2023.
- On tiny episodic memories in continual learning. arXiv preprint arXiv:1902.10486, 2019.
- Palm: Scaling language modeling with pathways. Journal of Machine Learning Research, 24(240):1–113, 2023.
- Kernel continual learning. In ICML, pp. 2621–2631. PMLR, 2021.
- Heterogeneous forgetting compensation for class-incremental learning. In ICCV, pp. 11742–11751, 2023.
- Podnet: Pooled outputs distillation for small-tasks incremental learning. In ECCV, pp. 86–102. Springer, 2020.
- Reinforcement learning with neural radiance fields. In NeurIPS, volume 35, pp. 16931–16945, 2022.
- Palm-e: An embodied multimodal language model. arXiv preprint arXiv:2303.03378, 2023.
- Cril: Continual robot imitation learning via generative and prediction model. In IROS, pp. 6747–5754. IEEE, 2021.
- Ifor: Iterative flow minimization for robotic object rearrangement. In CVPR, pp. 14787–14797, 2022.
- Rvt: Robotic view transformer for 3d object manipulation. arXiv preprint arXiv:2306.14896, 2023.
- Continual robot learning using self-supervised task inference. IEEE Transactions on Cognitive and Developmental Systems, 2023.
- Lora: Low-rank adaptation of large language models. arXiv preprint arXiv:2106.09685, 2021.
- Inner monologue: Embodied reasoning through planning with language models. arXiv preprint arXiv:2207.05608, 2022.
- Perceiver io: A general architecture for structured inputs & outputs. arXiv preprint arXiv:2107.14795, 2021.
- Rlbench: The robot learning benchmark & learning environment. IEEE Robotics and Automation Letters, 5(2):3019–3026, 2020.
- Coarse-to-fine q-attention: Efficient learning for visual robotic manipulation via discretisation. In CVPR, pp. 13739–13748, 2022.
- Vima: General robot manipulation with multimodal prompts. arXiv, 2022.
- Continual learning with node-importance based adaptive group sparse regularization. In NeurIPS, volume 33, pp. 3647–3658, 2020.
- Langley, P. Crafting papers on machine learning. In Langley, P. (ed.), ICML, pp. 1207–1216, Stanford, CA, 2000. Morgan Kaufmann.
- Your diffusion model is secretly a zero-shot classifier. In ICCV, pp. 2206–2217, October 2023.
- Continual few-shot intent detection. In COLING, pp. 333–343, 2022.
- Learning without forgetting. IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021.
- Learning transferable visual models from natural language supervision. In ICML, pp. 8748–8763. PMLR, 2021.
- icarl: Incremental classifier and representation learning. In CVPR, pp. 2001–2010, 2017.
- V-rep: A versatile and scalable robot simulation framework. In IROS, pp. 1321–1326. IEEE, 2013.
- High-resolution image synthesis with latent diffusion models. In CVPR, pp. 10684–10695, 2022.
- Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action. In CoRL, pp. 492–504. PMLR, 2023.
- Snerl: Semantic-aware neural radiance fields for reinforcement learning. In ICML. PMLR, 2023.
- Cliport: What and where pathways for robotic manipulation. In CoRL, pp. 894–906. PMLR, 2022.
- Perceiver-actor: A multi-task transformer for robotic manipulation. In CoRL, pp. 785–799. PMLR, 2023.
- Create your world: Lifelong text-to-image diffusion. arXiv preprint arXiv:2309.04430, 2023.
- Exploring example influence in continual learning. In NeurIPS, volume 35, pp. 27075–27086, 2022.
- Gcr: Gradient coreset based replay buffer selection for continual learning. In CVPR, pp. 99–108, 2022.
- Bring evanescent representations to life in lifelong class incremental learning. In CVPR, pp. 16732–16741, 2022.
- Lotus: Continual imitation learning for robot manipulation through unsupervised skill discovery. arXiv preprint arXiv:2311.02058, 2023.
- Coscl: Cooperation of small continual learners is stronger than a big one. In ECCV, pp. 254–271. Springer, 2022.
- A comprehensive survey of continual learning: Theory, method and application. arXiv preprint arXiv:2302.00487, 2023.
- Incremental learning via rate reduction. In CVPR, pp. 1125–1133, 2021.
- Incremental learning using conditional adversarial networks. In ICCV, pp. 6619–6628, 2019.
- Open-vocabulary panoptic segmentation with text-to-image diffusion models. In CVPR, pp. 2955–2966, 2023.
- Continual object detection via prototypical task correlation guided gating mechanism. In CVPR, pp. 9255–9264, 2022.
- Large batch optimization for deep learning: Training bert in 76 minutes. arXiv preprint arXiv:1904.00962, 2019.
- pixelnerf: Neural radiance fields from one or few images. In CVPR, pp. 4578–4587, 2021.
- Gnfactor: Multi-task real robot learning with generalizable neural feature fields. In CoRL, pp. 284–301. PMLR, 2023.
- Rt-2: Vision-language-action models transfer web knowledge to robotic control. In CoRL, pp. 2165–2183. PMLR, 2023.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.