On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline (2212.05749v2)

Published 12 Dec 2022 in cs.LG, cs.CV, and cs.RO

Abstract: In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks. We revisit a simple Learning-from-Scratch (LfS) baseline that incorporates data augmentation and a shallow ConvNet, and find that this baseline is surprisingly competitive with recent approaches (PVR, MVP, R3M) that leverage frozen visual representations trained on large-scale vision datasets -- across a variety of algorithms, task domains, and metrics in simulation and on a real robot. Our results demonstrate that these methods are hindered by a significant domain gap between the pre-training datasets and current benchmarks for visuo-motor control, which is alleviated by finetuning. Based on our findings, we provide recommendations for future research in pre-training for control and hope that our simple yet strong baseline will aid in accurately benchmarking progress in this area.

References (45)

Authors (8)

Nicklas Hansen (22 papers)
Zhecheng Yuan (18 papers)
Yanjie Ze (20 papers)
Tongzhou Mu (19 papers)
Aravind Rajeswaran (42 papers)
Hao Su (219 papers)
Huazhe Xu (93 papers)
Xiaolong Wang (243 papers)

Citations (55)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - gemcollector/learning-from-scratch: The repository of On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline (23 stars)

Tweets

https://twitter.com/jonzamora_ai/status/1792905377081962973

https://twitter.com/furongh/status/1757393325546496308

On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline (2212.05749v2)

Summary

Related Papers

GitHub

Tweets