CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications (2405.01107v3)

Published 2 May 2024 in cs.RO, cs.MA, cs.SY, and eess.SY

Abstract: Autonomous robot operation in unstructured environments is often underpinned by spatial understanding through vision. Systems composed of multiple concurrently operating robots additionally require access to frequent, accurate and reliable pose estimates. In this work, we propose CoViS-Net, a decentralized visual spatial foundation model that learns spatial priors from data, enabling pose estimation as well as spatial comprehension. Our model is fully decentralized, platform-agnostic, executable in real-time using onboard compute, and does not require existing networking infrastructure. CoViS-Net provides relative pose estimates and a local bird's-eye-view (BEV) representation, even without camera overlap between robots (in contrast to classical methods). We demonstrate its use in a multi-robot formation control task across various real-world settings. We provide code, models and supplementary material online. https://proroklab.github.io/CoViS-Net/

References (69)

Authors (4)

Jan Blumenkamp (12 papers)
Steven Morad (15 papers)
Jennifer Gielis (4 papers)
Amanda Prorok (66 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications (2405.01107v3)

Summary

Related Papers

Tweets