Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality (2307.07935v4)

Published 16 Jul 2023 in cs.CV

Abstract: Due to the lack of enough real multi-agent data and time-consuming of labeling, existing multi-agent cooperative perception algorithms usually select the simulated sensor data for training and validating. However, the perception performance is degraded when these simulation-trained models are deployed to the real world, due to the significant domain gap between the simulated and real data. In this paper, we propose the first Simulation-to-Reality transfer learning framework for multi-agent cooperative perception using a novel Vision Transformer, named as S2R-ViT, which considers both the Deployment Gap and Feature Gap between simulated and real data. We investigate the effects of these two types of domain gaps and propose a novel uncertainty-aware vision transformer to effectively relief the Deployment Gap and an agent-based feature adaptation module with inter-agent and ego-agent discriminators to reduce the Feature Gap. Our intensive experiments on the public multi-agent cooperative perception datasets OPV2V and V2V4Real demonstrate that the proposed S2R-ViT can effectively bridge the gap from simulation to reality and outperform other methods significantly for point cloud-based 3D object detection.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jinlong Li (50 papers)
  2. Runsheng Xu (40 papers)
  3. Xinyu Liu (123 papers)
  4. Baolu Li (12 papers)
  5. Qin Zou (32 papers)
  6. Jiaqi Ma (83 papers)
  7. Hongkai Yu (50 papers)
Citations (12)

Summary

We haven't generated a summary for this paper yet.