Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations (2403.13261v2)

Published 20 Mar 2024 in cs.CV

Abstract: The perception of motion behavior in a dynamic environment holds significant importance for autonomous driving systems, wherein class-agnostic motion prediction methods directly predict the motion of the entire point cloud. While most existing methods rely on fully-supervised learning, the manual labeling of point cloud data is laborious and time-consuming. Therefore, several annotation-efficient methods have been proposed to address this challenge. Although effective, these methods rely on weak annotations or additional multi-modal data like images, and the potential benefits inherent in the point cloud sequence are still underexplored. To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds. Initially, we employ an optimal transport solver to establish coarse correspondences between current and future point clouds as the coarse pseudo motion labels. Training models directly using such coarse labels leads to noticeable spatial and temporal prediction inconsistencies. To mitigate these issues, we introduce three simple spatial and temporal regularization losses, which facilitate the self-supervised training process effectively. Experimental results demonstrate the significant superiority of our approach over the state-of-the-art self-supervised methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Pointflownet: Learning representations for rigid motion estimation from point clouds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  2. nuscenes: A multimodal dataset for autonomous driving. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  3. Argoverse: 3d tracking and forecasting with rich maps. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  4. Marco Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. In Neural Information Processing Systems (NeurIPS), 2013.
  5. Uncertainty-aware short-term motion prediction of traffic actors for autonomous driving. In IEEE Winter Conference on Applications of Computer Vision (WACV), 2020.
  6. Exploiting rigidity constraints for lidar scene flow estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  7. Tpnet: Trajectory proposal network for motion prediction. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  8. Hplflownet: Hierarchical permutohedral lattice flownet for scene flow estimation on large-scale point clouds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  9. Marcel Van Herk. Image registration using chamfer matching. Handbook of Medical Imaging, 2000.
  10. Flowstep3d: Model unrolling for self-supervised scene flow estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  11. Pointpillars: Fast encoders for object detection from point clouds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  12. Patchwork++: Fast and robust ground segmentation solving partial under-segmentation using 3d point cloud. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
  13. Self-point-flow: Self-supervised scene flow estimation from point clouds with optimal transport and random walk. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021a.
  14. Rigidflow: Self-supervised scene flow learning on point clouds by local rigidity prior. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  15. Weakly supervised class-agnostic motion prediction for autonomous driving. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
  16. Neural scene flow prior. Neural Information Processing Systems (NeurIPS), 2021b.
  17. Pnpnet: End-to-end perception and prediction with tracking in the loop. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  18. Flownet3d: Learning scene flow in 3d point clouds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  19. Self-supervised pillar motion learning for autonomous driving. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  20. Fast and furious: Real time end-to-end 3d detection, tracking and motion forecasting with a single convolutional net. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  21. Just go with the flow: Self-supervised scene flow estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  22. Pytorch: An imperative style, high-performance deep learning library. In Neural Information Processing Systems (NeurIPS), 2019.
  23. Scene flow from point clouds with or without learning. In International Conference on 3D Vision (3DV), 2020.
  24. Flot: Scene flow on point clouds guided by optimal transport. In European Conference on Computer Vision (ECCV), 2020.
  25. Long-term occupancy grid prediction using recurrent neural networks. In IEEE international conference on robotics and automation (ICRA), 2019.
  26. Convolutional lstm network: A machine learning approach for precipitation nowcasting. In Neural Information Processing Systems (NeurIPS), 2015.
  27. Scalability in perception for autonomous driving: Waymo open dataset. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  28. Hierarchical attention learning of scene flow in 3d point clouds. IEEE Transactions on Image Processing (TIP), 2021.
  29. Semi-supervised class-agnostic motion prediction with pseudo label regeneration and bevmix, 2023.
  30. Be-sti: Spatial-temporal integrated network for class-agnostic motion prediction with bidirectional enhancement. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  31. Motionnet: Joint perception and motion prediction for autonomous driving based on bird’s eye view maps. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020a.
  32. Pointpwc-net: Cost volume on point clouds for (self-) supervised scene flow estimation. In European Conference on Computer Vision (ECCV), 2020b.
  33. Bootstrap motion forecasting with self-consistent constraints. In IEEE Conference on International Conference on Computer Vision (ICCV), 2023.
  34. Tnt: Target-driven trajectory prediction. In Conference on Robot Learning (CoRL), 2020.
  35. Voxelnet: End-to-end learning for point cloud based 3d object detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Kewei Wang (15 papers)
  2. Yizheng Wu (36 papers)
  3. Jun Cen (28 papers)
  4. Zhiyu Pan (24 papers)
  5. Xingyi Li (14 papers)
  6. Zhe Wang (574 papers)
  7. Zhiguo Cao (88 papers)
  8. Guosheng Lin (157 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com