DCDet: Dynamic Cross-based 3D Object Detector (2401.07240v2)
Abstract: Recently, significant progress has been made in the research of 3D object detection. However, most prior studies have focused on the utilization of center-based or anchor-based label assignment schemes. Alternative label assignment strategies remain unexplored in 3D object detection. We find that the center-based label assignment often fails to generate sufficient positive samples for training, while the anchor-based label assignment tends to encounter an imbalanced issue when handling objects of varying scales. To solve these issues, we introduce a dynamic cross label assignment (DCLA) scheme, which dynamically assigns positive samples for each object from a cross-shaped region, thus providing sufficient and balanced positive samples for training. Furthermore, to address the challenge of accurately regressing objects with varying scales, we put forth a rotation-weighted Intersection over Union (RWIoU) metric to replace the widely used L1 metric in regression loss. Extensive experiments demonstrate the generality and effectiveness of our DCLA and RWIoU-based regression loss. The Code will be available at https://github.com/Say2L/DCDet.git.
- Voxel r-cnn: Towards high performance voxel-based 3d object detection. In AAAI, 2021.
- Embracing single stride 3d object detector with sparse transformer. In CVPR, 2022.
- Fully sparse 3d object detection. In NeurIPS, 2022.
- Afdet: Anchor free one stage 3d object detection. arXiv preprint arXiv:2006.12671, 2020.
- Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430, 2021.
- Are we ready for autonomous driving? the kitti vision benchmark suite. In CVPR, 2012.
- Voxel set transformer: A set-to-set approach to 3d object detection from point clouds. In CVPR, 2022.
- Afdetv2: Rethinking the necessity of the second stage for object detection from point clouds. In AAAI, 2022.
- Pointpillars: Fast encoders for object detection from point clouds. In CVPR, 2019.
- Lidar r-cnn: An efficient and universal 3d object detector. In CVPR, 2021.
- Pillarnext: Rethinking network designs for 3d object detection in lidar point clouds. In CVPR, 2023.
- Focal loss for dense object detection. In ICCV, 2017.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In CVPR, 2017.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In NeurIPS, 2017.
- Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR, 2019.
- Rethinking iou-based optimization for single-stage 3d object detection. In ECCV, 2022.
- Pointrcnn: 3d object proposal generation and detection from point cloud. In CVPR, 2019.
- Pv-rcnn: Point-voxel feature set abstraction for 3d object detection. In CVPR, 2020.
- From points to parts: 3d object detection from point cloud with part-aware and part-aggregation network. TPAMI, 2020.
- Pillarnet: Real-time and high-performance pillar-based 3d object detection. In ECCV, 2022.
- Pv-rcnn++: Point-voxel feature set abstraction with local vector representation for 3d object detection. IJCV, 2023.
- Scalability in perception for autonomous driving: Waymo open dataset. In CVPR, 2020.
- Swformer: Sparse window transformer for 3d object detection in point clouds. In ECCV, 2022.
- OpenPCDet Development Team. Openpcdet: An open-source toolbox for 3d object detection from point clouds. https://github.com/open-mmlab/OpenPCDet, 2020.
- Fcos: Fully convolutional one-stage object detection. In ICCV, 2019.
- 3d-centernet: 3d object detection network for point clouds with center estimation priority. Pattern Recognition, 2021.
- Dsvt: Dynamic sparse voxel transformer with rotated sets. In CVPR, 2023.
- Behind the curtain: Learning occluded shapes for 3d object detection. In AAAI, 2022.
- Second: Sparsely embedded convolutional detection. Sensors, 2018.
- 3dssd: Point-based 3d single stage object detector. In CVPR, 2020.
- Center-based 3d object detection and tracking. In CVPR, 2021.
- Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In CVPR, 2020.
- Focal and efficient iou loss for accurate bounding box regression. Neurocomputing, 2022.
- Not all points are equal: Learning highly efficient point-based detectors for 3d lidar point clouds. In CVPR, 2022.
- Distance-iou loss: Faster and better learning for bounding box regression. In AAAI, 2020.
- Cia-ssd: Confident iou-aware single-stage object detector from point cloud. In AAAI, 2021.
- Voxelnet: End-to-end learning for point cloud based 3d object detection. In CVPR, 2018.
- Iou loss for 2d/3d object detection. In 3DV, 2019.
- Objects as points. arXiv preprint arXiv:1904.07850, 2019.
- Centerformer: Center-based transformer for 3d object detection. In ECCV, 2022.
- Autoassign: Differentiable label assignment for dense object detection. arXiv preprint arXiv:2007.03496, 2020.