Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
88 tokens/sec
GPT-4o
11 tokens/sec
Gemini 2.5 Pro Pro
52 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
10 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
Gemini 2.5 Flash Deprecated
12 tokens/sec
2000 character limit reached

Hybrid Classification-Regression Adaptive Loss for Dense Object Detection (2408.17182v1)

Published 30 Aug 2024 in cs.CV

Abstract: For object detection detectors, enhancing model performance hinges on the ability to simultaneously consider inconsistencies across tasks and focus on difficult-to-train samples. Achieving this necessitates incorporating information from both the classification and regression tasks. However, prior work tends to either emphasize difficult-to-train samples within their respective tasks or simply compute classification scores with IoU, often leading to suboptimal model performance. In this paper, we propose a Hybrid Classification-Regression Adaptive Loss, termed as HCRAL. Specifically, we introduce the Residual of Classification and IoU (RCI) module for cross-task supervision, addressing task inconsistencies, and the Conditioning Factor (CF) to focus on difficult-to-train samples within each task. Furthermore, we introduce a new strategy named Expanded Adaptive Training Sample Selection (EATSS) to provide additional samples that exhibit classification and regression inconsistencies. To validate the effectiveness of the proposed method, we conduct extensive experiments on COCO test-dev. Experimental evaluations demonstrate the superiority of our approachs. Additionally, we designed experiments by separately combining the classification and regression loss with regular loss functions in popular one-stage models, demonstrating improved performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Cascade R-CNN: delving into high quality object detection. In CVPR, pp.  6154–6162, 2018.
  2. R-FCN: object detection via region-based fully convolutional networks. In NIPS, pp.  379–387, 2016.
  3. Deformable convolutional networks. In ICCV, pp.  764–773, 2017.
  4. You better look twice: a new perspective for designing accurate detectors with reduced computations. In BMVC, pp.  257, 2021.
  5. Centernet: Keypoint triplets for object detection. In ICCV, pp.  6568–6577, 2019.
  6. Retinafacemask: A single stage face mask detector for assisting control of the COVID-19 pandemic. In SMC, pp.  832–837, 2021.
  7. Res2net: A new multi-scale backbone architecture. IEEE transactions on pattern analysis and machine intelligence, (2):652–662, 2021.
  8. YOLOX: exceeding YOLO series in 2021. arXiv preprint arXiv:2107.08430, 2021.
  9. Accurate, large minibatch SGD: training imagenet in 1 hour. arXiv preprint arXiv:1706.02677, 2017.
  10. $\alpha$-iou: A family of power intersection over union losses for bounding box regression. In NeurIPS, pp.  20230–20242, 2021.
  11. Deep residual learning for image recognition. In CVPR, pp.  770–778, 2016.
  12. Mask R-CNN. In ICCV, pp.  2980–2988, 2017.
  13. Probabilistic anchor assignment with iou prediction for object detection. In ECCV, pp.  355–371, 2020.
  14. Cornernet: Detecting objects as paired keypoints. In ECCV, pp.  765–781, 2018.
  15. Gradient harmonized single-stage detector. In AAAI, pp.  8577–8584, 2019.
  16. Generalized focal loss: Learning qualified and distributed bounding boxes for dense object detection. In NeurIPS, 2020.
  17. Generalized focal loss V2: learning reliable localization quality estimation for dense object detection. In CVPR, pp.  11632–11641, 2021.
  18. Microsoft COCO: common objects in context. In ECCV, pp.  740–755, 2014.
  19. Focal loss for dense object detection. In ICCV, pp.  2999–3007, 2017.
  20. SSD: single shot multibox detector. In ECCV, pp.  21–37, 2016.
  21. Learning efficient single-stage pedestrian detectors by asymptotic localization fitting. In ECCV, pp.  643–659, 2018.
  22. DR loss: Improving object detection by distributional ranking. In CVPR, pp.  12161–12169, 2020.
  23. YOLO9000: better, faster, stronger. In CVPR, pp.  6517–6525, 2017.
  24. You only look once: Unified, real-time object detection. In CVPR, pp.  779–788, 2016.
  25. Faster R-CNN: towards real-time object detection with region proposal networks. In NIPS, pp.  91–99, 2015.
  26. Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR, pp.  658–666, 2019.
  27. FCOS: fully convolutional one-stage object detection. In ICCV, pp.  9626–9635, 2019.
  28. Wise-iou: Bounding box regression loss with dynamic focusing mechanism. arXiv preprint arXiv:2301.10051, 2023.
  29. Iou-aware single-stage object detector for accurate localization. Image Vis. Comput., pp.  103911, 2020.
  30. Revisiting AP loss for dense object detection: Adaptive ranking pair selection. In CVPR, pp.  14167–14176, 2022.
  31. Unitbox: An advanced object detection network. In ACM, pp.  516–520, 2016.
  32. Varifocalnet: An iou-aware dense object detector. In CVPR, pp.  8514–8523, 2021.
  33. Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In CVPR, pp.  9756–9765, 2020.
  34. Freeanchor: Learning to match anchors for visual object detection. In NeurIPS, pp.  147–155, 2019.
  35. Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing, pp.  146–157, 2022.
  36. Distance-iou loss: Faster and better learning for bounding box regression. In AAAI, pp.  12993–13000, 2020.
  37. Adaptive sparse pairwise loss for object re-identification. In CVPR, pp.  19691–19701, 2023.
  38. Bottom-up object detection by grouping extreme and center points. In CVPR, pp.  850–859, 2019.
  39. Feature selective anchor-free module for single-shot object detection. In CVPR, pp.  840–849, 2019a.
  40. Soft anchor-point object detection. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (eds.), ECCV, pp.  91–107, 2020.
  41. Deformable convnets V2: more deformable, better results. In CVPR, pp.  9308–9316, 2019b.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com