Single-Point Supervised High-Resolution Dynamic Network for Infrared Small Target Detection (2408.01976v2)
Abstract: Infrared small target detection (IRSTD) tasks are extremely challenging for two main reasons: 1) it is difficult to obtain accurate labelling information that is critical to existing methods, and 2) infrared (IR) small target information is easily lost in deep networks. To address these issues, we propose a single-point supervised high-resolution dynamic network (SSHD-Net). In contrast to existing methods, we achieve state-of-the-art (SOTA) detection performance using only single-point supervision. Specifically, we first design a high-resolution cross-feature extraction module (HCEM), that achieves bi-directional feature interaction through stepped feature cascade channels (SFCC). It balances network depth and feature resolution to maintain deep IR small-target information. Secondly, the effective integration of global and local features is achieved through the dynamic coordinate fusion module (DCFM), which enhances the anti-interference ability in complex backgrounds. In addition, we introduce the high-resolution multilevel residual module (HMRM) to enhance the semantic information extraction capability. Finally, we design the adaptive target localization detection head (ATLDH) to improve detection accuracy. Experiments on the publicly available datasets NUDT-SIRST and IRSTD-1k demonstrate the effectiveness of our method. Compared to other SOTA methods, our method can achieve better detection performance with only a single point of supervision.
- Y. Sun, J. Yang, and W. An, “Infrared dim and small target detection via multiple subspace learning and spatial-temporal patch-tensor model,” IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 5, pp. 3737–3752, 2020.
- X. Ying, Y. Wang, L. Wang, W. Sheng, L. Liu, Z. Lin, and S. Zhou, “Mocopnet: Exploring local motion and contrast priors for infrared small target super-resolution,” arXiv e-prints, pp. arXiv–2201, 2022.
- M. Teutsch and W. Krüger, “Classification of small boats in infrared images for maritime surveillance,” in 2010 International WaterSide Security Conference. IEEE, 2010, pp. 1–7.
- Z. Wang, T. Zang, Z. Fu, H. Yang, and W. Du, “Rlpgb-net: Reinforcement learning of feature fusion and global context boundary attention for infrared dim small target detection,” IEEE Transactions on Geoscience and Remote Sensing, 2023.
- W. Zhang, M. Cong, and L. Wang, “Algorithms for optical weak small targets detection and tracking,” in International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003, vol. 1. IEEE, 2003, pp. 643–647.
- B. Li, C. Xiao, L. Wang, Y. Wang, Z. Lin, M. Li, W. An, and Y. Guo, “Dense nested attention network for infrared small target detection,” IEEE Transactions on Image Processing, vol. 32, pp. 1745–1758, 2022.
- C.-s. Zhang, Z. Shi, B. Feng, and B.-s. Xu, “Infrared lens thermal effect: equivalent focal shift and calculating model,” in International Symposium on Optoelectronic Technology and Application 2014: Infrared Technology and Applications, vol. 9300. SPIE, 2014, pp. 451–458.
- B. Yang, X. Zhang, J. Zhang, J. Luo, M. Zhou, and Y. Pi, “Eflnet: Enhancing feature learning network for infrared small target detection,” IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–11, 2024.
- S. D. Deshpande, M. H. Er, R. Venkateswarlu, and P. Chan, “Max-mean and max-median filters for detection of small targets,” in Signal and Data Processing of Small Targets 1999, vol. 3809. SPIE, 1999, pp. 74–83.
- H. Wang and Y. Xin, “Wavelet-based contourlet transform and kurtosis map for infrared small target detection in complex background,” Sensors, vol. 20, no. 3, p. 755, 2020.
- T. M. Hussain, A. M. Baig, T. N. Saadawi, and S. A. Ahmed, “Infrared pyroelectric sensor for detection of vehicular traffic using digital signal processing techniques,” IEEE transactions on vehicular technology, vol. 44, no. 3, pp. 683–689, 1995.
- Y. Wei, X. You, and H. Li, “Multiscale patch-based contrast measure for small infrared target detection,” Pattern Recognition, vol. 58, pp. 216–226, 2016.
- Y. Qin, L. Bruzzone, C. Gao, and B. Li, “Infrared small target detection based on facet kernel and random walker,” IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 9, pp. 7104–7118, 2019.
- Z. Qiu, Y. Ma, F. Fan, J. Huang, and M. Wu, “Adaptive scale patch-based contrast measure for dim and small infrared target detection,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2022.
- C. Gao, D. Meng, Y. Yang, Y. Wang, X. Zhou, and A. G. Hauptmann, “Infrared patch-image model for small target detection in a single image,” IEEE transactions on image processing, vol. 22, no. 12, pp. 4996–5009, 2013.
- Y. Dai and Y. Wu, “Reweighted infrared patch-tensor model with both nonlocal and local priors for single-frame small target detection,” IEEE journal of selected topics in applied earth observations and remote sensing, vol. 10, no. 8, pp. 3752–3767, 2017.
- L. Zhang and Z. Peng, “Infrared small target detection based on partial sum of the tensor nuclear norm,” Remote Sensing, vol. 11, no. 4, p. 382, 2019.
- Y. Dai, Y. Wu, F. Zhou, and K. Barnard, “Attentional local contrast networks for infrared small target detection,” IEEE transactions on geoscience and remote sensing, vol. 59, no. 11, pp. 9813–9824, 2021.
- F. Liu, C. Gao, F. Chen, D. Meng, W. Zuo, and X. Gao, “Infrared small and dim target detection with transformer under complex backgrounds,” IEEE Transactions on Image Processing, vol. 32, pp. 5921–5932, 2023.
- G. Chen, W. Wang, and S. Tan, “Irstformer: A hierarchical vision transformer for infrared small target detection,” Remote Sensing, vol. 14, no. 14, p. 3258, 2022.
- H. Wang, L. Zhou, and L. Wang, “Miss detection vs. false alarm: Adversarial learning for small object segmentation in infrared images,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 8509–8518.
- Y. Dai, Y. Wu, F. Zhou, and K. Barnard, “Asymmetric contextual modulation for infrared small target detection,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021, pp. 950–959.
- Y. Dai, X. Li, F. Zhou, Y. Qian, Y. Chen, and J. Yang, “One-stage cascade refinement networks for infrared small target detection,” IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1–17, 2023.
- X. Ying, L. Liu, Y. Wang, R. Li, N. Chen, Z. Lin, W. Sheng, and S. Zhou, “Mapping degeneration meets label evolution: Learning infrared small target detection with single point supervision,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 15 528–15 538.
- M. Zhang, R. Zhang, Y. Yang, H. Bai, J. Zhang, and J. Guo, “Isnet: Shape matters for infrared small target detection,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 877–886.
- X. Bai and F. Zhou, “Analysis of new top-hat transformation and the application for infrared dim small target detection,” Pattern Recognition, vol. 43, no. 6, pp. 2145–2156, 2010.
- Z. Qiu, Y. Ma, F. Fan, J. Huang, M. Wu, and X. Mei, “A pixel-level local contrast measure for infrared small target detection,” Defence Technology, vol. 18, no. 9, pp. 1589–1601, 2022.
- L. Chen, T. Wu, S. Zheng, Z. Qiu, and F. Huang, “Robust unsupervised multi-feature representation for infrared small target detection,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024.
- M. Mirza and S. Osindero, “Conditional generative adversarial nets,” arXiv preprint arXiv:1411.1784, 2014.
- J. Du, H. Lu, L. Zhang, M. Hu, S. Chen, Y. Deng, X. Shen, and Y. Zhang, “A spatial-temporal feature-based detection framework for infrared dim small target,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–12, 2021.
- M. Ju, J. Luo, G. Liu, and H. Luo, “Istdet: An efficient end-to-end neural network for infrared small target detection,” Infrared Physics & Technology, vol. 114, p. 103659, 2021.
- X. Zhou, L. Jiang, C. Hu, S. Lei, T. Zhang, and X. Mou, “Yolo-sase: An improved yolo algorithm for the small targets detection in complex backgrounds,” Sensors, vol. 22, no. 12, p. 4600, 2022.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
- A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, “Mobilenets: Efficient convolutional neural networks for mobile vision applications,” arXiv preprint arXiv:1704.04861, 2017.
- K. Sun, B. Xiao, D. Liu, and J. Wang, “Deep high-resolution representation learning for human pose estimation,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 5693–5703.
- C. Li, A. Zhou, and A. Yao, “Omni-dimensional dynamic convolution,” arXiv preprint arXiv:2209.07947, 2022.
- Q. Hou, D. Zhou, and J. Feng, “Coordinate attention for efficient mobile network design,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 13 713–13 722.
- T. Zhang, L. Li, S. Cao, T. Pu, and Z. Peng, “Attention-guided pyramid context networks for detecting infrared small target under complex background,” IEEE Transactions on Aerospace and Electronic Systems, 2023.
- X. Wu, D. Hong, and J. Chanussot, “Uiu-net: U-net in u-net for infrared small object detection,” IEEE Transactions on Image Processing, vol. 32, pp. 364–376, 2022.