Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection (2311.08747v3)

Published 15 Nov 2023 in cs.CV

Abstract: Infrared small target detection based on deep learning offers unique advantages in separating small targets from complex and dynamic backgrounds. However, the features of infrared small targets gradually weaken as the depth of convolutional neural network (CNN) increases. To address this issue, we propose a novel method for detecting infrared small targets called improved dense nested attention network (IDNANet), which is based on the transformer architecture. We preserve the dense nested structure of dense nested attention network (DNANet) and introduce the Swin-transformer during feature extraction stage to enhance the continuity of features. Furthermore, we integrate the ACmix attention structure into the dense nested structure to enhance the features of intermediate layers. Additionally, we design a weighted dice binary cross-entropy (WD-BCE) loss function to mitigate the negative impact of foreground-background imbalance in the samples. Moreover, we develop a dataset specifically for infrared small targets, called BIT-SIRST. The dataset comprises a significant amount of real-world targets and manually annotated labels, as well as synthetic data and corresponding labels. We have evaluated the effectiveness of our method through experiments conducted on public datasets. In comparison to other state-of-the-art methods, our approach outperforms in terms of probability of detection ($P_d$), false-alarm rate ($F_a$), and mean intersection of union ($mIoU$). The $mIoU$ reaches 90.89\% on the NUDT-SIRST dataset and 79.72\% on the SIRST dataset. The BIT-SIRST dataset and codes are available openly at \href{https://github.com/EdwardBao1006/bit\_sirst}{\color[HTML]{B22222}{https://github.com/EdwardBao1006/bit\_sirst}}.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. An analysis of the softmax cross entropy loss for learning-to-rank with binary relevance, in: Proc. ACM SIGIR Int. Conf. Theory Inform. Retr., pp. 75–78.
  2. A local contrast method for small infrared target detection. IEEE Trans. Geosci. Remote Sensing 52, 574–581.
  3. Local patch network with global attention for infrared small target detection. IEEE Trans. Aerosp. Electron. Syst. 58, 3979–3991.
  4. One-stage cascade refinement networks for infrared small target detection. IEEE Trans. Geosci. Remote Sensing 61, 1–17.
  5. Attentional local contrast networks for infrared small target detection. IEEE Trans. Geosci. Remote Sensing 59, 9813–9824.
  6. Ieee winter conf. appl. comput. vis., in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 950–959.
  7. Max-mean and max-median filters for detection of small targets, in: Signal and Data Processing of Small Targets 1999, SPIE. pp. 74–83.
  8. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 .
  9. A spatial-temporal feature-based detection framework for infrared dim small target. IEEE Trans. Geosci. Remote Sensing 60, 1–12.
  10. Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12.
  11. Infrared patch-image model for small target detection in a single image. IEEE Trans. Image Process. 22, 4996–5009.
  12. Fast r-cnn, in: Proc. IEEE Int. Conf. Comput. Vision, pp. 1440–1448.
  13. Infrared small target detection utilizing the multiscale relative local contrast measure. IEEE Geosci. Remote Sens. Lett. 15, 612–616.
  14. A robust infrared small target detection algorithm based on human visual system. IEEE Geosci. Remote Sens. Lett. 11, 2168–2172.
  15. A local contrast method for infrared small-target detection utilizing a tri-layer window. IEEE Geosci. Remote Sens. Lett. 17, 1822–1826.
  16. Kcpnet: Knowledge-driven context perception networks for ship detection in infrared imagery. IEEE Trans. Geosci. Remote Sensing 61, 1–19.
  17. Infrared small target segmentation networks: A survey. Pattern Recognit. 143, 109788.
  18. Dense nested attention network for infrared small target detection. IEEE Trans. Image Process. 32, 1745–1758.
  19. Dice loss for data-imbalanced nlp tasks. arXiv preprint arXiv:1911.02855 .
  20. Tiny and dim infrared target detection based on weighted local contrast. IEEE Geosci. Remote Sens. Lett. 15, 1780–1784.
  21. Ssd: Single shot multibox detector, in: Eur. Conf. Comput. Vis., Springer. pp. 21–37.
  22. Swin transformer v2: Scaling up capacity and resolution, in: IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recogn., pp. 12009–12019.
  23. Infrared small target detection based on joint local contrast measures. Optik 273, 170437.
  24. A false-alarm aware methodology to develop robust and efficient multi-scale infrared small target detection algorithm. Infrared Phys. Technol. 89, 387–397.
  25. Local contrast attention guide network for detecting infrared small targets. IEEE Trans. Geosci. Remote Sensing .
  26. Abc: Attention with bilinear correlation for infrared small target detection. arXiv preprint arXiv:2303.10321 .
  27. On the integration of self-attention and convolution, in: Proc. IEEE Comput. Soc. Conf. Comput. Vision Pattern. Recognit., pp. 815–825.
  28. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 32.
  29. Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. neural inf. proces. syst. 28.
  30. Detection of dim targets in digital infrared imagery by morphological image processing. Opt. Eng. 35, 1886–1893.
  31. Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, in: Proc. IEEE Comput. Soc. Conf. Comput. Vision Pattern. Recognit., pp. 7464–7475.
  32. Miss detection vs. false alarm: Adversarial learning for small object segmentation in infrared images, in: Proc. IEEE Int. Conf. Comput. Vision, pp. 8509–8518.
  33. Interior attention-aware network for infrared small target detection. IEEE Trans. Geosci. Remote Sensing 60, 1–13.
  34. Ship detection in spaceborne infrared image based on lightweight cnn and multisource feature cascade decision. IEEE Trans. Geosci. Remote Sensing 59, 4324–4339.
  35. Multiscale patch-based contrast measure for small infrared target detection. Pattern Recognit. 58, 216–226.
  36. Srcanet: Stacked residual coordinate attention network for infrared ship detection. IEEE Trans. Geosci. Remote Sensing 60, 1–14.
  37. Mtu-net: Multilevel transunet for space-based infrared tiny ship detection. IEEE Trans. Geosci. Remote Sensing 61, 1–15.
  38. Uiu-net: U-net in u-net for infrared small object detection. IEEE Trans. Image Process. 32, 364–376.
  39. Infrared small target detection based on local contrast-weighted multidirectional derivative. IEEE Trans. Geosci. Remote Sensing 61, 1–16.
  40. Mapping degeneration meets label evolution: Learning infrared small target detection with single point supervision, in: IEEE Conf. Comput. Vis. Pattern Recognit., pp. 15528–15538.
  41. Infrared small target detection via non-convex rank approximation minimization joint l 2, 1 norm. Remote Sens. 10, 1821.
  42. Infrared small target detection based on partial sum of the tensor nuclear norm. Remote Sens. 11, 382.
  43. Isnet: Shape matters for infrared small target detection, in: Proc. IEEE Conf. Comput. Vis. Pattern Recognit., pp. 877–886.
  44. Agpcnet: Attention-guided pyramid context networks for infrared small target detection. arXiv preprint arXiv:2111.03580 .
  45. Infrared small target detection based on gradient correlation filtering and contrast measurement. IEEE Trans. Geosci. Remote Sensing 61, 1–12.
Citations (4)

Summary

We haven't generated a summary for this paper yet.