Papers
Topics
Authors
Recent
Search
2000 character limit reached

DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection

Published 23 Oct 2024 in cs.CV | (2410.17822v1)

Abstract: Object detection algorithms are pivotal components of unmanned aerial vehicle (UAV) imaging systems, extensively employed in complex fields. However, images captured by high-mobility UAVs often suffer from motion blur cases, which significantly impedes the performance of advanced object detection algorithms. To address these challenges, we propose an innovative object detection algorithm specifically designed for blurry images, named DREB-Net (Dual-stream Restoration Embedding Blur-feature Fusion Network). First, DREB-Net addresses the particularities of blurry image object detection problem by incorporating a Blurry image Restoration Auxiliary Branch (BRAB) during the training phase. Second, it fuses the extracted shallow features via Multi-level Attention-Guided Feature Fusion (MAGFF) module, to extract richer features. Here, the MAGFF module comprises local attention modules and global attention modules, which assign different weights to the branches. Then, during the inference phase, the deep feature extraction of the BRAB can be removed to reduce computational complexity and improve detection speed. In loss function, a combined loss of MSE and SSIM is added to the BRAB to restore blurry images. Finally, DREB-Net introduces Fast Fourier Transform in the early stages of feature extraction, via a Learnable Frequency domain Amplitude Modulation Module (LFAMM), to adjust feature amplitude and enhance feature processing capability. Experimental results indicate that DREB-Net can still effectively perform object detection tasks under motion blur in captured images, showcasing excellent performance and broad application prospects. Our source code will be available at https://github.com/EEIC-Lab/DREB-Net.git.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (69)
  1. Deep multi-modal object detection and semantic segmentation for autonomous driving: Datasets, methods, and challenges. IEEE Transactions on Intelligent Transportation Systems, 22(3):1341–1360, 2020.
  2. Object detection in traffic videos: A survey. IEEE Transactions on Intelligent Transportation Systems, 24(7):6780–6799, 2023.
  3. R3-net: A deep network for multi-oriented vehicle detection in aerial images and videos. arXiv preprint arXiv:1808.05560, 2018.
  4. Mnasfpn: Learning latency-aware pyramid architecture for object detection on mobile devices. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 13607–13616, 2020.
  5. Automatic control in microelectronics manufacturing: Practices, challenges, and possibilities. Automatica, 36(11):1567–1603, 2000.
  6. Fine-grained image analysis with deep learning: A survey. IEEE transactions on pattern analysis and machine intelligence, 44(12):8927–8948, 2021.
  7. Military autonomous drones (uavs)-from fantasy to reality. legal and ethical implications. Transportation research procedia, 59:292–299, 2021.
  8. Autonomous monitoring, analysis, and countering of air pollution using environmental drones. Heliyon, 6(1), 2020.
  9. Remote sensing of natural hazard-related disasters with small drones: Global trends, biases, and research opportunities. Remote Sensing of Environment, 264:112577, 2021.
  10. Visdrone-det2021: The vision meets drone object detection challenge results. In Proceedings of the IEEE/CVF International conference on computer vision, pages 2847–2854, 2021.
  11. Highly efficient anchor-free oriented small object detection for remote sensing images via periodic pseudo-domain. Remote Sensing, 15(15):3854, 2023.
  12. Scaf-net: Scene context attention-based fusion network for vehicle detection in aerial imagery. IEEE Geoscience and Remote Sensing Letters, 19:1–5, 2021.
  13. Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the IEEE/CVF international conference on computer vision, pages 2778–2788, 2021.
  14. A global-local self-adaptive network for drone-view object detection. IEEE Transactions on Image Processing, 30:1556–1569, 2020.
  15. Ufpmp-det: Toward accurate and efficient object detection on drone imagery. In Proceedings of the AAAI conference on artificial intelligence, volume 36, pages 1026–1033, 2022.
  16. Towards the unmanned aerial vehicles (uavs): A comprehensive review. Drones, 6(6):147, 2022.
  17. High-mobility satellite-uav communications: Challenges, solutions, and future research trends. IEEE Communications Magazine, 60(5):38–43, 2022.
  18. A novel unmanned aerial vehicle-sink enabled mobility model for military operations in sparse flying ad-hoc network. Transactions on Emerging Telecommunications Technologies, 33(5):e4466, 2022.
  19. Uav aerial image target detection based on blur-yolo. Remote Sensing Letters, 14(2):186–196, 2023.
  20. Slimdeblurgan-based motion deblurring and marker detection for autonomous drone landing. Sensors, 20(14):3918, 2020.
  21. Adaptive multi-scale fusion blind deblurred generative adversarial network method for sharpening image data. Drones, 7(2):96, 2023.
  22. E2nerf: Event enhanced neural radiance fields from blurry images. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13254–13264, 2023.
  23. Joint video multi-frame interpolation and deblurring under unknown exposure time. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13935–13944, 2023.
  24. Blur interpolation transformer for real-world motion from blur. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5713–5723, 2023.
  25. Object detection in 20 years: A survey. Proceedings of the IEEE, 111(3):257–276, 2023.
  26. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition. CVPR 2001, volume 1, pages I–I. Ieee, 2001.
  27. Histograms of oriented gradients for human detection. In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), volume 1, pages 886–893. Ieee, 2005.
  28. Object detection with deep learning: A review. IEEE transactions on neural networks and learning systems, 30(11):3212–3232, 2019.
  29. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.
  30. Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
  31. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28, 2015.
  32. A normalized gaussian wasserstein distance for tiny object detection. arXiv preprint arXiv:2110.13389, 2021.
  33. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016.
  34. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
  35. You only look one-level feature. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 13039–13048, 2021.
  36. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430, 2021.
  37. Yolov9: Learning what you want to learn using programmable gradient information. arXiv preprint arXiv:2402.13616, 2024.
  38. Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pages 21–37. Springer, 2016.
  39. Feature selective anchor-free module for single-shot object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 840–849, 2019.
  40. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pages 2980–2988, 2017.
  41. Embedded real-time object detection for a uav warning system. In Proceedings of the IEEE international conference on computer vision workshops, pages 2110–2118, 2017.
  42. Towards resolving the challenge of long-tail distribution in uav images for object detection. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, pages 3258–3267, 2021.
  43. Avdnet: A small-sized vehicle detection network for aerial visual data. IEEE Geoscience and Remote Sensing Letters, 17(3):494–498, 2019.
  44. A specially optimized one-stage network for object detection in remote sensing images. IEEE Geoscience and Remote Sensing Letters, 18(3):401–405, 2020.
  45. Gated and axis-concentrated localization network for remote sensing object detection. IEEE Transactions on Geoscience and Remote Sensing, 58(1):179–192, 2019.
  46. End-to-end object detection with transformers. In European conference on computer vision, pages 213–229. Springer, 2020.
  47. Dino: Detr with improved denoising anchor boxes for end-to-end object detection. arXiv preprint arXiv:2203.03605, 2022.
  48. Cornernet: Detecting objects as paired keypoints. In Proceedings of the European conference on computer vision (ECCV), pages 734–750, 2018.
  49. Objects as points. arXiv preprint arXiv:1904.07850, 2019.
  50. Fully convolutional one-stage 3d object detection on lidar range images. Advances in Neural Information Processing Systems, 35:34899–34911, 2022.
  51. Varifocalnet: An iou-aware dense object detector. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8514–8523, 2021.
  52. Centripetalnet: Pursuing high-quality keypoint pairs for object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10519–10528, 2020.
  53. A survey of design and implementation for optical camera communication. Signal Processing: Image Communication, 53:95–109, 2017.
  54. Deep image deblurring: A survey. International Journal of Computer Vision, 130(9):2103–2130, 2022.
  55. ChuMiao Li. A survey on image deblurring. arXiv preprint arXiv:2202.07456, 2022.
  56. Principles of digital wiener filtering. Geophysical Prospecting, 15(3):311–332, 1967.
  57. An adaptively accelerated lucy-richardson method for image deblurring. EURASIP Journal on Advances in Signal Processing, 2008:1–10, 2007.
  58. Learning a convolutional neural network for non-uniform motion blur removal. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 769–777, 2015.
  59. Deep multi-scale convolutional neural network for dynamic scene deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3883–3891, 2017.
  60. Scale-recurrent network for deep image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 8174–8182, 2018.
  61. Deblurgan: Blind motion deblurring using conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 8183–8192, 2018.
  62. Deblurgan-v2: Deblurring (orders-of-magnitude) faster and better. In Proceedings of the IEEE/CVF international conference on computer vision, pages 8878–8887, 2019.
  63. Uformer: A general u-shaped transformer for image restoration. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 17683–17693, 2022.
  64. Stripformer: Strip transformer for fast image deblurring. In European conference on computer vision, pages 146–162. Springer, 2022.
  65. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5728–5739, 2022.
  66. Yu Zhang and Qiang Yang. A survey on multi-task learning. IEEE transactions on knowledge and data engineering, 34(12):5586–5609, 2021.
  67. Multinet: Real-time joint semantic reasoning for autonomous driving. In 2018 IEEE intelligent vehicles symposium (IV), pages 1013–1020. IEEE, 2018.
  68. Multi-task deep neural network for joint face recognition and facial attribute prediction. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, pages 365–374, 2017.
  69. Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE transactions on pattern analysis and machine intelligence, 41(1):121–135, 2017.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.