Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RIDERS: Radar-Infrared Depth Estimation for Robust Sensing (2402.02067v1)

Published 3 Feb 2024 in cs.CV

Abstract: Dense depth recovery is crucial in autonomous driving, serving as a foundational element for obstacle avoidance, 3D object detection, and local path planning. Adverse weather conditions, including haze, dust, rain, snow, and darkness, introduce significant challenges to accurate dense depth estimation, thereby posing substantial safety risks in autonomous driving. These challenges are particularly pronounced for traditional depth estimation methods that rely on short electromagnetic wave sensors, such as visible spectrum cameras and near-infrared LiDAR, due to their susceptibility to diffraction noise and occlusion in such environments. To fundamentally overcome this issue, we present a novel approach for robust metric depth estimation by fusing a millimeter-wave Radar and a monocular infrared thermal camera, which are capable of penetrating atmospheric particles and unaffected by lighting conditions. Our proposed Radar-Infrared fusion method achieves highly accurate and finely detailed dense depth estimation through three stages, including monocular depth prediction with global scale alignment, quasi-dense Radar augmentation by learning Radar-pixels correspondences, and local scale refinement of dense depth using a scale map learner. Our method achieves exceptional visual quality and accurate metric estimation by addressing the challenges of ambiguity and misalignment that arise from directly fusing multi-modal long-wave features. We evaluate the performance of our approach on the NTU4DRadLM dataset and our self-collected challenging ZJU-Multispectrum dataset. Especially noteworthy is the unprecedented robustness demonstrated by our proposed method in smoky scenarios. Our code will be released at \url{https://github.com/MMOCKING/RIDERS}.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Corneliu Rablau “LIDAR–A new (self-driving) vehicle for introducing optics to broader engineering and non-engineering audiences” In Education and Training in Optics and Photonics, 2019, pp. 11143_138 Optica Publishing Group
  2. “A robust stereo feature-aided semi-direct SLAM system” In Robotics and Autonomous Systems 132 Elsevier, 2020, pp. 103597
  3. “Self-supervised monocular depth estimation for all day images using domain separation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 12737–12746
  4. “Robust Monocular Depth Estimation under Challenging Conditions” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 8177–8186
  5. “Compact single-chip W-band FMCW radar modules for commercial high-resolution sensor applications” In IEEE Transactions on Microwave Theory and Techniques 50.12 IEEE, 2002, pp. 2995–3001
  6. “Traffic vehicle cognition in severe weather based on radar and infrared thermal camera fusion” In Measurement Science and Technology 32.9 IOP Publishing, 2021, pp. 095111
  7. Christopher Doer and Gert F Trommer “Radar visual inertial odometry and radar thermal inertial odometry: Robust navigation even in challenging visual conditions” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 331–338 IEEE
  8. “4DRT-SLAM: Robust SLAM in Smoke Environments using 4D Radar and Thermal Camera based on Dense Deep Learnt Features” In 2023 IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and the 10th IEEE International Conference on Robotics, Automation and Mechatronics (RAM), p. accepted, IEEE, 2023
  9. “Multispectral transfer network: Unsupervised depth estimation for all-day vision” In Proceedings of the AAAI Conference on Artificial Intelligence 32.1, 2018
  10. “An alternative of lidar in nighttime: Unsupervised depth estimation based on single thermal image” In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021, pp. 3833–3843
  11. “Self-supervised depth and ego-motion estimation for monocular thermal video using multi-spectral consistency loss” In IEEE Robotics and Automation Letters 7.2 IEEE, 2021, pp. 1103–1110
  12. “Maximizing self-supervision from thermal image for effective self-supervised learning of depth and ego-motion” In IEEE Robotics and Automation Letters 7.3 IEEE, 2022, pp. 7771–7778
  13. “Self-supervised monocular depth estimation from thermal images via adversarial multi-spectral adaptation” In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, pp. 5798–5807
  14. Juan-Ting Lin, Dengxin Dai and Luc Van Gool “Depth estimation from monocular images and sparse radar data” In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 10233–10240 IEEE
  15. “Radar-camera pixel depth association for depth completion” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 12507–12516
  16. “Depth estimation from monocular images and sparse radar using deep ordinal regression network” In 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 3343–3347 IEEE
  17. “R4Dyn: Exploring radar for self-supervised monocular depth estimation of dynamic scenes” In 2021 International Conference on 3D Vision (3DV), 2021, pp. 751–760 IEEE
  18. “Depth Estimation From Camera Image and mmWave Radar Point Cloud” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 9275–9285
  19. “RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric Scale” In IEEE International Conference on Robotics and Automation (ICRA), 2024
  20. “NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping” In arXiv preprint arXiv:2309.00962, 2023
  21. “Joint self-supervised learning and adversarial adaptation for monocular depth estimation from thermal image” In Machine Vision and Applications 34.4 Springer, 2023, pp. 55
  22. Ukcheol Shin, Jinsun Park and In So Kweon “Deep Depth Estimation From Thermal Image” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 1043–1053
  23. “Full-velocity radar returns by radar-camera fusion” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 16198–16207
  24. “Deep ordinal regression network for monocular depth estimation” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 2002–2011
  25. René Ranftl, Alexey Bochkovskiy and Vladlen Koltun “Vision transformers for dense prediction” In Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 12179–12188
  26. Reiner Birkl, Diana Wofk and Matthias Müller “MiDaS v3.1 – A Model Zoo for Robust Monocular Relative Depth Estimation” In arXiv preprint arXiv:2307.14460, 2023
  27. “Zoedepth: Zero-shot transfer by combining relative and metric depth” In arXiv preprint arXiv:2302.12288, 2023
  28. “Learning to recover 3d scene shape from a single image” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 204–213
  29. “Towards accurate reconstruction of 3d scene shape from a single monocular image” In IEEE Transactions on Pattern Analysis and Machine Intelligence 45.5 IEEE, 2022, pp. 6480–6494
  30. “Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data” In arXiv preprint arXiv:2401.10891, 2024
  31. “Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer” In IEEE transactions on pattern analysis and machine intelligence 44.3 IEEE, 2020, pp. 1623–1637
  32. “NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields” In arXiv preprint arXiv:2312.13471, 2023
  33. George E Forsythe “Computer methods for mathematical computations” Prentice-hall, 1977
  34. Richard P Brent “Algorithms for minimization without derivatives” Courier Corporation, 2013
  35. “LoFTR: Detector-free local feature matching with transformers” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 8922–8931
  36. “Deep residual learning for image recognition” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778
  37. C Bradford Barber, David P Dobkin and Hannu Huhdanpaa “The quickhull algorithm for convex hulls” In ACM Transactions on Mathematical Software (TOMS) 22.4 Acm New York, NY, USA, 1996, pp. 469–483
  38. “Monocular Visual-Inertial Depth Estimation” In arXiv preprint arXiv:2303.12134, 2023
  39. O Rebecca Vincent and Olusegun Folorunso “A descriptive algorithm for sobel image edge detection” In Proceedings of informing science & IT education conference (InSITE) 40, 2009, pp. 97–107
  40. “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
  41. “NeuralRecon: Real-time coherent 3D reconstruction from monocular video” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 15598–15607
Citations (3)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com