RIDERS: Radar-Infrared Depth Estimation for Robust Sensing (2402.02067v1)
Abstract: Dense depth recovery is crucial in autonomous driving, serving as a foundational element for obstacle avoidance, 3D object detection, and local path planning. Adverse weather conditions, including haze, dust, rain, snow, and darkness, introduce significant challenges to accurate dense depth estimation, thereby posing substantial safety risks in autonomous driving. These challenges are particularly pronounced for traditional depth estimation methods that rely on short electromagnetic wave sensors, such as visible spectrum cameras and near-infrared LiDAR, due to their susceptibility to diffraction noise and occlusion in such environments. To fundamentally overcome this issue, we present a novel approach for robust metric depth estimation by fusing a millimeter-wave Radar and a monocular infrared thermal camera, which are capable of penetrating atmospheric particles and unaffected by lighting conditions. Our proposed Radar-Infrared fusion method achieves highly accurate and finely detailed dense depth estimation through three stages, including monocular depth prediction with global scale alignment, quasi-dense Radar augmentation by learning Radar-pixels correspondences, and local scale refinement of dense depth using a scale map learner. Our method achieves exceptional visual quality and accurate metric estimation by addressing the challenges of ambiguity and misalignment that arise from directly fusing multi-modal long-wave features. We evaluate the performance of our approach on the NTU4DRadLM dataset and our self-collected challenging ZJU-Multispectrum dataset. Especially noteworthy is the unprecedented robustness demonstrated by our proposed method in smoky scenarios. Our code will be released at \url{https://github.com/MMOCKING/RIDERS}.
- Corneliu Rablau “LIDAR–A new (self-driving) vehicle for introducing optics to broader engineering and non-engineering audiences” In Education and Training in Optics and Photonics, 2019, pp. 11143_138 Optica Publishing Group
- “A robust stereo feature-aided semi-direct SLAM system” In Robotics and Autonomous Systems 132 Elsevier, 2020, pp. 103597
- “Self-supervised monocular depth estimation for all day images using domain separation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 12737–12746
- “Robust Monocular Depth Estimation under Challenging Conditions” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 8177–8186
- “Compact single-chip W-band FMCW radar modules for commercial high-resolution sensor applications” In IEEE Transactions on Microwave Theory and Techniques 50.12 IEEE, 2002, pp. 2995–3001
- “Traffic vehicle cognition in severe weather based on radar and infrared thermal camera fusion” In Measurement Science and Technology 32.9 IOP Publishing, 2021, pp. 095111
- Christopher Doer and Gert F Trommer “Radar visual inertial odometry and radar thermal inertial odometry: Robust navigation even in challenging visual conditions” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 331–338 IEEE
- “4DRT-SLAM: Robust SLAM in Smoke Environments using 4D Radar and Thermal Camera based on Dense Deep Learnt Features” In 2023 IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and the 10th IEEE International Conference on Robotics, Automation and Mechatronics (RAM), p. accepted, IEEE, 2023
- “Multispectral transfer network: Unsupervised depth estimation for all-day vision” In Proceedings of the AAAI Conference on Artificial Intelligence 32.1, 2018
- “An alternative of lidar in nighttime: Unsupervised depth estimation based on single thermal image” In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021, pp. 3833–3843
- “Self-supervised depth and ego-motion estimation for monocular thermal video using multi-spectral consistency loss” In IEEE Robotics and Automation Letters 7.2 IEEE, 2021, pp. 1103–1110
- “Maximizing self-supervision from thermal image for effective self-supervised learning of depth and ego-motion” In IEEE Robotics and Automation Letters 7.3 IEEE, 2022, pp. 7771–7778
- “Self-supervised monocular depth estimation from thermal images via adversarial multi-spectral adaptation” In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, pp. 5798–5807
- Juan-Ting Lin, Dengxin Dai and Luc Van Gool “Depth estimation from monocular images and sparse radar data” In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 10233–10240 IEEE
- “Radar-camera pixel depth association for depth completion” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 12507–12516
- “Depth estimation from monocular images and sparse radar using deep ordinal regression network” In 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 3343–3347 IEEE
- “R4Dyn: Exploring radar for self-supervised monocular depth estimation of dynamic scenes” In 2021 International Conference on 3D Vision (3DV), 2021, pp. 751–760 IEEE
- “Depth Estimation From Camera Image and mmWave Radar Point Cloud” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 9275–9285
- “RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric Scale” In IEEE International Conference on Robotics and Automation (ICRA), 2024
- “NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping” In arXiv preprint arXiv:2309.00962, 2023
- “Joint self-supervised learning and adversarial adaptation for monocular depth estimation from thermal image” In Machine Vision and Applications 34.4 Springer, 2023, pp. 55
- Ukcheol Shin, Jinsun Park and In So Kweon “Deep Depth Estimation From Thermal Image” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 1043–1053
- “Full-velocity radar returns by radar-camera fusion” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 16198–16207
- “Deep ordinal regression network for monocular depth estimation” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 2002–2011
- René Ranftl, Alexey Bochkovskiy and Vladlen Koltun “Vision transformers for dense prediction” In Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 12179–12188
- Reiner Birkl, Diana Wofk and Matthias Müller “MiDaS v3.1 – A Model Zoo for Robust Monocular Relative Depth Estimation” In arXiv preprint arXiv:2307.14460, 2023
- “Zoedepth: Zero-shot transfer by combining relative and metric depth” In arXiv preprint arXiv:2302.12288, 2023
- “Learning to recover 3d scene shape from a single image” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 204–213
- “Towards accurate reconstruction of 3d scene shape from a single monocular image” In IEEE Transactions on Pattern Analysis and Machine Intelligence 45.5 IEEE, 2022, pp. 6480–6494
- “Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data” In arXiv preprint arXiv:2401.10891, 2024
- “Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer” In IEEE transactions on pattern analysis and machine intelligence 44.3 IEEE, 2020, pp. 1623–1637
- “NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields” In arXiv preprint arXiv:2312.13471, 2023
- George E Forsythe “Computer methods for mathematical computations” Prentice-hall, 1977
- Richard P Brent “Algorithms for minimization without derivatives” Courier Corporation, 2013
- “LoFTR: Detector-free local feature matching with transformers” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 8922–8931
- “Deep residual learning for image recognition” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778
- C Bradford Barber, David P Dobkin and Hannu Huhdanpaa “The quickhull algorithm for convex hulls” In ACM Transactions on Mathematical Software (TOMS) 22.4 Acm New York, NY, USA, 1996, pp. 469–483
- “Monocular Visual-Inertial Depth Estimation” In arXiv preprint arXiv:2303.12134, 2023
- O Rebecca Vincent and Olusegun Folorunso “A descriptive algorithm for sobel image edge detection” In Proceedings of informing science & IT education conference (InSITE) 40, 2009, pp. 97–107
- “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
- “NeuralRecon: Real-time coherent 3D reconstruction from monocular video” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 15598–15607