Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ADCNet: Learning from Raw Radar Data via Distillation (2303.11420v3)

Published 21 Mar 2023 in eess.SP, cs.AI, and cs.CV

Abstract: As autonomous vehicles and advanced driving assistance systems have entered wider deployment, there is an increased interest in building robust perception systems using radars. Radar-based systems are lower cost and more robust to adverse weather conditions than their LiDAR-based counterparts; however the point clouds produced are typically noisy and sparse by comparison. In order to combat these challenges, recent research has focused on consuming the raw radar data, instead of the final radar point cloud. We build on this line of work and demonstrate that by bringing elements of the signal processing pipeline into our network and then pre-training on the signal processing task, we are able to achieve state of the art detection performance on the RADIal dataset. Our method uses expensive offline signal processing algorithms to pseudo-label data and trains a network to distill this information into a fast convolutional backbone, which can then be finetuned for perception tasks. Extensive experiment results corroborate the effectiveness of the proposed techniques.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (40)
  1. The Oxford radar robotcar dataset: A radar extension to the oxford robotcar dataset. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020.
  2. Graham M Brooker. Understanding millimetre wave FMCW radars. In 1st International Conference on Sensing Technology, 2005.
  3. Language models are few-shot learners. Advances in Neural Information Processing Systems, 33:1877–1901, 2020.
  4. A simple framework for contrastive learning of visual representations. In Proceedings of the 37th International Conference on Machine Learning, pages 1597–1607. PMLR, 2020.
  5. Generative pre-training for speech with autoregressive predictive coding. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 3497–3501. IEEE, 2020.
  6. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  7. Distil-whisper: Robust knowledge distillation via large-scale pseudo labelling, 2023.
  8. T-fftradnet: Object detection with swin vision transformers from raw adc radar signals, 2023.
  9. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9729–9738, 2020.
  10. Peter J Huber. Robust estimation of a location parameter. Breakthroughs in statistics: Methodology and distribution, pages 492–518, 1992.
  11. CramNet: Camera-radar fusion with ray-constrained cross-attention for robust 3d object detection. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXVIII, pages 388–405. Springer, 2022.
  12. Cross-modal supervision-based multitask learning with automotive radar raw data. IEEE Transactions on Intelligent Vehicles, PP:1–15, 2023.
  13. Jian Li. Over a century of array signal processing. https://ieee-aess.org/files/ieeeaess/slides/2023AESSPowerPointDLJianLi.pdf. Accessed 2023-11-17.
  14. Contrastive unsupervised learning for speech emotion recognition. In 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 6329–6333. IEEE, 2021.
  15. Exploiting temporal relations on radar perception for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17071–17080, 2022a.
  16. Modality-agnostic learning for radar-lidar fusion in vehicle detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 918–927, 2022b.
  17. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision, pages 2980–2988, 2017.
  18. Echoes beyond points: Unleashing the power of raw radar data in multi-modality fusion. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
  19. Graph convolutional networks for 3d object detection on radar data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3060–3069, 2021.
  20. Centerfusion: Center-based radar and camera fusion for 3d object detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 1527–1536, 2021.
  21. Polarnet: Accelerated deep open space segmentation using automotive radar in polar domain. arXiv preprint arXiv:2103.03387, 2021.
  22. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
  23. K-radar: 4d radar object detection for autonomous driving in various weather conditions. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2022.
  24. CNN based road user detection using the 3d radar cube. IEEE Robotics and Automation Letters, 5(2):1263–1270, 2020.
  25. Robust multimodal vehicle detection in foggy weather using complementary lidar and radar signals. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 444–453, 2021.
  26. Sandeep Rao. MIMO radar. https://www.ti.com/lit/an/swra554a/swra554a.pdf?ts=1675435882950\&ref_url=https\%253A\%252F\%252Fwww.ti.com\%252Fproduct\%252FIWR6843. Accessed: 2023-02-19.
  27. Raw high-definition radar for multi-task learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17021–17030, 2022.
  28. Iterative adaptive approaches to mimo radar imaging. IEEE Journal of Selected Topics in Signal Processing, 4(1):5–20, 2010.
  29. ESPRIT-estimation of signal parameters via rotational invariance techniques. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37(7):984–995, 1989.
  30. Liranet: End-to-end trajectory prediction using spatio-temporal radar fusion. arXiv preprint arXiv:2010.00731, 2020.
  31. RADIATE: A radar dataset for automotive perception in bad weather. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 1–7. IEEE, 2021.
  32. Mimo radar for advanced driver-assistance systems and autonomous driving: Advantages and challenges. IEEE Signal Processing Magazine, 37(4):98–117, 2020.
  33. Harry L Van Trees. Optimum array processing: Part IV of detection, estimation, and modulation theory. John Wiley & Sons, 2002.
  34. Zeroflow: Scalable scene flow via distillation, 2023.
  35. RODNet: Radar object detection using cross-modal supervision. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 504–513, 2021a.
  36. Rethinking of radar’s role: A camera-radar dataset and systematic annotator via coordinate alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2815–2824, 2021b.
  37. Radarnet: Exploiting radar for robust perception of dynamic objects. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVIII 16, pages 496–512. Springer, 2020.
  38. Learning to detect mobile objects from lidar scans without labels. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 1120–1130, Los Alamitos, CA, USA, 2022. IEEE Computer Society.
  39. BigSSL: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition. IEEE Journal of Selected Topics in Signal Processing, 16(6):1519–1532, 2022.
  40. Cubelearn: End-to-end learning for human motion recognition from raw mmwave radar signals. IEEE Internet of Things Journal, 2023.
Citations (4)

Summary

We haven't generated a summary for this paper yet.