Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Real-Time Dense Stereo Embedded in A UAV for Road Inspection (1904.06017v1)

Published 12 Apr 2019 in cs.CV, cs.RO, and eess.IV

Abstract: The condition assessment of road surfaces is essential to ensure their serviceability while still providing maximum road traffic safety. This paper presents a robust stereo vision system embedded in an unmanned aerial vehicle (UAV). The perspective view of the target image is first transformed into the reference view, and this not only improves the disparity accuracy, but also reduces the algorithm's computational complexity. The cost volumes generated from stereo matching are then filtered using a bilateral filter. The latter has been proved to be a feasible solution for the functional minimisation problem in a fully connected Markov random field model. Finally, the disparity maps are transformed by minimising an energy function with respect to the roll angle and disparity projection model. This makes the damaged road areas more distinguishable from the road surface. The proposed system is implemented on an NVIDIA Jetson TX2 GPU with CUDA for real-time purposes. It is demonstrated through experiments that the damaged road areas can be easily distinguished from the transformed disparity maps.

Citations (54)

Summary

  • The paper introduces an innovative perspective transformation method that refines disparity accuracy while reducing computational complexity.
  • It uses an NVIDIA Jetson TX2 with CUDA for parallel processing, enabling robust real-time detection of road surface damage.
  • The approach outperforms methods like PSMNet under strict error tolerance, paving the way for automated, objective road monitoring.

Overview of Real-Time Dense Stereo Embedded in A UAV for Road Inspection

This paper by Fan et al. presents a technically sophisticated stereo vision system designed for road surface condition assessment using Unmanned Aerial Vehicles (UAVs). The system is embedded onto a UAV and leverages advanced algorithms to provide real-time insights about road safety conditions. The primary focus of the paper is on improving the precision and efficiency of stereo disparity maps, essential for detecting anomalies such as cracks and potholes on road surfaces.

Key Contributions

The authors introduce a perspective transformation method that refines disparity accuracy while decreasing the computational complexity of the algorithm. This approach mitigates the inherent distortions during image capture from different perspectives, which typically degrade stereo matching performance. By transforming the perspective in the target image to match the reference image, the system effectively reduces algorithmic demands and enhances matching fidelity.

A significant portion of the system's computational load revolves around a fully connected Markov random field (MRF) model, leveraged through bilateral filtering—a method previously validated as a feasible solution for energy minimization problems in dense stereo vision tasks.

Implementation and Results

The proposed system is executed on an NVIDIA Jetson TX2 GPU, utilizing CUDA for parallel processing efficiencies, thereby enabling real-time operation. The UAV equipped with this system demonstrates robust performance in distinguishing damaged areas from the road surface, as verified by both qualitative and quantitative experimentation. The system's algorithm achieves an impressive execution speed, measured in processed millions of disparity evaluations per second (Mde/s), attesting to its efficiency under practical constraints.

Comparison with existing methods such as PSMNet highlights the proposed system's proficiency. It outperforms PSMNet when error tolerance is refined, though PSMNet shows marginal advantages under broader constraints. Moreover, the system’s capability to handle motion-blurred images further underscores its robustness, particularly for UAV applications where jitter and movement are routine challenges.

Theoretical and Practical Implications

The approach taken in this paper advances the application of dense stereo vision technologies in real-world conditions where real-time feedback is vital, such as infrastructure monitoring and maintenance. From a theoretical perspective, it offers an efficient solution to mitigate the common trade-off between computational overhead and disparity map accuracy in local and global algorithms.

This research opens pathways for enhanced UAV functionalities, paving the way for automated, objective road inspection processes free from limitations endemic to manual assessments. Furthermore, the precision of results allows government bodies to allocate resources more strategically, based on accurate and timely assessments.

Future Trajectories

Future work could focus on integrating these disparity maps into Simultaneous Localization and Mapping (SLAM) algorithms for spatially accurate road condition mapping. Additionally, the extension of this system to automated defect classification via Convolutional Neural Networks (CNNs) suggests a promising direction for more nuanced, intelligent defect assessment.

In summary, this paper delivers a pragmatic and technically sound method for road inspection via UAVs, with substantive implications for both the academic and applied realms of autonomous visual inspection technology.

Youtube Logo Streamline Icon: https://streamlinehq.com