Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting

Published 20 Jan 2026 in cs.CV, cs.GR, and cs.LG | (2601.14208v1)

Abstract: Inspecting the undercarriage of used vehicles is a labor-intensive task that requires inspectors to crouch or crawl underneath each vehicle to thoroughly examine it. Additionally, online buyers rarely see undercarriage photos. We present an end-to-end pipeline that utilizes a three-camera rig to capture videos of the undercarriage as the vehicle drives over it, and produces an interactive 3D model of the undercarriage. The 3D model enables inspectors and customers to rotate, zoom, and slice through the undercarriage, allowing them to detect rust, leaks, or impact damage in seconds, thereby improving both workplace safety and buyer confidence. Our primary contribution is a rig-aware Structure-from-Motion (SfM) pipeline specifically designed to overcome the challenges of wide-angle lens distortion and low-parallax scenes. Our method overcomes the challenges of wide-angle lens distortion and low-parallax scenes by integrating precise camera calibration, synchronized video streams, and strong geometric priors from the camera rig. We use a constrained matching strategy with learned components, the DISK feature extractor, and the attention-based LightGlue matcher to generate high-quality sparse point clouds that are often unattainable with standard SfM pipelines. These point clouds seed the Gaussian splatting process to generate photorealistic undercarriage models that render in real-time. Our experiments and ablation studies demonstrate that our design choices are essential to achieve state-of-the-art quality.

Abstract PDF Upgrade to Chat

Authors (8)

Summary

The paper introduces a rig-aware 3D reconstruction pipeline that uses Gaussian splatting to generate photorealistic vehicle undercarriage models.
It employs precise calibration with ChArUco boards and adaptive video synchronization to overcome wide-angle distortion and low-parallax challenges.
The method achieves high photometric fidelity (PSNR 30.66 dB, SSIM 0.92) and real-time rendering (>130 FPS) for interactive vehicle inspections.

Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting

Introduction

The paper "Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting" (2601.14208) advances the state of automated vehicle inspection by introducing a robust, efficient pipeline for reconstructing interactive 3D models of vehicle undercarriages. Centered on a three-camera rig with extreme wide-angle lenses, the system tackles domain-specific challenges such as severe lens distortion, low parallax, and spatial-temporal alignment, culminating in photorealistic reconstructions via 3D Gaussian splatting. This approach enables detailed remote inspections, facilitating defect identification (e.g., rust, leaks), improving operational safety, and enhancing transparency for online automotive marketplaces.

Figure 1: End-to-end 3D reconstruction pipeline—from calibration and synchronization to SfM and real-time Gaussian splatting.

Technical Contributions

Precise Camera Calibration

Critical to geometric fidelity, the system begins with a one-time, high-precision calibration using a ChArUco board. By fitting an eight-parameter Full OpenCV distortion model and leveraging robust sub-pixel corner detection, the pipeline quantifies and rectifies severe wide-angle distortions. The calibration process greatly lowers RMS reprojection error, ensuring subsequent SfM accuracy.

Figure 2: ChArUco board used for accurate camera intrinsic calibration.

Evaluation shows that undistortion is essential:

Figure 3: Left: raw calibration image; Right: undistorted image using Full OpenCV model.

Figure 4: Left: raw undercarriage image; Right: undistorted image using Full OpenCV model.

Video Synchronization

Hardware triggers cannot eliminate multi-camera latency; consequently, temporal misalignment must be corrected via global vertical motion estimation and adaptive offset optimization. The pipeline employs phase correlation and L1 loss minimization for millisecond-level synchronization, producing well-aligned sequences necessary for reliable multi-view geometry.

Figure 5: Vertical motion shift curves for unsynchronized videos.

Rig-Aware Structure-from-Motion

Frame Selection and Feature Extraction

To maximize scene sharpness and minimize motion blur, the system samples triplets of the sharpest frames using Laplacian variance scoring. The images undergo CLAHE preprocessing, substantially boosting detectable features in underexposed regions.

A learned descriptor, DISK, is adopted for feature extraction in challenging undercarriage scenes; this is paired with LightGlue, an attention-based matcher, to robustly compute correspondences even under extreme distortion and repetitive scene structure.

Constrained Feature Matching and Geometric Priors

Feature matching is efficiently limited to spatiotemporally proximate frame pairs and stereo baselines suitable for the rig configuration. The pipeline omits impractical cross-baseline matchings and employs COLMAP for geometric verification. Critically, bundle adjustment incorporates explicit rig priors on inter-camera translation, mitigating drift and degenerate solutions under low parallax.

Figure 6: Dense sparse point cloud representing vehicle undercarriage geometry.

Ablation results highlight the necessity of each component. Removal of calibration, custom matching, or priors consistently produces noisy or incomplete reconstructions.

Figure 7: Degenerate, noisy renders result from omitted calibration, matching, or pose prior components.

3D Gaussian Splatting and Rendering

Gaussian splatting explicitly models the scene using optimized anisotropic 3D Gaussians, enabling radiance field rendering at interactive rates. The point cloud from rig-aware SfM initializes means, with spherical harmonics for color and covariance adapted to local density. The optimization converges efficiently, leveraging the geometric accuracy of the SfM priors.

Resulting models exhibit strong photometric fidelity (mean PSNR 30.66 dB, SSIM 0.92, LPIPS 0.19) and real-time rendering performance ( $>$ 130 FPS). Diagnostic features, including rust patterns, bolt heads, and fluid marks, are consistently resolved.

Figure 8: Gaussian splat render—fine-grained diagnostic texture, significant rust accurately captured.

Figure 9: Photorealistic 3D undercarriage model—multiple perspectives reveal comprehensive geometry and details.

Practical and Theoretical Implications

This methodology directly enables reliable, interactive undercarriage inspections in online marketplaces, supporting transparency and informed purchasing decisions. The approach sets a computational benchmark for real-time visual inspection, paving the way for automated damage detection, longitudinal fleet monitoring, and scalable insurance claim evaluation.

Theoretical contributions include the integration of strong physical priors into multi-view geometry under extreme conditions and the establishment of a reproducible pipeline for SfM-seeded explicit radiance field optimization. The demonstrated fusion of learned matchers and constrained correspondence strategies suggests extensions into other domains with low-parallax, wide-angle capture geometries.

Future research avenues include automating condition assessment via multimodal learning (e.g., defect segmentation), generalizing the pipeline for arbitrary industrial inspection scenarios, and integrating uncertainty quantification in the geometric reconstruction and rendering stages.

Conclusion

The rig-aware pipeline for 3D vehicle undercarriage reconstruction exemplifies effective synergy between calibrated multi-view geometry, robust learned feature matching, and efficient explicit radiance modeling. Achieving superior real-time, interactive renderings, the system not only fulfills practical requirements of remote automotive inspection but also provides a blueprint for robust 3D reconstruction in challenging environments. Extensions of this work could transform automated inspection across domains, leveraging both geometric and photometric precision for condition assessment and anomaly detection.

Markdown Report Issue