Edge Detection-Based ICP (ED-ICP) Algorithm
- Edge Detection-Based ICP (ED-ICP) is a method for robust monocular visual odometry that integrates edge alignment within an ICP framework.
- The algorithm employs a coarse-to-fine, uncertainty-driven edge-guided data association along with adaptive patch sizing for precise real-time pose estimation.
- Quantitative evaluations on synthetic and real-world datasets demonstrate reduced drift and failure rates compared to classical methods under challenging illumination and motion.
Edge Detection-Based ICP (ED-ICP) is a methodology for monocular visual odometry that exploits edge feature alignment within an Iterative Closest Point (ICP) framework, augmented with a coarse-to-fine, uncertainty-driven edge-guided data association scheme. The approach is designed for robust camera tracking under challenging illumination and large motion, enabling resilient and accurate real-time pose estimation with improved performance over classical point-based and direct methods (Wu et al., 2019).
1. ICP-Based Edge Registration: Front-End
The front-end initiates by extracting a set of prominent 2D edges from the incoming image using detectors such as Canny, Structured Forest (SE), or Holistically-Nested Edge Detection (HED). Each reference-frame edge pixel is characterized by its position and local gradient direction , locally representing the edge as a 1D curve with normal and tangent .
Given a relative pose estimate between frames, each reference edge pixel is back-projected using its inverse-depth and re-projected into the current frame with
The nearest-neighbor edge pixel in the current frame is found by minimizing the Euclidean distance. The matching cost employs a point-to-tangent residual: where is the local gradient direction at the matched pixel. The Huber-weighted ICP objective to be minimized is
The optimization proceeds in a multi-scale image pyramid, using Gauss-Newton updates with pose composition , providing robust but coarse motion estimation and initial data association.
2. Coarse-to-Fine Edge-Guided Data Association
Following the ICP phase, refined edge correspondence is conducted to address the partial observability induced by lack of unique edge descriptors. This proceeds as a 1D template-matching search along the tangent direction , but the search interval is bounded by a probabilistic uncertainty-driven radius.
The search length is set as
where is the angle between the epipolar line and edge normal, is the projected uncertainty along , is the depth-disparity uncertainty, and , are empirically tuned gains.
Candidate matches are extracted along within , each pre-warped according to the current pose and depth hypothesis, with a patch of image-gradient magnitudes extracted. The L cost for each candidate is
and the match is selected as the refined correspondence.
3. Geometric Uncertainty Analysis and Dynamic Search Bounding
The search interval and depth uncertainty metrics are analytically derived via point-to-edge geometric uncertainty analysis. The intersection of the edge direction and the epipolar line yields: and the variance: with an upper bound .
The disparity and its variance are: where the directional uncertainties are decomposed using the principal axes and variances . The depth confidence is defined by .
4. Match Confidence and Adaptive Patch Sizing
To resolve ambiguities on low-texture or flat edges, the algorithm computes the Attainable Maximum Likelihood (AML) confidence
where is the minimal patch cost in the current search. If , the patch size is incremented to include additional gradient structure and re-evaluate the candidates. If confidence remains insufficient, matching falls back to the initial ICP result, and for the lowest confidence (small ), the corresponding depth estimate is held fixed in bundle adjustment.
5. Bundle Adjustment and Joint Optimization
Refined correspondences with associated depth and confidence metrics are aggregated in a local window for a joint optimization over all poses and depths . The cost per correspondence incorporates:
- the point-to-tangent residual,
- a reprojection residual:
- an optional photometric gradient-consistency term: The objective minimized is: using frameworks such as g2o or custom Levenberg-Marquardt solvers. Corrrespondences with low depth-confidence are included with depths fixed to preserve mapping integrity.
6. Quantitative Evaluation and Comparative Performance
ED-ICP was evaluated on synthetic and real-world datasets with varying photometric noise and illumination. On vKITTI with day/night variance, it attained the lowest translational drift (approximately $1.5$–$2$ cm/m), with core processing times of ms (matching) and ms (ICP) per frame. Classical Lucas-Kanade and census-based optical flows were outperformed both in accuracy and speed, especially under high inter-frame displacement.
On the real-world Symphony-Lake dataset (1.5M images, strong sun-glare, and auto-exposure shifts), ED-ICP achieved the lowest failure rate (1–3% per survey) and low drift (as little as $6$ cm/m in winter). Under frame subsampling (fast motion), drift degraded only slightly, while other approaches displayed $2$– increased failure. Full system runtime was approximately $80$ ms/frame for tracking plus $200$ ms/frame for mapping with a laptop and GPU edge-detector.
KITTI benchmark results (after scale correction) reported mean Absolute Trajectory Error (ATE) of $11.5$ m over sequences $00$–$09$ (matching ORB-SLAM2 and outperforming DSO; $11.5$ m vs $12.1$ m ATE).
7. Significance and Methodological Contributions
ED-ICP integrates ICP-based edge registration with a fast, analytically bounded 1D photometric refinement to maximize robustness against illumination and geometric perturbations in monocular visual odometry. Its coarse-to-fine data association pipeline, guided by geometric and photometric uncertainty analysis, yields superior performance in complex conditions (illumination change, large motion). This suggests a broader applicability in low-texture or challenging lighting regimes.
By coupling uncertainty-driven correspondence search with dynamic patch size adaptation and rigorous back-end optimization, ED-ICP achieves resilience and efficiency not available in classical direct or feature-based pipelines (Wu et al., 2019). This hybrid approach is significant for real-time VO systems requiring high reliability under adverse environmental variation.