Papers
Topics
Authors
Recent
2000 character limit reached

Edge Detection-Based ICP (ED-ICP) Algorithm

Updated 19 December 2025
  • Edge Detection-Based ICP (ED-ICP) is a method for robust monocular visual odometry that integrates edge alignment within an ICP framework.
  • The algorithm employs a coarse-to-fine, uncertainty-driven edge-guided data association along with adaptive patch sizing for precise real-time pose estimation.
  • Quantitative evaluations on synthetic and real-world datasets demonstrate reduced drift and failure rates compared to classical methods under challenging illumination and motion.

Edge Detection-Based ICP (ED-ICP) is a methodology for monocular visual odometry that exploits edge feature alignment within an Iterative Closest Point (ICP) framework, augmented with a coarse-to-fine, uncertainty-driven edge-guided data association scheme. The approach is designed for robust camera tracking under challenging illumination and large motion, enabling resilient and accurate real-time pose estimation with improved performance over classical point-based and direct methods (Wu et al., 2019).

1. ICP-Based Edge Registration: Front-End

The front-end initiates by extracting a set of prominent 2D edges from the incoming image using detectors such as Canny, Structured Forest (SE), or Holistically-Nested Edge Detection (HED). Each reference-frame edge pixel prp_r is characterized by its position prΩR2p_r \in \Omega \subset \mathbb{R}^2 and local gradient direction g(pr)g(p_r), locally representing the edge as a 1D curve with normal g(pr)g(p_r) and tangent gg_\perp.

Given a relative pose estimate ξse(3)\xi \in \mathfrak{se}(3) between frames, each reference edge pixel is back-projected using its inverse-depth drd_r and re-projected into the current frame with

pkr=π(Rkrπ1(pr,dr)+tkr).p_{kr} = \pi(R_{kr} \pi^{-1}(p_r, d_r) + t_{kr}).

The nearest-neighbor edge pixel n(pkr)n(p_{kr}) in the current frame is found by minimizing the Euclidean distance. The matching cost employs a point-to-tangent residual: ri(ξ)=g(n(pkr))[pkrn(pkr)],r_i(\xi) = g(n(p_{kr}))^\top \left[p_{kr} - n(p_{kr})\right], where g(n(pkr))g(n(p_{kr})) is the local gradient direction at the matched pixel. The Huber-weighted ICP objective to be minimized is

EkrE(ξ)=prSrEwprρ(g(n(pkr))[pkrn(pkr)]).E^E_{kr}(\xi) = \sum_{p_r \in S_r^E} w_{p_r} \rho\left( g(n(p_{kr}))^\top [p_{kr} - n(p_{kr})] \right).

The optimization proceeds in a multi-scale image pyramid, using Gauss-Newton updates δηR6\delta\eta \in \mathbb{R}^6 with pose composition ξexp(δη)ξ\xi \leftarrow \exp(\delta\eta) \circ \xi, providing robust but coarse motion estimation and initial data association.

2. Coarse-to-Fine Edge-Guided Data Association

Following the ICP phase, refined edge correspondence is conducted to address the partial observability induced by lack of unique edge descriptors. This proceeds as a 1D template-matching search along the tangent direction gg_\perp, but the search interval is bounded by a probabilistic uncertainty-driven radius.

The search length is set as

λ1/2=kpσpg+kμσμsinθ,\lambda_{1/2} = k_p \sigma_{p\perp g} + k_\mu \sigma_\mu |\sin\theta|,

where θ\theta is the angle between the epipolar line and edge normal, σpg\sigma_{p\perp g} is the projected uncertainty along gg_\perp, σμ\sigma_\mu is the depth-disparity uncertainty, and kpk_p, kμ1k_\mu \approx 1 are empirically tuned gains.

Candidate matches qq_\ell are extracted along gg_\perp within [±λ1/2][\pm \lambda_{1/2}], each pre-warped according to the current pose and depth hypothesis, with a patch of image-gradient magnitudes FF extracted. The L2_2 cost for each candidate is

c=Fk(q)Fr(pr)22,c_\ell = \| F_k(q_\ell) - F_r(p_r) \|_2^2,

and the match q=argmincq^* = {\arg\min}_\ell c_\ell is selected as the refined correspondence.

3. Geometric Uncertainty Analysis and Dynamic Search Bounding

The search interval and depth uncertainty metrics are analytically derived via point-to-edge geometric uncertainty analysis. The intersection of the edge direction S={p+λg}S = \{ p^* + \lambda g_\perp \} and the epipolar line L={p+μl}L = \{ p + \mu l \} yields: λ(p,μ)=pp,g+μl,g=epg+μsinθ,\lambda(p, \mu) = \langle p^* - p, g_\perp \rangle + \mu \langle l, g_\perp \rangle = e_{p\perp g} + \mu \sin\theta, and the variance: σλ2=σpg2+σμ2sin2θ,\sigma_\lambda^2 = \sigma_{p\perp g}^2 + \sigma_\mu^2 \sin^2\theta, with an upper bound σλσpg+σμsinθ\sigma_\lambda \leq \sigma_{p\perp g} + \sigma_\mu |\sin\theta|.

The disparity μ\mu and its variance are: μ(p)=pp,gg,l=epgcosθ,σμ2=σpg2cos2θ,\mu(p) = \frac{\langle p^* - p, g \rangle}{\langle g, l \rangle} = \frac{e_{p \parallel g}}{\cos\theta}, \qquad \sigma_\mu^2 = \frac{\sigma_{p\parallel g}^2}{\cos^2\theta}, where the directional uncertainties are decomposed using the principal axes (v1,v2)(v_1, v_2) and variances (σ1,σ2)(\sigma_1, \sigma_2). The depth confidence is defined by Cd=1/σμC_d = 1/\sigma_\mu.

4. Match Confidence and Adaptive Patch Sizing

To resolve ambiguities on low-texture or flat edges, the algorithm computes the Attainable Maximum Likelihood (AML) confidence

Cm=1(cc)2,C_m = \frac{1}{\sum_\ell (c_\ell - c_*)^2},

where cc_* is the minimal patch cost in the current search. If Cm<τmC_m < \tau_m, the patch size is incremented Spmin(Sp+2,Smax)S_p \leftarrow \min(S_p + 2, S_{\max}) to include additional gradient structure and re-evaluate the candidates. If confidence remains insufficient, matching falls back to the initial ICP result, and for the lowest confidence (small CdC_d), the corresponding depth estimate is held fixed in bundle adjustment.

5. Bundle Adjustment and Joint Optimization

Refined correspondences with associated depth and confidence metrics are aggregated in a local window for a joint optimization over all poses {ξi}\{ \xi_i \} and depths {dj}\{ d_j \}. The cost per correspondence incorporates:

  • the point-to-tangent residual,
  • a reprojection residual: EijR=ρ(π(Rijπ1(pr,dr)+tij)q2),E^R_{ij} = \rho\left(\| \pi(R_{ij} \pi^{-1}(p_r, d_r) + t_{ij}) - q^* \|_2\right),
  • an optional photometric gradient-consistency term: EijP=ρ(Fj(π(...))Fr(pr)2).E^P_{ij} = \rho(\| F_j(\pi(...)) - F_r(p_r) \|_2). The objective minimized is: i,rwirEEirE+i,rwirREirR+i,rwirPEirP,\sum_{i,r} w_{ir}^E E^E_{ir} + \sum_{i,r} w_{ir}^R E^R_{ir} + \sum_{i,r} w_{ir}^P E^P_{ir}, using frameworks such as g2o or custom Levenberg-Marquardt solvers. Corrrespondences with low depth-confidence are included with depths fixed to preserve mapping integrity.

6. Quantitative Evaluation and Comparative Performance

ED-ICP was evaluated on synthetic and real-world datasets with varying photometric noise and illumination. On vKITTI with day/night variance, it attained the lowest translational drift (approximately $1.5$–$2$ cm/m), with core processing times of 20\sim20 ms (matching) and 50\sim50 ms (ICP) per frame. Classical Lucas-Kanade and census-based optical flows were outperformed both in accuracy and speed, especially under high inter-frame displacement.

On the real-world Symphony-Lake dataset (1.5M images, strong sun-glare, and auto-exposure shifts), ED-ICP achieved the lowest failure rate (1–3% per survey) and low drift (as little as $6$ cm/m in winter). Under 3×3\times frame subsampling (fast motion), drift degraded only slightly, while other approaches displayed $2$–3×3\times increased failure. Full system runtime was approximately $80$ ms/frame for tracking plus $200$ ms/frame for mapping with a laptop and GPU edge-detector.

KITTI benchmark results (after scale correction) reported mean Absolute Trajectory Error (ATE) of $11.5$ m over sequences $00$–$09$ (matching ORB-SLAM2 and outperforming DSO; $11.5$ m vs $12.1$ m ATE).

7. Significance and Methodological Contributions

ED-ICP integrates ICP-based edge registration with a fast, analytically bounded 1D photometric refinement to maximize robustness against illumination and geometric perturbations in monocular visual odometry. Its coarse-to-fine data association pipeline, guided by geometric and photometric uncertainty analysis, yields superior performance in complex conditions (illumination change, large motion). This suggests a broader applicability in low-texture or challenging lighting regimes.

By coupling uncertainty-driven correspondence search with dynamic patch size adaptation and rigorous back-end optimization, ED-ICP achieves resilience and efficiency not available in classical direct or feature-based pipelines (Wu et al., 2019). This hybrid approach is significant for real-time VO systems requiring high reliability under adverse environmental variation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Topic to Video (Beta)

Follow Topic

Get notified by email when new papers are published related to Edge Detection-Based ICP (ED-ICP) Algorithm.