Papers
Topics
Authors
Recent
Search
2000 character limit reached

Localisation Recall Precision (LRP) Error

Updated 29 May 2026
  • Localisation Recall Precision (LRP) Error is a comprehensive metric that integrates localisation, recall, and precision to assess detector performance.
  • It explicitly decomposes errors into localisation, false positive, and false negative components, offering actionable insights for detector tuning.
  • Extensions such as oLRP and aLRP loss enable optimal threshold selection and unified loss design, enhancing visual detection training.

Localisation Recall Precision (LRP) Error quantifies the performance of visual detectors by combining localisation accuracy, recall, and precision into a unified, interpretable metric. LRP was proposed to directly address key deficiencies of Average Precision (AP) in object detection evaluation, such as its insensitivity to bounding box localisation errors and its limited interpretability for identifying sources of error. The metric provides a scalar value in [0, 1], with lower values indicating superior detector performance, and explicitly decomposes the contributions of localisation (bounding box tightness), false positives (precision), and false negatives (recall) (Oksuz et al., 2018). Subsequent generalizations extended LRP evaluation to a broad range of visual detection tasks, including instance segmentation, keypoint detection, and panoptic segmentation (Oksuz et al., 2020).

1. Formal Definition and Components

Let X={x1,…,x∣X∣}X = \{x_1, \dots, x_{|X|}\} denote the set of ground-truth boxes for one class, and Y={yj}Y = \{y_j\} the set of predicted boxes with confidence scores sj∈[0,1]s_j \in [0, 1]. For a chosen score threshold ss and Intersection-over-Union (IoU) threshold τ\tau, predictions with sj≥ss_j \geq s form YsY_s. Greedy one-to-one matching is performed between XX and YsY_s in decreasing score order, accepting only pairs with IoU ≥τ\geq \tau. The following notations are used:

  • Y={yj}Y = \{y_j\}0: Number of true positive matches
  • Y={yj}Y = \{y_j\}1: Number of false positives, Y={yj}Y = \{y_j\}2
  • Y={yj}Y = \{y_j\}3: Number of false negatives, Y={yj}Y = \{y_j\}4

The compact formulation of LRP error is: Y={yj}Y = \{y_j\}5

LRP can be decomposed into three components:

  • Localisation error:

Y={yj}Y = \{y_j\}6

Normalized such that Y={yj}Y = \{y_j\}7 is the mean IoU of matched detections.

  • False positive error:

Y={yj}Y = \{y_j\}8

  • False negative error:

Y={yj}Y = \{y_j\}9

The weighted sum yields the overall metric: sj∈[0,1]s_j \in [0, 1]0 (Oksuz et al., 2018, Oksuz et al., 2020).

2. Computation and Evaluation Procedure

The computation of LRP involves:

  1. Thresholding: Select an IoU threshold sj∈[0,1]s_j \in [0, 1]1 and a confidence threshold sj∈[0,1]s_j \in [0, 1]2; assemble sj∈[0,1]s_j \in [0, 1]3.
  2. Matching: Greedy one-to-one matching between sj∈[0,1]s_j \in [0, 1]4 and sj∈[0,1]s_j \in [0, 1]5 under the IoU and score criteria.
  3. Count TP/FP/FN: True positives correspond to matches, false positives are unmatched predictions, and false negatives are unmatched ground truths.
  4. Component calculation: Compute each of the three error terms as specified.
  5. Aggregate: Combine all error components into the final LRP score.

LRP is undefined only if both sj∈[0,1]s_j \in [0, 1]6 and sj∈[0,1]s_j \in [0, 1]7 are zero.

3. Optimal LRP (oLRP) and Mean oLRP (moLRP)

The LRP error depends on the chosen confidence threshold sj∈[0,1]s_j \in [0, 1]8. To identify a detector's best possible performance, the Optimal LRP (oLRP) is defined as the minimum LRP achievable over all possible score thresholds: sj∈[0,1]s_j \in [0, 1]9 The corresponding optimal threshold ss0 is the operating point minimizing LRP. For multi-class detectors, the mean oLRP (moLRP) across classes is: ss1 This enables evaluation and comparison of detectors at their optimal operational points (Oksuz et al., 2018).

AP is the integral under the precision-recall curve, disregarding the actual tightness of bounding boxes once an IoU threshold is met. AP cannot distinguish between detections with high and low localisation accuracy above ss2 and fails to provide operating thresholds or decomposable error sources.

LRP and oLRP directly address these issues:

  • LRP decomposes total error into localisation, false positives, and false negatives, providing interpretable diagnostics.
  • oLRP identifies per-class optimal thresholds ss3, enabling effective threshold selection without manual tuning.
  • LRP is a proper metric, satisfying the triangle inequality, and is sensitive to bounding box tightness.
  • When AP and moLRP disagree, the latter reveals differences in box tightness and precision-recall curve properties not captured by AP.

LRP also outperforms AP in stability under data scarcity, since LRP does not rely on curve interpolation (Oksuz et al., 2018, Oksuz et al., 2020, Oksuz et al., 2020).

5. Empirical Findings and Practical Implications

Multiple state-of-the-art detectors (e.g., SSD, Faster R-CNN, RetinaNet) evaluated on MS COCO ranked consistently by mAP and moLRP; however, moLRP uniquely quantifies box tightness and the sharpness of the precision-recall profile. Experimental evidence demonstrates:

  • Class-specific optimal thresholds ss4 vary widely, often deviating from common thresholds like ss5, confirming the inadequacy of one-size-fits-all score thresholds.
  • Per-class thresholding using ss6 obtained from oLRP reduces both LRP error and increases mAP, most significantly for classes with non-standard optimal thresholds.
  • Ablation with varying IoU ss7 provides insight: higher ss8 tightens localisation requirements and affects error trade-offs, summarizing performance under different strictness constraints (Oksuz et al., 2018).

6. Generalizations and Extensions: aLRP Loss

The LRP metric forms the basis for the average Localisation-Recall-Precision (aLRP) loss, a unified, bounded, balanced, ranking-based loss for both classification and localisation in object detection. aLRP extends LRP analogously to how AP Loss extends precision to a ranking-based framework, inheriting the decomposability and task-unification properties.

aLRP uses a single hyperparameter (the smoothing window ss9 for approximating the Heaviside step function) and unifies matching, ranking, and localisation error for efficient and balanced gradient updates during training (Oksuz et al., 2020).

In summary, LRP and its variants constitute a rigorous framework for evaluating and diagnosing visual detectors, with superior interpretability and operational utility compared to AP and IoU-only metrics. The explicit decomposition and optimizability of LRP support diagnostic insight, principled threshold selection, and balanced metric-based training loss design.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Localisation Recall Precision (LRP) Error.