Optimal LRP (oLRP) Metric
- Optimal LRP (oLRP) is a unified object detection error metric that integrates localization error, false positives, and false negatives to determine an optimal confidence threshold.
- It is computed by exhaustively searching over a discretized grid of thresholds to minimize the LRP error, yielding distinct error components for localization, false positives, and false negatives.
- oLRP provides enhanced interpretability and discrimination over Average Precision, enabling practical performance evaluation and tailored threshold selection in modern detectors.
Optimal LRP (oLRP) is a metric-theoretic object detection error that integrates localization error, false positives, and false negatives with respect to an optimally chosen confidence score threshold. Acting as a unified scalar error metric in , oLRP provides both a quantitative summary of detection performance and guidance on threshold selection, addressing limitations of metrics such as Average Precision (AP) by offering enhanced discrimination power and interpretability for modern visual detectors (Oksuz et al., 2018).
1. Foundations: LRP Error Definition
Localization Recall Precision (LRP) error evaluates object detection quality by combining three core error sources: bounding-box localization, false positives, and false negatives. For a given object class, denote the set of ground-truth boxes as and the set of detections (with confidence scores) as . Given Intersection-over-Union (IoU) threshold and confidence threshold , detections with scores below are discarded. The usual IoU-based matching assigns detections to ground truths, yielding counts (true positives), (false positives), and (false negatives).
The LRP error at threshold is computed via three normalized terms:
- Localization error:
0
where 1 denotes the matched ground-truth and detection pairs.
- False-positive rate:
2
- False-negative rate:
3
The combined LRP error is given by:
4
This formulation ensures LRP 5, with lower values indicating better performance. All terms admit a clear operational interpretation, and the metric remains well-defined whenever genuine detections or ground truths are present.
2. Definition and Computation of Optimal LRP
Optimal LRP (oLRP) for a given class searches for the threshold 6 minimizing the LRP error:
7
The optimization is typically performed over a discretized grid 8 by exhaustive search. For each 9, the error components 0, 1, and 2 are evaluated. The threshold 3 achieving
4
is selected, and its auxiliary components 5, 6, and 7 quantify the optimal breakdown of localization and classification errors.
A mean over classes (moLRP) is defined analogously to mean Average Precision (mAP):
8
where 9 is the set of evaluated classes.
3. Algorithmic Structure and Practical Computation
Core computation proceeds via the following for each class and IoU threshold:
- For each 0:
- Select detections 1.
- Assign detections to ground-truths using one-to-one IoU matching (2).
- Compute 3, 4, 5.
- Derive 6, 7, 8 as above.
- Calculate 9.
- If 0 is the minimum so far, store 1 and corresponding error components.
- Report 2 and the values at 3.
This process applies independently per class. The exhaustive thresholding, performed at a grid resolution of 0.01, is computationally modest given contemporary resources.
4. Comparison with AP and Other Evaluation Metrics
Average Precision (AP) computes the area under the precision-recall (PR) curve by integrating performance over all confidence thresholds. AP, however, is insensitive to PR-curve shapes that yield identical areas and disregards bounding-box localization accuracy, except for the IoU binary qualification of a match. oLRP, in contrast, reports the single best operating point on the PR curve, directly characterizes the operational trade-off at deployment, and incorporates localization error as a primary component.
Distinctive theoretical properties include:
- oLRP reports separate components for localization, false positive, and false negative errors at 4.
- LRP satisfies the triangle inequality and is a bounded metric in 5.
- oLRP remains valid even in degenerate cases with no true positives, so long as ground-truth or detected objects exist.
Unlike Panoptic Quality (PQ)—which combines recognition and segmentation quality but is specialized for panoptic segmentation—oLRP generalizes to all detection tasks and offers a principled means of threshold selection.
5. Empirical Findings and Deployment Implications
Empirical analysis on detection benchmarks (e.g., MS COCO-val2017) encompassing diverse contemporary detectors (e.g., SSD-300, SSD-512, Faster R-CNN + FPN, RetinaNet + FPN with ResNet-50/101) demonstrates:
- moLRP correlates closely with mAP rankings but often discriminates detectors with identical mAP by revealing differences in PR-curve sharpness or localization fidelity.
- Class-specific optimal thresholds 6 exhibit high variability (from 7 to 8), indicating the suboptimality of a global threshold.
- In video object detection, switching from a general default threshold (9) to class-specific 0 improved oLRP and mAP, especially in classes with very low/high optimal thresholds.
6. Limitations and Considerations
Potential limitations of oLRP include:
- Threshold discretization: Optimization over 1 is performed on a discretized grid, introducing small approximation error.
- Single IoU threshold: oLRP is computed at a fixed 2 (e.g., 3), though evaluation at multiple thresholds is straightforward.
- Summary at one operating point: Collapsing performance onto 4 may obscure detector behavior elsewhere on the PR curve.
Nevertheless, oLRP delivers a practical, interpretable single-value metric that unifies localization and classification error, provides actionable deployment thresholds, and supports nuanced comparative analysis of detection systems (Oksuz et al., 2018).