Re-ranking Person Re-identification with k-reciprocal Encoding (1701.08398v4)

Published 29 Jan 2017 in cs.CV

Abstract: When considering person re-identification (re-ID) as a retrieval process, re-ranking is a critical step to improve its accuracy. Yet in the re-ID community, limited effort has been devoted to re-ranking, especially those fully automatic, unsupervised solutions. In this paper, we propose a k-reciprocal encoding method to re-rank the re-ID results. Our hypothesis is that if a gallery image is similar to the probe in the k-reciprocal nearest neighbors, it is more likely to be a true match. Specifically, given an image, a k-reciprocal feature is calculated by encoding its k-reciprocal nearest neighbors into a single vector, which is used for re-ranking under the Jaccard distance. The final distance is computed as the combination of the original distance and the Jaccard distance. Our re-ranking method does not require any human interaction or any labeled data, so it is applicable to large-scale datasets. Experiments on the large-scale Market-1501, CUHK03, MARS, and PRW datasets confirm the effectiveness of our method.

Authors (4)

Zhun Zhong (60 papers)
Liang Zheng (181 papers)
Donglin Cao (6 papers)
Shaozi Li (30 papers)

Citations (1,459)

View on Semantic Scholar

Summary

Re-ranking Person Re-identification with $k$ -reciprocal Encoding

The paper "Re-ranking Person Re-identification with $k$ -reciprocal Encoding" presents a novel approach to enhance the accuracy of person re-identification (re-ID) through an unsupervised, automatic re-ranking methodology. This research concentrates on re-ranking as a pivotal step in re-ID, leveraging $k$ -reciprocal encoding to refine initial ranking lists generated in typical re-ID processes.

Methodology and Contributions

The core contribution of this paper is the introduction of $k$ -reciprocal neighbors and their encoding into feature vectors for re-ranking purposes. The hypothesis stipulates that gallery images which are reciprocally nearest neighbors of a probe image are more likely to be true matches. The approach is delineated in several steps:

$k$ -reciprocal Feature Encoding: For a given probe image, $k$ -reciprocal nearest neighbors are encoded into a single vector. This vector representation allows for straightforward comparison using the Jaccard distance metric.
Jaccard Distance Calculation: The Jaccard distance between $k$ -reciprocal feature vectors of the probe and gallery images is computed. This distance is then combined with the original distance to derive a final measure.
Local Query Expansion: To bolster the robustness of $k$ -reciprocal features, the method incorporates a local query expansion, which refines the feature vectors based on the nearest neighbors.
Weighted Distance Aggregation: The final re-ranking distance is an amalgamation of the original distance and the Jaccard distance, weighted to balance their contributions effectively.

The research posits that this method, requiring no human interaction or labeled data, is scalable and applicable to large datasets. Results are initially presented for large datasets such as Market-1501, CUHK03, MARS, and PRW.

Experimental Results

The paper presents extensive experiments that validate the proposed method:

Market-1501 Dataset: The method shows significant improvements in both rank-1 accuracy and mean average precision (mAP). For instance, using the IDE (ResNet-50) baseline, rank-1 accuracy improved from 72.54% to 74.85% and mAP increased from 46.00% to 59.87%.
CUHK03 Dataset: Despite showing modest improvements in single-shot settings, the new protocol (which splits the dataset ensuring multiple ground truths in the gallery) demonstrated a marked performance increase. Notably, the IDE (ResNet-50) + XQDA baseline, when combined with the proposed method, saw improvements in rank-1 accuracy from 32.0% to 38.1% and mAP from 29.6% to 40.3%.
MARS Dataset: On this video-based dataset, the approach yielded considerable enhancements. For the IDE (ResNet-50) + XQDA combination, rank-1 accuracy rose from 70.51% to 73.94%, and mAP from 55.12% to 68.45%.
PRW Dataset: Validating on an end-to-end re-ID dataset, the proposed re-ranking method consistently improved the performance metrics.

Implications and Future Directions

The proposed $k$ -reciprocal re-ranking method offers several implications:

Improved Re-ID Performance: This method enhances the ranking accuracy of initial results without requiring additional labeled data, making it useful for large-scale re-ID tasks.
Scalability and Usability: The approach’s unsupervised nature means it can be seamlessly integrated into existing systems and applied to future datasets without modification.
Foundation for Further Research: This method opens avenues for exploring other unsupervised re-ranking techniques that leverage the relationship between nearest neighbors.

Conclusions

The research firmly establishes $k$ -reciprocal encoding as an effective tool for re-ranking in person re-ID, demonstrating substantial improvements in several benchmark datasets. Future developments could focus on further optimizing the parameters and integrating additional contextual information to enhance robustness and accuracy.

The presented approach makes significant strides in the field of re-ID, offering a scalable, effective, and unsupervised method for improving retrieval accuracy in large and complex datasets.

PDF Markdown