Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Re-ranking Person Re-identification with k-reciprocal Encoding (1701.08398v4)

Published 29 Jan 2017 in cs.CV

Abstract: When considering person re-identification (re-ID) as a retrieval process, re-ranking is a critical step to improve its accuracy. Yet in the re-ID community, limited effort has been devoted to re-ranking, especially those fully automatic, unsupervised solutions. In this paper, we propose a k-reciprocal encoding method to re-rank the re-ID results. Our hypothesis is that if a gallery image is similar to the probe in the k-reciprocal nearest neighbors, it is more likely to be a true match. Specifically, given an image, a k-reciprocal feature is calculated by encoding its k-reciprocal nearest neighbors into a single vector, which is used for re-ranking under the Jaccard distance. The final distance is computed as the combination of the original distance and the Jaccard distance. Our re-ranking method does not require any human interaction or any labeled data, so it is applicable to large-scale datasets. Experiments on the large-scale Market-1501, CUHK03, MARS, and PRW datasets confirm the effectiveness of our method.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zhun Zhong (60 papers)
  2. Liang Zheng (181 papers)
  3. Donglin Cao (6 papers)
  4. Shaozi Li (30 papers)
Citations (1,459)

Summary

Re-ranking Person Re-identification with kk-reciprocal Encoding

The paper "Re-ranking Person Re-identification with kk-reciprocal Encoding" presents a novel approach to enhance the accuracy of person re-identification (re-ID) through an unsupervised, automatic re-ranking methodology. This research concentrates on re-ranking as a pivotal step in re-ID, leveraging kk-reciprocal encoding to refine initial ranking lists generated in typical re-ID processes.

Methodology and Contributions

The core contribution of this paper is the introduction of kk-reciprocal neighbors and their encoding into feature vectors for re-ranking purposes. The hypothesis stipulates that gallery images which are reciprocally nearest neighbors of a probe image are more likely to be true matches. The approach is delineated in several steps:

  1. kk-reciprocal Feature Encoding: For a given probe image, kk-reciprocal nearest neighbors are encoded into a single vector. This vector representation allows for straightforward comparison using the Jaccard distance metric.
  2. Jaccard Distance Calculation: The Jaccard distance between kk-reciprocal feature vectors of the probe and gallery images is computed. This distance is then combined with the original distance to derive a final measure.
  3. Local Query Expansion: To bolster the robustness of kk-reciprocal features, the method incorporates a local query expansion, which refines the feature vectors based on the nearest neighbors.
  4. Weighted Distance Aggregation: The final re-ranking distance is an amalgamation of the original distance and the Jaccard distance, weighted to balance their contributions effectively.

The research posits that this method, requiring no human interaction or labeled data, is scalable and applicable to large datasets. Results are initially presented for large datasets such as Market-1501, CUHK03, MARS, and PRW.

Experimental Results

The paper presents extensive experiments that validate the proposed method:

  1. Market-1501 Dataset: The method shows significant improvements in both rank-1 accuracy and mean average precision (mAP). For instance, using the IDE (ResNet-50) baseline, rank-1 accuracy improved from 72.54% to 74.85% and mAP increased from 46.00% to 59.87%.
  2. CUHK03 Dataset: Despite showing modest improvements in single-shot settings, the new protocol (which splits the dataset ensuring multiple ground truths in the gallery) demonstrated a marked performance increase. Notably, the IDE (ResNet-50) + XQDA baseline, when combined with the proposed method, saw improvements in rank-1 accuracy from 32.0% to 38.1% and mAP from 29.6% to 40.3%.
  3. MARS Dataset: On this video-based dataset, the approach yielded considerable enhancements. For the IDE (ResNet-50) + XQDA combination, rank-1 accuracy rose from 70.51% to 73.94%, and mAP from 55.12% to 68.45%.
  4. PRW Dataset: Validating on an end-to-end re-ID dataset, the proposed re-ranking method consistently improved the performance metrics.

Implications and Future Directions

The proposed kk-reciprocal re-ranking method offers several implications:

  1. Improved Re-ID Performance: This method enhances the ranking accuracy of initial results without requiring additional labeled data, making it useful for large-scale re-ID tasks.
  2. Scalability and Usability: The approach’s unsupervised nature means it can be seamlessly integrated into existing systems and applied to future datasets without modification.
  3. Foundation for Further Research: This method opens avenues for exploring other unsupervised re-ranking techniques that leverage the relationship between nearest neighbors.

Conclusions

The research firmly establishes kk-reciprocal encoding as an effective tool for re-ranking in person re-ID, demonstrating substantial improvements in several benchmark datasets. Future developments could focus on further optimizing the parameters and integrating additional contextual information to enhance robustness and accuracy.

The presented approach makes significant strides in the field of re-ID, offering a scalable, effective, and unsupervised method for improving retrieval accuracy in large and complex datasets.