Person Re-identification in the Wild (1604.02531v2)

Published 9 Apr 2016 in cs.CV

Abstract: We present a novel large-scale dataset and comprehensive baselines for end-to-end pedestrian detection and person recognition in raw video frames. Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification accuracy and assessing the effectiveness of different detectors for re-identification. We make three distinct contributions. First, a new dataset, PRW, is introduced to evaluate Person Re-identification in the Wild, using videos acquired through six synchronized cameras. It contains 932 identities and 11,816 frames in which pedestrians are annotated with their bounding box positions and identities. Extensive benchmarking results are presented on this dataset. Second, we show that pedestrian detection aids re-identification through two simple yet effective improvements: a discriminatively trained ID-discriminative Embedding (IDE) in the person subspace using convolutional neural network (CNN) features and a Confidence Weighted Similarity (CWS) metric that incorporates detection scores into similarity measurement. Third, we derive insights in evaluating detector performance for the particular scenario of accurate person re-identification.

Authors (6)

Liang Zheng (181 papers)
Hengheng Zhang (6 papers)
Shaoyan Sun (4 papers)
Manmohan Chandraker (108 papers)
Yi Yang (856 papers)
Qi Tian (314 papers)

Citations (658)

View on Semantic Scholar

Summary

Person Re-identification in the Wild: An Overview

The paper "Person Re-identification in the Wild" introduces a comprehensive dataset and evaluation frameworks to address the challenges inherent in person re-identification (re-ID) in natural environments. This work presents significant advancements in pedestrian detection and person re-ID, bridging the gap often encountered when these tasks are studied in isolation.

Key Contributions

The authors make several noteworthy contributions:

PRW Dataset: The introduction of the PRW dataset marks a pivotal development for the field. Comprising 932 identities and 11,816 annotated frames across six near-synchronized cameras, this dataset supports simultaneous evaluation of detection and re-ID tasks. It fills a critical void left by existing datasets that either lack ID annotations or comprehensive video frames.
Detection and Re-ID Interplay: The paper elucidates how pedestrian detection can enhance re-ID accuracy. The authors propose a cascaded fine-tuning strategy and a Confidence Weighted Similarity (CWS) metric, which together incorporate detection data into re-ID frameworks, resulting in improved performance.
Evaluation of Detection Criteria: A new evaluation rule is proposed, suggesting that an intersection-over-union (IoU) threshold of 0.7, rather than the conventional 0.5, is more effective for assessing detector performance in the context of re-ID tasks.

Methodological Insights

Cascaded Fine-tuning: This strategy first trains a detection model and subsequently fine-tunes it for classification tasks. This approach leverages increased pedestrian detection data to refine CNN embeddings, enhancing their discriminative power.
Confidence Weighted Similarity (CWS): By integrating detection confidences into similarity measurements, CWS diminishes the negative impact of false positives in large galleries, offering a refined approach to similarity scoring.

Experimental Evaluation and Findings

The experimental results demonstrated superior re-ID performance using the novel IDE descriptor, notably when applied with the cascaded fine-tuning strategy. IDE $_{det}$ , the fine-tuned model, consistently exceeded the baseline IDE $_{imgnet}$ model in accuracy, underscoring the merit of pre-processing with varied detection data.

A comparative analysis of multiple detectors and recognizers further highlighted the impact of fine-tuning on re-ID performance. Particularly, detection accuracy under IoU $>$ 0.7 was shown to align closely with improved re-ID outcomes.

Implications and Future Directions

The implications of this research are multifaceted. Practically, this work sets a new benchmark for integrated end-to-end systems that encompass both detection and recognition tasks. The PRW dataset itself will likely serve as a fundamental resource in subsequent studies aiming to evaluate and optimize re-ID systems.

Theoretically, this paper opens up avenues for exploring improved detector localization methods and sophisticated re-weighting schemes within similarity measurement frameworks. Furthermore, the potential synergies between detection-enhanced feature learning and partial re-ID methodologies warrant further exploration.

Conclusion

The authors present a robust and well-structured contribution to the domain of person re-ID, particularly in challenging real-world conditions. By addressing both detection and recognition in a unified system, this research provides a strategic framework to drive future advancements in the field. The proposed methodologies, dataset, and subsequent findings lay a strong foundation for ongoing research aimed at more accurate and reliable person re-identification in complex environments.

PDF Markdown