Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification (1904.04975v2)

Published 10 Apr 2019 in cs.CV

Abstract: Re-identifying a person across multiple disjoint camera views is important for intelligent video surveillance, smart retailing and many other applications. However, existing person re-identification (ReID) methods are challenged by the ubiquitous occlusion over persons and suffer from performance degradation. This paper proposes a novel occlusion-robust and alignment-free model for occluded person ReID and extends its application to realistic and crowded scenarios. The proposed model first leverages the full convolution network (FCN) and pyramid pooling to extract spatial pyramid features. Then an alignment-free matching approach, namely Foreground-aware Pyramid Reconstruction (FPR), is developed to accurately compute matching scores between occluded persons, despite their different scales and sizes. FPR uses the error from robust reconstruction over spatial pyramid features to measure similarities between two persons. More importantly, we design an occlusion-sensitive foreground probability generator that focuses more on clean human body parts to refine the similarity computation with less contamination from occlusion. The FPR is easily embedded into any end-to-end person ReID models. The effectiveness of the proposed method is clearly demonstrated by the experimental results (Rank-1 accuracy) on three occluded person datasets: Partial REID (78.30\%), Partial iLIDS (68.08\%) and Occluded REID (81.00\%); and three benchmark person datasets: Market1501 (95.42\%), DukeMTMC (88.64\%) and CUHK03 (76.08\%)

Review of "Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification"

The paper presents an innovative methodology for addressing the challenge of occluded person re-identification (ReID) in scenarios involving multiple disjoint cameras. Traditional ReID systems often struggle with occlusion due to objects or other individuals, leading to a significant loss in accuracy. This paper proposes an occlusion-robust framework that does not rely on person alignment or external cues, setting itself apart from other approaches.

Key Contributions and Methodological Advancements

  • Foreground-aware Pyramid Reconstruction (FPR): The authors introduce FPR as a core component of their system. This alignment-free approach computes similarity scores between occluded individuals by leveraging pyramid pooling coupled with foreground-aware spatial reconstruction. This method effectively handles different scales and sizes of occluded persons, utilizing reconstruction errors from spatial pyramid features to measure similarity.
  • Occlusion-sensitive Foreground Probability Generator: The model integrates a novel generator that prioritizes unblemished human body parts, further refining similarity computation by minimizing the influence of occlusion.
  • Embedding in End-to-end Models: FPR is designed to be easily integratable into existing end-to-end ReID models, enhancing flexibility and adaptability within different system architectures.

Experimental Results

Experimental evaluation presents compelling evidence for the efficacy of the proposed approach:

  • Occluded Datasets: On Partial REID, Partial iLIDS, and Occluded REID datasets, the model achieves Rank-1 accuracy of 78.30%, 68.08%, and 81.00%, respectively. These results indicate significant progress over the current state-of-the-art methods, such as PCB and DSR.
  • Benchmark Datasets: On Market1501, DukeMTMC, and CUHK03 benchmark datasets, the method achieves competitive Rank-1 accuracy scores of 95.42%, 88.64%, and 76.08%, respectively. This reaffirms the model's applicability across both occluded and unoccluded scenarios.

Theoretical and Practical Implications

The model's alignment-free nature and reduced dependency on external cues imply a significant reduction in computational complexity and inference time. This approach could enable more real-time applications in surveillance and retailing where occlusions are prevalent, offering robust identity matching without requiring high-fidelity segmentation or pose estimation.

Future Directions

Looking ahead, the methodology could be extended and optimized for diverse applications beyond video surveillance and retail, such as autonomous systems and robotics. Further research might explore the integration of real-time adaptive mechanisms to dynamically adjust the foreground-aware components based on environmental and contextual cues.

In summary, this paper makes a substantive contribution to the field of occluded person re-identification, offering a fresh perspective that prioritizes computational efficiency, simplicity in integration, and robustness in highly occluded environments.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Lingxiao He (14 papers)
  2. Yinggang Wang (2 papers)
  3. Wu Liu (56 papers)
  4. Xingyu Liao (18 papers)
  5. He Zhao (117 papers)
  6. Zhenan Sun (81 papers)
  7. Jiashi Feng (295 papers)
Citations (191)